Skip to main page content
Ideas
  1. Course Search
  2. COMPSCI 1090A

Data Science 1: Introduction to Data Science
COMPSCI 1090A

Jointly Offered with: Faculty of Arts & Sciences as STAT 109A, Faculty of Arts & Sciences as APCOMP 209A

Course Information

Description

Data Science 1 is the first half of a one-year introduction to data science. The course focuses on the analysis of messy, real-life data to make predictions using statistical and machine learning methods. Material covered integrates the five key facets of an investigation using data: (1) data collection – data wrangling and cleaning to obtain a suitable dataset; (2) data management – accessing data quickly and reliably; (3) exploratory data analysis – generating hypotheses and building intuition; (4) prediction or statistical learning – developing and applying models such as linear and logistic regression, k-nearest neighbors, decision trees, and probabilistic approaches based on Bayes’ rule; and (5) communication – summarizing results through visualization, storytelling, and interpretable summaries.

This is the first part of a two-course sequence. The curriculum builds throughout the academic year, and students are strongly encouraged to enroll in both the fall and spring courses within the same academic year.

Course Notes

Only one of CS 1090a (previously CS 109a), AC 209a, Stat 109a, or Stat 121a can be taken for credit. This course was previously numbered CS 109a.

School Faculty of Arts & Sciences
Credits 4
Cross Reg

Available for Harvard Cross Registration

Course Component Lecture
Grading Basis FAS Letter Graded
Course Requirements Not to be taken in addition to Applied Computation 209, or Applied Computation 209A, or Statistics 109A, or Statistics 121, or Statistics 121A.
General Education N/A
Quantitative Reasoning with Data Yes
Divisional Distribution Science & Engineering & Applied Science
Course Level For Undergraduate and Graduate Students