Skip to main page content
Ideas
  1. Course Search
  2. APCOMP 209A

Data Science 1: Introduction to Data Science
APCOMP 209A

Jointly Offered with: Faculty of Arts & Sciences as COMPSCI 1090A, Faculty of Arts & Sciences as STAT 109A

Course Information

Description

Data Science 1 is the first half of a one-year introduction to data science. The course focuses on the analysis of messy, real-life data to make predictions using statistical and machine learning methods. Material covered integrates the five key facets of an investigation using data: (1) data collection – data wrangling and cleaning to obtain a suitable dataset; (2) data management – accessing data quickly and reliably; (3) exploratory data analysis – generating hypotheses and building intuition; (4) prediction or statistical learning – developing and applying models such as linear and logistic regression, k-nearest neighbors, decision trees, and probabilistic approaches based on Bayes’ rule; and (5) communication – summarizing results through visualization, storytelling, and interpretable summaries.

This is the first part of a two-course sequence. The curriculum builds throughout the academic year, and students are strongly encouraged to enroll in both the fall and spring courses within the same academic year.

Course Notes

Only one of CS 1090a, CS 109a, AC 209a, Stat 109a, or Stat 121a can be taken for credit.

School Faculty of Arts & Sciences
Credits 4
Cross Reg

Available for Harvard Cross Registration

Course Component Lecture
Grading Basis FAS Letter Graded
Course Requirements Not to be taken in addition to Computer Science 1090A, or Statistics 109A, or Statistics 121, or Statistics 121A.
General Education N/A
Quantitative Reasoning with Data Yes
Divisional Distribution Science & Engineering & Applied Science
Course Level Primarily for Graduate Students