Welcome to CS109: Data Science
Assignments
- Homework 0: Hello, world (solutions)
- Homework 1: Which of two things is larger? (solutions)
- Homework 2: Desperately Seeking Silver (solutions)
- Homework 3: Bayesian Tomatoes (solutions)
- Homework 4: Do We Really Need Chocolate Recommendations? (solutions)
- Homework 5: Networks and Congress (solutions)
Lecture Supplements
- A gallery of statistical graphs with matplotlib (see also the version with default matplotlib styles)
- A rubric for data wrangling and exploratory data analysis
- Web Scraping and Parsing Demo
- Cross Validation: The Right and Wrong Way
Labs
- Lab 2: Web Scraping
- Lab 3: EDA, Pandas, Matplotlib
- Lab 4: Scikit-Learn, Regression, PCA
- Lab 5: Bias, Variance, Cross-Validation
- Lab 6: Bayes, Linear Regression, and Metropolis Sampling
- Lab 7: Gibbs Sampling
- Lab 8: MapReduce
- Lab 9: Networks
- Lab 10: Support Vector Machines