This is the course website for a previous iteration of the course. If youโre looking for the most recent course website, look at practicaldsc.org.
Practical Data Science ๐ ๏ธ
EECS 398-003, Fall 2024 at the University of Michigan
Week 1: Introduction, Python
- Tue Aug 27
LEC 1 Introduction
๐ filled html GitHub๐ฅ recording๐ Read: LDS 1Come meet the staff from 3-4PM in the diag in front of BBB!
SUR Welcome Survey
- Thu Aug 29
LEC 2 Python and Jupyter Notebooks
EX HW Example Homework (not due!)
- Fri Aug 30
DISC 1 Introductions, Python Review
Week 2: NumPy and Pandas
- Tue Sep 3
LEC 3 NumPy
- Thu Sep 5
LEC 4 Simulation, DataFrame Fundamentals
HW 1 Python Fundamentals
- Fri Sep 6
DISC 2 Arrays and DataFrames
Week 3: More Pandas
- Tue Sep 10
LEC 5 Querying and Grouping
- Thu Sep 12
LEC 6 Grouping, Pivoting, and Merging
- Fri Sep 13
DIS 3 Grouping, Pivoting, and Merging
HW 2 Arrays and DataFrames
Week 4: Exploratory Data Analysis
- Tue Sep 17
LEC 7 EDA, Data Cleaning, and Visualization
- Thu Sep 19
LEC 8 More Visualization, Missing Values
๐ filled html๐ more visualization examples GitHub๐ฅ recording๐ Read: LDS 11๐ Read: Missingnessโฏ๏ธ extra videosHW 3 Grouping, Pivoting, and Merging
- Fri Sep 20
DIS 4 Visualization, Missing Values, More Practice
Week 5: Web Scraping and APIs
- Tue Sep 24
LEC 9 Web Scraping
๐ filled html GitHub๐ฅ recording๐ Read: LDS 14.3๐ Read: DSP 7.3Looking for other sources of data for personal projects? Check out our Resources page.
- Thu Sep 26
LEC 10 APIs, Spreadsheets, and SQL
๐ filled html GitHub๐ฅ recording๐ Read: LDS 7๐ Read: LDS 14.2โฏ๏ธ extra videosThe extra video linked above walks through the spreadsheets section of the lecture. The Google Sheet used in the video is linked here.
HW 4 Exploratory Data Analysis and Missing Values
- Fri Sep 27
DIS 5 Web Scraping
Week 6: Text Processing
- Tue Oct 1
LEC 11 Regular Expressions
- Thu Oct 3
LEC 12 Text as Data
HW 5 Web Scraping and APIs
- Fri Oct 4
DIS 6 Regular Expressions and Text Features
Week 7: Midterm Exam
- Mon Oct 7
REV Midterm Review (6-8PM, FXB 1109, led by TAs)
- Tue Oct 8
LEC 13 Midterm Review (during lecture, led by Suraj)
OH Office Hours After Lecture
๐ filled slides๐ filled html GitHub๐ฅ recordingIn office hours, we walked through questions from past discussions and discussed some of the intricacies of the groupby method. The annotated slides, Jupyter Notebook, and a recording of the session can be found here.
- Wed Oct 9
EXAM Midterm Exam (7-9PM)
- Thu Oct 10
No Lecture: (Early) Fall Break ๐
- Fri Oct 11
No Discussion: (Early) Fall Break ๐
Week 8: Fall Break; Introduction to Modeling
- Tue Oct 15
No Lecture: Fall Break ๐
- Thu Oct 17
LEC 14 Introduction to Modeling
๐ blank slides๐ filled slides๐ฅ recording๐ Read: LDS 4๐ Read: UCSD 1.1-1.2โฏ๏ธ extra videosHW 6 SQL, Regular Expressions, and GPTEECS
- Fri Oct 18
DIS 7 Summary Statistics and the Constant Model
Week 9: Regression
- Tue Oct 22
LEC 15 Simple Linear Regression
- Thu Oct 24
LEC 16 Regression using Linear Algebra
๐ blank slides๐ filled slides๐ filled html GitHub๐ฅ recording๐ Read: LDS 15.4-15.5๐ Read: UCSD 2.2In addition to the resources above, it's a good idea to review Linear Algebra Review for Data Science (LARDS), Section 8, titled Projecting onto the span of multiple vectors, again.
HW 7 Loss Functions and Linear Algebra
GitHub๐ Read: Maximum Likelihood Estimation (for Question 4)๐ solutionsNote that you will submit Homework 7 as a PDF containing your answers to math questions โ you will not submit any code!
- Fri Oct 25
DIS 8 Linear Regression
Week 10: More Regression, Feature Engineering
- Tue Oct 29
LEC 17 Multiple Linear Regression and Feature Engineering
๐ filled html GitHubโ๏ธ annotations๐ฅ recordingThe "annotations" link contains a PDF of the annotations I drew on today's lecture slides. The formatting issues from today's lecture are fixed.
- Thu Oct 31
LEC 18 Feature Engineering, Continued
- Fri Nov 1
DIS 9 Multiple Linear Regression and Feature Engineering
HW 8 Linear Regression
Week 11: Generalization
- Tue Nov 5
LEC 19 Pipelines, Generalization
๐ filled html GitHubโ๏ธ annotations๐ฅ recording๐ Read: LDS 16We will still have class on Election Day, but as always, lecture is recorded, so don't hesitate to miss class if you need the time to vote.
- Thu Nov 7
LEC 20 Cross-Validation and Regularization
- Fri Nov 8
DIS 10 Generalization, Cross-Validation, Regularization
Week 12: Regularization, Gradient Descent
- Mon Nov 11
HW 9 Multiple Linear Regression, Feature Engineering
- Tue Nov 12
LEC 21 Regularization, Gradient Descent
- Thu Nov 14
LEC 22 Gradient Descent
- Fri Nov 15
DIS 11 Gradient Descent and Convexity
Week 13: Classification and Logistic Regression
- Tue Nov 19
LEC 23 Introduction to Classification
๐ filled html GitHubโ๏ธ annotations๐ฅ recording๐ Read: LDS 19There are some slides involving logistic regression in this lecture, but we greatly improved them for Lecture 24, so look there for logistic regression!
- Thu Nov 21
LEC 24 Logistic Regression
- Fri Nov 22
DIS 12 Classifier Evaluation and Logistic Regression
Week 14: More Classification; Thanksgiving
- Mon Nov 25
PR HW Portfolio Homework Checkpoint (no slip days!)
- Tue Nov 26
LEC 25 Thresholds, Multiclass Classification
- Thu Nov 28
No Lecture: Thanksgiving Break ๐ฆ
- Fri Nov 29
No Discussion: Thanksgiving Break ๐ฆ
Week 15: Conclusion
- Mon Dec 2
HW 10 CV, Regularization, Grad. Desc., and Log. Reg.
GitHub๐ solutionsSee this thread on Ed for an important update! There's an updated notebook (
hw10-updated.ipynb
) the autograder denominator has been lowered from 24 to 22, and the (optional) prediction competition is extended until 12/9.- Tue Dec 3
LEC 26 Clustering
- Thu Dec 5
LEC 27 Computer Vision, Conclusion
๐ filled html GitHubโ๏ธ annotations๐ฅ recording๐ Read: Stanford Softmax Notes๐ Read: PCA TA NotesI wrote the "PCA TA Notes" above when I was a TA for a similar class; take a look if you're curious, and read this if you want to learn even more about PCA. Neither PCA nor softmax regression are in scope for the Final Exam.
- Fri Dec 6
DIS 13 Group Office Hours (come with questions!)
- Sat Dec 7
PR HW Portfolio Homework (no slip days!)
Week 16: Final Exam
- Mon Dec 9
REV 1 Take Up Midterm Exam (6:30-8:30PM, 1017 DOW)
HW 10 HW 10 Prediction Competition
- Tue Dec 10
REV 2 Review Post-Midterm Content (5-8PM, 1670 BBB)
๐ filled slides๐ง problems๐ฅ recordingThe last hour of the review session doesn't appear in the recording. For that material, watch this video.
SUR End-of-Semester Survey and Official Evals
If at least 85% of the class fills out both by 12/10 at 11:59PM, we'll add 1% of extra credit to everyone's overall grade.
- Thu Dec 12
EXAM Final Exam (4-6PM)