Syllabus
Syllabus
Instructor:
Course Assistants:
Course Overview
This course is an introduction to machine learning and statistics. The course presents
motivation, methods, implementation and some supporting theory for several types of data
analysis, including classification and regression, clustering, unsupervised feature learning, and
multi-layer networks. The main objective of the course is for students to gain an understanding
of and experience with some essential statistical machine learning methodology and practice.
The course will also touch on social impacts of the use of machine learning.
The course will not follow a textbook closely. However, the following book contains some of the
course material: The Elements of Statistical Learning: Data Mining, Inference, and Prediction, by
T. Hastie, R. Tibshirani, and J. Friedman, Springer, 2nd edition. The book is available at:
https://web.stanford.edu/~hastie/ElemStatLearn/index.html
What you will need to succeed in this course:
Some degree of:
Course Structure
Classes will be in person. The slides for each class will be posted before class.
Grading
Assignments: Assignments will include a mix of problem solving and data analysis (coding).
Python will be the course programming language. And all programming assignments must be
handed in using jupyter notebook including cell outputs.
Quizzes
We will hold 2, multiple choice 20 minute quizzes during regular class hours, you will need to
have access to canvas either on a laptop or on your phone.
Exams
Assignments will be due in one week, submitted online on Gradescope by 2PM before start of
class.
Collaboration on homework assignments (other than HW1) with fellow students is encouraged.
However, such collaboration should be clearly acknowledged, by listing the names of the
students with whom you have had any discussions concerning the problem. You may not share
written work or code---after discussing a problem with others, the solution must be written by
yourself.
Tentative Schedule: Blue - supervised learning, Red - unsupervised
learning. Purple - both.
Week 1 Introduction
Week 2 PCA
Classification/Bayes classifier
Week 4 Perceptrons.
SVMs
Midterm - April 25