Data 201 Syllabus Spring25 Section02
Data 201 Syllabus Spring25 Section02
Syllabus
Course Overview:
In this course, you will learn the fundamentals of data processing and modeling in the
context of Data Science. Emphasis will be placed on careful planning and deliberate
decision-making when working with data and building models. Programming will be
done in Python, and we will extensively use the scikit-learn collection.
This course will introduce you to various supervised and unsupervised machine-learning
techniques, including regression, classification, and clustering methods. By the end of the
course, you are not expected to be an expert on any particular technique. Still, you should
exhibit a solid high-level understanding of the goals of each method, be able to
determine when a specific type of model is more or less suitable to a real-world problem,
and, most importantly, demonstrate keen attention to detail when working with data. A
very strong emphasis will be placed on understanding why we are doing what we are
doing.
Learning Objectives:
1. The course will provide you with a critical understanding of the essential aspects and
techniques in data preprocessing, modeling, and data use.
2. You will learn to apply specific methods/algorithms and interpret the results.
3. You will work with real-world problems that are being engaged with by contemporary
problem-solvers and decision-makers.
Class Format:
The class will be in-person. Each class will take the following format:
● Reading: Before the class, you will be pointed to some reading materials (see
"Reading Materials" in the course schedule) . Reading is not required but highly
recommended to do before the class.
● Summary/Elaboration/Q&A: In the class, the instructor will summarize
important points from the reading material, elaborating on details that were not
included in the reading while fielding any questions.
Grading:
Your final grade will be dependent on:
● Midterm: 25%. An online midterm (due on March 20th) will test your
knowledge of the concepts and Machine Learning algorithms presented during
the course. The midterm will be administered via Blackboard.
● Final: 25%. An online final (due on May 11th) will test your knowledge of the
concepts and Machine Learning algorithms presented during the course. The final
will be administered via Blackboard.
● Assignments: 45%. There will be seven assignments in this class. Each
assignment must be completed independently. The assignments will be posted on
Blackboard in a timely manner before the due date. You will have at least five
days to complete each assignment. If you have any questions, please actively
engage in Q/As on Piazza. Assignments will involve a lot of short answers and
developing code to determine the answer.
○ A1: Pandas and Numpy (6%).
○ A2: Statistics and Scaling (6%).
○ A3: Linear Regression (6%).
○ A4: Model Validation and Regularization (6%).
○ A5: Dimensionality Reduction (7%).
○ A6: Classification: Logistic Regression & KNN (7%).
○ A7: Classification: Decision Trees (7%).
● Participation: 5%. We will have occasional classwork that will count towards
participation. Participation does not necessarily mean volunteering in class
– Sometimes I will call on you randomly, sometimes I will use PollEverywhere
to allow you to respond electronically, and you can participate on the course
Piazza page by answering questions posed by other students. You also participate
by being engaged – this means not working on other coursework doing class, and
basically not doing anything else besides paying attention and taking notes.
● There will be extra credit opportunities in the Assignments and Exams. Keep a
sharp eye.
Readings:
Students should be able to understand the course content just by following the lecture and
by doing the readings. However, the following textbooks serve as good references.
● Introduction to Data mining, 1E [online][pdf];
● Introduction to Data mining, 2E [online];
● Python for Data Analysis, 3E [online];
Course Schedule:
3 01/30 Python and Pandas Data Frames Python for Data Analysis Chapter 5
(.pdf provided on BB)
26 04/29 Clustering
Important Dates:
1. The first day of add-and-drop is January 21st.
2. The first day of classes is January 22nd.
3. The last day of add-and-drop is January 31st.
4. The first day of the withdrawal period is February 1st.
5. Spring Break (no classes) March 8th - March 16th.
6. The last day of the withdrawal period is March 24th.
7. The last day of classes is May 2nd.
8. Final grades due for all students by 9 a.m is May 20th.
Artificial Intelligence:
The use or incorporation of any AI-generated content (from ChatGPT, Bard, Dall-e, etc.)
in graded assignments or exams is prohibited.
Letter Grades:
95 – 100 % A 72 – 74.99% C
90 – 94.99% A- 70 – 71.99% C-
85 – 89.99% B+ 60 – 69.99% D+
82 – 84.99% B
80 – 81.99% B-
75 – 79.99% C+ 0 – 59.99% F
Honor Code:
Academic integrity is at the heart of the university, and we all are responsible for
upholding the ideals of honor and integrity. The student-led honor system is responsible
for resolving any suspected violations of the Honor Code, and I will report all suspected
instances of academic dishonesty to the honor system. The Student Handbook
(www.wm.edu/studenthandbook) includes your responsibilities as a student. Your full
participation and observance of the Honor Code are expected. To read the Honor Code,
see www.wm.edu/honor.
Student Health:
William & Mary recognizes that students juggle different responsibilities and can face
challenges that make learning difficult. There are many resources available at W&M to
help students navigate emotional/psychological, physical/medical, material/accessibility
concerns, including:
1. The W&M Counseling Center at (757) 221-3620. Services are free and confidential.
2. The W&M Health Center at (757) 221-4386.
3. To seek assistance for interpersonal, academic, and wellness challenges, please contact
Care Support Services at wm.edu/care ([email protected]).
Student Accessibility:
William & Mary accommodates students with disabilities in accordance with federal laws
and university policy. Any student who feels they may need an accommodation based on
the impact of a learning, psychiatric, physical, or chronic health diagnosis should contact
Student Accessibility Services staff at 757-221-2512 or at [email protected] to determine if
accommodations are warranted and to obtain an official letter of accommodation. For
more information, please see www.wm.edu/sas.
Expectations:
All students are strongly encouraged to bring their laptops/notebook computers to class
since we plan on having a significant amount of active coding. Do not forget pen and
papers.
Possible Changes:
Any changes to the following course schedule or due dates will be announced in class
and on Blackboard ahead of time.
Student Success:
Student success supports each student's personal growth, development, engagement, and
belonging by providing holistic guidance as they navigate their own W&M journey. It
encompasses the offices of Academic Wellbeing, Care Support Services, and Student
Accessibility Services.
For academic support such as tutoring, time management, study skills, and academic
coaching, please contact Academic Wellbeing at wm.edu/academicwellbeing
([email protected]).
For concerns about the wellbeing of a member of the William & Mary community or to
seek assistance for interpersonal, academic, and wellness challenges, please contact Care
Support Services at wm.edu/care ([email protected]).
Additional Information:
William & Mary values inclusiveness most highly and believes that diversity is critical to
equity and the pursuit of academic excellence. Our goal is to create and sustain an
environment in which diversity can thrive. A diverse faculty, student body,
administration, and curriculum together foster learning and enhance excellence. We seek
to promote an environment of inclusion and to maintain a safe, nurturing community that
is respectful of our differences and what we share.