DA517 Machine Learning II: Syllabus (Spring 2020-2021)

Download as pdf or txt
Download as pdf or txt
You are on page 1of 3

Syllabus (Spring 2020-2021)

DA517 Machine Learning II

Meeting Times
Saturday 13:00 - 16:00

Wednesday 19:00 - 22:00

Instructor
İnanç Arın e-mail: [email protected]

Teaching Assistant
Yalçın Can Kılıç email: [email protected]

Course Description, Aim and Content


In the scope of this course we will cover various machine learning and data mining methods,
namely the associations, clustering and classification applied to different data types. We will look
at recommendation systems and personalization where data mining models play an important
role. Students are expected to understand the fundamental theory behind each technique, as
well as implementing them using an environment such as Python. Some machine learning
concepts will be recovered by applying them on Python. This course also aims to improve
practical programming skills of the students for implementing basic/advanced machine learning
techniques. Therefore, we will be developing lots of Python applications to reinforce our
theoretical knowledge.

1
Schedule1
1. (Stochastic) Gradient Descent
2. Some Machine Learning Concepts (Recap with Python)
a. End to end machine learning application, Imbalanced dataset
b. Overfitting/Underfitting
c. Handling missing values
3. Association Rules (maybe)
4. Clustering Algorithms
a. Partition Based Clustering Algorithms
b. Hierarchical Clustering Algorithms
c. Density Based Clustering Algorithms and Outlier Detection with DBSCAN
d. How to use unsupervised learning to improve supervised learning results
5. Text Mining
a. Text Clustering
i. Longest Common Subsequence/Substring
b. Text Classification
c. Data Structures/Indexing Methods to Improve Text Clustering (Locality Sensitive
Hashing)
6. Dimensionality Reduction
a. Linear Projection
i. PCA (Incremental & Kernel PCA)
ii. SVD
b. Manifold Learning
i. t-SNE
7. Data Mining for Recommendation Systems and Personalization
8. Artificial Neural Network with Keras
a. Sequential API
b. Functional API
c. Wide & Deep Neural Networks
d. Callbacks
9. Advanced end to end machine learning application development

1
Instructor may change the contents and the order of the topics during the semester

2
Grading
● Take-Home Exams 60% (Group & individual exercises)
● Final Exam 40% (Both theoretical and practical with Python)

References
● Data Mining Concepts and Techniques, Jiawei Han & Micheline Kamber & Jian Pei,
Morgan Kaufmann (3rd edition)
● Patel, A. A. (2019). Hands-On Unsupervised Learning Using Python: How to Build Applied
Machine Learning Solutions from Unlabeled Data. O'Reilly Media.
● Hapke, H. M., Lane, H., & Howard, C. (2019). Natural language processing in action.
● Géron, Aurélien. Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow:
Concepts, Tools, and Techniques to Build Intelligent Systems. O'Reilly Media, 2019.
● Scikit-Learn: Machine Learning in Python https://scikit-learn.org/stable/index.html
● Some other additional resources will be shared in our course Google Drive folder.

Announcements and SUCourse+


Students are responsible for all announcements made during the regular class meetings.
Students should check their Sabanci University e-mail accounts (mymail.sabanciuniv.edu) and
follow the SUCourse+ site (https://sucourse.sabanciuniv.edu/plus/) for this class regularly as they
are responsible for all announcements and postings delivered through these sources.

Important Note: This document may be modified during the semester (Spring 2020-2021) due to
unforeseen reasons.

You might also like