DM Lecture 1 Introudction and Policies
DM Lecture 1 Introudction and Policies
Warehousing
Faizad Ullah
1
About Me
Faizad Ullah
Ph.D. Student at LUMS
Specialization
Natural Language Processing (NLP)
Machine Learning
Data Science
Contributions
Medical Image Analysis
Graph Analysis
Text Analytics of Low-Resourced Language
2
Course Description
Theoretical (3 Credit Hrs Course)
Emphasis on hands-on
*3-5 Assignments
Programming Assignments
*One Project
Programming Environment
Python (Pytoch, TensorFlow, Colab)
Midterm 20%
4
Policies
Quizzes
Most quizzes are announced (50% quizzes will be unannounced)
Announcements will be made during the class and on slides, so check slides regularly if you miss
lectures. No announcement will be sent via email
Sharing
Copying is not allowed for assignments. Discussions are encouraged; however, you must submit your
own work.
Violators would be reported to the Disciplinary Committee or face marks reduction penalties
Plagiarism
Do NOT pass someone else’s work as your own!
Write in your own words and cite the reference if you use someone else’s material.
5
Policies (2)
Submission Policy
Submissions are due at the day and time specified
Late submissions will result in 50% marks deduction per day from obtained marks (i.e., 2 days
late submission will get zero credit).
Attendance Policy
You are advised to attend all lectures.
It’s the students’ responsibility to recover any information or announcements posted during a
lecture from which they were absent.
Classroom behavior
Maintain classroom sanctity by remaining quiet and attentive
Asking questions is encouraged.
You are not allowed to use a Laptop/mobile phone, etc., during class.
6
Policies (3)
Retakes
No retakes for quizzes, assignments, exams, or projects
In case of any medical emergency or unavoidable circumstances, inform before hand and seek a formal
approval. You need to share medical reports for departmental record.
Do not wait for the final exam to seek approval for retakes
7
Course Material
All course material (i.e.,Books, class handouts, reading
assignments) will be shared on Moodle
Text Book
Data Mining: Concepts and Techniques, 3rd ed, by Jiawein Han, Micheline Kamber, Jian Pei
Pattern Classification, 2nd Edition, by Richard O. Duda (Author), Peter E. Hart (Author), David G. Stork
Reference Book
Mining Massive Datasets, by Jure Leskovec, Anand Rajaraman, Jeffrey D. Ullman
8
Contact
How to contact me?
E-mail: Will share soon
Office:
Office Hours: Mentioned on office door
9
Most Important
11
Data is Everywhere
• There has been enormous data growth in both
commercial and scientific databases due to advances
in data generation and collection technologies
E-Commerce
Traffic Patterns
Gather whatever data you can
whenever and wherever possible.
• Alternative Names:
• Knowledge discovery (mining) in databases (KDD), knowledge extraction,
data/pattern analysis, data archeology, data dredging, information
harvesting, business intelligence, etc. 15
Data Mining: Confluence of Multiple Disciplines
16
• Comments