Week 1 Introduction To ML
Week 1 Introduction To ML
1 03/01/2022
What is this course about
2
Course planning
Assignment
3
Course overview …..
4
Course overview
5
1.1
Introduction to Data Science
6
The Era of Big Data
90% of the information ever generated was generated in the last two
years?
8
What is Data Science ?
Data Science is an umbrella term and it's basically the marriage of
many different fields.
9
What is Data Science ?
Definition of Data Science according to “Drew Conway”
10
What is Data Science ?
11
1.2
Introduction to Machine
Learning
12
What is Machine Learning ?
Artificial Intelligence (AI) and Machine Learning (ML) are the part of
computer science that are correlated with each other.
These two technologies are the most trending technologies which are
used for creating intelligent systems.
13
What is Machine Learning ?
Researchers interested in artificial intelligence wanted to see if
computers could learn from data.
14
What is Machine Learning ?
BUT, it is a science that’s gaining fresh momentum: the ability to
automatically apply complex mathematical calculations to big data –over and
over, faster and faster – is a recent development
15
15
What is Machine Learning ?
Google trends for the term “Machine Learning”
Definition of Machine Learning
Machine learning is the subfield of computer science that "gives computers the
ability to learn without being explicitly programmed" (Arthur Samuel, 1959)
Regression
Machine Learning in this course
Classification (Logistic Regression)
Reinforcement
Supervised learning Unsupervised learning
learning
Dimensionality
Regression Classification Clustering Q-learning
reduction
Logistic Random
SVM k-NN
regression forest
1.3
Machine Learning Tools
31
Machine Learning Tools
Python
Python is a high level language
Java & C: write code, compile and run, and then watch the output
Full featured
Not just a statistics language, but has full capabilities for data acquisition,
cleaning, databases, high performance computing, and more
Libraries:
Pandas
Numpy
Scipy
Scikit-Learn
Interactive tools:
Spyder: IDE for python
Jupyter Notebook: A web application that allows to:
create and share documents that contain live code, equations, visualizations and
explanatory text. Uses include: data cleaning and transformation, numerical simulation,
statistical modeling, machine learning and much more.
Pandas
https://pandas.pydata.org/pandas-docs/stable/
Pandas Series
Pandas DataFrame
Pandas DataFrame
Thank you for your
attention
41
Practical work
42