ML1-Introduction To Machine Learning
ML1-Introduction To Machine Learning
1
Giới thiệu về AI
2
Giới thiệu về AI
3
Giới thiệu về AI
4
Overview: AI, ML, vs Deep Learning
5
What is Machine Learning?
6
What is Machine Learning?
7
Traditional vs Machine Learning
Approach
8
Traditional Programming
9
Machine Learning Programming
10
What is Machine Learning?
11
Machine Learning in Action
12
Machine Learning in Action
13
Machine Learning in Action
• Document Classification
14
Machine Learning in Action
15
Machine Learning in Action
The self-driving
16
Machine Learning in Action
Many, many more…
Machine Learning.
Data Science, Mining and Knowledge Discovery.
Computer Vision.
Natural Language Processing.
Speech Recognition.
Evolutionary and Natural Computation.
Fuzzy Computation and Technologies.
Artificial Life.
Knowledge-Based Systems.
Automated Reasoning.
Logic and Constraint Programming.
Intelligent Planning.
........ 17
What this course is about
18
Machine Learning Tasks
Broad categories-
• Supervised learning: classification, regression
• Unsupervised learning: Clustering, density estimation
• Semi-supervised learning
• Reinforcement learning
• Many more…
19
Algorithms
Supervised Unsupervised
learning learning
Semi-supervised 20
Machine learning structure
• Supervised learning
Machine learning structure
• Unsupervised learning
Supervised Learning
23
Supervised Learning - Classification
24
Supervised Learning Problems
• Loan application
25
Supervised Learning
27
Unsupervised Learning - Density Estimation
28
Unsupervised Learning - Clustering
29
Unsupervised Learning - Clustering
30
Defining the Learning Task
Improve on task, T, with respect to performance metric, P, based on
experience, E.
T: Playing checkers
P: Percentage of games won against an arbitrary opponent
E: Playing practice games against itself
Learner
Environment/
Experience Knowledge
Performance
Element
32
Train, Validation, Test Set in ML
33
Training Set
34
Validation Set
36
Validation Set: Overfitting Problem
• As the picture
below, dots are
training data and
curves are your
algorithms. The
green curve is
overfitting and the
black is good.
37
Test Set
38
Some terminology in machine learning
41
Learning
• Machine learning is indeed about automated
learning with algorithms:
• Induction: Machine learning algorithms learn
through a process called induction or inductive
learning.
• Generalization make predictions or decisions
based on specific data instances that were not
seen during training
• Over-Learning is poor performance on data
other than the training dataset. This is also called
42
over-fitting
Learning
• Under-Learning: When a model has not learned
enough structure from the database. This is also
called under-fitting.
• Online Learning is when a method is updated
with data instances from the domain as they
become available
• Offline Learning is when a method is created
on pre-prepared data and is then used
operationally on unobserved data
43
Learning
• Supervised Learning: This is a learning process
for generalizing on problems where a prediction
is required. A “teaching process” compares
predictions by the model to known answers and
makes corrections in the model.
• Unsupervised Learning: This is a learning
process for generalizing the structure in the data
where no prediction is required. Natural
structures are identified and exploited for
relating instances to each other.
44
Modeling
• Model Selection is the process of configuring
and training the model as a model selection
process. Each iteration we have a new model.
• Inductive Bias: Bias is the limits imposed on
the selected model. All models are biased which
introduces error in the model, and by definition
all models have error
• Model Variance: Variance is how sensitive the
model is to the data on which it was trained. A
machine learning method can have a high or a
low variance when creating a model on a dataset
45
Modeling
• Bias-Variance Tradeoff: Model selection can
be thought of as a the trade-off of the bias and
variance. A low bias model will have a high
variance and will need to be trained for a long
time or many times to get a usable model. A
high bias model will have a low variance and
will train quickly, but suffer poor and limited
performance
46