Chapter 1 - Introduction
Chapter 1 - Introduction
Chapter 1 - Introduction
• Arthur Samuel (1959): "Field of study that gives computers the ability to learn without
being explicitly programmed"
• Tom Mitchell (1997): "A computer program is said to learn from experience E with
respect to some class of tasks T and performance measure P, if its performance at tasks in
T, as measured by P, improves with experience E".
Machine Learning
Machine Learning
Machine Learning
Experience
Prediction
7 + + + - + ?
8 + - + - + ?
9 + + + - - ?
Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 3 / 23
Machine Learning
What is learning?
Data + Label
• Supervised learning: the learner (learning algorithm) are trained on labeled examples, i.e.,
input where the desired output is known.
• Unsupervised learning: the learner operates on unlabeled examples, i.e., input where the
desired output is unknown.
grouping/clustering
BRK O YP ?KMRS O OK S Q
F KS S Q FO S Q 4 VcS Q
F KS S Q FO S Q DOKV
7K K 7K K 7K K
Validation set
avg performance
• K-fold cross validation: (for small model and small data)
– Randomly partitioned k equal sized sub-samples.
– k - 1 for training and 1 for testing.
– k times (folds) of validation and taking the average.
good checkpoint
Overfitting
• Precision:
number of correct system answers
P =
number of system answers
• Recall:
number of correct system answers
R=
number of correct problem answers
số ng dự đoán bị thật sự / số ng bị trong dataset
TP
P recision =
TP + FP
TP
Recall =
TP + FN
TP + TN
Accuracy =
TP + TN + FP + FN
Price
N ??
Example Quality Price Buy
Quality 1 Good Low Yes
2 Bad High No
Y 3 Good High ?
??
4 Bad Low ?
• A learner that makes no prior assumptions regarding the identity of the target
concept cannot classify any unseen instances.
• A learner that makes no prior assumptions regarding the identity of the target
concept cannot classify any unseen instances.
• A learner that makes no a priori assumptions regarding the identity of the target concept
has no rational basic for classifying any unseen instances.
• A learner that makes no prior assumptions regarding the identity of the target
concept cannot classify any unseen instances.
• A learner that makes no a priori assumptions regarding the identity of the target concept
has no rational basic for classifying any unseen instances.
• The inductive bias (learning bias): the set of assumptions that the learner uses to predict
outputs given inputs that it has not encountered.