0% found this document useful (0 votes)

12 views

Chapter 1 - Introduction

This document is the introduction to a lecture on machine learning. It defines machine learning as programs that improve automatically through experience, and as the study of algorithms and models that learn patterns from data to perform tasks without explicit instructions. It discusses supervised, unsupervised, reinforcement, and evolutionary learning. Key phases of machine learning projects are presented as training, validation, and testing data sets to evaluate performance and avoid overfitting. Common performance measures like precision, recall, and F1 score are also introduced.

Uploaded by

Gia Khang Tạ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Chapter 1 - Introduction

Uploaded by

Gia Khang Tạ

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Machine Learning

Chapter 1 - Introduction

Lecturer: Duc Dung Nguyen, PhD.

Contact: [email protected]

Faculty of Computer Science and Engineering

Hochiminh city University of Technology
Machine Learning

What is Machine learning?

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 1 / 23

Machine Learning

What is Machine learning?

• Arthur Samuel (1959): "Field of study that gives computers the ability to learn without
being explicitly programmed"
• Tom Mitchell (1997): "A computer program is said to learn from experience E with
respect to some class of tasks T and performance measure P, if its performance at tasks in
T, as measured by P, improves with experience E".

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 1 / 23

Machine Learning

• How to construct programs that automatically improve with experience.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 2 / 23

Machine Learning

• How to construct programs that automatically improve with experience.

• The scientific study of algorithms and statistical models that computer systems use to
perform a specific task without using explicit instructions, relying on patterns and
inference instead.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 2 / 23

Machine Learning

• How to construct programs that automatically improve with experience.

• The scientific study of algorithms and statistical models that computer systems use to
perform a specific task without using explicit instructions, relying on patterns and
inference instead.
• A subset of artificial intelligence.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 2 / 23

Example

Experience

Example Gray? Mammal? Large? Vegetarian? Wild? Elephant

1 + + + + + +
2 + + + - + +
3 + + - + + -
4 - + + + + -
5 + - + - + -
1 + + + + - +

Prediction

7 + + + - + ?
8 + - + - + ?
9 + + + - - ?
Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 3 / 23
Machine Learning

What is learning?

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 4 / 23

Machine Learning

Learning is an (endless) generalization or

induction process.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 5 / 23

Types of Machine Learning

Data + Label
• Supervised learning: the learner (learning algorithm) are trained on labeled examples, i.e.,
input where the desired output is known.
• Unsupervised learning: the learner operates on unlabeled examples, i.e., input where the
desired output is unknown.
grouping/clustering

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 6 / 23

Types of Machine Learning

• Reinforcement learning: between supervised and unsupervised learning. It is told when an

answer is wrong, but not how to correct it.
• Evolutionary learning: biological evolution can be seen as a learning process, to improve
survival rates and chance of having offspring.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 7 / 23

Types of Machine Learning

• The most common type: supervised learning.

– Classification: to find the class of an instance given its selected features.
– Regression: to find a function whose curve passes as close as possible to all of the given
data points.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 8 / 23

Phases of Machine Learning

How many phase do we have in machine

learning?

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 9 / 23

Phases of Machine Learning

BRK O YP ?KMRS O OK S Q

F KS S Q FO S Q 4 VcS Q

F KS S Q FO S Q DOKV
7K K 7K K 7K K
Validation set

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 10 / 23

Phases of Machine Learning

avg performance
• K-fold cross validation: (for small model and small data)
– Randomly partitioned k equal sized sub-samples.
– k - 1 for training and 1 for testing.
– k times (folds) of validation and taking the average.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 11 / 23

Phases of Machine Learning

Statistical significance test: to reject the null-hypothesis that the two

compared systems are equivalently efficient although their performance measures
are different.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 12 / 23

Phases of Machine Learning

loss of test increase

underfitting
-> overfitting

good checkpoint

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 13 / 23

Phases of Machine Learning

Overfitting

• There is noise in the data

• The number of training examples is too small to produce a representative sample of the
target concept.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 14 / 23

Performance Measures

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 15 / 23

Performance Measures

số ng dự đoán bị thật sự / số ng hệ thống dự đoán bị

• Precision:
number of correct system answers
P =
number of system answers
• Recall:
number of correct system answers
R=
number of correct problem answers
số ng dự đoán bị thật sự / số ng bị trong dataset

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 16 / 23

Performance Measures

Trade-off Precision vs Recall

TP
P recision =
TP + FP
TP
Recall =
TP + FN
TP + TN
Accuracy =
TP + TN + FP + FN

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 17 / 23

Performance Measures

F1 score: want to seek a balance between Precision and Recall

It is good when there is an uneven class distribution.
P ∗R
F1 = 2
P +R

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 18 / 23

Inductive Bias

Price

N ??
Example Quality Price Buy
Quality 1 Good Low Yes
2 Bad High No
Y 3 Good High ?
??
4 Bad Low ?

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 19 / 23

Inductive Bias

• A learner that makes no prior assumptions regarding the identity of the target
concept cannot classify any unseen instances.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 20 / 23

Inductive Bias

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 20 / 23

Inductive Bias

• A learner that makes no prior assumptions regarding the identity of the target
concept cannot classify any unseen instances.
• A learner that makes no a priori assumptions regarding the identity of the target concept
has no rational basic for classifying any unseen instances.
• The inductive bias (learning bias): the set of assumptions that the learner uses to predict
outputs given inputs that it has not encountered.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 20 / 23

Inductive Bias

Common inductive bias in ML:

• Maximum conditional independence: if the hypothesis can be cast in a Bayesian

framework, try to maximize conditional independence (Naive Bayes classifier).

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 21 / 23

Inductive Bias

Common inductive bias in ML:

• Maximum conditional independence: if the hypothesis can be cast in a Bayesian

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 21 / 23

Inductive Bias

Common inductive bias in ML:

• Maximum conditional independence: if the hypothesis can be cast in a Bayesian

framework, try to maximize conditional independence (Naive Bayes classifier).
• Minimum cross-validation error: when trying to choose among hypotheses, select the
hypothesis with the lowest cross-validation error.
• Maximum margin: when drawing a boundary between two classes, attempt to maximize
the width of the boundary (SVM). The assumption is that distinct classes tend to be
separated by wide boundaries.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 21 / 23

Inductive Bias

Common inductive bias in ML:

• Minimum description length: when forming a hypothesis, attempt to minimize the

length of the description of the hypothesis. The assumption is that simpler hypotheses are
more likely to be true.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 22 / 23

Inductive Bias

Common inductive bias in ML:

• Minimum description length: when forming a hypothesis, attempt to minimize the

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 22 / 23

Inductive Bias

Common inductive bias in ML:

• Minimum description length: when forming a hypothesis, attempt to minimize the

length of the description of the hypothesis. The assumption is that simpler hypotheses are
more likely to be true.
• Minimum features: unless there is good evidence that a feature is useful, it should be
deleted.
• Nearest neighbors: assume that most of the cases in a small neighborhood in feature
space belong to the same class.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 22 / 23

ML Tech Neo Study
No ratings yet
ML Tech Neo Study
146 pages
Stats 101c Final Project
100% (1)
Stats 101c Final Project
16 pages
Chapter 3 - Bayesian Learning
No ratings yet
Chapter 3 - Bayesian Learning
40 pages
ML1-Introduction To Machine Learning
No ratings yet
ML1-Introduction To Machine Learning
46 pages
ML.1-Overview of ML (Week 1)
No ratings yet
ML.1-Overview of ML (Week 1)
24 pages
Chap-6 Machine Learning Introduction
No ratings yet
Chap-6 Machine Learning Introduction
49 pages
DL-ppt
No ratings yet
DL-ppt
100 pages
Schapire MachineLearning
No ratings yet
Schapire MachineLearning
38 pages
Module 2_Deep_Learning_Fundamentals
No ratings yet
Module 2_Deep_Learning_Fundamentals
98 pages
Machine Learning Notes (1)
No ratings yet
Machine Learning Notes (1)
19 pages
03-Introduction To Machine Learning - DNN
No ratings yet
03-Introduction To Machine Learning - DNN
35 pages
Chapter 5 Machine Learning
No ratings yet
Chapter 5 Machine Learning
96 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
Chapter 5 - Graphical Models
No ratings yet
Chapter 5 - Graphical Models
65 pages
Chapter 5 - Machine Learning Basics
No ratings yet
Chapter 5 - Machine Learning Basics
58 pages
Machine Learning
No ratings yet
Machine Learning
44 pages
Lecture 3 Deep Learning
No ratings yet
Lecture 3 Deep Learning
98 pages
Key Ideas in Machine Learning
No ratings yet
Key Ideas in Machine Learning
11 pages
Introductiontomachinelearning 230723174746 1a0e5edc
No ratings yet
Introductiontomachinelearning 230723174746 1a0e5edc
27 pages
unit 1
100% (1)
unit 1
13 pages
Elements of Machine Learning
No ratings yet
Elements of Machine Learning
116 pages
Lesson 4 -Introduction Machine Learning
No ratings yet
Lesson 4 -Introduction Machine Learning
44 pages
Karthik
No ratings yet
Karthik
10 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
11 pages
Module 4
No ratings yet
Module 4
28 pages
AI321: Theoretical Foundations of Machine Learning: Dr. Motaz El-Saban
No ratings yet
AI321: Theoretical Foundations of Machine Learning: Dr. Motaz El-Saban
44 pages
Machine Learning: Louis Fippo Fitime
No ratings yet
Machine Learning: Louis Fippo Fitime
37 pages
Unit1 ML NGP
No ratings yet
Unit1 ML NGP
106 pages
UNIT-I
No ratings yet
UNIT-I
132 pages
Project Report 2
No ratings yet
Project Report 2
11 pages
AI.5 Machine Learning (21 26)
No ratings yet
AI.5 Machine Learning (21 26)
176 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
Unit I
No ratings yet
Unit I
150 pages
Unit 1
No ratings yet
Unit 1
62 pages
3 - Machine Learning Overview
No ratings yet
3 - Machine Learning Overview
30 pages
ML-chap-2
No ratings yet
ML-chap-2
60 pages
Introduction to ML Unit-1 PPT
No ratings yet
Introduction to ML Unit-1 PPT
90 pages
Lec2 Intro to ML
No ratings yet
Lec2 Intro to ML
35 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
27 pages
Machine Learning Moudle - 1: There Are Three Main Types of Machine Learning
No ratings yet
Machine Learning Moudle - 1: There Are Three Main Types of Machine Learning
86 pages
Lesson3-IntroML
No ratings yet
Lesson3-IntroML
46 pages
Introduction To Machine Learning EECS 6327
No ratings yet
Introduction To Machine Learning EECS 6327
22 pages
18.Overview
No ratings yet
18.Overview
18 pages
Unit 1
No ratings yet
Unit 1
51 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
machineLearning-unit1
No ratings yet
machineLearning-unit1
9 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
21AI63 Module 1
No ratings yet
21AI63 Module 1
38 pages
ML m1-m5 NOTES
No ratings yet
ML m1-m5 NOTES
160 pages
Unit 1-2
No ratings yet
Unit 1-2
22 pages
UNIT 1
No ratings yet
UNIT 1
38 pages
ML
No ratings yet
ML
19 pages
Basics of Machine Learning
100% (4)
Basics of Machine Learning
22 pages
LINFO2262: Machine Learning: Classification and Evaluation
No ratings yet
LINFO2262: Machine Learning: Classification and Evaluation
39 pages
21ai63 Mod 1
No ratings yet
21ai63 Mod 1
38 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
15 pages
ML Lec 1
No ratings yet
ML Lec 1
47 pages
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
An Explainable Transformer-Based Model For Phishing Email Detection: A Large Language Model Approach
No ratings yet
An Explainable Transformer-Based Model For Phishing Email Detection: A Large Language Model Approach
15 pages
Face Sketch Construction and Recognition Synopsis
No ratings yet
Face Sketch Construction and Recognition Synopsis
11 pages
Mobile ALOHA - Learning Bimanual Mobile Manipulation With Low-Cost Whole-Body Teleoperation
No ratings yet
Mobile ALOHA - Learning Bimanual Mobile Manipulation With Low-Cost Whole-Body Teleoperation
20 pages
AI900_CERT1
No ratings yet
AI900_CERT1
23 pages
A I IN FINANCE UT COURSE SYLLABUS & BIOS
No ratings yet
A I IN FINANCE UT COURSE SYLLABUS & BIOS
10 pages
CB Design Optimisation
No ratings yet
CB Design Optimisation
12 pages
unit 1 ai reflection , project cycle and ethics
No ratings yet
unit 1 ai reflection , project cycle and ethics
11 pages
Evaluation of Liquid Loading in Gas Wells Using Machine Learning
No ratings yet
Evaluation of Liquid Loading in Gas Wells Using Machine Learning
12 pages
Deep Learning Based Optimization in Massive MIMO Systems
No ratings yet
Deep Learning Based Optimization in Massive MIMO Systems
34 pages
Logistic Regression A Brief Primer
No ratings yet
Logistic Regression A Brief Primer
6 pages
Early Detection of Cardiovascular Diseases Using Machine Learning 2
No ratings yet
Early Detection of Cardiovascular Diseases Using Machine Learning 2
38 pages
Chapter#10 (Part#01) SL (K-NN)
No ratings yet
Chapter#10 (Part#01) SL (K-NN)
27 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
59 pages
IT 323 Lectures by Ruchika Pharswan Till Midterm V2
No ratings yet
IT 323 Lectures by Ruchika Pharswan Till Midterm V2
157 pages
10999-Manuscript (Word) - 48093-2-15-20231227
No ratings yet
10999-Manuscript (Word) - 48093-2-15-20231227
8 pages
Report Technical Seminar
No ratings yet
Report Technical Seminar
30 pages
Data Science & Analytics Paper
No ratings yet
Data Science & Analytics Paper
55 pages
ML - Chapter 6 - Model Evaluation
No ratings yet
ML - Chapter 6 - Model Evaluation
65 pages
Airfare_Estimation_31.1.25
No ratings yet
Airfare_Estimation_31.1.25
6 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
ML MU Unit 2
100% (3)
ML MU Unit 2
84 pages
(Ebook) Artificial Intelligence for Materials Science by Yuan Cheng, Tian Wang, Gang Zhang, (eds.) ISBN 9783030683092, 3030683095 download pdf
100% (12)
(Ebook) Artificial Intelligence for Materials Science by Yuan Cheng, Tian Wang, Gang Zhang, (eds.) ISBN 9783030683092, 3030683095 download pdf
81 pages
R7 Yuyty
100% (1)
R7 Yuyty
9 pages
Shrinkage Method
No ratings yet
Shrinkage Method
2 pages
Computers and Electronics in Agriculture: Anna Chlingaryan, Salah Sukkarieh, Brett Whelan
No ratings yet
Computers and Electronics in Agriculture: Anna Chlingaryan, Salah Sukkarieh, Brett Whelan
9 pages
Regularization: The Problem of Overfitting
No ratings yet
Regularization: The Problem of Overfitting
23 pages
GR No-01-Project-Report
No ratings yet
GR No-01-Project-Report
51 pages
Infrastructures 09 00003
No ratings yet
Infrastructures 09 00003
16 pages
Top 50 Artificial Intelligence Questions and Answers (2023) - Javatpoint
100% (1)
Top 50 Artificial Intelligence Questions and Answers (2023) - Javatpoint
27 pages

Chapter 1 - Introduction

Uploaded by

Chapter 1 - Introduction

Uploaded by

Machine Learning

Lecturer: Duc Dung Nguyen, PhD.

Faculty of Computer Science and Engineering

What is Machine learning?

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 1 / 23

What is Machine learning?

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 1 / 23

• How to construct programs that automatically improve with experience.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 2 / 23

• How to construct programs that automatically improve with experience.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 2 / 23

• How to construct programs that automatically improve with experience.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 2 / 23

Example Gray? Mammal? Large? Vegetarian? Wild? Elephant

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 4 / 23

Learning is an (endless) generalization or

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 5 / 23

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 6 / 23

• Reinforcement learning: between supervised and unsupervised learning. It is told when an

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 7 / 23

• The most common type: supervised learning.

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 8 / 23

How many phase do we have in machine

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 9 / 23

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 10 / 23

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 11 / 23

Statistical significance test: to reject the null-hypothesis that the two

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 12 / 23

loss of test increase

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 13 / 23

• There is noise in the data

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 14 / 23

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 15 / 23

số ng dự đoán bị thật sự / số ng hệ thống dự đoán bị

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 16 / 23

Trade-off Precision vs Recall

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 17 / 23

F1 score: want to seek a balance between Precision and Recall

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 18 / 23

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 19 / 23

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 20 / 23

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 20 / 23

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 20 / 23

Common inductive bias in ML:

• Maximum conditional independence: if the hypothesis can be cast in a Bayesian

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 21 / 23

Common inductive bias in ML:

• Maximum conditional independence: if the hypothesis can be cast in a Bayesian

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 21 / 23

Common inductive bias in ML:

• Maximum conditional independence: if the hypothesis can be cast in a Bayesian

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 21 / 23

Common inductive bias in ML:

• Minimum description length: when forming a hypothesis, attempt to minimize the

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 22 / 23

Common inductive bias in ML:

• Minimum description length: when forming a hypothesis, attempt to minimize the

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 22 / 23

Common inductive bias in ML:

• Minimum description length: when forming a hypothesis, attempt to minimize the

Lecturer: Duc Dung Nguyen, PhD. Contact: [email protected] Machine Learning 22 / 23

You might also like