0% found this document useful (0 votes)

8 views26 pages

Supevised Learning - 1

The document discusses supervised learning in artificial intelligence, focusing on its two main types: regression and classification, which utilize labeled training data to make predictions. It explains linear regression, including its assumptions, loss functions, and methods for minimizing errors, such as the least squares method and gradient descent. Additionally, it addresses model evaluation concepts like overfitting, underfitting, and the bias-variance trade-off.

Uploaded by

Hoài Hân Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views26 pages

Supevised Learning - 1

Uploaded by

Hoài Hân Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Applications of

Artificial Intelligence
(ME3181)

Supervised Learning

Phung Thanh Huy

Department of Mechatronics
Ho Chi Minh City University of Technology (HCMUT)
[email protected]
14/09/2023
Supervised Learning
Supervised Learning
Supervised Learning Unsupersvised Learning
Discrete Classification Clustering
Continuous Regression Dimentionality Reduction

o Supervised Learning (Học có giám sát): learns from labeled training data
to make predictions or decisions.
o Regression: Finding the relationship between a dependent variable
(label, target, output, outcome variable) and one or more
independent variables (also known as predictors or features).
o Classification: assign input data points to one of several predefined
categories or classes
o Unsupervised Learning (Học không giám sát): finds patterns,
relationships, or structures in a dataset without the presence of labeled
output or target variables.

Lecture notes of Andrew Ng

Applications of AI (ME3181) 3
Supervised Learning
Supervised Learning Unsupersvised Learning
Discrete Classification Clustering
Continuous Regression Dimentionality Reduction

Training set

Learning Algorithm

𝑥 𝑦
ℎ
Data Estimated Value
Hypothesis/ Model

https://www.amybergquist.com/
Lecture notes of Andrew Ng
Applications of AI (ME3181) 4
Linear Regression
Linear Regression

UST Class: Machine Learning (by Junseong Bang)

Applications of AI (ME3181) 6
Linear Correlation

UST Class: Machine Learning (by Junseong Bang)

Applications of AI (ME3181) 7
Simple Linear Regression
- Equation of a line: 𝑦 = 𝑤𝑥 + 𝑏
𝑤: coefficient / weight
𝑏: intercept / bias
- Linear regression:
Predict the relation between x and y, assuming that the relation is linear.
i.e. Giving x and y, estimate a function 𝑦ො ≈ 𝑤𝑥 + 𝑏
The problem becomes finding 𝑤 and 𝑏

𝑦 = 𝑦ො + 𝜖, 𝜖~𝑁 0, 𝜎 2
𝑦 ≈ 𝑦ො = 𝑤𝑥 + 𝑏

Applications of AI (ME3181) 8
Simple Linear Regression

UST Class: Machine Learning (by Junseong Bang)

Applications of AI (ME3181) 9
Simple Linear Regression
Assumption:
𝑦 = 𝑓 𝑥 ≈ 𝑦ො = 𝑎𝑥 + 𝑏
Or
𝑦 = 𝑦ො + 𝜖 𝜖: residual
Where 𝜖~𝑁 0, 𝜎 2 o For each data sample:
Dataset 𝜖𝑖 = 𝑦𝑖 − 𝑦ො𝑖
𝒙 𝒚 o Sum of squared 𝜖𝑖 :
𝑛 𝑛
𝑥 1 𝑦 1 𝐿= ෍ 𝜖𝑖2 =෍ 𝑦𝑖 − 𝑎𝑥𝑖 − 𝑏 2
𝑖=1 𝑖=1
𝑥 2
𝑦 2 o Least Square Method:
𝑎, 𝑏 are selected so that 𝑄 is minimized
… …
𝑎, 𝑏 = 𝑎𝑟𝑔𝑚𝑖𝑛 𝑄
𝑚 𝑚
𝑥 𝑦 o Consider:
𝑦 σ𝑛𝑖=1 𝜖𝑖 𝑛 𝑦𝑖 − 𝑎𝑥𝑖 − 𝑏
𝜖ҧ = =෍ = 𝑦ത − 𝑥ҧ − 𝑏
𝑚 𝑖=1 𝑚
Real value 𝑦
o Since 𝜖~𝑁 0, 𝜎 2 , 𝐸 𝜖 = 𝜖 ҧ = 0
Estimated value 𝑦ො 𝑏 = 𝑦ത − 𝑥ҧ

𝑥
Applications of AI (ME3181) 10
Simple Linear Regression
Assumption:
𝑦 = 𝑓 𝑥 ≈ 𝑦ො = 𝑎𝑥 + 𝑏
Or
𝑦 = 𝑦ො + 𝜖 𝜖: residual
Where 𝜖~𝑁 0, 𝜎 2
Dataset
o 𝐿 = σ𝑛𝑖=1 𝑦𝑖 − 𝑦ത − 𝑎(𝑥𝑖 − 𝑥ҧ ) 2
𝒙 𝒚
o To minimize 𝐿:
𝑥 1 𝑦 1 𝑛 𝑛
𝜕𝐿 2
= −2 ෍ 𝑦𝑖 − 𝑦ത 𝑥𝑖 − 𝑥ҧ − 𝑎 ෍ 𝑥𝑖 − 𝑥ҧ =0
𝑥 2
𝑦 2 𝜕𝑎 𝑖=1 𝑖=1

… …
𝑚 𝑚
𝑥 𝑦
𝑦

Real value 𝑦

Estimated value 𝑦ො

𝑥
Applications of AI (ME3181) 11
Residual Analysis
𝑚
Total sum of square
𝑖
𝑇𝑆𝑆 = ෍(𝑦 − 𝑦)
ത
1

Explained sum of square 𝑚

𝑖 2
𝐸𝑆𝑆 = ෍ 𝑦ො − 𝑦ത 𝑦ത is mean values of all 𝑦
1

Residual sum of squares

𝑚
𝑖 2
𝑅𝑆𝑆 = ෍ 𝑦 − 𝑦ො
1
Coefficient of determination

𝑅𝑆𝑆
𝑅2 =1−
𝑇𝑆𝑆

Applications of AI (ME3181) 12
General Linear Regression
- Generalization for multiple variables (multiple features):
estimate: 𝑦ො = 𝑓 𝑥 = 𝑏 + 𝑤1 𝑥1 + 𝑤2 𝑥2 + ⋯ + 𝑤𝑛 𝑥𝑛
Note: 𝑦 = 𝑤𝑥 + 𝑏 is not linear, it is affine.
- Make the equation become linear
Let 𝑏 = 𝑤0 and 𝑥0 = 1
𝑦ො = 𝑤0 𝑥0 + 𝑤1 𝑥1 + ⋯ + 𝑤𝑛 𝑥𝑛
𝑦ො = 𝑤 𝑇 𝑥
𝒙 = 𝑥0 , … , 𝑥𝑛 Vectors (n+1) x 1
𝒘 = 𝑤0 , … , 𝑤𝑛
- The problem is to find w

UST Class: Machine Learning (by Junseong Bang)

Applications of AI (ME3181) 13
General Linear Regression
Error of Each data point: Cost
Loss Function – Cost Function Error of All data points: Lost
𝑦
- Total errors of the estimated
function should be minimized Real value 𝑦
Error
- There are several ways to evaluate the Estimated value 𝑦ො
errors:
𝑚
1 2 𝑥
𝑀𝑆𝐸 = ෍ 𝑦 𝑖 − 𝑦ො 𝑖
𝑚 𝑚 set of data
𝑖=1 𝑥 1 …𝑥 𝑚
𝑚 And 𝑦 1 … 𝑦 𝑚
1 𝑖
𝑀𝐴𝐸 = ෍ |𝑦 − 𝑦ො 𝑖 |
𝑚
𝑖=1
- Usually, MSE is used.
- The problem becomes finding 𝑤 so that MSE is minimum

Applications of AI (ME3181) 14
General Linear Regression
Solving the model
Data
𝑚
𝑥 𝑖 ,𝑦 𝑖
𝑖=1
Hypothesis
𝑦 (𝑖) ≈ 𝑤 𝑇 𝑥 𝑖

Loss-function
𝑚 𝑚
1 𝑖 𝑖 2 𝑖 2
min ෍ 𝑦 − 𝑦ො → 𝑚𝑖𝑛 ෍ 𝑦 − 𝑤𝑇𝑥 𝑖
𝑚
𝑖=1 𝑖=1

𝑚
1 𝑖 𝑇 𝑖 2 1 2
Analytical solution 𝐿 𝑤 = ෍ 𝑦 −𝑤 𝑥 = 𝑦 − 𝑋𝑇𝑤 2
𝑚 𝑚
𝑖=1

𝑤0
1 1 1 𝑥0
1 𝑥0 = 1
𝑦1 𝑥0 𝑥1 𝑥𝑛 𝑤= 𝑤
1 1 1 1 1 …1
2 𝑥 = 𝑥1
𝑦 = 𝑦… 𝑋= 𝑥0 𝑥1 … 𝑥𝑛 … 𝑤𝑛
… … …
1
𝑦𝑚 1
𝑥0 𝑥1
1
𝑥𝑛
1 𝑥𝑛

Applications of AI (ME3181) 15
General Linear Regression
Analytical solution
1 2
1
𝐿 𝑤 = 𝑦 − 𝑋𝑇𝑤 2 = 𝑦 − 𝑋𝑇𝑤 2
𝑚 𝑚
𝐿(𝑤)
To minimize L(𝑤), =0
𝜕𝑤

Hence, 𝑤 could be calculated by:

𝑤 = 𝑋𝑋 𝑇 −1 𝑋𝑦

Applications of AI (ME3181) 16
Introduction to Gradient Descent
Numerical solution of Loss-function
𝜕𝐿(𝑤)
- Solving a cost function is to solve 𝜕𝑤 = 0
- In the case 𝑤 is a vector: ∇𝑤 𝐿 𝑤 = 0
- It may be difficult to find and solve analytical solutions

Gradient descent
𝑛𝑒𝑥𝑡 𝑐𝑢𝑟𝑟𝑒𝑛𝑡
𝑤 =𝑤 − 𝛼∇𝑤 𝐿 𝑤

Hyper parameter: Learning Rate

Applications of AI (ME3181) 17
Introduction to Gradient Descent
Batch Gradient Descent (or Gradient Descent – GD)
𝑛𝑒𝑥𝑡 𝑐𝑢𝑟𝑟𝑒𝑛𝑡
𝑤 =𝑤 − 𝛼∇𝑤 𝐿 𝑤
- The whole train data set is used to train
Stochastic Gradient Descent (SGD)
- Each random data sample is used per 1 update
Mini-batch Gradient Descent
- The whole dataset is split into small batches.

n features and m data samples

Applications of AI (ME3181) 18
Related forms
Log-linear
𝑤 𝑤 𝑤
ln 𝑥1 1 𝑥2 2 … 𝑥𝑛 𝑛 = 𝑤1 ln 𝑥1 + 𝑤2 ln 𝑥2 + ⋯ + 𝑤𝑛 ln 𝑥𝑛

Let ln 𝑥𝑖 → 𝑥′𝑖

https://analystprep.com/study-notes/cfa-level-2/linear-or-log-linear-model/
Applications of AI (ME3181) 19
Related forms
Polymial regression
𝑦 ≈ 𝑦ො = 𝑤𝑛 𝑥 𝑛 + ⋯ + 𝑤0 Let 𝑥 𝑖 → 𝑥𝑖

Degree

Degree = 2
?

Applications of AI (ME3181) 20
Model valuation
Overfitting and Underfitting

Underfitting

Overfitting

Applications of AI (ME3181) 21
Model valuation
Learning curves

Underfitting Overfitting

Applications of AI (ME3181) 22
Model valuation
Bias/ Variance trade-off
Error = Bias + Variance + Irreducible Noise

Bias: wrong assumption, select the wrong model

Variance: Model sensitivity to the small variation in the training data
Noise: From the data sources

Trade-off:
Increase a model’s complexity → Increase variance and Reduce Bias
Reduce a model’s complexity → Reduce variance and Increase Bias

Methods to alleviate the overfitting

‣Reduce the number of features (i.e., Leave only key features)

‣Perform regularization

Applications of AI (ME3181) 23
Regularization

Applications of AI (ME3181) 24
Regularization
Ridge Regression 𝑛
𝛼
𝐿𝑅𝑖𝑑𝑔𝑒 𝑤 = 𝐿 𝑤 + ෍ 𝑤𝑖2
2
𝑖=0

Lasso Regression
𝑛

𝐿𝐿𝑎𝑠𝑠𝑜 𝑤 = 𝐿 𝑤 + 𝛼 ෍ 𝑤𝑖
𝑖=0

Elastic Net
𝑛 𝑛
1−𝑟
𝐿𝐸𝑙𝑎𝑠𝑡𝑖𝑐𝑁𝑒𝑡 𝑤 = 𝐿 𝑤 + 𝑟𝛼 ෍ 𝑤𝑖 + 𝛼 ෍ 𝑤𝑖2
2
𝑖=0 𝑖=0

Applications of AI (ME3181) 25
Regularization
Early Stopping

Applications of AI (ME3181) 26

Human and Social Biology SBA Guidelines
80% (5)
Human and Social Biology SBA Guidelines
5 pages
Acc QSN at Ans Gce PDF
No ratings yet
Acc QSN at Ans Gce PDF
156 pages
AI Lec-04
No ratings yet
AI Lec-04
21 pages
AI Foundations and Applications: 4. Linear Regression
No ratings yet
AI Foundations and Applications: 4. Linear Regression
31 pages
Aiml 4
No ratings yet
Aiml 4
107 pages
Module III - Simulation Scenarios - C1
No ratings yet
Module III - Simulation Scenarios - C1
179 pages
02-Linear Regression
No ratings yet
02-Linear Regression
17 pages
Machine Learning: MACHINE LEARNING - Copy Rights Reserved Real Time Signals
No ratings yet
Machine Learning: MACHINE LEARNING - Copy Rights Reserved Real Time Signals
56 pages
ANN-Unit 3 - Regression & Multi-Layer Perceptron
No ratings yet
ANN-Unit 3 - Regression & Multi-Layer Perceptron
35 pages
AI Lec 2
No ratings yet
AI Lec 2
49 pages
Unit 2 Machine Learning
No ratings yet
Unit 2 Machine Learning
32 pages
O F Ai (Fai) & L R: Aplicações de Inteligência Artificial A A I
No ratings yet
O F Ai (Fai) & L R: Aplicações de Inteligência Artificial A A I
23 pages
First Cours 2
No ratings yet
First Cours 2
42 pages
AIML Lab
No ratings yet
AIML Lab
48 pages
Unit 2 ML - Ver 2
No ratings yet
Unit 2 ML - Ver 2
129 pages
ML Lecture 2 2023
No ratings yet
ML Lecture 2 2023
59 pages
Task - Linear Regression
No ratings yet
Task - Linear Regression
5 pages
Module B Handbook
No ratings yet
Module B Handbook
11 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
HAAI Linear Models Slides
No ratings yet
HAAI Linear Models Slides
49 pages
Fundamentals
No ratings yet
Fundamentals
32 pages
Jntuk Machine Learning 3-2 Unit-2
No ratings yet
Jntuk Machine Learning 3-2 Unit-2
47 pages
Lecture3 Supervised Learning I
No ratings yet
Lecture3 Supervised Learning I
84 pages
Neural Network - Optimization DRAFT 3.11
No ratings yet
Neural Network - Optimization DRAFT 3.11
66 pages
Linear-Regression 231212 072619
No ratings yet
Linear-Regression 231212 072619
13 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
AI Chapter 3 Part 4
No ratings yet
AI Chapter 3 Part 4
57 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
Lec 3-5 (Function Approximation)
No ratings yet
Lec 3-5 (Function Approximation)
34 pages
CSE 411 ML CH 2
No ratings yet
CSE 411 ML CH 2
53 pages
AI Midterm Review
No ratings yet
AI Midterm Review
4 pages
CSE 411 ML CH 2
No ratings yet
CSE 411 ML CH 2
53 pages
Deep Learning A Tutorial
No ratings yet
Deep Learning A Tutorial
16 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
12 pages
Lecture 3 - Linear Regression
No ratings yet
Lecture 3 - Linear Regression
31 pages
Minsky y Papert
No ratings yet
Minsky y Papert
77 pages
2000 Conf
No ratings yet
2000 Conf
22 pages
AI ML 3 Updated
No ratings yet
AI ML 3 Updated
34 pages
Module 3
No ratings yet
Module 3
27 pages
ML QB
No ratings yet
ML QB
13 pages
Ai Application
No ratings yet
Ai Application
28 pages
ML Notes
No ratings yet
ML Notes
13 pages
Alshammari 2024 Ijca 923446
No ratings yet
Alshammari 2024 Ijca 923446
6 pages
ML Notes
No ratings yet
ML Notes
14 pages
Lecture Notes 5 Linear Regression
No ratings yet
Lecture Notes 5 Linear Regression
11 pages
Unit 4 - Machine Learning PDF
No ratings yet
Unit 4 - Machine Learning PDF
49 pages
Machine Learning Shortnote
No ratings yet
Machine Learning Shortnote
14 pages
Lab # 9
No ratings yet
Lab # 9
6 pages
Week 4 Linear Regression
No ratings yet
Week 4 Linear Regression
38 pages
Machine Learning Guide
No ratings yet
Machine Learning Guide
185 pages
Lesson 04 Deep Neural Network
No ratings yet
Lesson 04 Deep Neural Network
81 pages
Week#2
No ratings yet
Week#2
34 pages
Department of Computer Science & Engineering.: Submitted To
No ratings yet
Department of Computer Science & Engineering.: Submitted To
16 pages
Lecture 3 - Regression
No ratings yet
Lecture 3 - Regression
47 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Linear Regression and Logistic Regression
No ratings yet
Linear Regression and Logistic Regression
19 pages
Unit 2
No ratings yet
Unit 2
80 pages
Unit 3 Ii Mark
No ratings yet
Unit 3 Ii Mark
51 pages
Module 3 Slide
No ratings yet
Module 3 Slide
46 pages
Lecture2 ML Regression
No ratings yet
Lecture2 ML Regression
56 pages
Lec 100
No ratings yet
Lec 100
13 pages
Lesson Plan Short Story
100% (1)
Lesson Plan Short Story
2 pages
Hope - 3 Grade 12: Quarter 1 Week 4 Module 4
80% (5)
Hope - 3 Grade 12: Quarter 1 Week 4 Module 4
3 pages
Beginners Guide To Arabic
No ratings yet
Beginners Guide To Arabic
4 pages
Curriculum Integration in Makabaya2
No ratings yet
Curriculum Integration in Makabaya2
3 pages
Zone of Proximal Development
No ratings yet
Zone of Proximal Development
5 pages
LE Q3 TLE-7 Lesson-1 Week-1
No ratings yet
LE Q3 TLE-7 Lesson-1 Week-1
11 pages
Weatherfield Academy: Summary of Key Findings For Parents and Pupils
No ratings yet
Weatherfield Academy: Summary of Key Findings For Parents and Pupils
10 pages
Cot 1 Quarter 1 Lesson Plan
No ratings yet
Cot 1 Quarter 1 Lesson Plan
5 pages
Eclectic Approach
No ratings yet
Eclectic Approach
12 pages
CHC Lesson Plan
No ratings yet
CHC Lesson Plan
28 pages
Design Family Conference
No ratings yet
Design Family Conference
26 pages
6019SSL-CW2 2425 Semester 2 - Standard
No ratings yet
6019SSL-CW2 2425 Semester 2 - Standard
10 pages
Time Frame Topics Learning Competencies Assessment: Syllabus in English For Academic and Professional Purposes - 1
No ratings yet
Time Frame Topics Learning Competencies Assessment: Syllabus in English For Academic and Professional Purposes - 1
13 pages
Testing Practices of English Teachers in Selected Public Secondary Schools
No ratings yet
Testing Practices of English Teachers in Selected Public Secondary Schools
12 pages
Narrative Report
No ratings yet
Narrative Report
66 pages
CIS Guidance Designate Accomplish-Ment Report SY 2024-2025
No ratings yet
CIS Guidance Designate Accomplish-Ment Report SY 2024-2025
8 pages
DLL 6. Eapp PASSIVIZATION AND NOMINALIZTN
No ratings yet
DLL 6. Eapp PASSIVIZATION AND NOMINALIZTN
1 page
Scratch - Storyline With Exponents
No ratings yet
Scratch - Storyline With Exponents
3 pages
Isulan National High School
No ratings yet
Isulan National High School
2 pages
Module Review Counseling Techniques
No ratings yet
Module Review Counseling Techniques
4 pages
Midbrain Activation Franchise
No ratings yet
Midbrain Activation Franchise
26 pages
1ms Yearly Planning 20212022
No ratings yet
1ms Yearly Planning 20212022
7 pages
Study
No ratings yet
Study
12 pages
Annisa Qinaya R-1041-Week 4 Task (Learning Style) - 21042247
No ratings yet
Annisa Qinaya R-1041-Week 4 Task (Learning Style) - 21042247
3 pages
First Quarter LP Health 7
No ratings yet
First Quarter LP Health 7
19 pages
Rak CV (Sal)
No ratings yet
Rak CV (Sal)
1 page
CLAVE Thesis Proposal EDIT
No ratings yet
CLAVE Thesis Proposal EDIT
23 pages
Artificial Intelligence and Machine Learning For Business
No ratings yet
Artificial Intelligence and Machine Learning For Business
22 pages

Supevised Learning - 1

Uploaded by

Supevised Learning - 1

Uploaded by

Applications of

Phung Thanh Huy

Lecture notes of Andrew Ng

UST Class: Machine Learning (by Junseong Bang)

UST Class: Machine Learning (by Junseong Bang)

UST Class: Machine Learning (by Junseong Bang)

Explained sum of square 𝑚

Residual sum of squares

UST Class: Machine Learning (by Junseong Bang)

Hence, 𝑤 could be calculated by:

Hyper parameter: Learning Rate

n features and m data samples

Bias: wrong assumption, select the wrong model

Methods to alleviate the overfitting

‣Reduce the number of features (i.e., Leave only key features)

You might also like