0% found this document useful (0 votes)

11 views14 pages

2 Linear Regression

The document provides an overview of classical machine learning algorithms, focusing on supervised and unsupervised learning, with a detailed explanation of linear regression. It describes the components of supervised learning, including the network function, loss function, and optimization algorithm, and outlines the optimization process for linear regression using gradient descent. Additionally, it discusses the closed-form solution for minimizing the loss function in linear regression.

Uploaded by

ghughudekhecho

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views14 pages

2 Linear Regression

Uploaded by

ghughudekhecho

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Classical machine learning algorithms

• Supervised Learning
• Generic equation

• Three main ingredients of any supervised

algorithm
• Network function
• Map the features into the labels
• Loss function
• Quantify the deviation between actual and
corresponding prediction
• Optimization algorithm
• Technique used to minimize the loss

• Unsupervised Learning

1
Linear Regression
• We aim at predicting a continuous target
value given an input feature vector.
• We assume a d-dimensional feature vector
is denoted by , while is the output variable..
• The hypothesis function is defined by

• Geometrically, when d = 1, is actually a line

in a 2D plan

2
Linear Regression

Slope Intercept

• The intercept term is often called the bias parameter

• This terminology derives from the point of view that the
output of the transformation is biased toward being in the 𝑤𝑜
absence of any input.
• This term is different from the idea of a statistical bias, in
which a statistical estimation algorithm’s expected
estimate of a quantity is not equal to the true quantity.

3
Optimization: Linear Regression
• Mapping of th instance

• Deviation in approximation for the th

instance

• Overall deviation

4
Optimization: Linear Regression
• Overall deviation

• Optimization Problem

• Gradient Descent will be inapplicable as

mode is a non-differentiable statistic.

5
Optimization: Linear Regression
• We minimize the square of deviation
instead of mode.

• Gradient descent
1. Start with an initial guess for , say J
• Gradient descent: initialize your starting point for
search for minimum anywhere
2. Iterate until convergence,
• Compute gradient of J w.r.t linear coefficient at time
t
• Update to get by taking a step in the opposite
direction of the gradient

And eventually you will get to minimum 6

Optimization: Linear Regression
𝑵
𝟏 𝟐
𝒂𝒓𝒈𝒎𝒊𝒏𝑤 , 𝑤 , …, 𝑤 𝑱= ∑ ( 𝒚 𝒊 − ( 𝒘 𝒙 𝒊 +𝒘 𝟎) )
𝑻
0 1 𝑑
𝟐 𝒊 =𝟏
𝑇

… … …
^
𝑦 𝑖=¿ × +𝑤 0
… … …
… … …

𝑋 ∈ ℝ 𝑑× 𝑁
7
Optimization: Linear Regression
𝑵
𝟏 𝟐
𝒂𝒓𝒈𝒎𝒊𝒏𝑤 , 𝑤 , …, 𝑤 𝑱= ∑ ( 𝒚 𝒊 − ( 𝒘 𝒙 𝒊 +𝒘 𝟎) )
𝑻
0 1 𝑑
𝟐 𝒊 =𝟏
𝑇

…
^
𝑦 𝑖=¿ ×
… … … …
… … … …
… … …

𝑋 ∈ ℝ 𝑑× 𝑁
8
Optimization: Linear Regression
𝑵
𝟏 𝟐
𝒂𝒓𝒈𝒎𝒊𝒏𝒘 𝑱 = ∑ ( 𝒚 𝒊 − 𝒘 𝒙 𝒊 ) =∥ 𝒚 − 𝑿 𝒘 ∥ 𝑭
𝑻 𝑻 𝟐
𝟐 𝒊=𝟏

𝟏
𝒂𝒓𝒈𝒎𝒊𝒏𝒘 [ 𝟐 𝟐
𝑱 = ( 𝒚 𝟏 − 𝒘 𝒙 𝟏) + ( 𝒚 𝟐 −𝒘 𝒙 𝟐 ) +… + ( 𝒚 𝑵 − 𝒘 𝒙 𝑵 )
𝟐
𝑻 𝑻 𝑻 𝟐
]
𝟐 𝟐
𝜕 ( 𝒚 𝟏 −𝒘 𝒙 𝟏 ) 𝜕 ( 𝒚 𝟏 −𝒘 𝒙 𝟏 ) 𝜕 ( 𝒚 𝟏 − 𝒘 𝒙𝟏 )
𝑻 𝑻 𝑻

=−𝟐 ( 𝒚 𝟏 −𝒘 𝒙 𝟏 ) 𝒙 𝟎𝟏
𝑻
= ×
𝜕 𝑤0 𝜕 ( 𝒚 𝟏 − 𝒘 𝒙𝟏 ) 𝜕𝑤0
𝑻

9
Optimization: Linear Regression
𝟏
𝒂𝒓𝒈𝒎𝒊𝒏𝒘 [ 𝟐 𝟐
𝑱 = ( 𝒚 𝟏 − 𝒘 𝒙 𝟏) + ( 𝒚 𝟐 −𝒘 𝒙 𝟐 ) +… + ( 𝒚 𝑵 − 𝒘 𝒙 𝑵 )
𝟐
𝑻 𝑻 𝑻 𝟐
]
𝟐 𝟐
𝜕 ( 𝒚 𝟏 −𝒘 𝒙 𝟏 ) 𝜕 ( 𝒚 𝟏 −𝒘 𝒙 𝟏 ) 𝜕 ( 𝒚 𝟏 − 𝒘 𝒙𝟏 )
𝑻 𝑻 𝑻

=−𝟐 ( 𝒚 𝟏 −𝒘 𝒙𝟏 ) 𝒙 𝟎 𝟏
𝑻
= ×
𝜕 𝑤0 𝜕 ( 𝒚 𝟏 − 𝒘 𝒙𝟏 ) 𝜕𝑤0
𝑻

𝜕𝑱 𝟏
𝜕 𝑤0 𝟐
[
= − 𝟐 ( 𝒚 𝟏 − 𝒘 𝒙 𝟏 ) 𝒙 𝟎𝟏 −𝟐 ( 𝒚 𝟐 −𝒘 𝒙𝟐 ) 𝒙 𝟎𝟐 −… − 𝟐 ( 𝒚 𝑵 − 𝒘 𝒙 𝑵 ) 𝒙 𝟎 𝑵
𝑻 𝑻 𝑻
]

10
Optimization: Linear Regression
𝜕𝑱
𝜕 𝑤0
[
=− ( 𝒚 𝟏 −𝒘 𝒙 𝟏 ) 𝒙 𝟎𝟏 + ( 𝒚 𝟐 − 𝒘 𝒙 𝟐 ) 𝒙 𝟎 𝟐 +…+ 𝟐 ( 𝒚 𝑵 − 𝒘 𝒙 𝑵 ) 𝒙 𝟎 𝑵
𝑻 𝑻 𝑻
]
𝑇

𝜕𝑱
=¿
− × − ×
𝜕 𝑤0 … …
… …
… …

𝜕𝑱
=¿− 𝑋0 . × ( 𝑦 − 𝑋𝑇 𝑤)
𝜕 𝑤0
11
Optimization: Linear Regression

𝜕𝑱 𝑻
=− 𝑿 𝟎 .×( 𝒚 − 𝑿 𝒘 )
𝜕 𝑤0

𝜕 𝑱 𝑻
=− 𝑿 𝒋 . ×( 𝒚 − 𝑿 𝒘)
𝜕𝑤 𝑗

𝜕 𝑱 𝑻
=− 𝑿 × ( 𝒚 − 𝑿 𝒘 )
𝜕𝑤

12
Optimization: Closed form Solution
At the point of minimization, the gradient
of the loss function with respect to the
model parameters is zero, indicating that
the best fit has been found.

𝜕 𝑱 𝑻
=− 𝑿 × ( 𝒚 − 𝑿 𝒘 )
𝜕𝑤

13
References
• Lecture 2: Deep Learning Fundamentals by
Serena Yeung

Podar International School (Cbse) Practice Sheet STD: X Chapter: 2 Polynomials Year Subject: Mathematics
100% (1)
Podar International School (Cbse) Practice Sheet STD: X Chapter: 2 Polynomials Year Subject: Mathematics
6 pages
BSC Maths Numerical Methods PDF
100% (1)
BSC Maths Numerical Methods PDF
223 pages
Module3 Ch1
No ratings yet
Module3 Ch1
83 pages
Cp4252 ML Unit-II
No ratings yet
Cp4252 ML Unit-II
44 pages
Wk05 Machine Learning
No ratings yet
Wk05 Machine Learning
6 pages
Linear Regression 18may
No ratings yet
Linear Regression 18may
28 pages
Unit 2 ML - Ver 2
No ratings yet
Unit 2 ML - Ver 2
129 pages
Regression PPT
No ratings yet
Regression PPT
21 pages
Module2 Optimizations
No ratings yet
Module2 Optimizations
65 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
Lecture 5 - Linear Regression
No ratings yet
Lecture 5 - Linear Regression
51 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Lecture 3
No ratings yet
Lecture 3
61 pages
10 Linear Regression
No ratings yet
10 Linear Regression
61 pages
Week11 - Regularization and Optimization
No ratings yet
Week11 - Regularization and Optimization
75 pages
Introduction To Machine Learning: Slides Credit: CMU AI, Zico Kolter, Pat Virtue
No ratings yet
Introduction To Machine Learning: Slides Credit: CMU AI, Zico Kolter, Pat Virtue
59 pages
Unit 2 ML - Ver 2
No ratings yet
Unit 2 ML - Ver 2
129 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
48 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
Lecture 3 - Regression
No ratings yet
Lecture 3 - Regression
47 pages
Regression and Optimization in ML
No ratings yet
Regression and Optimization in ML
41 pages
03 Linear Models
No ratings yet
03 Linear Models
46 pages
Week 4 Linear Regression
No ratings yet
Week 4 Linear Regression
38 pages
GradientDescent-Regression Slides
No ratings yet
GradientDescent-Regression Slides
26 pages
Python Tutorial
No ratings yet
Python Tutorial
37 pages
Mlfa Autumn 22 Lec 02
No ratings yet
Mlfa Autumn 22 Lec 02
24 pages
Lecture3 Upload
No ratings yet
Lecture3 Upload
28 pages
MLA TAB Lecture3
No ratings yet
MLA TAB Lecture3
70 pages
CS480 6 Linear Models
No ratings yet
CS480 6 Linear Models
68 pages
Lec 3-5 (Function Approximation)
No ratings yet
Lec 3-5 (Function Approximation)
34 pages
Intro To ML RevisionNotes
No ratings yet
Intro To ML RevisionNotes
24 pages
COL774 Practice Problems
No ratings yet
COL774 Practice Problems
22 pages
Linear and Non-Linear Models-Lec4
No ratings yet
Linear and Non-Linear Models-Lec4
35 pages
Lec 07-08 - Final
No ratings yet
Lec 07-08 - Final
32 pages
L. D. College of Engineering: Lab Manual For
No ratings yet
L. D. College of Engineering: Lab Manual For
70 pages
Lec 3 Regression.
No ratings yet
Lec 3 Regression.
20 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
(MLP) Lecture Notes
No ratings yet
(MLP) Lecture Notes
22 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
04 LinearModels
No ratings yet
04 LinearModels
28 pages
ML 20 04 23
No ratings yet
ML 20 04 23
19 pages
Lecture 3 - Linear Regression
No ratings yet
Lecture 3 - Linear Regression
31 pages
Linear Regression
No ratings yet
Linear Regression
60 pages
Lecture Notes 5 Linear Regression
No ratings yet
Lecture Notes 5 Linear Regression
11 pages
Linear Regression
No ratings yet
Linear Regression
26 pages
CSE 412 Lab Manual 3 Linear Regression
No ratings yet
CSE 412 Lab Manual 3 Linear Regression
10 pages
Chapter Regression
No ratings yet
Chapter Regression
10 pages
2a Linear Regression 18may
No ratings yet
2a Linear Regression 18may
28 pages
CIS 4526: Foundations of Machine Learning Linear Regression: (Modified From Sanja Fidler)
No ratings yet
CIS 4526: Foundations of Machine Learning Linear Regression: (Modified From Sanja Fidler)
20 pages
Module B Handbook
No ratings yet
Module B Handbook
11 pages
Introduction To Machine Learning Lecture 2: Linear Regression
No ratings yet
Introduction To Machine Learning Lecture 2: Linear Regression
38 pages
Introml 02 Regression Annotated PDF
No ratings yet
Introml 02 Regression Annotated PDF
26 pages
ML Summary PDF
No ratings yet
ML Summary PDF
5 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
Moore Mealy Machine Lecture-1
No ratings yet
Moore Mealy Machine Lecture-1
15 pages
Unit 2
No ratings yet
Unit 2
38 pages
Rajalakshmi Engineering College: Thandalam, Chennai - 602 105 Lesson Plan
No ratings yet
Rajalakshmi Engineering College: Thandalam, Chennai - 602 105 Lesson Plan
6 pages
Numerical Modelling in Fortran: Day 6: Paul Tackley, 2017
No ratings yet
Numerical Modelling in Fortran: Day 6: Paul Tackley, 2017
53 pages
PART-7 Ordinary Differential Equations Runge-Kutta Methods: Numerical Methods of Chemical Engineers CHE F242
No ratings yet
PART-7 Ordinary Differential Equations Runge-Kutta Methods: Numerical Methods of Chemical Engineers CHE F242
14 pages
Solution of Linear Systems of Equations in Matlab, Freemat, Octave and Scilab by WWW - Freemat.info
No ratings yet
Solution of Linear Systems of Equations in Matlab, Freemat, Octave and Scilab by WWW - Freemat.info
4 pages
DAA Worksheet Exp-2.1
No ratings yet
DAA Worksheet Exp-2.1
5 pages
Certified Global Minima
100% (1)
Certified Global Minima
8 pages
NP Complete
No ratings yet
NP Complete
22 pages
Intro Regression Modeling
No ratings yet
Intro Regression Modeling
11 pages
Worksheet - Maxima and Minima
No ratings yet
Worksheet - Maxima and Minima
2 pages
Assignment 3: Digital Signal Processing (ELE222T) DTFT and DFT Dr. Priyanka Kokil Questions
No ratings yet
Assignment 3: Digital Signal Processing (ELE222T) DTFT and DFT Dr. Priyanka Kokil Questions
4 pages
Assignment No. 5 Linear Programming
No ratings yet
Assignment No. 5 Linear Programming
4 pages
Assignment No 2 Cs 502 Solution
No ratings yet
Assignment No 2 Cs 502 Solution
5 pages
Gaussian Quadrature
No ratings yet
Gaussian Quadrature
21 pages
Advanced Operations Research Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras
No ratings yet
Advanced Operations Research Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras
38 pages
Ece Filter Design
No ratings yet
Ece Filter Design
117 pages
Eligibility of Village Fund Direct Cash Assistance Recipients Using Artificial Neural Network
No ratings yet
Eligibility of Village Fund Direct Cash Assistance Recipients Using Artificial Neural Network
8 pages
Math 26 M4 Assessment Manguilimotan BSEDMath3A
No ratings yet
Math 26 M4 Assessment Manguilimotan BSEDMath3A
9 pages
Finite Element Analysis
No ratings yet
Finite Element Analysis
9 pages
M2C3 Solution of Algebraic and Transcendental Newton Raphson Method
No ratings yet
M2C3 Solution of Algebraic and Transcendental Newton Raphson Method
5 pages
Midterm Exam 20220306
No ratings yet
Midterm Exam 20220306
10 pages
Iv Year Mechanical Engineering
No ratings yet
Iv Year Mechanical Engineering
4 pages
Package Fuzzymcdm': R Topics Documented
No ratings yet
Package Fuzzymcdm': R Topics Documented
8 pages
PC04 Me 2020 1
No ratings yet
PC04 Me 2020 1
2 pages
The Travelling Salesman Problem
No ratings yet
The Travelling Salesman Problem
1 page
Casus Irreducibilis
No ratings yet
Casus Irreducibilis
2 pages
Matrix Multiplication
No ratings yet
Matrix Multiplication
2 pages
ALGEBRA SIMPLIFIED EQUATIONS WORKBOOK WITH ANSWERS: Linear Equations, Quadratic Equations, Systems of Equations
From Everand
ALGEBRA SIMPLIFIED EQUATIONS WORKBOOK WITH ANSWERS: Linear Equations, Quadratic Equations, Systems of Equations
Luke Aneke
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
From Everand
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet

2 Linear Regression

Uploaded by

2 Linear Regression

Uploaded by

Classical machine learning algorithms

• Three main ingredients of any supervised

• Geometrically, when d = 1, is actually a line

• The intercept term is often called the bias parameter

• Deviation in approximation for the th

• Gradient Descent will be inapplicable as

And eventually you will get to minimum 6

You might also like