0% found this document useful (0 votes)

6 views51 pages

Lecture 6

Uploaded by

Mia Mahmudul Hoque Shaon 2014314642

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views51 pages

Lecture 6

Uploaded by

Mia Mahmudul Hoque Shaon 2014314642

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

Lecture 6: Linear Regression

Md. Shahriar Hussain

ECE Department, NSU

North South University Source: Andrew NG Lectures CSE445

What is Linear Regression

• Linear regression is defined as an algorithm that provides a linear

relationship between an independent variable and a dependent variable to
predict the outcome of future events

North South University Source: Andrew NG Lectures CSE445 2

Linear Regression Example

North South University Source: Andrew NG Lectures CSE445 3

Linear Regression Example

North South University Source: Andrew NG Lectures CSE445 4

Linear Regression Example

North South University Source: Andrew NG Lectures CSE445 5

Linear Regression Example

A Line of best
fit/Regression Line is
a straight line that
represents the best
approximation of a
scatter plot of data
points

North South University Source: Andrew NG Lectures CSE445 6

Linear Regression Example

Estimated/Predicted value (𝑦/𝑦) Actual/True value (𝑦)/Ground Truth

North South University Source: Andrew NG Lectures CSE445 7

Data Set Description

North South University Source: Andrew NG Lectures CSE445 8

Data Set Description

x (1) = 2104
x (2) = 1416
y (1) = 460
(x, y)= One Training Example
(x (i), y (i))= ith Training example y (2) = 232

North South University Source: Andrew NG Lectures CSE445 9

Hypothesis

Training Set

Learning Algorithm

Size of house h Estimated

New/unseen price
data (x) hypothesis 𝑦 𝑥
North South University Source: Andrew NG Lectures CSE445 10
Hypothesis

• How do we represent h ?

𝑦 𝑥 =

𝜃0 and 𝜃1 : parameters/weights that will be

trained/determined by the ML model
Not hyperparameters Linear regression with one variable.
𝜃0 = intercept/bias/constant Univariate linear regression.
𝜃1 = slope/coefficient/gradient

North South University Source: Andrew NG Lectures CSE445 11

Hypothesis

The goal is to choose Ө0 and Ө1 properly so that hӨ(x) is close to y.

• A cost function lets us figure out how to fit the best straight line to our data

North South University Source: Andrew NG Lectures CSE445 12

Hypothesis

Size in feet2 (x) Price ($) in 1000's (y)

2104 460
1416 232
1534 315
852 178
… …
Hypothesis:
‘s: Parameters
How to choose ‘s ?
North South University Source: Andrew NG Lectures CSE445 13
Example

North South University Source: Andrew NG Lectures CSE445 14

Cost Function

minimize
Ө0 Ө1

• We need to choose Ө0 and Ө1 in a way that the result of the function will be minimized for all
m training example. This equation is called cost function.
J(Ө0 , Ө1)=
minimize
Ө0 Ө1 J(Ө0 , Ө1)

North South University Source: Andrew NG Lectures CSE445 15

Cost Function

Cost Function:

Goal:

• Here the cost function is called Squared Error cost function

• Minimize squared different between predicted house price and actual house
price
• 1/m - means we determine the average
• 1/2m the 2 makes the math a bit easier, and doesn't change the constants we
determine at all (i.e., half the smallest value is still the smallest value!)

North South University Source: Andrew NG Lectures CSE445 16

Cost Function Calculation

• For simplifications, assumes θ0 = 0

Find best values of θ1 so that J(θ1) is minimum

North South University Source: Andrew NG Lectures CSE445 17

Cost Function Calculation

3
3

2
2

1
1

0
0 -0.5 0 0.5 1 1.5 2 2.5
0 1 2 3
For, θ1 = 1
J(θ1) = 1/2*3 [0+0+0]=0
North South University Source: Andrew NG Lectures CSE445 18
Cost Function Calculation

For, θ1 = 0.5
J(θ1) = ?
North South University Source: Andrew NG Lectures CSE445 19
Cost Function Calculation

North South University Source: Andrew NG Lectures CSE445 20

Cost Function Calculation

For, θ1 = 0
J(θ1) = ?
North South University Source: Andrew NG Lectures CSE445 21
Cost Function Calculation

North South University Source: Andrew NG Lectures CSE445 22

Cost Function Calculation

• If we compute a range of values plot

 J(θ1) vs θ1 we get a polynomial
(looks like a quadratic)
• The optimization objective for the learning
algorithm is find the value of θ1 which
minimizes J(θ1)
 So, here θ1 = 1 is the best value
for θ1

 The line which has the least sum

of squares of errors is the best fit
line

North South University Source: Andrew NG Lectures CSE445 23

Important Equations

Hypothesis:

Parameters:

Cost Function:

Goal:

North South University Source: Andrew NG Lectures CSE445 24

Cost Function for two parameters

(for fixed , this is a function of x) (function of the parameters )

500

400

300
Price ($)
200
in 1000’s
100

0
0 500 1000 1500 2000 2500 3000

Size in feet2 (x)

North South University Source: Andrew NG Lectures CSE445 25

Cost Function for two parameters

• Previously we plotted our cost function by plotting

– θ1 vs J(θ1)
• Now we have two parameters
– Plot becomes a bit more complicated
– Generates a 3D surface plot where axis are
• X = θ1
• Z = θ0
• Y = J(θ0,θ1)

North South University Source: Andrew NG Lectures CSE445 26

Cost Function for two parameters

• We can see that the height

(y) of the graph indicates the
value of the cost function,
• we need to find where y is at
a minimum

North South University Source: Andrew NG Lectures CSE445 27

Cost Function for two parameters

• A contour plot is a graphical technique for representing a 3-dimensional surface by

plotting constant z slices, called contours, on a 2-dimensional format

North South University Source: Andrew NG Lectures CSE445 28

Cost Function for two parameters

North South University Source: Andrew NG Lectures CSE445 29

Cost Function for two parameters

North South University Source: Andrew NG Lectures CSE445 30

Cost Function for two parameters

North South University Source: Andrew NG Lectures CSE445 31

Cost Function for two parameters

North South University Source: Andrew NG Lectures CSE445 32

Gradient descent

• We want to get min J(θ0, θ1)

• Gradient descent
– Used all over machine learning for minimization

• Outline:

• Start with some

• Keep changing to reduce until we hopefully

end up at a minimum

North South University Source: Andrew NG Lectures CSE445 33

Gradient descent

 Start with initial guesses

 Start at 0,0 (or any other value)
 Keeping changing θ0 and θ1 a little bit to try and reduce J(θ0,θ1)
 Each time you change the parameters, you select the gradient which
reduces J(θ0,θ1) the most possible
 Repeat
 Do so until you converge to a local minimum
 Has an interesting property
 Where you start can determine which minimum you end up
 Here we can see one initialization point led to one local minimum
 The other led to a different one

North South University Source: Andrew NG Lectures CSE445 34

Gradient descent

• One initialization point led to one local minimum.

The other led to a different one
North South University Source: Andrew NG Lectures CSE445 35
Gradient Descent Algorithm
• Gradient descent is used to minimize the MSE by
calculating the gradient of the cost function

Correct: Simultaneous update Incorrect:

North South University Source: Andrew NG Lectures CSE445 36

Gradient Descent Algorithm

North South University Source: Andrew NG Lectures CSE445 37

Gradient Descent Algorithm

North South University Source: Andrew NG Lectures CSE445 38

Gradient Descent Algorithm

North South University Source: Andrew NG Lectures CSE445 39

Gradient Descent Algorithm

North South University Source: Andrew NG Lectures CSE445 40

Gradient Descent Algorithm

North South University Source: Andrew NG Lectures CSE445 41

Gradient Descent Algorithm

North South University Source: Andrew NG Lectures CSE445 42

Learning Rate

• Here, α is the learning rate, a hyperparameter

• It controls how big steps we made
• If α is small, we will take tiny steps
• If α is big, we have an aggressive gradient descent

North South University Source: Andrew NG Lectures CSE445 43

Learning Rate

If α is too small, gradient descent

can be slow.
Higher training time

If α is too large, gradient descent

can overshoot the minimum. It may
fail to converge, or even diverge.

North South University Source: Andrew NG Lectures CSE445 44

Learning Rate

If α is too small, gradient descent

can be slow.
Higher training time

If α is too large, gradient descent

can overshoot the minimum. It may
fail to converge, or even diverge.

North South University Source: Andrew NG Lectures CSE445 45

Learning Rate

If α is too small, gradient descent

can be slow.
Higher training time

If α is too large, gradient descent

can overshoot the minimum. It may
fail to converge, or even diverge.

North South University Source: Andrew NG Lectures CSE445 46

Local Minima

• Local minimum: value of the loss function is minimum at that point in a local
region.
• Global minima: value of the loss function is minimum globally across the
entire domain the loss function

North South University Source: Andrew NG Lectures CSE445 47

Local Minima

at local minima

Global minima

North South University Source: Andrew NG Lectures CSE445 48

Gradient Descent Calculation

North South University Source: Andrew NG Lectures CSE445 49

Gradient Descent Calculation

North South University Source: Andrew NG Lectures CSE445 50

• Reference:
– Andrew NG Lectures on Machine Learning, Standford University

North South University Source: Andrew NG Lectures CSE445 51

Machine Learning - Exploring The Model - Resp
No ratings yet
Machine Learning - Exploring The Model - Resp
18 pages
Digiscope Slimhole MWD Ps
No ratings yet
Digiscope Slimhole MWD Ps
2 pages
Lecture 3
No ratings yet
Lecture 3
56 pages
Linear Regression - Univariate
No ratings yet
Linear Regression - Univariate
62 pages
Linear Regression With Multiple Variables
No ratings yet
Linear Regression With Multiple Variables
56 pages
CSE445 T3 Linear Regression One Variable
No ratings yet
CSE445 T3 Linear Regression One Variable
57 pages
Lecture 4
No ratings yet
Lecture 4
45 pages
Chap6 (Regression)
No ratings yet
Chap6 (Regression)
74 pages
L3 Linear Regression and Gradient Descent
No ratings yet
L3 Linear Regression and Gradient Descent
46 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
48 pages
Slide 3 - Linear Regression One Variable
No ratings yet
Slide 3 - Linear Regression One Variable
60 pages
Linear Regression
No ratings yet
Linear Regression
55 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
Linear Regression
No ratings yet
Linear Regression
63 pages
(ML&PR 2025) Lec2 Regression II
No ratings yet
(ML&PR 2025) Lec2 Regression II
41 pages
Computing For Data Sciences: Introduction To Regression Analysis
No ratings yet
Computing For Data Sciences: Introduction To Regression Analysis
9 pages
Lecture02a Optimization Annotated PDF
No ratings yet
Lecture02a Optimization Annotated PDF
23 pages
04 LinearRegression
No ratings yet
04 LinearRegression
61 pages
Gradient Descent
No ratings yet
Gradient Descent
5 pages
Linear Regression For Absolute Beginners With Implementation in Python
No ratings yet
Linear Regression For Absolute Beginners With Implementation in Python
17 pages
Gradient Descent
No ratings yet
Gradient Descent
9 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
5.1loss Function, Optimization, GD
No ratings yet
5.1loss Function, Optimization, GD
39 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
43 pages
CS 304.A Training Models
No ratings yet
CS 304.A Training Models
149 pages
Lec6 7 Linear Regression
No ratings yet
Lec6 7 Linear Regression
38 pages
Gradient Descent Algorithm.Y...
No ratings yet
Gradient Descent Algorithm.Y...
10 pages
(PR 2024) Lec2 Regression II
No ratings yet
(PR 2024) Lec2 Regression II
41 pages
Problem Set Linear Regression and Gradient Descent
No ratings yet
Problem Set Linear Regression and Gradient Descent
3 pages
Linear Regression
100% (1)
Linear Regression
51 pages
Linear Regression
No ratings yet
Linear Regression
54 pages
ML Lecture # 03 Gradient Descent
No ratings yet
ML Lecture # 03 Gradient Descent
23 pages
05 Gradient Descent
No ratings yet
05 Gradient Descent
23 pages
04 LinearRegression PDF
No ratings yet
04 LinearRegression PDF
61 pages
CCS355 Neural Networks and Deep Learning
No ratings yet
CCS355 Neural Networks and Deep Learning
142 pages
Regression
No ratings yet
Regression
30 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
CS229
No ratings yet
CS229
69 pages
cs229 2
No ratings yet
cs229 2
275 pages
Notes 1
No ratings yet
Notes 1
30 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
ML 02 Linear Regression
No ratings yet
ML 02 Linear Regression
51 pages
What Is Machine Learning by Coursera
No ratings yet
What Is Machine Learning by Coursera
47 pages
Lecture 2.1 Linear Regression
No ratings yet
Lecture 2.1 Linear Regression
36 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
12 pages
ML02
No ratings yet
ML02
25 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
15 pages
Linearna Regresija - NG
No ratings yet
Linearna Regresija - NG
7 pages
Stanford ML CS229-Merged Notes
No ratings yet
Stanford ML CS229-Merged Notes
126 pages
Machine Learning Notes by Standard Andrew NG
No ratings yet
Machine Learning Notes by Standard Andrew NG
142 pages
Machine Learning Notes AndrewNg
No ratings yet
Machine Learning Notes AndrewNg
141 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
293 pages
Linear Regression Multi
No ratings yet
Linear Regression Multi
43 pages
(MLP) MidtermNote
No ratings yet
(MLP) MidtermNote
31 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
7 pages
Barron's Physics Practice Plus: 400+ Online Questions and Quick Study Review
From Everand
Barron's Physics Practice Plus: 400+ Online Questions and Quick Study Review
Barron's Educational Series
No ratings yet
Set Theory Essentials
From Everand
Set Theory Essentials
Emil Milewski
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Calculus of Variations: Mechanics, Control and Other Applications
From Everand
Calculus of Variations: Mechanics, Control and Other Applications
Charles R. MacCluer
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
MINERALS
No ratings yet
MINERALS
4 pages
Industrial Internship Report ON Fundamental Analysis of Indian Steel Industry
No ratings yet
Industrial Internship Report ON Fundamental Analysis of Indian Steel Industry
60 pages
Expanding Mental Health Care in The Kingdom of Eswatini: Successes, Challenges and Recommendations From Initial Experiences in Lubombo Region
No ratings yet
Expanding Mental Health Care in The Kingdom of Eswatini: Successes, Challenges and Recommendations From Initial Experiences in Lubombo Region
8 pages
Agree or Disagree
No ratings yet
Agree or Disagree
2 pages
BEIJER - IX TxA and IX TXB To X2 Migration Guidelines (08 - 2016)
No ratings yet
BEIJER - IX TxA and IX TXB To X2 Migration Guidelines (08 - 2016)
10 pages
Immunomodulators - Prof Olayinka Ogunleye - 240711 - 150532
No ratings yet
Immunomodulators - Prof Olayinka Ogunleye - 240711 - 150532
99 pages
Business Model Canvas
No ratings yet
Business Model Canvas
3 pages
BM135 Commercial LAW SLIDES
No ratings yet
BM135 Commercial LAW SLIDES
79 pages
Shareholders & Stakehoders
No ratings yet
Shareholders & Stakehoders
9 pages
SDS Underwater Cutting Rods 2018 PDF
100% (1)
SDS Underwater Cutting Rods 2018 PDF
8 pages
Transmission Line Modelling and Performance
100% (1)
Transmission Line Modelling and Performance
8 pages
Final BSBCMM401 Assessment Solution
No ratings yet
Final BSBCMM401 Assessment Solution
18 pages
Hotel Ibis Senen
No ratings yet
Hotel Ibis Senen
1 page
CV Ognjanovic
No ratings yet
CV Ognjanovic
23 pages
PR1 Characteristics Strengths and Weaknesses Kinds and Importance of Qualitative Research
No ratings yet
PR1 Characteristics Strengths and Weaknesses Kinds and Importance of Qualitative Research
13 pages
Is 14223 1 1995
No ratings yet
Is 14223 1 1995
10 pages
TOPIC 7 Unemployment
No ratings yet
TOPIC 7 Unemployment
13 pages
CH3 4
No ratings yet
CH3 4
32 pages
Special Power of Attorney
No ratings yet
Special Power of Attorney
2 pages
Steam Calculators - Heat Loss Calculator
No ratings yet
Steam Calculators - Heat Loss Calculator
1 page
DLL - FP Wk8 Day 1
No ratings yet
DLL - FP Wk8 Day 1
5 pages
Developing Models of Managerial Competencies of Managers: A Review
No ratings yet
Developing Models of Managerial Competencies of Managers: A Review
15 pages
Chapter 2 - Classification of Business
No ratings yet
Chapter 2 - Classification of Business
22 pages
February 3, 2020 G.R. No.: Click or Tap Here To Enter Ponente
100% (1)
February 3, 2020 G.R. No.: Click or Tap Here To Enter Ponente
2 pages
Banquet Personnel
No ratings yet
Banquet Personnel
21 pages
City of Cebu Versus Hers of Candido Rubi
No ratings yet
City of Cebu Versus Hers of Candido Rubi
1 page
ESG Module Handbook 23.24A
No ratings yet
ESG Module Handbook 23.24A
12 pages
IEC-IM03 Series: Key Features
No ratings yet
IEC-IM03 Series: Key Features
1 page
1-Claims For Variations
No ratings yet
1-Claims For Variations
7 pages