0% found this document useful (0 votes)

20 views20 pages

Lec 3 Regression.

Linear regression is a supervised machine learning algorithm that finds the best linear relationship between a dependent variable and one or more independent variables. It works by minimizing the sum of the squares of the differences between the actual and predicted values. Overfitting and underfitting occur when the model is too complex or simple respectively, and various techniques like regularization can help address overfitting.

Uploaded by

Katende Chris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views20 pages

Lec 3 Regression.

Uploaded by

Katende Chris

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Linear Regression

 Linear Regression is a supervised machine learning algorithm.

 It tries to find out the best linear relationship that describes the data you have.
 It assumes that there exists a linear relationship between a dependent variable and
independent variable(s).
 The value of the dependent variable of a linear regression model is a continuous
value i.e. real numbers.
Linear Regression
We want to find the best line (linear function y=f(X)) to
explain the data.
y

X
Simple Linear Regression
Simple Linear Regression Equation

The equation that describes how individual y values relate to x

The predicted value of y is given by: 𝑦 = 𝑏0 + 𝑏1 X . Where;

 y is a dependent variable.
 𝑦 is the predicted value of y
 X is an independent variable.
 b0 and b1 are the regression coefficients.
 b0 is the intercept or the bias that fixes the offset to a line.
 b1 is the slope or weight that specifies the factor by which X
has an impact on Y.
Error for Simple Linear Regression model

Y= 𝛽0 + 𝛽1 X + 𝜀 ed the regressiondel.
 𝜀: reflects how individuals deviate from others with the same
value of x
Ŷ82=b0 + b182 e82=Y82-Ŷ82

X=82
Estimated Simple Linear Regression Equation
 Recall: The estimated simple linear regression equation is:
𝑌 = 𝑏0 + 𝑏1 X
 b0 is the estimate for β0

 b1 is the estimate for β1

 𝑌 is the estimated (predicted) value of Y for a given x value.

ŷ
Least Squares method

• Least Squares Criterion: Choose the “best” β0 and β1 to minimize

• S=Σ(𝑌𝑖 – (𝛽0 + 𝛽1𝑋𝑖) )2

• Use calculus: take derivative with respect to β0 and with respect to

β1 and set the two resulting equations equal to zero and solve for β0
and β1

• Of all possible lines pick the one that minimizes the sum of the
distances squared of each point from that line
Least Squares Solution

b1 
 (X  X )(Y  Y )
i i

 (X  X )
slope:
2
i

Intercept: b 0  Y  b1 X
Estimating the Variance s 2

• An Estimate of s 2
The mean square error (MSE) provides the estimate
of s 2, and the notation s2 is also used.
s2 = MSE = SSE/(n-2)
where:

If points are close to the regression line then SSE will be small
If points are far from the regression line then SSE will be large

SSE: Sum of Squared Errors

Bias, variance tradeoff

Variance
Bias
 Regression predictions should be unbiased. That is:
"average of predictions" should ≈ "average of observations"
 Bias measures how far the mean of predictions is from the mean of actual
values
𝐵𝑖𝑎𝑠 = 𝑚𝑒𝑎𝑛 𝑜𝑓 𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑖𝑜𝑛𝑠 – 𝑚𝑒𝑎𝑛 𝑜𝑓 𝑎𝑐𝑡𝑢𝑎𝑙 𝑣𝑎𝑙𝑢𝑒𝑠 (𝑔𝑟𝑜𝑢𝑛𝑑 𝑡𝑟𝑢𝑡ℎ 𝑙𝑎𝑏𝑒𝑙𝑠)

 The model can’t fit the data (usually too simplistic)

 Increase the complexity of the model to minimize Bias
Variance
 Variance indicates how much the estimate of the target function will alter if different
training data were used.
 It describes how much a random variable differs from its expected value.
 It is based on a single training set.
 Measures the inconsistency of different predictions using different training

 Different samples of training data yield different model fits

 Increase the size training data set to minimize variance
Overfitting Vs Model’s Complexity

•Models with high bias will have low variance.

•Models with high variance will have a low bias
Over-fitting

 Overfitting is an undesirable behavior where a learning model gives

accurate predictions for training data but not for new data.
 The machines fits all the data points or more than the required data
points present in the training set,.
 The model starts caching noise
How to minimize Overfitting

 Reduce model complexity

 Training with more data
 Removing features
 Early stopping the training
 Regularization
Regularization and Over-fitting
Adding a regularizer:

Model
error Without regularizer
With regularizer

Number of iterations
Under fitting

 The model is not able to capture the underlying trend of the

data.
How to minimize underfitting

• By increasing the training time of the model.

• By increasing the number of features
• Increasing model complexity
2. Multiple Linear Regression
 In multiple linear regression, the dependent variable depends on more than one
independent variables

 The predicted value of y is given by:

𝑦 = 𝛽0 + 𝛽1𝑋1 + 𝛽2𝑋2 + 𝛽3𝑋3 + … … + 𝛽𝑛𝑋𝑛

Predictive Analytics
No ratings yet
Predictive Analytics
46 pages
Top 100 ML Interview Q&A
100% (1)
Top 100 ML Interview Q&A
39 pages
MLS 1 - Regression
No ratings yet
MLS 1 - Regression
20 pages
Data Science
100% (1)
Data Science
14 pages
Linear Regression
No ratings yet
Linear Regression
89 pages
Cp4252 ML Unit-II
No ratings yet
Cp4252 ML Unit-II
44 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
Complete Linear Regression Algorithm
No ratings yet
Complete Linear Regression Algorithm
4 pages
Regression
No ratings yet
Regression
45 pages
Unit 2
No ratings yet
Unit 2
136 pages
d3 It ML Jan 2023 Part 2
No ratings yet
d3 It ML Jan 2023 Part 2
32 pages
Regression v33
No ratings yet
Regression v33
81 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
Unit 2
No ratings yet
Unit 2
92 pages
ML Unit Ii
No ratings yet
ML Unit Ii
30 pages
Adaptive Filter
100% (2)
Adaptive Filter
35 pages
Unit - 3 - ML - 24
No ratings yet
Unit - 3 - ML - 24
41 pages
Linear Regression
No ratings yet
Linear Regression
49 pages
MachineLearning Unit-II
No ratings yet
MachineLearning Unit-II
45 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Lecture 3
No ratings yet
Lecture 3
33 pages
Sparse Regression
No ratings yet
Sparse Regression
37 pages
2-Linear Regression
No ratings yet
2-Linear Regression
31 pages
2.1 Supervised Regression
No ratings yet
2.1 Supervised Regression
26 pages
3 - Linear Regression-Least Square Error Fit
No ratings yet
3 - Linear Regression-Least Square Error Fit
35 pages
ML Unit-2
No ratings yet
ML Unit-2
34 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
Machine Learning Unit2
No ratings yet
Machine Learning Unit2
31 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
MachineLearning Unit II
No ratings yet
MachineLearning Unit II
45 pages
Lec 3-5 (Function Approximation)
No ratings yet
Lec 3-5 (Function Approximation)
34 pages
LinearRegression1 210720 171800
No ratings yet
LinearRegression1 210720 171800
41 pages
Linear Regression
No ratings yet
Linear Regression
35 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
ML 1
No ratings yet
ML 1
24 pages
Assignment 2
No ratings yet
Assignment 2
42 pages
Unit-Vi 2
No ratings yet
Unit-Vi 2
31 pages
Quant Interview Prep
No ratings yet
Quant Interview Prep
14 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
ML 20 04 23
No ratings yet
ML 20 04 23
19 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
Linear Regression Algorithm
No ratings yet
Linear Regression Algorithm
16 pages
FML Unit2
No ratings yet
FML Unit2
13 pages
Linear Regression
No ratings yet
Linear Regression
26 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Linear Regression
No ratings yet
Linear Regression
60 pages
Linear Regression
No ratings yet
Linear Regression
9 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
12 pages
Regression Questionnaire
No ratings yet
Regression Questionnaire
10 pages
Teit ML2
No ratings yet
Teit ML2
11 pages
1 Linear Regreesion Introduction
No ratings yet
1 Linear Regreesion Introduction
7 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
Linear Regression Model Presentation
No ratings yet
Linear Regression Model Presentation
7 pages
FINTECH QBank MCQ
No ratings yet
FINTECH QBank MCQ
34 pages
Figure 12-6: To Display Details
No ratings yet
Figure 12-6: To Display Details
144 pages
Isn't Linear Regression From Statistics?
No ratings yet
Isn't Linear Regression From Statistics?
4 pages
Practice Set #6 Demand Management and Forecasting
No ratings yet
Practice Set #6 Demand Management and Forecasting
8 pages
Agricolae PDF
No ratings yet
Agricolae PDF
118 pages
Lecture1 Introduction
No ratings yet
Lecture1 Introduction
20 pages
Nonparametric Econometrics. A Primer
No ratings yet
Nonparametric Econometrics. A Primer
103 pages
Mathematical Foundation For AI
No ratings yet
Mathematical Foundation For AI
3 pages
Krajewski - Om12 - 08
No ratings yet
Krajewski - Om12 - 08
74 pages
Handout 2020 Part1 PDF
No ratings yet
Handout 2020 Part1 PDF
82 pages
002 1 s2.0 S266626042030030X Main
No ratings yet
002 1 s2.0 S266626042030030X Main
11 pages
03 Patterns Styles
No ratings yet
03 Patterns Styles
37 pages
Correlation Functions and Power Spectra: Jan Larsen
No ratings yet
Correlation Functions and Power Spectra: Jan Larsen
29 pages
Major p3
No ratings yet
Major p3
24 pages
Machine Learning-Based Approaches For Financial Ma
No ratings yet
Machine Learning-Based Approaches For Financial Ma
19 pages
MGT 2070 - Chapter 3 - Practice Questions
No ratings yet
MGT 2070 - Chapter 3 - Practice Questions
81 pages
BSE2107 OOP II - Inner Classes in Java-1
No ratings yet
BSE2107 OOP II - Inner Classes in Java-1
25 pages
Introduction 01
No ratings yet
Introduction 01
35 pages
BSE2107 OOP II - Object Serialization and Deserialization-1
No ratings yet
BSE2107 OOP II - Object Serialization and Deserialization-1
11 pages
Thesis
No ratings yet
Thesis
56 pages
Deep Learning Project Nice
No ratings yet
Deep Learning Project Nice
45 pages
Software Testing Fundamentals
No ratings yet
Software Testing Fundamentals
43 pages
System Testing Annotated
No ratings yet
System Testing Annotated
56 pages
ADSP Savitha Notes
No ratings yet
ADSP Savitha Notes
103 pages
DSA Book by Narshimha
No ratings yet
DSA Book by Narshimha
76 pages
An Information-Theoretic Framework For Receiver Quantization in Communication - Single
No ratings yet
An Information-Theoretic Framework For Receiver Quantization in Communication - Single
35 pages
A Review of Jet Grouting Practice and Development
No ratings yet
A Review of Jet Grouting Practice and Development
31 pages
Performance Metrics Error Measures in Ma
No ratings yet
Performance Metrics Error Measures in Ma
32 pages
Microsoft Word - XII - SPLIT UP SYLLABUS-10.07.2024
No ratings yet
Microsoft Word - XII - SPLIT UP SYLLABUS-10.07.2024
4 pages
31983-Article Text-123228-1-10-20230617
No ratings yet
31983-Article Text-123228-1-10-20230617
9 pages
Measure of Dispersion or Variation
No ratings yet
Measure of Dispersion or Variation
5 pages
Plagiarism
No ratings yet
Plagiarism
24 pages
Unit 6
No ratings yet
Unit 6
60 pages
Max Lloyd Quantization
No ratings yet
Max Lloyd Quantization
5 pages
Loss Functions
No ratings yet
Loss Functions
7 pages
Lee Iterative Filter Adaptive Network For Single Image Defocus Deblurring CVPR 2021 Paper
No ratings yet
Lee Iterative Filter Adaptive Network For Single Image Defocus Deblurring CVPR 2021 Paper
9 pages
OOP Exam
No ratings yet
OOP Exam
13 pages
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)

Lec 3 Regression.

Uploaded by

Lec 3 Regression.

Uploaded by

Linear Regression

 Linear Regression is a supervised machine learning algorithm.

The equation that describes how individual y values relate to x

 b1 is the estimate for β1

 𝑌 is the estimated (predicted) value of Y for a given x value.

• Least Squares Criterion: Choose the “best” β0 and β1 to minimize

• S=Σ(𝑌𝑖 – (𝛽0 + 𝛽1𝑋𝑖) )2

• Use calculus: take derivative with respect to β0 and with respect to

SSE: Sum of Squared Errors

 The model can’t fit the data (usually too simplistic)

 Different samples of training data yield different model fits

•Models with high bias will have low variance.

 Overfitting is an undesirable behavior where a learning model gives

 Reduce model complexity

 The model is not able to capture the underlying trend of the

• By increasing the training time of the model.

 The predicted value of y is given by:

𝑦 = 𝛽0 + 𝛽1𝑋1 + 𝛽2𝑋2 + 𝛽3𝑋3 + … … + 𝛽𝑛𝑋𝑛

You might also like