0% found this document useful (0 votes)

198 views21 pages

Regression PPT

Linear regression is a machine learning technique used to model the relationship between independent variables (x) and a dependent variable (y). It finds the best fit linear equation to describe the relationship by minimizing the sum of the squared errors between observed and predicted values of y. Gradient descent is an iterative algorithm that can be used to minimize this error function and find the optimal coefficients for the linear regression model when there are multiple independent variables. Logistic regression is similar but uses a sigmoid function to transform its output into a probability value that can predict discrete class labels rather than continuous target variables.

Uploaded by

Rakesh bhukya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

198 views21 pages

Regression PPT

Uploaded by

Rakesh bhukya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

Regression

Linear Regression
• Got a bunch of points (xi , yi).
• Want to fit a line y = ax + b that describes the trend
• We define a cost function that computes the total squared error of our
predictions w.r.t. observed values yi, J(a, b) = P(axi + b − yi)2 that we want to
minimize.
• See it as a function of a and b: compute both derivatives, force them equal
to zero, and solve for a and b
• The coefficients you get give you the minimum squared error.
• Can do this for specific points, or in general and find the formulas.
Sum of Squared Error

Table 1 Sample Salary Data Figure 1 Scatter Plot Diagram

we cannot mathematically determine the years of

experience.

But, we can determine / predict salary column values

(Dependent Variables) based on years of experience.

If you look at the data, the dependent column values

(Salary in 1000$) are increasing / decreasing based on
years of experience.
Sum of Squared Error

• In order to fit the best intercept line between the points in the above
scatter plots, we use a metric called “Sum of Squared Errors” (SSE) and

• compare the lines to find out the best fit by reducing errors. The errors
are sum difference between actual value and predicted value.

• To find the errors for each dependent value, we need to use the formula
below.
Sum of Squared Error

Table 3 SSE Calculation

The sum of squared errors SSE output is 5226.19. To do the best fit of line intercept, we need to apply a linear
regression model to reduce the SSE value at minimum as possible
Sum of Squared Error

• To identify a slope intercept, we use the equation y = mx + b

– ‘m’ is the slope
– ‘x’ → independent variables
– ‘b’ is intercept

 We will use Ordinary Least Squares method to find the best line
intercept (b) slope (m)

Ordinary Least Squares (OLS) Method

To use OLS method, we apply the below formula to find the equation
Sum of Squared Error

Table 4: OLS method calculations

Sum of Squared Error
• We can also find the equation in MS-Excel.

• First, select data and then go to the Insert

Tab and from the Charts group, select Scatter
plot.

• After getting the Scatter plot, go to the Add

Chart Element (in Chart Layout group) and
then select Trendline and go to the More
Trendline Options… and then tick on Display
Equation on chart
Gradient Descent Algorithm

• Ordinary Least Square method looks simple and computation is easy.

But, this OLS method will only work for a univariate dataset which is
single independent variables and single dependent variables.

• Multi-variate dataset contains a single independent variables set and

multiple dependent variables sets, require us to use a machine learning
algorithm called “Gradient Descent”.

• The main reason why gradient descent is used for linear regression is
the computational complexity: it's computationally cheaper (faster) to
find the solution using the gradient descent in some cases.

• The formula which you wrote looks very simple, even computationally,
because it only works for univariate case, i.e. when you have only one
variable. In the multivariate case, when you have many variables, the
formulae is slightly more complicated on paper and requires much more
calculations when you implement it in software:

• Gradient descent is an iterative optimization algorithm to find the

minimum of a function. Here that function is our Loss Function.
Gradient Descent Algorithm

• The values of m and c are updated at each iteration

to get the optimal solution.
Gradient Descent Algorithm
Gradient Descent Algorithms

• Imagine a valley and a person with no sense of direction who wants to

get to the bottom of the valley.

• He goes down the slope and takes large steps when the slope is steep
and small steps when the slope is less steep.

• He decides his next position based on his current position and stops
when he gets to the bottom of the valley which was his goal.

• Let’s try applying gradient descent to m and c and approach it step by

step:
Gradient Descent Algorithm

1. Initially let m = 0 and c = 0. Let L be our learning rate. This controls how
much the value of m changes with each step. L could be a small value
like 0.0001 for good accuracy.

2. Calculate the partial derivative of the loss function with respect to m,

and plug in the current values of x, y, m and c in it to obtain the
derivative
Gradient Descent Algorithm

Dm is the value of the partial derivative with respect to m. Similarly lets find the
partial derivative with respect to c, Dc :

3. Now we update the current value of m and c using the following equation:
Gradient Descent Algorithm

4. We repeat this process until our loss function is a very small value or
ideally 0 (which means 0 error or 100% accuracy). The value of m and c
that we are left with now will be the optimum values.
• Now going back to our analogy, m can be considered the current position of
the person. D is equivalent to the steepness of the slope and L can be the
speed with which he moves.
• Now the new value of m that we calculate using the above equation will be
his next position, and L×D will be the size of the steps he will take.
• When the slope is more steep (D is more) he takes longer steps and when it
is less steep (D is less), he takes smaller steps. Finally he arrives at the
bottom of the valley which corresponds to our loss = 0.
Now with the optimum value of m and c our model is ready to make
predictions !
Logistic Regression
• Logistic regression is a classification algorithm used to assign observations to a
discrete set of classes

• Unlike linear regression which outputs continuous number values, logistic

regression transforms its output using the logistic sigmoid function to return a
probability value which can then be mapped to two or more discrete classes.

Comparison to linear regression

 Given data on time spent studying and exam scores. Linear Regression and
logistic regression can predict different things:

 Linear Regression could help us predict the student’s test score on a scale of 0 -
100. Linear regression predictions are continuous (numbers in a range).

 Logistic Regression could help use predict whether the student passed or failed..
Types of Logistic Regression
• Binary (Pass/Fail)

• Multi (Cats, Dogs, Sheep)

• Ordinal (Low, Medium, High)

Binary Logistic Regression

 Given data on student exam results and our goal is to predict whether a
student will pass or fail based on number of hours slept and hours spent
studying.
 We have two features (hours slept, hours studied) and two classes:
passed (1) and failed (0).
Binary Logistic Regression

Studied Slept Passed

4.85 9.63 1
8.62 3.23 0
5.43 8.23 1
9.21 6.34 0

Graphically we could represent our data with scatter plot

Sigmoid Activation

• In order to map predicted values to probabilities, we use the sigmoid function

• The function maps any real value into another value between 0 and 1
• In machine learning, we use sigmoid to map predictions to probabilities.

1
𝑆(𝑧) =
1 + 𝑒 −𝑧
Decision boundary
Making predictions

Principles of Econometrics 4e Chapter 2 Solution
84% (19)
Principles of Econometrics 4e Chapter 2 Solution
33 pages
Migration Plan To SAP S4 HANA Finance
67% (3)
Migration Plan To SAP S4 HANA Finance
15 pages
Portable) LAN Games Repository 1.0.0 (Final)
No ratings yet
Portable) LAN Games Repository 1.0.0 (Final)
7 pages
SPSS ANNOTATED OUTPUT Discriminant Analysis 1
No ratings yet
SPSS ANNOTATED OUTPUT Discriminant Analysis 1
14 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Chapter 14 Simple Linear Regression
No ratings yet
Chapter 14 Simple Linear Regression
45 pages
K Kiran Kumar IIM Indore
100% (1)
K Kiran Kumar IIM Indore
115 pages
Simple Linear Regression Model Ordinary Least Square (OLS) Method
No ratings yet
Simple Linear Regression Model Ordinary Least Square (OLS) Method
18 pages
Regression Analysis Assignment
No ratings yet
Regression Analysis Assignment
8 pages
CH 13
No ratings yet
CH 13
123 pages
Bahan Univariate Linear Regression
No ratings yet
Bahan Univariate Linear Regression
64 pages
Problem Set 5
No ratings yet
Problem Set 5
5 pages
CH 02
No ratings yet
CH 02
88 pages
Time Series Components
No ratings yet
Time Series Components
3 pages
Topic04 - Simple Linear Regression
No ratings yet
Topic04 - Simple Linear Regression
11 pages
Simplex Method For Standard Maximization Problem
No ratings yet
Simplex Method For Standard Maximization Problem
6 pages
4 - LM Test and Heteroskedasticity
No ratings yet
4 - LM Test and Heteroskedasticity
13 pages
Business Statistics Test Bank Solutions Manual
100% (1)
Business Statistics Test Bank Solutions Manual
3 pages
Chapter 1: The Nature of Econometrics and Economic Data Chapter 2: The Simple Regression Model
No ratings yet
Chapter 1: The Nature of Econometrics and Economic Data Chapter 2: The Simple Regression Model
19 pages
Wooldridge 7e Ch03 SM
100% (1)
Wooldridge 7e Ch03 SM
11 pages
Solutions Chapter 5
No ratings yet
Solutions Chapter 5
21 pages
CH 06
No ratings yet
CH 06
20 pages
ECON 330-Econometrics-Dr. Farooq Naseer
No ratings yet
ECON 330-Econometrics-Dr. Farooq Naseer
5 pages
Regression
No ratings yet
Regression
46 pages
3 Multiple Linear Regression: Estimation and Properties: Ezequiel Uriel Universidad de Valencia Version: 09-2013
100% (1)
3 Multiple Linear Regression: Estimation and Properties: Ezequiel Uriel Universidad de Valencia Version: 09-2013
37 pages
Evans Analytics2e PPT 08
No ratings yet
Evans Analytics2e PPT 08
65 pages
2 Simple Regression Model Estimation and Properties
100% (1)
2 Simple Regression Model Estimation and Properties
48 pages
Multiple Regression
No ratings yet
Multiple Regression
12 pages
Ugc Model Curriculum Statistics: Submitted To The University Grants Commission in April 2001
No ratings yet
Ugc Model Curriculum Statistics: Submitted To The University Grants Commission in April 2001
101 pages
ES Assignment
No ratings yet
ES Assignment
19 pages
Multiple Regression Analysis
100% (1)
Multiple Regression Analysis
27 pages
Bowerman Regression CHPT 1
100% (2)
Bowerman Regression CHPT 1
18 pages
Sample Question Papers
No ratings yet
Sample Question Papers
138 pages
Econometrics 1
No ratings yet
Econometrics 1
74 pages
Chapter 6 Linear Regression Using Excel 2010-GOOD
0% (1)
Chapter 6 Linear Regression Using Excel 2010-GOOD
5 pages
CH 12
No ratings yet
CH 12
30 pages
4.determinants Assignment Solutions
100% (2)
4.determinants Assignment Solutions
13 pages
SPSS Multiple Linear Regression
No ratings yet
SPSS Multiple Linear Regression
55 pages
Chapter Three: Estimation of Multiple Linear Regression Model
No ratings yet
Chapter Three: Estimation of Multiple Linear Regression Model
18 pages
Introductory Concepts of Probabability & Statistics
No ratings yet
Introductory Concepts of Probabability & Statistics
6 pages
L G 0016125104 0051669710
50% (2)
L G 0016125104 0051669710
30 pages
3 - Wooldridge - Introductory Econometrics - Ch03
No ratings yet
3 - Wooldridge - Introductory Econometrics - Ch03
25 pages
Revised Simplex Method
No ratings yet
Revised Simplex Method
18 pages
Correlation Regression Curve Fitting-1
No ratings yet
Correlation Regression Curve Fitting-1
3 pages
Introduction To Time Series Analysis by
No ratings yet
Introduction To Time Series Analysis by
194 pages
Multiple Regression SPECIALISTICA
No ratings yet
Multiple Regression SPECIALISTICA
93 pages
Chapter Three Multiple
No ratings yet
Chapter Three Multiple
15 pages
Regression - and - Correlation 2 PDF
No ratings yet
Regression - and - Correlation 2 PDF
49 pages
Functions of Several Variables
No ratings yet
Functions of Several Variables
10 pages
Econometrics Multiple Regression Analysis: Heteroskedasticity
No ratings yet
Econometrics Multiple Regression Analysis: Heteroskedasticity
19 pages
Vector Valued Function
No ratings yet
Vector Valued Function
33 pages
17 Regression Analysis
No ratings yet
17 Regression Analysis
10 pages
Lecture Note 4 To 7 OLS
No ratings yet
Lecture Note 4 To 7 OLS
29 pages
Partial Derivative and Its Economic Application
No ratings yet
Partial Derivative and Its Economic Application
21 pages
Mca Notes - m-1
No ratings yet
Mca Notes - m-1
25 pages
Chapter 3 Simplex Method PDF
No ratings yet
Chapter 3 Simplex Method PDF
32 pages
Econometrics I: TA Session 5: Giovanna Ubida
No ratings yet
Econometrics I: TA Session 5: Giovanna Ubida
20 pages
Linear Regression Using Gradient Descent
No ratings yet
Linear Regression Using Gradient Descent
2 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Linear Regression
100% (1)
Linear Regression
8 pages
Lect03 CSN382
No ratings yet
Lect03 CSN382
31 pages
Basic Interview Question of Linear Regression
No ratings yet
Basic Interview Question of Linear Regression
9 pages
Arduino Menu On A Nokia 5110 LCD Using A Rotary Encoder
No ratings yet
Arduino Menu On A Nokia 5110 LCD Using A Rotary Encoder
11 pages
Trace
No ratings yet
Trace
16 pages
7075
No ratings yet
7075
156 pages
Configuration Fallback
No ratings yet
Configuration Fallback
1 page
Living in The IT Era Module 1 - Introduction To Information and Communication
No ratings yet
Living in The IT Era Module 1 - Introduction To Information and Communication
8 pages
Travel and Tourism Synopsis
No ratings yet
Travel and Tourism Synopsis
16 pages
QuickGuide IHC
No ratings yet
QuickGuide IHC
24 pages
Big Data Analytics (BDA) : Name of The Faculty: Affiliation: Teaching Area
No ratings yet
Big Data Analytics (BDA) : Name of The Faculty: Affiliation: Teaching Area
8 pages
A Systematic and Generalizable Approach To The Heuristic Evaluation of User Interfaces
No ratings yet
A Systematic and Generalizable Approach To The Heuristic Evaluation of User Interfaces
15 pages
How To Connect To An API With JavaScript
No ratings yet
How To Connect To An API With JavaScript
11 pages
Case Study Ques
0% (1)
Case Study Ques
4 pages
Mid Term Exam Questioner
No ratings yet
Mid Term Exam Questioner
4 pages
Instakart Axis Deposit Slip (Client Copy) Date of Deposition: Deposit Slip No: 8603388
No ratings yet
Instakart Axis Deposit Slip (Client Copy) Date of Deposition: Deposit Slip No: 8603388
2 pages
UUCMS - ಸಮಗ್ರ ವಿಶ್ವವಿದ್ಯಾಲಯ ಮತ್ತು ಕಾಲೇಜು ನಿರ್ವಹಣಾ ವ್ಯವಸ್ಥೆ
No ratings yet
UUCMS - ಸಮಗ್ರ ವಿಶ್ವವಿದ್ಯಾಲಯ ಮತ್ತು ಕಾಲೇಜು ನಿರ್ವಹಣಾ ವ್ಯವಸ್ಥೆ
2 pages
L2 Website
No ratings yet
L2 Website
19 pages
Java Multithreading Concurrency Interview Questions and Answers - JournalDev
No ratings yet
Java Multithreading Concurrency Interview Questions and Answers - JournalDev
25 pages
Chapter 1 - 3
No ratings yet
Chapter 1 - 3
20 pages
Sap BW Scenario: Human Capital Management
No ratings yet
Sap BW Scenario: Human Capital Management
7 pages
Long TMK
No ratings yet
Long TMK
2 pages
Forensics Windowsregistry Cheat Sheet 161221024032
No ratings yet
Forensics Windowsregistry Cheat Sheet 161221024032
1 page
2021 ITS665 - ISP565 - GROUP PROJECT-revMac21
No ratings yet
2021 ITS665 - ISP565 - GROUP PROJECT-revMac21
6 pages
2ND Quarter CSS 12 Week 5-6
No ratings yet
2ND Quarter CSS 12 Week 5-6
6 pages
Oo Lab
No ratings yet
Oo Lab
327 pages
Yamaha PA Full-Line 2018 Global EN PDF
No ratings yet
Yamaha PA Full-Line 2018 Global EN PDF
239 pages
CS301P-Assignment 2 Solution Fall 2024 by M.junaid Qazi
No ratings yet
CS301P-Assignment 2 Solution Fall 2024 by M.junaid Qazi
6 pages
Action Plan Ict Sy2019-2020
100% (1)
Action Plan Ict Sy2019-2020
10 pages
fx20
No ratings yet
fx20
3 pages
Surfside Magazine Ad Specs
No ratings yet
Surfside Magazine Ad Specs
1 page

Regression PPT

Uploaded by

Regression PPT

Uploaded by

Regression

Table 1 Sample Salary Data Figure 1 Scatter Plot Diagram

we cannot mathematically determine the years of

But, we can determine / predict salary column values

If you look at the data, the dependent column values

Table 3 SSE Calculation

• To identify a slope intercept, we use the equation y = mx + b

Ordinary Least Squares (OLS) Method

Table 4: OLS method calculations

• First, select data and then go to the Insert

• After getting the Scatter plot, go to the Add

• Ordinary Least Square method looks simple and computation is easy.

• Multi-variate dataset contains a single independent variables set and

• Gradient descent is an iterative optimization algorithm to find the

• The values of m and c are updated at each iteration

• Imagine a valley and a person with no sense of direction who wants to

• Let’s try applying gradient descent to m and c and approach it step by

2. Calculate the partial derivative of the loss function with respect to m,

• Unlike linear regression which outputs continuous number values, logistic

Comparison to linear regression

• Multi (Cats, Dogs, Sheep)

• Ordinal (Low, Medium, High)

Binary Logistic Regression

Studied Slept Passed

Graphically we could represent our data with scatter plot

• In order to map predicted values to probabilities, we use the sigmoid function

You might also like