0% found this document useful (0 votes)

51 views49 pages

Regression Analysis

Uploaded by

harshitad1272

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views49 pages

Regression Analysis

Uploaded by

harshitad1272

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 49

Regression Analysis

• Regression analysis is used in statistics to find trends in data.

For example, you might guess that there’s a connection between how
much you eat and how much you weigh.
• In statistical modelling, regression analysis is a set of statistical
procedures for estimating the relationships between a dependent
variable(outcome: weight) and one or more independent
variable(predictors, covariates, features: amount of food we eat).
• It provides an equation for a graph so that you can make predictions
about your data.
Regression Analysis Example

Year vs RainFall Record

Formula can be represented as y=mx+b
Regression gives you a useful equation, which for this chart is:
y = -2.2923x + 4624.4

• Best of all, you can use the equation to make predictions. For example,
how much snow will fall in 2017?
y = -2.2923(2017) + 4624.4 = 0.8 inches.
• Regression also gives you an R2 value, which for this graph is 0.702.
This number tells you how good your model is.
• The values range from 0 to 1, with 0 being a terrible model and 1
being a perfect model. As you can probably see, 0.7 is a fairly decent
model so you can be fairly confident in your weather prediction!
Regression Analysis
There are multiple benefits of using regression analysis:
• It indicates the significant relationships between dependent variable
and independent variable.
• It indicates the strength of impact of multiple independent variables
on a dependent variable.

In Regression analysis, we fit a curve / line to the data points, in such

a manner that the differences between the distances of data points from
the curve or line is minimized.
Regression Techniques
These techniques are mostly driven by three metrics (number of
independent variables, type of dependent variables and shape of
regression line).
Regression Techniques

•Simple Linear regression(Univariate) analysis

uses a single x variable for each dependent “y”
variable. For example: (x, Y).
•Multiple Linear regression(Multivariate) uses
multiple “x” variables for each independent
variable: (x1)1, (x2)1, (x3)1, Y1).
•Nonlinear Regression if the regression curve is
nonlinear then there is nonlinear regression
between variables
Regression Types
1.Simple and Multiple Linear Regression
2.Logistic Regression
3.Polynomial Regression
4.Ridge Regression and Lasso Regression (upgrades to Linear
Regression)
Linear Regression
• Dependent variable is continuous
• Independent variable(s) can be continuous or discrete
• Nature of regression line is linear.
• It establishes a relationship between dependent variable (Y) and one or
more independent variables (X) using a best fit straight line (also known
as regression line).
• Equation Y=m*X + c + e, where ‘c’ is intercept, ‘m’ is slope of the line and ’e’
is error term.
How to obtain best fit line (Value of m and c)?
• Least Square Method: A most common method used for fitting a
regression line.
• It calculates the best-fit line for the observed data by minimizing the sum of
the squares of the vertical deviations from each data point to the line.
• Error is the difference between the actual value and Predicted value and
the goal is to reduce this difference.
How to obtain best fit line (Value of m and c)?
• The vertical distance between the data point and the regression line is
known as Error or Residual.
• Each data point has one residual and the sum of all the differences is
known as the Sum of Residuals/Errors.
How to obtain best fit line (Value of m and c)?
Residual/Error = Actual values – Predicted Values
Sum of Residuals/Errors = Sum(Actual- Predicted Values)
Square of Sum of Residuals/Errors = (Sum(Actual- Predicted Values))2

Aim is to minimize this sum of square error term or Cost Fuction

Least Square Method / Ordinary Least Square (OLS):
For equation: y = mx + b
Use the least square method to determine the equation of line of best fit for
the data. Then plot the line.
Metrix(s) for Model Evaluation
• Residual Sum of Squares (RSS). Sum of difference between
each actual output and the predicted output.
• Mean Square Error (MSE) is computed as RSS divided by the total
number of data points.

• Root Mean Squared Error (RMSE)

Metrix(s) for Model Evaluation
• R-squared is the proportion of the variance in the dependent variable
that is predicted from the independent variable.
• It ranges from 0 to 1.
• With linear regression, the coefficient of determination is equal to the
square of the correlation between the x and y variables.
• If R2 is equal to 0, then the dependent variable cannot be predicted from
the independent variable.
• If R2 is equal to 1, then the dependent variable can be predicted from
the independent variable without any error.
Metrix(s) for Model Evaluation
R-squared:
Formula 1: Using correlation coefficient

Square this value to get the coefficient of determination

Formula 2: Using sum of squares.
Example:
• Last year, five randomly selected students took a math aptitude test before
they began their statistics course. The Statistics Department has three
questions:
• What linear regression equation best predicts statistics performance,
based on math aptitude scores?
• If a student made an 80 on the aptitude test, what grade would we expect
her to make in statistics?
• How well does the regression equation fit the data?

x 95 85 80 70 60

y 85 95 70 65 70
What linear regression equation best predicts statistics performance, based on math aptitude scores?
If a student made an 80 on the aptitude test, what grade would we expect her to make in statistics?
How well does the regression equation fit the data?
What linear regression equation best predicts statistics performance, based on math aptitude scores?
If a student made an 80 on the aptitude test, what grade would we expect her to make in statistics?
How well does the regression equation fit the data?
What linear regression equation best predicts statistics performance, based on math aptitude scores?
If a student made an 80 on the aptitude test, what grade would we expect her to make in statistics?
How well does the regression equation fit the data?

Calculate the Value of R2

Linear Regression: Assumptions
• Linearity: the dependent variable Y should be linearly related to
independent variables.
• Normality: The X and Y variables should be normally distributed
• Homoscedasticity: The variance of the error terms should be constant
• Independence/No Multicollinearity: No correlation should be there
between the independent variables.
• The error terms should be normally distributed
• No Autocorrelation: The error terms should be independent of each
other.
Cost Function
• The whole idea of the linear Regression is to find the best fit line,
which has very low error (cost function).

Properties of the Regression line:

• 1. The line minimizes the sum of squared difference between the
observed values(actual y-value) and the predicted value(ŷ value)
• 2. The line passes through the mean of independent and dependent
features.
Cost Function vs Loss Function
• The loss function calculates the error per observation, whilst the cost
function calculates the error over the whole dataset.
Linear Regression: Gradient Descent
• Gradient descent is an optimization algorithm used to find the values of
parameters (coefficients) of a function that minimizes a cost function
(cost).
• The idea is to start with random m and b values and then iteratively
updating the values, reaching minimum cost.
• Steps:
1. Initially, the values of m and b will be 0 and the learning rate(α) will be introduced
to the function. The value of learning rate(α) is taken very small, something
between 0.01 or 0.0001.
The learning rate is a tuning parameter in an optimization algorithm that determines the step size
at each iteration while moving toward a minimum of a cost function.
Linear Regression: Gradient Descent
2. Partial derivative is calculate for the cost function equation in terms of slope(m)
and also derivatives are calculated with respect to the intercept(b)

3. After the derivatives are calculated,The slope(m) and intercept(b) are updated
with the help of the following equation.
m = m-α*derivative of m
b = b-α*derivative of b

4. The process of updating the values of m and b continues until the cost function
reaches the ideal value of 0 or close to 0.
Multiple Linear Regression
• The main difference is the number of independent variables that they
take as inputs. Simple linear regression just takes a single feature,
while multiple linear regression takes multiple x values.

• Another way is to use Normal Equation with multiple independent

variables.
Multiple Linear Regression
• The main difference is the number of independent variables that they
take as inputs. Simple linear regression just takes a single feature,
while multiple linear regression takes multiple x values.
ŷ = b0 + b1x1 + b2x2 + … + bk-1xk-1 + bkxk

Y = Xb

b = (X'X)-1X'Y
Multiple Linear Regression: Example

Student Test score IQ Study hours

1 100 110 40

2 90 120 30

3 80 100 20

4 70 90 0

5 60 80 10
Multiple Linear Regression: Example

•Define X.

•Define X'.

•Compute X'X.

•Find the inverse of X'X.

•Define Y.
Multiple Linear Regression: Example
b = (X'X)-1X'Y

ŷ = 20 + 0.5x1 +0.5x2
Regression Types
1.Simple and Multiple Linear Regression
2.Logistic Regression
3.Polynomial Regression
4.Ridge Regression and Lasso Regression (upgrades to
Linear Regression)
5.Decision Trees Regression
6.Support Vector Regression (SVR)
Logistic Regression

• Logistic Regression is used when the dependent variable(target) is categorical or

binary. For example:
• To predict whether an email is spam (1) or (0)
• Whether the tumor is malignant (1) or not (0)
• Logistic regression is widely used for classification problems
• Logistic regression doesn’t require linear relationship between dependent and
independent variables.
• Logistic regression estimates the probability of an event belonging to a class, such
as voted or didn’t vote.
Logistic Regression Example
HR Analytics : IT firms recruit large number of people, but one of the
problems they encounter is after accepting the job offer many
candidates do not join. So, this results in cost over-runs because they
have to repeat the entire process again. Now when you get an
application, can you actually predict whether that applicant is likely to
join the organization (Binary Outcome - Join / Not Join).
Why Logistic Regression

In Linear regression, we draw a straight line(the best fit line) L1 such that the sum of distances of all the data
points to the line is minimal.
Logistic Regression Equation
It uses sigmoid function is to map any predicted values of probabilities into
another value between 0 and 1.

Linear model: ŷ = b0+b1x

Sigmoid function: σ(z) = 1/(1+e−z)
Logistic regression model:
ŷ = σ(b0+b1x) = 1/(1+e-(b0+b1x))

Also called logistic or logit function

Logistic Regression Equation
To make in the range from 0 to +infinity

To make in the range from -infinity to +infinity

Since we want to calculate value of p

Logistic Regression Evaluation
In logistic regression, as the output is a probability value between 0 or 1,
mean squared error wouldn’t be the right choice. Instead, we use the log loss
function which is derived from the maximum likelihood estimation
method.
Overfitting and Underfitting

Derive
Hypothesis
Overfitting and Underfitting
Underfitting is a situation when your model is too simple for your data or
your hypothesis about data distribution is wrong and too simple. For
example, your data is quadratic and your model is linear.

This situation is also called high bias.

This means that your algorithm can do

accurate predictions, but the initial
assumption about the data is incorrect.
Overfitting and Underfitting
Overfitting is a situation when your model is too complex for your data.
For example, your data is linear and your model is high-degree
polynomial.
This situation is also called high variance.
In this situation, changing the input data only a little, the model output
changes very much.
•low bias, low variance — is a good result, just right.
•low bias, high variance — overfitting — the algorithm outputs very different predictions for
similar data.
•high bias, low variance — underfitting — the algorithm outputs similar predictions for similar
data, but predictions are wrong (algorithm “miss”).
•high bias, high variance — very bad algorithm. You will most likely never see this.
Overfitting
and
Underfitting
Regularization
• Regularization is one of the ways to improve our model to work on
unseen data by ignoring the less important features.
• Regularization minimizes the validation loss and tries to improve the
accuracy of the model.
• It avoids overfitting by adding a penalty to the model with high
variance.
Lasso Regression
LASSO stands for Least Absolute Shrinkage and Selection Operator.

Lasso regression performs L1 regularization, i.e. it adds a factor of sum of

absolute value of coefficients in the optimization objective.

This type of regularization (L1) can lead to zero coefficients i.e. some of the
features are completely neglected for the evaluation of output. So, Lasso
regression not only helps in reducing over-fitting but it can help us in feature
selection.
Ridge Regression
In ridge regression, the cost function is altered by adding a penalty
equivalent to square of the magnitude of the coefficients.

The penalty term (lambda) regularizes the coefficients such that if the
coefficients take large values the optimization function is penalized.
Ridge regression shrinks the coefficients and it helps to reduce the model
complexity and multi-collinearity
It is also called “L2 regularization”.
Polynomial Regression
• A regression equation is a polynomial regression equation if the power
of independent variable is more than 1. The equation below represents
a polynomial equation:

• In this regression technique, the best fit line is not a straight line. It is
rather a curve that fits into the data points.
Lasso vs Ridge
Regression vs Classification
• Classification predictions can be evaluated using accuracy, whereas
regression predictions cannot.
• Regression predictions can be evaluated using root mean squared
error, whereas classification predictions cannot.

• A classification algorithm may predict a continuous value, but the

continuous value is in the form of a probability for a class label.
• A regression algorithm may predict a discrete value, but the discrete
value in the form of an integer quantity.

Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Lecture 6 - Regression Analysis
No ratings yet
Lecture 6 - Regression Analysis
34 pages
Linear Regression Models
No ratings yet
Linear Regression Models
42 pages
Complete Linear Regression Algorithm
No ratings yet
Complete Linear Regression Algorithm
4 pages
Regression v33
No ratings yet
Regression v33
81 pages
Unit 2 Regression
No ratings yet
Unit 2 Regression
31 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
Linear Regression
100% (1)
Linear Regression
8 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
SAIDS - Linear Least Squares
No ratings yet
SAIDS - Linear Least Squares
27 pages
Module III (Part II) (Regression and Time Series)
No ratings yet
Module III (Part II) (Regression and Time Series)
118 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
Lecture 3 - Linear Regression Imran 20022025 092939am
No ratings yet
Lecture 3 - Linear Regression Imran 20022025 092939am
46 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
Evans Analytics2e PPT 08
No ratings yet
Evans Analytics2e PPT 08
65 pages
1.linear Regression PSP
No ratings yet
1.linear Regression PSP
92 pages
Linear Regression
No ratings yet
Linear Regression
49 pages
MachineLearning Unit-II
No ratings yet
MachineLearning Unit-II
45 pages
Unit-3 Data Analysis
No ratings yet
Unit-3 Data Analysis
36 pages
AI Lec23
No ratings yet
AI Lec23
36 pages
AI Lec5
No ratings yet
AI Lec5
42 pages
DMJAP LinearRegression 3
No ratings yet
DMJAP LinearRegression 3
28 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Lecture6 Regression
No ratings yet
Lecture6 Regression
42 pages
Regression Coeffient
No ratings yet
Regression Coeffient
52 pages
Lecture 9-10
No ratings yet
Lecture 9-10
28 pages
Linear Regression Models
No ratings yet
Linear Regression Models
41 pages
Topic 8 - Regression Analysis
No ratings yet
Topic 8 - Regression Analysis
51 pages
MachineLearning Unit II
No ratings yet
MachineLearning Unit II
45 pages
Linear Regression
No ratings yet
Linear Regression
35 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
33 pages
DA unit-III
No ratings yet
DA unit-III
30 pages
DA-3rd Unit
No ratings yet
DA-3rd Unit
16 pages
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
No ratings yet
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
19 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
Lecture Note #8 - PEC-CS701E
No ratings yet
Lecture Note #8 - PEC-CS701E
20 pages
Unit III
No ratings yet
Unit III
18 pages
CSL0777 L12
No ratings yet
CSL0777 L12
18 pages
DSI Guide Linear Regression 1677259638
No ratings yet
DSI Guide Linear Regression 1677259638
17 pages
Linear Regression-Part 2
No ratings yet
Linear Regression-Part 2
26 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
5 - Part II - Regression Analysis W-Notes
No ratings yet
5 - Part II - Regression Analysis W-Notes
10 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
13 pages
Classical Machine Learning: Linear Regression: Ramesh S
No ratings yet
Classical Machine Learning: Linear Regression: Ramesh S
28 pages
Machine Learning and Deep Learning Course
No ratings yet
Machine Learning and Deep Learning Course
23 pages
Unit - Iii
No ratings yet
Unit - Iii
9 pages
Everything You Need To Know About Linear Regression
No ratings yet
Everything You Need To Know About Linear Regression
19 pages
Linearregressionpl
No ratings yet
Linearregressionpl
9 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Experiment No 7
No ratings yet
Experiment No 7
7 pages
03 Revisions L Regression
No ratings yet
03 Revisions L Regression
25 pages
Regression Unit-2
No ratings yet
Regression Unit-2
5 pages
SimpleMultipleLinearRegression FoundationalMathofAI S24
No ratings yet
SimpleMultipleLinearRegression FoundationalMathofAI S24
6 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Regression
No ratings yet
Regression
4 pages
1.5.linear Regression
No ratings yet
1.5.linear Regression
5 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
SAT 202310 QAS Math
No ratings yet
SAT 202310 QAS Math
17 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
7 pages
CH367
No ratings yet
CH367
5 pages
Algebra 1 Section 8
No ratings yet
Algebra 1 Section 8
68 pages
MAF3821 2024 Part1
100% (1)
MAF3821 2024 Part1
35 pages
Quantitative Analysis 3
No ratings yet
Quantitative Analysis 3
22 pages
Moderator
No ratings yet
Moderator
8 pages
IT3230-Compiler Design Lab-HANDOUT-2024
No ratings yet
IT3230-Compiler Design Lab-HANDOUT-2024
5 pages
DA Unit-3
No ratings yet
DA Unit-3
13 pages
(FREE PDF Sample) (Ebook PDF) Introductory Econometrics: Asia-Pacific 2nd Edition Ebooks
100% (5)
(FREE PDF Sample) (Ebook PDF) Introductory Econometrics: Asia-Pacific 2nd Edition Ebooks
49 pages
StockWatson Econ CH04
No ratings yet
StockWatson Econ CH04
27 pages
Introduction To Time Series Analysis
No ratings yet
Introduction To Time Series Analysis
17 pages
Time Series Practice P5
No ratings yet
Time Series Practice P5
4 pages
Econometrics Assign 2024
No ratings yet
Econometrics Assign 2024
2 pages
Tictc Toe
No ratings yet
Tictc Toe
19 pages
CH 14 .....
No ratings yet
CH 14 .....
36 pages
(Important) - ) Modeling Data and Curve Fitting - Non-Linear Least-Squares Minimization and Curve-Fitting For Python
No ratings yet
(Important) - ) Modeling Data and Curve Fitting - Non-Linear Least-Squares Minimization and Curve-Fitting For Python
37 pages
Lecture 3 - Advanced Topics
No ratings yet
Lecture 3 - Advanced Topics
28 pages
Mc031 - Gsdp8118 Advanced Research Methodology
No ratings yet
Mc031 - Gsdp8118 Advanced Research Methodology
42 pages
Economics Lecture Notes - Advanced Econometrics
No ratings yet
Economics Lecture Notes - Advanced Econometrics
3 pages
Automata Assignment-1
No ratings yet
Automata Assignment-1
4 pages
Mock 5 S1
No ratings yet
Mock 5 S1
28 pages
Jambi Sistem
No ratings yet
Jambi Sistem
6 pages
Lesson 3 Interpolation
No ratings yet
Lesson 3 Interpolation
18 pages
Proceedings of Spie
No ratings yet
Proceedings of Spie
10 pages
2 Time Series Regression and Exploratory Data Analysis 2.1 Classical Regression in The Time Series Context
No ratings yet
2 Time Series Regression and Exploratory Data Analysis 2.1 Classical Regression in The Time Series Context
16 pages
Arathi
No ratings yet
Arathi
9 pages
Influence of Particulate Matter On Asth
No ratings yet
Influence of Particulate Matter On Asth
10 pages
Ecn222 Standard Error
No ratings yet
Ecn222 Standard Error
2 pages
Andika Word Apk. Kom
No ratings yet
Andika Word Apk. Kom
4 pages
Econometrics - Week 5 Tutorials 2024
No ratings yet
Econometrics - Week 5 Tutorials 2024
3 pages
Sta302 Final Project - Poster
No ratings yet
Sta302 Final Project - Poster
1 page
Topic 4
No ratings yet
Topic 4
15 pages
Lab Phys2 - 4
No ratings yet
Lab Phys2 - 4
7 pages
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet

Regression Analysis

Uploaded by

Regression Analysis

Uploaded by

Regression Analysis

• Regression analysis is used in statistics to find trends in data.

Year vs RainFall Record

In Regression analysis, we fit a curve / line to the data points, in such

•Simple Linear regression(Univariate) analysis

Aim is to minimize this sum of square error term or Cost Fuction

• Root Mean Squared Error (RMSE)

Square this value to get the coefficient of determination

Calculate the Value of R2

Properties of the Regression line:

• Another way is to use Normal Equation with multiple independent

Student Test score IQ Study hours

•Find the inverse of X'X.

• Logistic Regression is used when the dependent variable(target) is categorical or

Linear model: ŷ = b0+b1x

Also called logistic or logit function

To make in the range from -infinity to +infinity

Since we want to calculate value of p

This situation is also called high bias.

This means that your algorithm can do

Lasso regression performs L1 regularization, i.e. it adds a factor of sum of

• A classification algorithm may predict a continuous value, but the

You might also like