0% found this document useful (0 votes)

36 views19 pages

Unit 2 Topic 1 REGRESSION

Uploaded by

yrajat2650

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views19 pages

Unit 2 Topic 1 REGRESSION

Uploaded by

yrajat2650

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 19

REGRESSION

Dr. Anil Kumar Dubey

Associate Professor,
Computer Science & Engineering Department,
ABES EC, Ghaziabad
Affiliated to Dr. A.P.J. Abdul Kalam Technical University,
Uttar Pradesh, Lucknow
Basic
 Regression is a statistical method that tries to
determine the strength and character of the
relationship between one dependent variable and a
series of other variables. It is used in finance,
investing, and other disciplines.

 Regression analysis is a set of statistical methods

used for the estimation of relationships between a
dependent variable and one or more independent
variables. It can be utilized to assess the strength of
the relationship between variables and for modeling
the future relationship between them.
Conti…
Regression, a statistical approach, dissects
the relationship between dependent and
independent variables, enabling predictions
through various regression models.
Regression
Regression is a statistical approach used to
analyze the relationship between a
dependent variable (target variable) and one
or more independent variables (predictor
variables).

Objective is to determine the most suitable

function that characterizes the connection
between these variables.
Conti…
 It is a supervised machine learning technique, used
to predict the value of the dependent variable for
new, unseen data. It models the relationship between
the input features and the target variable, allowing
for the estimation or prediction of numerical values.

 Regression analysis problem works with if output

variable is a real or continuous value, such as
“salary” or “weight”. Many different models can be
used, the simplest is the linear regression. It tries to
fit data with the best hyper-plane which goes through
the points.
Linear Model Assumptions
 Linear regression analysis is based on six fundamental
assumptions:
 The dependent and independent variables show a
linear relationship between the slope and the intercept.
 The independent variable is not random.
 The value of the residual (error) is zero.
 The value of the residual (error) is constant across all
observations.
 The value of the residual (error) is not correlated across
all observations.
 The residual (error) values follow the normal
distribution.
Regression Types

Simple Regression
• Used to predict a continuous dependent variable based on a
single independent variable.
• Simple linear regression should be used when there is only a
single independent variable.
Conti…
 Multiple Regression
◦ Used to predict a continuous dependent variable based
on multiple independent variables.
◦ Multiple linear regression should be used when there are
multiple independent variables.

 NonLinear Regression
◦ Relationship between the dependent variable and
independent variable(s) follows a nonlinear pattern.
◦ Provides flexibility in modeling a wide range of functional
forms.
Simple Linear Regression
 Simple linear regression is a model that assesses the
relationship between a dependent variable and an
independent variable. The simple linear model is
expressed using the following equation:
Y = a + bX + ϵ
Where:
Y – Dependent variable
X – Independent (explanatory) variable
a – Intercept
b – Slope
ϵ – Residual (error)
Multiple Linear Regression
 Multiplelinear regression analysis is essentially similar
to the simple linear model, with the exception that
multiple independent variables are used in the model.
The mathematical representation of multiple linear
regression is:
Y = a + bX1 + cX2 + dX3 + ϵ
Where:
Y – Dependent variable
X1, X2, X3 – Independent (explanatory) variables
a – Intercept
b, c, d – Slopes
Terminologies related to Regression
Response Variable: The primary factor to
predict or understand in regression, also known as
the dependent variable or target variable.

Predictor Variable: Factors influencing the

response variable, used to predict its values; also
called independent variables.

Outliers: Observations with significantly low or

high values compared to others, potentially
impacting results and best avoided.
Conti…
Multicollinearity: High correlation among
independent variables, which can complicate
the ranking of influential variables.

Underfitting and Overfitting: Overfitting

occurs when an algorithm performs well on
training but poorly on testing, while
underfitting indicates poor performance on
both datasets.
Characteristics of Regression
Continuous Target Variable: Regression deals
with predicting continuous target variables that
represent numerical values. Examples include
predicting house prices, forecasting sales figures, or
estimating patient recovery times.

Error Measurement: Regression models are

evaluated based on their ability to minimize the
error between the predicted and actual values of the
target variable. Common error metrics include mean
absolute error (MAE), mean squared error (MSE),
and root mean squared error (RMSE).
Conti…
 Model Complexity: Regression models range from
simple linear models to more complex nonlinear
models. The choice of model complexity depends on
the complexity of the relationship between the input
features and the target variable.

 Overfitting and Underfitting: Regression models are

susceptible to overfitting and underfitting.
 Interpretability: The interpretability of regression
models varies depending on the algorithm used. Simple
linear models are highly interpretable, while more
complex models may be more difficult to interpret.
Conti…
In ML, "overfitting" means a model has a high
accuracy on the training data but a significantly lower
accuracy on the testing data, indicating that the
model has learned the training data too closely
and cannot generalize well to new data.
In ML, "underfitting" refers to a situation where
both the training accuracy and testing
accuracy are low, indicating that the model is
too simple to capture the patterns in the data
and is not performing well on either the
training data or new unseen data.
Examples
To predicting age of a person
To predicting nationality of a person
To predicting whether stock price of a company will
increase tomorrow
To predicting whether a document is related to
sighting of UFOs?

Note: Predicting age of a person (because it is a real

value, predicting nationality is categorical, whether
stock price will increase is discrete-yes/no answer,
predicting whether a document is related to UFO is
Linear Regression
Logistic Regression
Thanks

DA unit-III
No ratings yet
DA unit-III
30 pages
BA3 4 5modules
No ratings yet
BA3 4 5modules
258 pages
Unit 11
No ratings yet
Unit 11
21 pages
2.1 Regression Analysis
No ratings yet
2.1 Regression Analysis
28 pages
Types of Supervised Learning2
No ratings yet
Types of Supervised Learning2
66 pages
Unit-2 ML
No ratings yet
Unit-2 ML
39 pages
ML Module3 Regression
No ratings yet
ML Module3 Regression
51 pages
Simple Linear and Logistic Regression
No ratings yet
Simple Linear and Logistic Regression
81 pages
Unit-Iii-1 1
No ratings yet
Unit-Iii-1 1
31 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Data Science
100% (1)
Data Science
14 pages
Unit 2
No ratings yet
Unit 2
18 pages
Unit 2
No ratings yet
Unit 2
67 pages
Regression Unit-2
No ratings yet
Regression Unit-2
5 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Da Module 3
No ratings yet
Da Module 3
54 pages
Regression
No ratings yet
Regression
11 pages
Module 3
No ratings yet
Module 3
34 pages
4 ML
No ratings yet
4 ML
41 pages
DA Unit-3
No ratings yet
DA Unit-3
13 pages
Lecture 9-10
No ratings yet
Lecture 9-10
28 pages
Regression in M.L
No ratings yet
Regression in M.L
13 pages
Linear Regression Model 1
No ratings yet
Linear Regression Model 1
23 pages
ML PR-2
No ratings yet
ML PR-2
11 pages
MachineLearning Unit II
No ratings yet
MachineLearning Unit II
45 pages
Unit - Iii Data Analysis
No ratings yet
Unit - Iii Data Analysis
39 pages
MLT Unit 2
No ratings yet
MLT Unit 2
53 pages
Unit 2
No ratings yet
Unit 2
19 pages
Data Analytics Regression Unit III
No ratings yet
Data Analytics Regression Unit III
27 pages
Data Analytics Regression UNIT-III
No ratings yet
Data Analytics Regression UNIT-III
26 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Regression Modelling
No ratings yet
Regression Modelling
25 pages
18-Linear Regression
No ratings yet
18-Linear Regression
29 pages
Unit - 3 Machine Learning
No ratings yet
Unit - 3 Machine Learning
30 pages
Unit-3 Part 2 DA
No ratings yet
Unit-3 Part 2 DA
20 pages
LECTURE Regression
No ratings yet
LECTURE Regression
12 pages
ML Algorithm
No ratings yet
ML Algorithm
4 pages
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
No ratings yet
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
19 pages
Mod3 Eda
No ratings yet
Mod3 Eda
16 pages
Lecture Note #8 - PEC-CS701E
No ratings yet
Lecture Note #8 - PEC-CS701E
20 pages
Linear Regression
No ratings yet
Linear Regression
16 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
Michaelis Menten Equation
No ratings yet
Michaelis Menten Equation
19 pages
6 Regression Analysis
No ratings yet
6 Regression Analysis
12 pages
U-4 Iml
No ratings yet
U-4 Iml
17 pages
DA-3rd Unit
No ratings yet
DA-3rd Unit
16 pages
MBS 7e PPT 15
No ratings yet
MBS 7e PPT 15
51 pages
Class 8 - Linear Regression
No ratings yet
Class 8 - Linear Regression
56 pages
Linear Regression For Machine Learning
No ratings yet
Linear Regression For Machine Learning
9 pages
Chapter 4 PowerPoint
No ratings yet
Chapter 4 PowerPoint
76 pages
Multiple Regression and Model Building: Dr. Subhradev Sen Alliance School of Business
No ratings yet
Multiple Regression and Model Building: Dr. Subhradev Sen Alliance School of Business
150 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
Hanan
No ratings yet
Hanan
9 pages
Notes 2
No ratings yet
Notes 2
22 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
L4a - Supervised Learning
No ratings yet
L4a - Supervised Learning
25 pages
Classical Machine Learning: Linear Regression: Ramesh S
No ratings yet
Classical Machine Learning: Linear Regression: Ramesh S
28 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
26 pages
A Practical Guide To Using Panel Data
No ratings yet
A Practical Guide To Using Panel Data
6 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Reflections On Econometric Methodology : Urcfur
No ratings yet
Reflections On Econometric Methodology : Urcfur
16 pages
Eco227y5y Lec0101
No ratings yet
Eco227y5y Lec0101
8 pages
Economic Model Econometric Model
No ratings yet
Economic Model Econometric Model
2 pages
Kidist PPT - Admas
No ratings yet
Kidist PPT - Admas
21 pages
Chapter 6. Correlation vs. Causality in Regression Analysis
No ratings yet
Chapter 6. Correlation vs. Causality in Regression Analysis
27 pages
Estimation Theory Lec 1 - InTRODUCTION
No ratings yet
Estimation Theory Lec 1 - InTRODUCTION
21 pages
Supervised Learning With R
No ratings yet
Supervised Learning With R
30 pages
Syllabus
No ratings yet
Syllabus
9 pages
ARDL
No ratings yet
ARDL
6 pages
Suka Makan Asdwasd
No ratings yet
Suka Makan Asdwasd
19 pages
Multiple Regression With Serial
No ratings yet
Multiple Regression With Serial
15 pages
Estimator & Types of Estimators
No ratings yet
Estimator & Types of Estimators
30 pages
OLS
No ratings yet
OLS
18 pages
UMADBK Assignment Brief (CW1)
No ratings yet
UMADBK Assignment Brief (CW1)
10 pages
Regression Analysis Excel Template
No ratings yet
Regression Analysis Excel Template
5 pages
Ajams 8 2 1
No ratings yet
Ajams 8 2 1
5 pages
27.12.10h15 KTLTC De-1
No ratings yet
27.12.10h15 KTLTC De-1
6 pages
Regression Statistics: Multiple R 0.79 R Square 0.63 Adjusted R Square 0.60 Standard Error 233.08
No ratings yet
Regression Statistics: Multiple R 0.79 R Square 0.63 Adjusted R Square 0.60 Standard Error 233.08
4 pages
Propensity Score Matching in Stata Using Teffects
No ratings yet
Propensity Score Matching in Stata Using Teffects
6 pages
Syllabus May2015 Exam
No ratings yet
Syllabus May2015 Exam
4 pages
Sobel
No ratings yet
Sobel
4 pages
X X B X B X B y X X B X B N B Y: QMDS 202 Data Analysis and Modeling
No ratings yet
X X B X B X B y X X B X B N B Y: QMDS 202 Data Analysis and Modeling
6 pages
Forecasting: (Untitled)
No ratings yet
Forecasting: (Untitled)
1 page
EViews
No ratings yet
EViews
9 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)

Unit 2 Topic 1 REGRESSION

Uploaded by

Unit 2 Topic 1 REGRESSION

Uploaded by

REGRESSION

Dr. Anil Kumar Dubey

 Regression analysis is a set of statistical methods

Objective is to determine the most suitable

 Regression analysis problem works with if output

Predictor Variable: Factors influencing the

Outliers: Observations with significantly low or

Underfitting and Overfitting: Overfitting

Error Measurement: Regression models are

 Overfitting and Underfitting: Regression models are

Note: Predicting age of a person (because it is a real

You might also like