0% found this document useful (0 votes)

109 views

Simple Linear Regression Analysis

Regression analysis is used to understand the relationship between two variables and predict the value of one variable based on another. A regression model contains an independent (predictor) variable and a dependent (response) variable. Linear regression estimates the coefficients of the linear equation that best predicts the dependent variable from the independent variables. The method of least squares is used to determine the regression line that minimizes the sum of the squared residuals.

Uploaded by

Jacqueline Carbonel

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

109 views

Simple Linear Regression Analysis

Uploaded by

Jacqueline Carbonel

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 34

Regression Analysis

Regression Analysis
 Regression Analysis is used to:
1)understand the relation between two variables
2)predict the value of one variable based on another
variable.
 A regression model is comprised of a dependent
(response) variable and an independent (predictor)
variable.

Independent Variable(s) Dependent Variable

Prediction Relationship
Regression Analysis

 Linear regression estimates the coefficients of the

linear equation, involving one or more independent
variables that best predict the value of the
dependent variable.

 If you believe that none of your predictor variables

is correlated with the errors in your dependent
variable, you can use the linear regression
procedure
Simple Linear Regression
The Scatter Diagram – used to graphically
investigate relationship between the dependent
and independent variables

100
Y 50
0
0 20 X 40 60
Plot of all (Xi , Yi) pairs
Types of Regression Models

Positive Linear Relationship Relationship NOT Linear

Negative Linear Relationship No Relationship

Simple Linear Regression Model
 Regression models are used to test if a relationship
exist between variables; that is to use one variable to
predict another. However, there is a random error that
cannot be predicted.

Y intercept Random
Error

Yi   0  1 X i   i
Dependent
(Response)
Slope
Variable Independent
(Predictor/Explanatory)
Variable
Population Linear Regression
Model

Y Yi   0  1X i   i Observed
Value

i = Random Error

   0  1X i
YX

X
Observed Value
Sample Linear Regression Model

yˆ i  b0  b1 xi

yi = Predicted Value of Y for observation i

xi = Value of X for observation i

b0 = Sample Y - intercept used as estimate

of the population 0
b1 = Sample Slope used as estimate of the
population 1
Sample Linear Regression Model

Sample data are used to estimate the true

values for the intercept and slope.

yˆ i  b0  b1 xi
The difference between the actual value of Y
and the predicted value (using sample data) is
known as the error.
Error = actual value – predicted value
 Yi
Sample Linear Regression Model

yˆ i  b0  b1 xi
n
 n  n 
n  xi yi    xi   yi 
b1  i 1  i 1  i 1 
2
n
 n

n  xi    xi 
2

i 1  i 1 
b0  y  b1 x
Table 3.1. Intelligence Test Scores and Freshmen Chemistry Grades
Test Score Chemistry
Student (x) Grade (y)
1 65 85
2 50 74
3 55 76
4 65 90
5 55 85
6 70 87
7 65 94
8 70 98
9 55 81
10 70 91
11 50 76
12 55 74
Figure 3.1. Scatter Diagram with regression line

100

95
yˆ i  b0  b1 xi
Chemistry Grade

85
Determining point
80 estimate of b0 and b1
Using the Method of
75
Least Squares
70
40 45 50 55 60 65 70 75
Intelligence Test Score
Measures of Variation: The
Sum of Squares

Y 
SSE =(Yi - Yi )2
_ b Xi
 b0 + 1
SST = (Yi - Y) 2
Yi =
 _
SSR = (Yi - Y)2
_
Y

X
Xi
Method of Least Squares
n n
SSE   e   ( yi  b0  b1 xi )
2
i
2

i 1 i 1
n

The process of differentiating i 1 i

e 2
with respect
to b0 and b1 and equating the derivatives to zero
n
 n  n 
n xi yi    xi   yi 
b1  i 1  i 1  i 1  b0  y  b1 x
2
n
 n

n xi    xi 
2

i 1  i 1 
Method of Least Squares

n
 n  n 
n xi yi    xi   yi 
b1  i 1  i 1  i 1 
2
n
 n

n xi    xi 
2

i 1  i 1 

 (x i  x )( yi  y )
b1  i 1
n

 i
( x
i 1
 x ) 2
Table 3.1. Intelligence Test Scores and Freshmen Chemistry Grades
Test Score Chemistry
Student (x) Grade (y)
1 65 85
2 50 74
b1  0.897
3 55 76
b0  30.056
4 65 90
5 55 85
yˆ i  b0  b1 xi
6 70 87
yˆ i  30.056  0.897 xi
7 65 94
8 70 98
9 55 81
10 70 91
11 50 76
12 55 74
Figure 3.1. Scatter Diagram with regression line

100

95 yˆ i  30.056  0.897 xi
Chemistry Grade

75 The slope of 0.897 means for

each increase of one unit in
70
intelligence Test Score (X),
40 45 50 55 60 65 70 75
the Chemistry Grade (Y) is
Intelligence Test Score estimated to increase by
0.897 units.
Using SPSS
Graphs To add regression line Use
Scatter
Simple SPSS Chart Editor

100 Chart
Options
Fit Line

Regression
Line
80
Chemistry Grade

Regression
70 Rsq = 0.7438 Prediction
40 50 60 70 80
Line
Test Score
Using SPSS
Analyze
Regression
Linear

Coefficientsa a
Coefficients
Unstandardized Standardized
Unstandardized Standardized
Coefficients Coefficients
Coefficients Coefficients
Model B Std. Error Beta t Sig.
Model B Std. Error Beta t Sig.
1 (Constant) 30.043 10.137 2.964 .014
1 (Constant) 30.043 10.137 2.964 .014
Test Score .897 .167 .862 5.389 .000
Test Score .897 .167 .862 5.389 .000
a. Dependent Variable: Chemistry Grade
a. Dependent Variable: Chemistry Grade

yˆ i  30.043  0.897 xi
Using SPSS Standard Deviation
Analyze Coefficient of
Regression Correlation Determination around the
Linear regression line

Model Summaryb b
Model Summary
Adjusted Std. Error of
Adjusted Std. Error of
Model R R Square R Square the Estimate
Model R a R Square R Square the Estimate
1 .862 .744 .718 4.319
1 .862a .744 .718 4.319 Measures of
a. Predictors: (Constant), Test Score
a. Predictors: (Constant), Test Score
b. Dependent Variable: Chemistry Grade
Variation
b. Dependent Variable: Chemistry Grade

ANOVAb b
ANOVA
Sum of
Sum of
Model Squares df Mean Square F Sig.
Model Squares df Mean Square F Sig. a
1 Regression 541.693 1 541.693 29.036 .000 a
1 Regression 541.693 1 541.693 29.036 .000
Residual 186.557 10 18.656
Residual 186.557 10 18.656
Total 728.250 11
Total 728.250 11
a. Predictors: (Constant), Test Score
a. Predictors: (Constant), Test Score
b. Dependent Variable: Chemistry Grade
b. Dependent Variable: Chemistry Grade
Testing the Significance of b

 Similar to a test on r in the one-predictor case

t =(0.8972136-0)/0.1665043 = 5.39 H0 is rejected,

i.e. the regression line has a nonzero slope
2
Variance Explained – r
r2 tells us the proportion of variance in Y which is
explained by X

r 
2
SS regression

SSYˆ

 Yˆ  Y 2

 Y  Y 
2
SS total SSY

• a ratio reflecting the proportion of variance captured

by our model relative to the overall variance in our
data
• highly interpretable: r2 =.50 means 50% of the
variance in Y is explained by X
Linear Regression Assumptions

For Linear Models

 1. Normality
 Y Values Are Normally Distributed For Each
X
 Probability Distribution of Error is Normal

 2. Homoscedasticity (Constant Variance)

 3. Independence of Errors
Variation of Errors Around the
Regression Line

y values are normally distributed

f(e) around the regression line.
For each x value, the “spread” or
variance around the regression line
is the same.

Y
X2
X1
X
Regression Line
Residual Analysis

 Purposes
 Examine Linearity
 Evaluate violations of assumptions

 Graphical Analysis of Residuals

 Plot residuals Vs. Xi values 
 Difference between actual Yi & predicted Yi

 Studentized residuals:
 Allows consideration for the magnitude of the
residuals
Residual Analysis for Linearity

Not Linear
 Linear
e e

X X
Residual Analysis for Homoscedasticity

Heteroscedasticity 
SR
Homoscedasticity
SR

X X

Using Standardized Residuals

• Predict Chemistry Grade
• Predict residual
• Predict studentized residual
• Predict standardized residual
Residual Analysis for
Normality

kdensity r, normal swilk r  Normal

kernel = epanechnikov, bandwidth = 2.25
kernel = epanechnikov, bandwidth = 2.25
Kernel density estimate
Kernel density estimate
Normal density
Normal density
.1
.1

.08
.08
Density

.06
Density

.06

.04
.04

Kernel density estimate

Kernel density estimate .02
.02
-10 -5 0 5 10
-10 -5 0
Residuals 5 10
Residuals
Residual Analysis for Linearity

scatter r X, yline(0)

5
5
 Linear

Residuals
Residuals
0
0

-5
-5
50 55 60 65 70
50 55 60
Test Score 65 70
Test Score
Residual Analysis for
Homoscedasticity

 Homoscedasticity
scatter r1 X, yline(0) scatter sr X, yline(0)
2 2
2 2

1 1
1 Standardized residuals 1

Studentized residuals
Standardized residuals

Studentized residuals
0 0
0 0

-1 -1
-1 -1

-2 -2
50 55 60 65 70 -2 50 55 60 65 70 -2
50 55 60
Test Score 65 70 50 55 60
Test Score 65 70
Test Score Test Score

Using Standardized Residuals Using Studentized Residuals

Residual Analysis for
Homoscedasticity

hettest
 Homoscedasticity
Residual Analysis for
Independence

scatter r obs, yline(0)

 Independent
5
5

Residuals
Residuals
0
0

-5
-5
0 5 10 15
0 5 obs 10 15
obs
Residual Analysis for
Independence

Durbin-Watson Statistic.
The D-W statistic is
defined as:
 Independent

Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
National Institute of Technology, Tiruchirappalli MBA Trimester Examination, Basic Data Analytic Marathon Exam
No ratings yet
National Institute of Technology, Tiruchirappalli MBA Trimester Examination, Basic Data Analytic Marathon Exam
22 pages
Slides of Discovering Statistics Using SPSS by Muhammad Yousaf Abid. Iqra University Islamabad.
No ratings yet
Slides of Discovering Statistics Using SPSS by Muhammad Yousaf Abid. Iqra University Islamabad.
31 pages
SOLID WASTE MANAGEMENT OF SHS OF ANHS Autosaved
100% (4)
SOLID WASTE MANAGEMENT OF SHS OF ANHS Autosaved
25 pages
Public-Policy-And-Program-Management SYLLABUS
100% (3)
Public-Policy-And-Program-Management SYLLABUS
2 pages
Statistics For Business and Economics: Simple Regression
No ratings yet
Statistics For Business and Economics: Simple Regression
68 pages
Nmims Decision Science Applicable For June 2020 Exams
No ratings yet
Nmims Decision Science Applicable For June 2020 Exams
10 pages
Logistic Regression
0% (1)
Logistic Regression
71 pages
Reading Material - Module-4 - Portfolio Optimization and Analytics
No ratings yet
Reading Material - Module-4 - Portfolio Optimization and Analytics
23 pages
Multivariate Analysis IBS
No ratings yet
Multivariate Analysis IBS
20 pages
Univariate, Bivariate and Multivariate Methods in Corpus-Based Lexicography - A Study of Synonymy
100% (1)
Univariate, Bivariate and Multivariate Methods in Corpus-Based Lexicography - A Study of Synonymy
614 pages
Linear Algebra For Business Analytics
No ratings yet
Linear Algebra For Business Analytics
27 pages
Health Economics: From Wikipedia, The Free Encyclopedia
No ratings yet
Health Economics: From Wikipedia, The Free Encyclopedia
20 pages
BRM Unit-4
No ratings yet
BRM Unit-4
116 pages
Deco504 Statistical Methods in Economics English
No ratings yet
Deco504 Statistical Methods in Economics English
397 pages
Business Analytics-UNIt 1 Re
100% (1)
Business Analytics-UNIt 1 Re
18 pages
21UBN2T441 - Financial Analytics Notes PDF
No ratings yet
21UBN2T441 - Financial Analytics Notes PDF
49 pages
Technical Analysis
No ratings yet
Technical Analysis
70 pages
Case Study-Retail Walmart Store Sales Prediction - Forecasting
No ratings yet
Case Study-Retail Walmart Store Sales Prediction - Forecasting
3 pages
Decision Analysis & Modelling
No ratings yet
Decision Analysis & Modelling
2 pages
Business Analytics: Advance: Logistic Regression
100% (1)
Business Analytics: Advance: Logistic Regression
26 pages
Statistics For Economics
No ratings yet
Statistics For Economics
58 pages
Derivatives and Risk Management - Bhaskar Sinha
No ratings yet
Derivatives and Risk Management - Bhaskar Sinha
3 pages
Time Series Analysis: 1 Contributed by National Academy of Statistical Administration
No ratings yet
Time Series Analysis: 1 Contributed by National Academy of Statistical Administration
56 pages
Ent in FinTech Merged
No ratings yet
Ent in FinTech Merged
234 pages
BA4101 - Statistics - For - Management - Revised
No ratings yet
BA4101 - Statistics - For - Management - Revised
21 pages
Business Analytics - The Science of Data Driven Decision Making
No ratings yet
Business Analytics - The Science of Data Driven Decision Making
55 pages
SPSS Practical
No ratings yet
SPSS Practical
31 pages
Hypothesis Testing Numericals
No ratings yet
Hypothesis Testing Numericals
5 pages
Business Report Advance Statistics
No ratings yet
Business Report Advance Statistics
39 pages
Cronbach's Alpha
No ratings yet
Cronbach's Alpha
5 pages
Date & Time: Session Paper No. Authors Title
No ratings yet
Date & Time: Session Paper No. Authors Title
12 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
31 pages
01 Multivariate Analysis
100% (1)
01 Multivariate Analysis
40 pages
Reubs High School: Statistics Project
No ratings yet
Reubs High School: Statistics Project
13 pages
Mba Sem-1 Decision Science-I - U-13
No ratings yet
Mba Sem-1 Decision Science-I - U-13
14 pages
Hypothesis Testing Sept 2016
No ratings yet
Hypothesis Testing Sept 2016
54 pages
in Your Line of Work, Cite A Situation Using One of The Quantitative Techniques As Basis of Decision/s You Made
No ratings yet
in Your Line of Work, Cite A Situation Using One of The Quantitative Techniques As Basis of Decision/s You Made
57 pages
Introduction To Multivariate Analysis: Dr. Ibrahim Awad Ibrahim
No ratings yet
Introduction To Multivariate Analysis: Dr. Ibrahim Awad Ibrahim
36 pages
Unit 3 Dispersion
No ratings yet
Unit 3 Dispersion
5 pages
Statistical Analysis 1
No ratings yet
Statistical Analysis 1
94 pages
Karnatak University, Dharwad. Course Wise Subject List
No ratings yet
Karnatak University, Dharwad. Course Wise Subject List
14 pages
02-03 ASAP Business Analytics-2 Descriptive Statistics
No ratings yet
02-03 ASAP Business Analytics-2 Descriptive Statistics
109 pages
Statistics For Business I
No ratings yet
Statistics For Business I
63 pages
Business Decision Making Course Outline
No ratings yet
Business Decision Making Course Outline
3 pages
Chapter 6-8 Sampling and Estimation
No ratings yet
Chapter 6-8 Sampling and Estimation
48 pages
Managing Bull Whip - SCM Case Study
100% (1)
Managing Bull Whip - SCM Case Study
11 pages
Hypothesis Testing Class
No ratings yet
Hypothesis Testing Class
73 pages
Chapter 1 Data Analysis
No ratings yet
Chapter 1 Data Analysis
18 pages
Business Analytics Question Bank
No ratings yet
Business Analytics Question Bank
65 pages
Chapter 6 Section 4-5: Probability: Multiple Choice
No ratings yet
Chapter 6 Section 4-5: Probability: Multiple Choice
7 pages
Operation Research
No ratings yet
Operation Research
126 pages
Introduction To R: Arin Basu MD MPH Dataanalytics
No ratings yet
Introduction To R: Arin Basu MD MPH Dataanalytics
33 pages
Marketing Measurement & Forcasting
100% (2)
Marketing Measurement & Forcasting
41 pages
Assignment On Regression
100% (1)
Assignment On Regression
11 pages
Surveillance in Stock Exchanges Module
No ratings yet
Surveillance in Stock Exchanges Module
13 pages
Types of Data & The Scales of Measurement: Data at The Highest Level: Qualitative and Quantitative
No ratings yet
Types of Data & The Scales of Measurement: Data at The Highest Level: Qualitative and Quantitative
7 pages
GHRM ppt-1
No ratings yet
GHRM ppt-1
20 pages
Multiple Correspondence Analysis
No ratings yet
Multiple Correspondence Analysis
16 pages
I M Com QT Final On16march2016
0% (1)
I M Com QT Final On16march2016
166 pages
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
From Everand
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
Joseph O. Esin
No ratings yet
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
PHASE 3.MS PowerPoint Features - Book
No ratings yet
PHASE 3.MS PowerPoint Features - Book
16 pages
Knowledge Attitude and Practices On Disaster
No ratings yet
Knowledge Attitude and Practices On Disaster
39 pages
MS PPT. Session Guide Mark Lyndon B. Baguio
No ratings yet
MS PPT. Session Guide Mark Lyndon B. Baguio
15 pages
Bureaucratic Norms Vs Social Values
0% (1)
Bureaucratic Norms Vs Social Values
11 pages
ACTIVITY Organizational Profile
No ratings yet
ACTIVITY Organizational Profile
2 pages
Group I - Ethics and Culture
No ratings yet
Group I - Ethics and Culture
20 pages
EXERCISES FOR PARAMETRIC TESTS (Activity 1-)
No ratings yet
EXERCISES FOR PARAMETRIC TESTS (Activity 1-)
10 pages
Multiple Regression Analysis 1
No ratings yet
Multiple Regression Analysis 1
57 pages
Statistics and Parametric Tests
No ratings yet
Statistics and Parametric Tests
74 pages
Supplemental Notes in Natural Law and Conscience Part 2
No ratings yet
Supplemental Notes in Natural Law and Conscience Part 2
3 pages
16-The-Costco-Model CASE
No ratings yet
16-The-Costco-Model CASE
3 pages
Morality & Accountability in Public Service
100% (2)
Morality & Accountability in Public Service
50 pages
Assignment Topic 13
No ratings yet
Assignment Topic 13
2 pages
Slides Ridge Lasso Regression
No ratings yet
Slides Ridge Lasso Regression
23 pages
MGT 6203 - Sri - M5 - Treatment Effects v042919
No ratings yet
MGT 6203 - Sri - M5 - Treatment Effects v042919
21 pages
An Introduction To Robust Regression
No ratings yet
An Introduction To Robust Regression
42 pages
Logistic Regression A Primer
No ratings yet
Logistic Regression A Primer
94 pages
Analisis Permintaan Pakaian Pada Marketplace Shopee Di Generasi Milenial
No ratings yet
Analisis Permintaan Pakaian Pada Marketplace Shopee Di Generasi Milenial
6 pages
Harvard Ec 1123 Econometrics Problem Set 7 - Tarun Preet Singh
No ratings yet
Harvard Ec 1123 Econometrics Problem Set 7 - Tarun Preet Singh
3 pages
Chapter 5 Discrete Choice Models
100% (1)
Chapter 5 Discrete Choice Models
19 pages
Summer Reading: The Birth of Plenty
No ratings yet
Summer Reading: The Birth of Plenty
2 pages
EMM 8001 Advanced Statistical Methods - Docx 1
No ratings yet
EMM 8001 Advanced Statistical Methods - Docx 1
4 pages
Download Complete Matrix Differential Calculus with Applications in Statistics and Econometrics 3rd Edition Jan R. Magnus PDF for All Chapters
100% (1)
Download Complete Matrix Differential Calculus with Applications in Statistics and Econometrics 3rd Edition Jan R. Magnus PDF for All Chapters
55 pages
B.tech. EE-EEE Syllabus 3rd-4th
No ratings yet
B.tech. EE-EEE Syllabus 3rd-4th
25 pages
Data Analysis and Modeling Bcis New Course
No ratings yet
Data Analysis and Modeling Bcis New Course
2 pages
Quality Assurance Project Plans
No ratings yet
Quality Assurance Project Plans
42 pages
Sta - Chap 14
No ratings yet
Sta - Chap 14
4 pages
Bab 7.4
No ratings yet
Bab 7.4
13 pages
5cf783r0hSYZTD8N 0COXan7bvGRd4pWm-EPSM UNIT 7 WeatherTrendsSalesPredictor
No ratings yet
5cf783r0hSYZTD8N 0COXan7bvGRd4pWm-EPSM UNIT 7 WeatherTrendsSalesPredictor
5 pages
MAF3821 2024 Part1
No ratings yet
MAF3821 2024 Part1
35 pages
Demand Forecasting: TH TH TH TH
No ratings yet
Demand Forecasting: TH TH TH TH
4 pages
Ai ML
No ratings yet
Ai ML
19 pages
SPICE a Sparse Covariance-Based Estimation Method for Array Processing
No ratings yet
SPICE a Sparse Covariance-Based Estimation Method for Array Processing
10 pages
Short Circuit Current Estimation Using PMU Measurements During Normal Load Variation
No ratings yet
Short Circuit Current Estimation Using PMU Measurements During Normal Load Variation
5 pages
Regression Illustrations PDF
No ratings yet
Regression Illustrations PDF
5 pages
MUF0142 Sample Exam Questions 4
No ratings yet
MUF0142 Sample Exam Questions 4
16 pages
Churn Prediction Using Logistic Regression
No ratings yet
Churn Prediction Using Logistic Regression
5 pages
Econometrics: Problem Set 2: Professor: Mauricio Sarrias
No ratings yet
Econometrics: Problem Set 2: Professor: Mauricio Sarrias
10 pages
Full Download Original PDF Econometric Analysis 8th Edition by William H Greene PDF
100% (31)
Full Download Original PDF Econometric Analysis 8th Edition by William H Greene PDF
41 pages
Eece 522 Notes - 05 CH - 3b
No ratings yet
Eece 522 Notes - 05 CH - 3b
10 pages
Maxima Likelihood Estimation
No ratings yet
Maxima Likelihood Estimation
3 pages
MECO6312-2021F-Test1_AZ(1)
No ratings yet
MECO6312-2021F-Test1_AZ(1)
6 pages