0% found this document useful (0 votes)

42 views28 pages

8 SLR Gsba 545 2024

Uploaded by

jacksui181

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views28 pages

8 SLR Gsba 545 2024

Uploaded by

jacksui181

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Data Driven Decision Making:

Simple Linear Regression Analysis

GSBA 545, Fall 2024
Professor Dawn Porter
Simple Linear Regression Analysis
• Simple linear regression (SLR) model

• Regression model assumptions

⎼ normality, independence, linearity & constant variance

• Inference
⎼ F and t-testing

• Confidence & prediction intervals

2
Simple Linear Regression Model
c

0
𝑌 = β0 + β1 𝑋 + ε

I ⼆ Bot Bix

i l.it
β0 is the Y-intercept, or mean of Y when X is 0*
β1 is the slope, or change in the mean of Y per unit change in X
 is error term describing leftover effect on Y

HD H
3* Note: Be careful… you need to have data where X is 0 for this to make sense.
Ordinary Least Squares (OLS) Estimation
෣ = −1398.77 + 145.371 𝐻𝑃
𝐴𝑣𝑒$ Optional background calculations:
Slope (b1)
σ 𝑋𝑖 − 𝑋ത 𝑌𝑖 − 𝑌ത
𝑏1 =
σ 𝑋𝑖 − 𝑋ത 2
True line of means
𝑠
= 𝑟𝑋𝑌 𝑠𝑌 = 145.37
𝑋

Y-Intercept (b0)
Least squares line:
𝑌෠ = −1399 + 145𝑋 𝑌ത = 19509.68, 𝑋ത = 143.83
𝑏0 = 𝑌ത − 𝑏1 𝑋ത
= 19509.68 − 145.37 143.83 = −1399

4
ixi an.sc
Ave$ and HP Model: Excel
Ave$ and HP Output: Excel
% of variation in Ave Price
explained when incorporating HP

state nor at
How much we expect tomodel
be off, on
3MSE average, when predicting Ave Price
OnCoefficients for at
regression equation
Determines if model, overall, is significant
or useful for predicting Ave Price

00 2h
Determines if HP is significant or
useful for predicting Ave Price

0000000
0
Byse
Ave$ and HP Model: JMP B
S
怀
Bit txi SCB i
n k 1

Fit Model
Ave$ and HP Output: JMP F
% of variation in Ave Price
explained when incorporating HP

n
How much we expect to be off, on

nkt average, when predicting Ave Price

CO
Determines if entire model, overall, is

O significant or useful for predicting Ave Price

⼼
000 Coefficients for regression equation

Determines if HP is significant or
useful for predicting Ave Price

鞏㗊
⽌ t HB fi H.ie not
true
Ave$ and HP Model: Python nkf
Ave$ and HP Output: Python (ANOVA)

Python doesn’t
automatically create a full
ANOVA table or report the
RMSE (Standard Error)
value, so a little more code
is necessary.
Ave$ and HP Output: Python
% of variation in Ave Price
explained when incorporating HP

Determines if entire model, overall, is

significant or useful for predicting Ave Price

Fri, 11 Oct 2024

How much we expect to be off, on

average, when predicting Ave Price

Coefficients for regression equation

Determines if HP is significant or
useful for predicting Ave Price

11
RMSE
Model Assumptions
Assumptions about the model error terms
1. Constant Variance: (Homoscedasticity)
lno_rti ol n Variance of error terms, σ , is
2

the same for all values of X.

2. Normality: Error terms follow a normal distribution for all values of X.

0
3. Independence: Error terms are statistically independent of each other.
4. Linearity: Linear in parameters.

ki
1Ciii lII.it
fE i
t_T
Model Assumptions: Constant Variance

If non-constant variance
i
exist_
exits, output results
Era cannot be fully trusted.

Measures should be
taken to obtain random
error scattering.

13
Model Assumptions: Normality

An approximate normal distribution of Y values is

assumed at each level of X, allowing us to create
confidence and prediction intervals.
Model Assumptions: Independence
If there is a dependence
00 between rows, usually seen in
Oo data assessed over time, there
will probably be an issue with
independence.

Other methods need to be

employed to incorporate the
dependence.
Model Assumptions: Linearity
If there is a nonlinear
relationship in the data,
OLS will not perform well.

y
Incorporating
transformations to
variables may help uncover
the true relationships.
Standard Error of Estimate (RMSE)

2
෠
σ 𝑌𝑖 − 𝑌𝑖 σ 𝑒𝑖2
Ʃ
𝑆𝑡𝑑 𝐸𝑟𝑟𝑜𝑟 = 𝑅𝑀𝑆𝐸 = 𝑆𝑒 =
0 𝑛−𝑘−1
=
𝑛−𝑘−1
n_n
9，1in
• Measures standard deviation of predicted vs. actual values
• Measures average error of estimate

ii
• Denoted by “Standard Error” in Excel and “Root Mean
Square Error (RMSE)” in JMP and other programs
• Affects parameter significance & prediction accuracy

17 * n = number of observations and k = number of independent variables in the model.

Measures of Variation
Prediction (x = 210): 𝑌෠ = 𝑏0 + 𝑏1 𝑋 = −1399 + 145.37 210 = $29,129

𝑌52 − 𝑌ത = 36,100 − 19,510 = $16,590

𝑌52 − 𝑌෠52 = 36,100 − 29,129 = $6971

𝑒52 = $6971
𝑌52 =$36,100
𝑌෠52 − 𝑌ത = 29,129 − 19,510 = $9619

𝑌෠52 =$29,129 The model improved our prediction for

that car by $9619 versus using just the
𝑌ത =$19,510 mean.

18 0
x = 210 同
1号
⅓
F-test for Overall Model: Excel
Testing H0: 1 = 0 vs Ha: 1 ≠ 0 at the  level of significance.
Reject H0 if: p-value (Sig F) < 

p-value ≈ 0.000 < 0.05 = α

⇒ Reject, so HP is useful

19
F-test for Overall Model: JMP

8
𝑀𝑆𝑅 5.3331𝑒 + 9
𝐹= = 0
𝑀𝑆𝐸 35723966

0
= 149.29 > 3.946 = 𝐹0.05,1,91

dfn
p-value ≈ 0.000 < 0.05 = α
⇒ Reject, so HP is useful
k

o aE.tt dfz
F-test for Overall Model: Python

𝑀𝑆𝑅 5.3331𝑒 + 09
𝐹= =
Fri, 11 Oct 2024 𝑀𝑆𝐸 3.572397𝑒 + 07
七
= 149.29 > 3.946 = 𝐹0.05,1,91

p-value ≈ 0.000 < 0.05 = α

⇒ Reject, so HP is useful

21
Slope Significance: Standard Error (𝑠𝑏1 )
Describes the possible sample-to-sample variability of b1.
• As RMSE increases, so does 𝑠𝑏1

Ef
• As n increases, 𝑠𝑏1 decreases 𝑅𝑀𝑆𝐸 1
𝑠𝑏1 = ×
• As sx (std deviation of X) increases, 𝑠𝑏1 decreases 𝑛 − 1 𝑠𝑥
⼀⼀
a

no0
CO

22 sb
d'rgsd ten
Slope Significance Test
If regression assumptions hold, we can reject 𝐻0 : β1 = 0 in favor of 𝐻a : β1 ≠ 0 at
the  level of significance if and only if the corresponding p-value <  (usually 0.05).

Test Statistic

𝑡=
𝑏1 − β1
𝑠𝑏1
0 t
景
95% Confidence Interval for β1
𝑏1 ± 𝑡0.025,𝑑𝑓 𝑠𝑏1

* tα, tα/2, and p-values are based on n–k–1 degrees of freedom, found as df Residual on Excel output
Slope Significance: Excel Output
𝑏1 145.371
𝑡= = = 12.218
𝑠𝑏1 11.898
The slope of HP is > 12 std errors
away from being 0 (or worthless).

p-value ≈ 0.000 < 0.05 = α

⇒ Reject, so HP is useful

8
0
0 00
24
1unit H⼝ out of
Slope Significance:12d
1 Ei
JMP Output
2

𝑏1 145.371
𝑡= =
𝑠𝑏1 11.898

= 12.22 > 1.986 = 𝑡0.025,91

p-value ≈ 0.000 < 0.05 = α

⇒ Reject, so HP is useful

B Bo 0

25
CO tnB.io
Cln.uttn es
i 品品
Slope Significance: Python Output

𝑏1 145.371
𝑡= =
𝑠𝑏1 11.898
Fri, 11 Oct 2024

= 12.22 > 1.986 = 𝑡0.025,91

p-value ≈ 0.000 < 0.05 = α

⇒ Reject, so HP is useful

26
Estimation: Prediction Intervals
Prediction (X = x)
𝑌෠ = 𝑏0 + 𝑏1 𝑥
Ave$ (HP = 210): 𝑌෠ = 𝑏0 + 𝑏1 𝑥 = −1399 + 145.37 210 = $29,129

A 95% prediction interval for an individual value of Y is

95% PI: 𝑌෠ ± 𝑡0.025,𝑑𝑓 𝐸𝑟𝑟𝑜𝑟 × 𝑆𝑒∗

A 95% PI for Ave$ when HP = 210:

29129 ± 1.986 × 5977 = $17,259, $40,999

* In JMP and other programs, Se is denoted RMSE, or Root Mean Square Error.
Estimation: Confidence Intervals
Prediction (X = x)
𝑌෠ = 𝑏0 + 𝑏1 𝑥
Ave$ (HP = 210): 𝑌෠ = 𝑏0 + 𝑏1 𝑥 = −1399 + 145.37 210 = $29,129

A 95% confidence interval for the mean value of Y is

𝑆𝑒∗
95% CI: 𝑌෠ ± 𝑡0.025,𝑑𝑓 𝐸𝑟𝑟𝑜𝑟
𝑛

A 95% CI for Ave$ when HP = 210:

5977
29129 ± 1.986 × = $27,898, $30,360
93
* In JMP and other programs, Se is denoted RMSE, or Root Mean Square Error.

Verified PDF Download Sociological Theory by George Ritzer 10e FULL Version
100% (1)
Verified PDF Download Sociological Theory by George Ritzer 10e FULL Version
404 pages
Isolated Footing Excel Computation
No ratings yet
Isolated Footing Excel Computation
27 pages
Electrical Installation Level 5 Learning Guide
No ratings yet
Electrical Installation Level 5 Learning Guide
76 pages
Lecture 3 - Linear Regression Imran 20022025 092939am
No ratings yet
Lecture 3 - Linear Regression Imran 20022025 092939am
46 pages
Fee Structure Agm Current
No ratings yet
Fee Structure Agm Current
2 pages
Day.10 Regression Evaluation Metrics MSE, RMSE, MAE, R-Squared
No ratings yet
Day.10 Regression Evaluation Metrics MSE, RMSE, MAE, R-Squared
8 pages
Simple and Multiple Regression
100% (2)
Simple and Multiple Regression
39 pages
BYK E-Prospectus of PDF
No ratings yet
BYK E-Prospectus of PDF
9 pages
ISOM2500 Spring 25 - Topic 10 - Linear Regression Interpretation and Diagnosis
No ratings yet
ISOM2500 Spring 25 - Topic 10 - Linear Regression Interpretation and Diagnosis
51 pages
BA501 Week5 Linear Regression
No ratings yet
BA501 Week5 Linear Regression
45 pages
292322356
No ratings yet
292322356
69 pages
Newbold-Presentación Regresión Cap 11
No ratings yet
Newbold-Presentación Regresión Cap 11
43 pages
Chapter 14
No ratings yet
Chapter 14
65 pages
Lecture 11 Simple Linear Regression
No ratings yet
Lecture 11 Simple Linear Regression
30 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
20 pages
Chapter 7 Regression Analysis
No ratings yet
Chapter 7 Regression Analysis
33 pages
Daunit 3
No ratings yet
Daunit 3
32 pages
Authentic Tasks
0% (1)
Authentic Tasks
5 pages
Fda Unit 5
No ratings yet
Fda Unit 5
20 pages
10 Bda
No ratings yet
10 Bda
35 pages
Taxi Reimbursement Request Form 07.31.24 - 0
No ratings yet
Taxi Reimbursement Request Form 07.31.24 - 0
2 pages
2024-Lecture 11
No ratings yet
2024-Lecture 11
37 pages
Chap 11
No ratings yet
Chap 11
64 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
51 pages
C1 English
No ratings yet
C1 English
26 pages
Statics Thinking-Regression
No ratings yet
Statics Thinking-Regression
51 pages
SimpleLineaReg Example
No ratings yet
SimpleLineaReg Example
3 pages
Answers To The First General Quick TEST UTME
No ratings yet
Answers To The First General Quick TEST UTME
22 pages
Stats101A - Chapter 2
No ratings yet
Stats101A - Chapter 2
59 pages
11 SimpleRegression
No ratings yet
11 SimpleRegression
42 pages
64482-International Price Index 23 24 v11
No ratings yet
64482-International Price Index 23 24 v11
30 pages
Quants
No ratings yet
Quants
8 pages
06 - Class 06 - Trade Setups
No ratings yet
06 - Class 06 - Trade Setups
12 pages
CH 14
No ratings yet
CH 14
31 pages
Lecture 10
No ratings yet
Lecture 10
38 pages
Yorrick - Player Sheet
No ratings yet
Yorrick - Player Sheet
2 pages
Regression Analysis
No ratings yet
Regression Analysis
49 pages
Easy Love Spell
50% (2)
Easy Love Spell
2 pages
Introduction To Linear Regression and Correlation Analysis: Objectives
100% (1)
Introduction To Linear Regression and Correlation Analysis: Objectives
33 pages
Simple Regression
100% (1)
Simple Regression
50 pages
File4-Session3-Introduction To Regression
No ratings yet
File4-Session3-Introduction To Regression
50 pages
Simple Linear Regression Analysis - Final
No ratings yet
Simple Linear Regression Analysis - Final
46 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
R June 6 Prakash Bari Health
No ratings yet
R June 6 Prakash Bari Health
6 pages
Business Statistics II
100% (2)
Business Statistics II
100 pages
Regression Notes
No ratings yet
Regression Notes
7 pages
Linear Regression
100% (2)
Linear Regression
28 pages
White Topping Report
73% (11)
White Topping Report
21 pages
UMTS Call Flow Scenarios Overview
No ratings yet
UMTS Call Flow Scenarios Overview
161 pages
Intermittent Fasting
No ratings yet
Intermittent Fasting
4 pages
Regression
No ratings yet
Regression
46 pages
8-1 To 8-3 Simple - Lin - Regress - Inference
No ratings yet
8-1 To 8-3 Simple - Lin - Regress - Inference
49 pages
F-3 Iso-Standard
No ratings yet
F-3 Iso-Standard
7 pages
Data Analytics Unit 3 Notes
100% (3)
Data Analytics Unit 3 Notes
28 pages
Simple Regression
No ratings yet
Simple Regression
46 pages
Simple Regression
No ratings yet
Simple Regression
35 pages
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
No ratings yet
The Bucharest University of Economic Studies Bucharest Business School Romanian - French INDE MBA Program
67 pages
Purcom Speech 1
No ratings yet
Purcom Speech 1
1 page
Simple Linear Regression 1. Review of Least Squares Procedure 2. Inference For Least Squares Lines
No ratings yet
Simple Linear Regression 1. Review of Least Squares Procedure 2. Inference For Least Squares Lines
51 pages
Spectrum MediaStore5000 Datasheet PDF
No ratings yet
Spectrum MediaStore5000 Datasheet PDF
2 pages
Statistics For Business and Economics: Simple Regression
No ratings yet
Statistics For Business and Economics: Simple Regression
68 pages
FAC3761 - Exam Prep - Mock Question Paper - Suggested Solution
No ratings yet
FAC3761 - Exam Prep - Mock Question Paper - Suggested Solution
9 pages
Simple Lin Regress Inference
No ratings yet
Simple Lin Regress Inference
51 pages
Lecture10 Regression2 TS PDF
No ratings yet
Lecture10 Regression2 TS PDF
22 pages
Formato Aplicacion para Lanchas DE
No ratings yet
Formato Aplicacion para Lanchas DE
2 pages
What Is Weather in Canada
No ratings yet
What Is Weather in Canada
5 pages
Lecture10 - SIMPLE LINEAR REGRESSION
No ratings yet
Lecture10 - SIMPLE LINEAR REGRESSION
13 pages
01 SLR Final
No ratings yet
01 SLR Final
37 pages
Nozzle First
No ratings yet
Nozzle First
21 pages
Simple Linear Regression PDF
No ratings yet
Simple Linear Regression PDF
7 pages
Statistics For Business and Economics: Simple Regression
No ratings yet
Statistics For Business and Economics: Simple Regression
64 pages
Topic Simple Linear Regression
No ratings yet
Topic Simple Linear Regression
38 pages
Regression Kann Ur 14
No ratings yet
Regression Kann Ur 14
43 pages
Intro To Regresion: Codergirl Data Analysis
No ratings yet
Intro To Regresion: Codergirl Data Analysis
32 pages
Estimation of Causal Relationships I: Illustration 1
No ratings yet
Estimation of Causal Relationships I: Illustration 1
8 pages
Brosur Master Steel
No ratings yet
Brosur Master Steel
4 pages
Simple Linear Regression and Multiple Linear Regression: MAST 6474 Introduction To Data Analysis I
No ratings yet
Simple Linear Regression and Multiple Linear Regression: MAST 6474 Introduction To Data Analysis I
15 pages
Delta Neutral Vega Long
No ratings yet
Delta Neutral Vega Long
6 pages
What Is Multiple Linear Regression
No ratings yet
What Is Multiple Linear Regression
23 pages
Regression Equation For SI
No ratings yet
Regression Equation For SI
12 pages
Regression Equation
No ratings yet
Regression Equation
56 pages
10 - Regression 1
No ratings yet
10 - Regression 1
58 pages
Lesson-Plan 1
No ratings yet
Lesson-Plan 1
2 pages
Formulas Linear Regression PDF
No ratings yet
Formulas Linear Regression PDF
5 pages
Data Sheet - Carrier Chiller
No ratings yet
Data Sheet - Carrier Chiller
4 pages
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
From Everand
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
Fouad Sabry
No ratings yet
XARIOS 400.: Superior Versatility and Reliability For Large-Sized Delivery Vehicles
No ratings yet
XARIOS 400.: Superior Versatility and Reliability For Large-Sized Delivery Vehicles
2 pages
14 Hes
No ratings yet
14 Hes
2 pages
Simpreg
No ratings yet
Simpreg
6 pages
Physiology Pneumonics
No ratings yet
Physiology Pneumonics
9 pages

8 SLR Gsba 545 2024

Uploaded by

8 SLR Gsba 545 2024

Uploaded by

Data Driven Decision Making:

Simple Linear Regression Analysis

• Regression model assumptions

• Confidence & prediction intervals

nkt average, when predicting Ave Price

O significant or useful for predicting Ave Price

Determines if entire model, overall, is

Fri, 11 Oct 2024

How much we expect to be off, on

Coefficients for regression equation

the same for all values of X.

An approximate normal distribution of Y values is

Other methods need to be

17 * n = number of observations and k = number of independent variables in the model.

𝑌52 − 𝑌ത = 36,100 − 19,510 = $16,590

𝑌52 − 𝑌෠52 = 36,100 − 29,129 = $6971

𝑌෠52 =$29,129 The model improved our prediction for

p-value ≈ 0.000 < 0.05 = α

p-value ≈ 0.000 < 0.05 = α

p-value ≈ 0.000 < 0.05 = α

= 12.22 > 1.986 = 𝑡0.025,91

p-value ≈ 0.000 < 0.05 = α

= 12.22 > 1.986 = 𝑡0.025,91

p-value ≈ 0.000 < 0.05 = α

A 95% prediction interval for an individual value of Y is

A 95% PI for Ave$ when HP = 210:

A 95% confidence interval for the mean value of Y is

A 95% CI for Ave$ when HP = 210:

You might also like