0% found this document useful (0 votes)

127 views55 pages

LGT2425 Lecture 3 Part II (Notes)

This document provides an overview of multiple linear regression analysis. It discusses estimating a multiple regression model using the least squares method to minimize the sum of squared errors. It also covers calculating R-squared, adjusted R-squared, and using Excel's regression tool to estimate a multiple regression model using data from Butler Trucking Company on miles traveled, deliveries, and travel time. Finally, it discusses checking the assumptions of a multiple regression model by examining residual plots.

Uploaded by

Jackie Chou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

127 views55 pages

LGT2425 Lecture 3 Part II (Notes)

Uploaded by

Jackie Chou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 55

LGT2425 INTRODUCTION

TO BUSINESS ANALYTICS
Lecture 3: Linear Regression (Part II)

1
Multiple regression model
■ Multiple regression model

■ y = Dependent variable
■ x1, x2,…,xq = Independent variables
■ β0, β1,…, βq = Parameters (represents the change in the mean value of
the dependent variable y that corresponds to a one unit increase in
the independent variable xi, holding the values of all other
independent variable constant)
■ ε = Error term (accounts for the variability in y that cannot be
explained by the linear effect of the q independent variables)
2
Estimated multiple regression model

■ Estimated multiple regression model

3
Least squares method and multiple regression

■ The least squares method is used to develop the estimated multiple

regression equation
n 2 n 2
b0 , b1 , b2 , , bq that satisfy min i 1
yi yˆi min e.
i 1 i

4
Multiple regression model

5
Extension of Butler Trucking Company
■ Butler Trucking Company
– The estimated simple linear regression equation is
ŷi =1.2739+0.0678xi
– The linear effect of the number of miles traveled explains
66.41% of the variability in travel time in the sample data
(r2=0.6641)
– 33.59% of the variability in sample travel times remains
unexplained
– The managers want to consider adding one or more
independent variables, such as number of deliveries, to
the model to explain some of the remaining variability in
the dependent variable
– 300 observations are used this time
6
Assignment Miles (x1) Deliveries (x2) Time (y)
1 100.0 4.0 9.3
2 50.0 3.0 4.8
3 100.0 4.0 8.9
4 100.0 2.0 6.5
5 50.0 2.0 4.2

290 85.0 2.0 7.8

291 75.0 2.0 6.5
292 70.0 2.0 6.1
293 75.0 4.0 7.2
294 70.0 6.0 8.9
295 95.0 6.0 10.9
296 50.0 4.0 7.2
297 50.0 1.0 3.5
298 85.0 2.0 8.0
299 100.0 2.0 7.8
300 65.0 6.0 10.0

7
Estimated multiple regression model

■ Estimated multiple linear regression with two independent variables

ŷ b0 b 1 x1 b2 x2

■ ŷ = estimated mean travel time

■ x1=distance travelled
■ x2=number of deliveries
■ SST, SSR, SSE and r2 are computed

8
Excel’s regression tool

9
Excel’s regression tool

10
Excel’s regression tool

𝑦ො = 0.1273 + 0.0672x1 + 0.6900x2 (r2=SSR/SST=915.5161/1120.1032=81.73%)

11
Calculate SSR, SST and r2
Assignment Miles (x1) Deliveries (x2) Time (y) Predicted y Mean y SST SSR
1 100.0 4.0 9.3 9.6055 7.2840 4.0643 5.3894
2 50.0 3.0 4.8 5.5564 7.2840 6.1703 2.9845
3 100.0 4.0 8.9 9.6055 7.2840 2.6115 5.3894
4 100.0 2.0 6.5 8.2255 7.2840 0.6147 0.8864
5 50.0 2.0 4.2 4.8664 7.2840 9.5111 5.8447

290 85.0 2.0 7.8 7.2178 7.2840 0.2663 0.0044

291 75.0 2.0 6.5 6.5460 7.2840 0.6147 0.5447
292 70.0 2.0 6.1 6.2101 7.2840 1.4019 1.1534
293 75.0 4.0 7.2 7.9260 7.2840 0.0071 0.4121
294 70.0 6.0 8.9 8.9700 7.2840 2.6115 2.8428
295 95.0 6.0 10.9 10.6496 7.2840 13.0755 11.3272
296 50.0 4.0 7.2 6.2464 7.2840 0.0071 1.0766
297 50.0 1.0 3.5 4.1764 7.2840 14.3187 9.6570
298 85.0 2.0 8.0 7.2178 7.2840 0.5127 0.0044
299 100.0 2.0 7.8 8.2255 7.2840 0.2663 0.8864
300 65.0 6.0 10.0 8.6341 7.2840 7.3767 1.8229
Mean y 7.2840 Total 1120.1032 915.5161

12
Adjusted r2

■ r2 never decreases when a new x variable is added to the model

■ This can be a disadvantage when comparing models
■ What is the net effect of adding a new variable?
– We lose a degree of freedom when a new x variable is added
– Did the new x variable add enough explanatory power to offset the loss of one degree
of freedom?

2 2n 1
radj 1 (1 r )
n k 1
(where n = sample size, k = number of independent variables)
– Interpreted as the percentage of the total sum of squares that can be explained by
using the estimated regression equation adjusted for the number of x variables used
– Smaller than r2
– Useful in comparing among models
13
2 2 n 1
r
adj 1 (1 r )
n k 1

Adjusted r2=1-[(1-0.8173)(300-1)/(300-2-1)]=0.8161=81.61%

14
Estimated multiple regression model

15
Inference and regression

Simple linear regression model Multiple regression model

16
Inference and regression
■ Statistical inference 碍硁വ抷
– Process of making estimates and drawing conclusions about one
or more characteristics of a population (the value of one or more
parameters) through the analysis of sample data drawn from the
population
■ Inference is commonly used to estimate and draw conclusions on
– The regression parameters β0, β1,…, βq
– The mean value and/or the predicted value of the dependent
variable y for specific values of the independent variables x1,
x2,…,xq
■ Consider both hypothesis testing and interval estimation
17
Inference and regression
■ Three regression conditions
– The population of potential error terms ε is normally distributed with
a mean of 0
– The population of potential error terms ε has a constant variance
– The values of ε are statistically independent
■ The errors must satisfy these conditions in order for inferences
■ How to check?
– Residual plots to check for violations of regression conditions
– Residuals vs. ŷi
– Residuals vs. Xi

18
■ When using a normal probability plot, normal errors will approximately
display in a straight line

Percent
100

0
-3 -2 -1 0 1 2 3
Residual

19
Symmetrically distributed around 0 Not symmetrically distributed around 0

20
Y Y

x x
residuals

residuals
x x

Non-constant variance Constant variance

21
Not Independent
Independent
residuals

residuals
X
residuals

error=no trend(cuz cant explain)

22
When residuals do not meet conditions

■ An important independent variable has been omitted

■ The function form of the model is inadequate to explain the
relationships between the independent variables and the dependent
variables

23
Excel’s regression tool

24
Scatter chart of residuals (e) and predicted
values of the dependent variable (xi)

25
Excel’s regression tool

26
27
Scatter chart of residuals (e) and predicted
values of the dependent variable (ŷ)

28
Inference and regression
■ Testing individual regression parameters
– T-test
– To determine whether statistically significant relationships exist
between the dependent variable y and each of the independent
variables
– If βj=0, there is no linear relationship between the dependent
variable y and the independent variable xj
– If βj≠0, there is a linear relationship between y and xj

29
Inference and regression
■ Use a t test to test the hypothesis that a regression parameter
– Sbj is estimated standard deviation of bj
– As the magnitude of t increases (as t deviates from zero in either
direction), we are more likely to reject the hypothesis that the
regression parameter βj=0

bj 0
t STAT (df = n – k – 1)
Sb
j

30
n

bj Sbj

31
32
H0: βj = 0 From the Excel output:
For Miles tSTAT = 27.3655, with p-value < 0.0001
H1: βj 0
For Deliveries tSTAT = 23.3731, with p-value < 0.0001
d.f. = 300-2-1 = 297

Excel = 0.05 The test statistic for each variable falls

=T.INV.2T(0.05,297) t /2 = 1.97 in the rejection region (p-values < 0.05)
Decision:
/2=0.025 /2=0.025 Reject H0 for each variable
Conclusion:
There is evidence that both
Reject H0
-tα/2
Do not reject H0
tα/2
Reject H0
Miles and Deliveries affect
0
-1.97 1.97 travel time at = 0.05

33
34
35
H 0: β j = 0 From the Excel output:
For Miles tSTAT = 27.3655, with p-value < 0.0001
H 1: β j 0
For Deliveries tSTAT = 23.3731, with p-value < 0.0001
d.f. = 300-2-1 = 297

Excel = 0.01 The test statistic for each variable falls

=T.INV.2T(0.01,297) t /2 = 2.59 in the rejection region (p-values < 0.01)
Decision:
/2=0.005 /2=0.005 Reject H0 for each variable
Conclusion:
There is evidence that both
Reject H0
-tα/2
Do not reject H0
tα/2
Reject H0
Miles and Deliveries affect
0
-2.59 2.59 travel time at = 0.01

36
P-value
■ P-value
– The probability of obtaining a test statistic equal to or more
extreme (< or >) than the observed sample value given H0 is true
– H0 is there is no linear relationship between the dependent
variable y and the independent variable
– The p-value is also called the observed level of significance
– Smallest value of α for which H0 can be rejected
■ Compare the p-value with α
– If p-value < α, reject H0
– If p-value ≥ α, do not reject H0
– If the p-value is low then H0 must go

37
Excel
=T.DIST.2T(D18,297) Reject H0 Do not reject H0 Reject H0
-tα/2 tα/2
0
-27.3655 27.3655

38
Inference and regression
■ Confidence interval
– An estimate of a population parameter that provides an interval
believed to contain the value of the parameter at some level of
confidence

bj tα / 2 Sb
j

■ Confidence level
– Indicates how frequently interval estimates based on samples of
the same size taken from the same population using identical
sampling techniques will contain the true value of the parameter
we are estimating
– 1 - α (level of significance)
39
bj Sbj

40
n

k
Confidence level

41
For Miles, upper 95%
=0.06718172+1.968*0.002454979=0.0720

Lower 95%
=0.06718172-1.968*0.002454979=0.0624

bj tα / 2 Sb 0.0624 ≤ β1 ≤ 0.0720
j
You have 95% confidence that this interval correctly estimates the
relationship between these variables.

From a hypothesis-testing viewpoint, because this confidence interval does

not include 0, you can conclude that the regression coefficient (β1) has a
significant effect.
42
Inference and regression
■ Testing for an overall regression relationship
■ Use an F test based on the F probability distribution
– H0: β1=β2=…= βq=0 (no linear relationship)
– H1: at least one βi ≠ 0 (at least one independent variable affects y)

SSR
MSR k
FSTAT
MSE SSE
n k 1
43
n
P-value for the F Test

SSR
MSR k
FSTAT
MSE SSE
n k 1

FSTAT = [915.5160626/2]/[204.5871374/(300-2-1)]=457.7580313/0.68884558=664.5292419

44
H0: β1=β2=0
H1: β1 and β2 not both zero
= 0.05, = 0.01
df1= 2 df2 = 297

= 0.05 = 0.01

0 Do not Reject H0
F 0 Do not Reject H0
F
reject H0 reject H0
F0.05 = 3.03 F0.01 = 4.68
Excel Excel
=F.INV.RT(0.05,2,297) =F.INV.RT(0.01,2,297)

Since FSTAT test statistic is in the rejection region, reject H0. There is evidence that at least one independent
variable affects y.
45
46
47
Inference and regression
■ Non-significant independent variables
– If practical experience dictates that the non-significant independent variable
has a relationship with the dependent variable, the independent variable
should be kept in the model
– If the model sufficiently explains the dependent variable without the non-
significant independent variable, then consider rerunning the regression
without the non-significant independent variable (results may change)
– The appropriate treatment of the inclusion or exclusion of the y-intercept when
b0 is not statistically significant may require special consideration
– Regression through the origin should not be forced unless there are strong a
priori reasons for believing that the dependent variable is equal to zero when
the values of all independent variables in the model are equal to zero

48
Categorical independent variables

■ Butler Trucking Company and Rush Hour

– Dependent variable: travel time (y)
– Independent variables: miles traveled (x1) and number of
deliveries (x2)
– Categorical variable/dummy variable: rush hour (x3)
■ x3=0 if an assignment did not include travel on the congested
segment of highway during afternoon rush hour
■ x3=1 if an assignment included travel on the congested
segment of highway during afternoon rush hour

49
Categorical independent variables

ei = yi - 𝑦ො𝑖 +ve means actual is larger

50
Categorical independent variables

𝑦ො = –0.3302 + 0.0672x1 + 0.6735x2 + 0.9980x3.

51
Categorical independent variables
■ The model estimates that travel time increases by
– 0.0672 hour for every increase of 1 mile traveled, holding constant the number
of deliveries and whether the driving assignment route requires the driver to
travel on the congested segment of a highway during the afternoon rush hour
period
– 0.6735 hour for every delivery, holding constant the number of miles traveled
and whether the driving assignment route requires the driver to travel on the
congested segment of a highway during the afternoon rush hour period
– 0.9980 hour if the driving assignment route requires the driver to travel on the
congested segment of a highway during the afternoon rush hour period, holding
constant the number of miles traveled and the number of deliveries
■ r2=0.8838 indicates that the regression model explains approximately 88.38% of the
variability in travel time for the driving assignments in the sample

52
Categorical independent variables

■ When x3 0:

■ When x3 1:

53
Categorical independent variables

■ If a categorical variable has k levels, k-1 dummy variables are required

■ Suppose a manufacturer of vending machines organized the sales
territories for a particular state into three regions: A, B, and C
■ Suppose the managers believe sales region is one of the important
factors in predicting the number of units sold

54
Categorical independent variables

ŷ b0 b1 x1 b2 x2
Region A yˆ b0 b1 0 b2 0 b0
Region B yˆ b0 b1 1 b2 0 b0 b1

Region C yˆ b0 b1 0 b2 1 b0 b2

The Elements of Quantitative Investing
From Everand
The Elements of Quantitative Investing
Giuseppe A. Paleologo
No ratings yet
E122
100% (2)
E122
5 pages
05 Linear Regression 2
No ratings yet
05 Linear Regression 2
71 pages
Regrion
No ratings yet
Regrion
19 pages
MS Excel Linear & Multiple Regression
No ratings yet
MS Excel Linear & Multiple Regression
8 pages
What Is Multiple Linear Regression
No ratings yet
What Is Multiple Linear Regression
23 pages
MS - Excel - Linear - & - Multiple - Regression Office 2007
No ratings yet
MS - Excel - Linear - & - Multiple - Regression Office 2007
7 pages
Topic Simple Linear Regression
No ratings yet
Topic Simple Linear Regression
38 pages
Multiple Linear Regression Analysis Usin
No ratings yet
Multiple Linear Regression Analysis Usin
19 pages
Multiple Linear Regression in Excel
No ratings yet
Multiple Linear Regression in Excel
19 pages
01 SLR Final
No ratings yet
01 SLR Final
37 pages
Regression Analysis
No ratings yet
Regression Analysis
20 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
36 pages
Unit 4
No ratings yet
Unit 4
7 pages
Intro To Regresion: Codergirl Data Analysis
No ratings yet
Intro To Regresion: Codergirl Data Analysis
32 pages
Topic 7 2023 Linear Regression STD
No ratings yet
Topic 7 2023 Linear Regression STD
14 pages
Regression
No ratings yet
Regression
60 pages
Simple Linear Regression and Multiple Linear Regression: MAST 6474 Introduction To Data Analysis I
No ratings yet
Simple Linear Regression and Multiple Linear Regression: MAST 6474 Introduction To Data Analysis I
15 pages
Chapter 4 (Regression) PDF
No ratings yet
Chapter 4 (Regression) PDF
63 pages
Regression Analysis
No ratings yet
Regression Analysis
65 pages
F Regression
No ratings yet
F Regression
65 pages
Evans Analytics2e PPT 08
No ratings yet
Evans Analytics2e PPT 08
65 pages
Intro To Reg Models
No ratings yet
Intro To Reg Models
27 pages
Bivariate
No ratings yet
Bivariate
28 pages
Simple Linear Regression: Coefficient of Determination
No ratings yet
Simple Linear Regression: Coefficient of Determination
21 pages
Fba 1
No ratings yet
Fba 1
9 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
20 pages
Linear Regression
100% (2)
Linear Regression
28 pages
Linear Regression Analysis in Excel Assingment
No ratings yet
Linear Regression Analysis in Excel Assingment
17 pages
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
No ratings yet
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
19 pages
A Tutorial On How To Run A Simple Linear Regression in Excel
No ratings yet
A Tutorial On How To Run A Simple Linear Regression in Excel
19 pages
Inference For Regression
No ratings yet
Inference For Regression
24 pages
Regression Analysis
100% (2)
Regression Analysis
9 pages
Regression Analysis
No ratings yet
Regression Analysis
49 pages
Simple Linear Regression Sample
No ratings yet
Simple Linear Regression Sample
55 pages
Chapter 3 MLR
No ratings yet
Chapter 3 MLR
40 pages
Regression Lecture Summary
No ratings yet
Regression Lecture Summary
31 pages
Simple Linear Regression With Excel
No ratings yet
Simple Linear Regression With Excel
39 pages
Session 3: - Quantitative Demand
No ratings yet
Session 3: - Quantitative Demand
32 pages
Estimation of Causal Relationships I: Illustration 1
No ratings yet
Estimation of Causal Relationships I: Illustration 1
8 pages
Ch08 - Linear Regression
No ratings yet
Ch08 - Linear Regression
37 pages
Chapter 8
No ratings yet
Chapter 8
60 pages
Chapter 6
No ratings yet
Chapter 6
58 pages
ISOM2500 Spring 25 - Topic 10 - Linear Regression Interpretation and Diagnosis
No ratings yet
ISOM2500 Spring 25 - Topic 10 - Linear Regression Interpretation and Diagnosis
51 pages
Chapter 14
No ratings yet
Chapter 14
65 pages
Linear Regression Analysis in Excel 2
No ratings yet
Linear Regression Analysis in Excel 2
15 pages
Regression Analysis Using Excel
100% (1)
Regression Analysis Using Excel
85 pages
Regression Assignment Inst
No ratings yet
Regression Assignment Inst
6 pages
Week 5 Multiple Regression: Busa3500 Statistics For Business Ii Piedmont College
No ratings yet
Week 5 Multiple Regression: Busa3500 Statistics For Business Ii Piedmont College
57 pages
Chapter 3 Notes
No ratings yet
Chapter 3 Notes
5 pages
Session 1: Simple Linear Regression: Figure 1 - Supervised and Unsupervised Learning Methods
No ratings yet
Session 1: Simple Linear Regression: Figure 1 - Supervised and Unsupervised Learning Methods
16 pages
Concepts - Regression Overview
No ratings yet
Concepts - Regression Overview
14 pages
Deck2 BusinessIntelligence M1 ACSA
No ratings yet
Deck2 BusinessIntelligence M1 ACSA
15 pages
Chapter 5,6 Regression Analysis
50% (2)
Chapter 5,6 Regression Analysis
44 pages
x=^μ= x n x x) n−1 s σ s x−μ σ se (´x) = σ n: Sample Mean
No ratings yet
x=^μ= x n x x) n−1 s σ s x−μ σ se (´x) = σ n: Sample Mean
5 pages
Business Statistics II
100% (2)
Business Statistics II
100 pages
Evans - Analytics2e - PPT - 07 and 08 CH
No ratings yet
Evans - Analytics2e - PPT - 07 and 08 CH
50 pages
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
From Everand
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
Stuart A. Klugman
4/5 (1)
Mechanics Using Matlab: An Introductory Guide
From Everand
Mechanics Using Matlab: An Introductory Guide
Aayushman Dutta
No ratings yet
Basics of Autodesk Inventor Nastran 2025
From Everand
Basics of Autodesk Inventor Nastran 2025
Gaurav Verma
No ratings yet
Solutions Manual to accompany Introduction to Linear Regression Analysis
From Everand
Solutions Manual to accompany Introduction to Linear Regression Analysis
Douglas C. Montgomery
1/5 (1)
LGT2425 Lecture 6 (Notes)
No ratings yet
LGT2425 Lecture 6 (Notes)
32 pages
Lgt2425 Introduction To Business Analytics: Lecture 5: Text Mining
No ratings yet
Lgt2425 Introduction To Business Analytics: Lecture 5: Text Mining
12 pages
LGT2425 Lecture 4 (Revised Notes)
No ratings yet
LGT2425 Lecture 4 (Revised Notes)
47 pages
Lgt2425 Introduction To Business Analytics: Lecture 3: Linear Regression (Part I)
No ratings yet
Lgt2425 Introduction To Business Analytics: Lecture 3: Linear Regression (Part I)
36 pages
Stock Price Predictions Using A Geometric Brownian Motion - Joel Lidén
No ratings yet
Stock Price Predictions Using A Geometric Brownian Motion - Joel Lidén
41 pages
When Is A Sprint A Sprint? A Review of The Analysis of Team-Sport Athlete Activity Profile
No ratings yet
When Is A Sprint A Sprint? A Review of The Analysis of Team-Sport Athlete Activity Profile
12 pages
Bootstrap Methods With Applications in R All-in-One Download
No ratings yet
Bootstrap Methods With Applications in R All-in-One Download
14 pages
BC 21D Lab Manual2009 - 10edit PDF
No ratings yet
BC 21D Lab Manual2009 - 10edit PDF
52 pages
Data Analysis Lesson Plan
No ratings yet
Data Analysis Lesson Plan
3 pages
Unsupervised Machine Learning in Python
100% (1)
Unsupervised Machine Learning in Python
89 pages
Community Assets and Relative Rurality Index A Multi-Dimensional Measure
No ratings yet
Community Assets and Relative Rurality Index A Multi-Dimensional Measure
12 pages
SI Assignment ND
No ratings yet
SI Assignment ND
2 pages
ACAB2008
No ratings yet
ACAB2008
182 pages
Comparative Study of Gaussian Dispersion Formulas Within The Polyphemus Platform: Evaluation With Prairie Grass and Kincaid Experiments
No ratings yet
Comparative Study of Gaussian Dispersion Formulas Within The Polyphemus Platform: Evaluation With Prairie Grass and Kincaid Experiments
15 pages
The Overlapping Data Problem
No ratings yet
The Overlapping Data Problem
38 pages
Matthew Hong JMP
No ratings yet
Matthew Hong JMP
48 pages
Teixeira 2021
No ratings yet
Teixeira 2021
18 pages
Financial Performance of Banks in Afghanistan
No ratings yet
Financial Performance of Banks in Afghanistan
9 pages
CS1 Mapping Syllabus PDF
No ratings yet
CS1 Mapping Syllabus PDF
9 pages
Crash Course On S To Chas Tic Calculus
No ratings yet
Crash Course On S To Chas Tic Calculus
12 pages
CS3491 - Aiml - Unit Iii Supervised Learning
No ratings yet
CS3491 - Aiml - Unit Iii Supervised Learning
162 pages
FINAL Exam - Stat and Prob 11
No ratings yet
FINAL Exam - Stat and Prob 11
4 pages
Dataeng Lq1 Notes
No ratings yet
Dataeng Lq1 Notes
11 pages
Solution of Nonlinear Equations: Graphical and Incremental Search Methods
No ratings yet
Solution of Nonlinear Equations: Graphical and Incremental Search Methods
43 pages
XOS DataAnalysisMIDTERM
No ratings yet
XOS DataAnalysisMIDTERM
18 pages
Systat
No ratings yet
Systat
8 pages
Quality Management
No ratings yet
Quality Management
65 pages
An Introduction To Digital Communications
No ratings yet
An Introduction To Digital Communications
70 pages
9 Ge1
No ratings yet
9 Ge1
19 pages
Jose Rizal High School Gov. W. Pascual Ave., Malabon City Tel/Fax 921-27-44 PACUCOA Accredited: Level II Senior High School Department
No ratings yet
Jose Rizal High School Gov. W. Pascual Ave., Malabon City Tel/Fax 921-27-44 PACUCOA Accredited: Level II Senior High School Department
10 pages
EBSCO-FullText-20 04 2025
No ratings yet
EBSCO-FullText-20 04 2025
15 pages
Topic 3
No ratings yet
Topic 3
42 pages
STA301 - Final Term Solved Subjective With Reference by Moaaz
61% (18)
STA301 - Final Term Solved Subjective With Reference by Moaaz
28 pages

LGT2425 Lecture 3 Part II (Notes)

Uploaded by

LGT2425 Lecture 3 Part II (Notes)

Uploaded by

LGT2425 INTRODUCTION

■ Estimated multiple regression model

■ The least squares method is used to develop the estimated multiple

290 85.0 2.0 7.8

■ Estimated multiple linear regression with two independent variables

■ ŷ = estimated mean travel time

𝑦ො = 0.1273 + 0.0672x1 + 0.6900x2 (r2=SSR/SST=915.5161/1120.1032=81.73%)

290 85.0 2.0 7.8 7.2178 7.2840 0.2663 0.0044

■ r2 never decreases when a new x variable is added to the model

Simple linear regression model Multiple regression model

Non-constant variance Constant variance

error=no trend(cuz cant explain)

■ An important independent variable has been omitted

Excel = 0.05 The test statistic for each variable falls

Excel = 0.01 The test statistic for each variable falls

From a hypothesis-testing viewpoint, because this confidence interval does

■ Butler Trucking Company and Rush Hour

ei = yi - 𝑦ො𝑖 +ve means actual is larger

𝑦ො = –0.3302 + 0.0672x1 + 0.6735x2 + 0.9980x3.

■ If a categorical variable has k levels, k-1 dummy variables are required

You might also like