Multiple Regression
Multiple Regression
STATISTICAL METHODS II
Slide 1
Topic 6B
Multiple Regression
Multiple Regression Model
Least Squares Method
Multiple Coefficient of Determination
Model Assumptions
Testing for Significance
Using the Estimated Regression Equation
for Estimation and Prediction
Slide 2
Multiple Regression
Slide 3
Multiple Regression Model
where:
β0, β1, β2, . . . , βp are the parameters, and
ε is a random variable called the error term
Slide 4
Multiple Regression Equation
Slide 5
Estimated Multiple Regression Equation
Slide 6
Estimation Process
Estimated Multiple
b0, b1, b2, . . . , bp Regression Equation
provide estimates of
β0, β1, β2, . . . , βp Sample statistics are
b0, b1, b2, . . . , bp
Slide 7
Least Squares Method
Slide 8
Least Squares Method
Slide 9
Multiple Regression Model
Slide 10
Multiple Regression Model
y = β0 + β1x1 + β2x2 + ε
where
y = annual salary ($000)
x1 = years of experience
x2 = score on programmer aptitude test
Slide 12
Solving for the Estimates of β0, β1, β2
Least Squares
Input Data Output
x1 x2 y Computer b0 =
Package b1 =
4 78 24
for Solving b2 =
7 100 43
. . . Multiple
R2 =
. . . Regression
3 89 30 Problems etc.
Slide 13
Solving for the Estimates of β0, β1, β2
Slide 14
Estimated Regression Equation
Slide 15
Interpreting the Coefficients
Slide 16
Interpreting the Coefficients
b1 = 1.404
Slide 17
Interpreting the Coefficients
b2 = 0.251
Slide 18
Multiple Coefficient of Determination
= +
where:
SST = total sum of squares
SSR = sum of squares due to regression
SSE = sum of squares due to error
Slide 19
Multiple Coefficient of Determination
ANOVA Output
Analysis of Variance
SOURCE DF SS MS F p
Regression 2 500.3285 250.164 42.76 0.000
Residual Error 17 99.45697 5.850
Total 19 599.7855
SSR
SST
Slide 20
Multiple Coefficient of Determination
R2 = SSR/SST
R2 = 500.3285/599.7855 = .83418
Slide 21
Adjusted Multiple Coefficient
of Determination
Adding independent variables, even ones that are
not statistically significant, causes the prediction
errors to become smaller, thus reducing the sum of
squares due to error, SSE.
Because SSR = SST – SSE, when SSE becomes smaller,
SSR becomes larger, causing R2 = SSR/SST to
increase.
The adjusted multiple coefficient of determination
compensates for the number of independent
variables in the model.
Slide 22
Adjusted Multiple Coefficient
of Determination
Slide 23
Assumptions About the Error Term ε
Slide 24
Testing for Significance
Slide 25
Testing for Significance: F Test
Slide 26
Testing for Significance: t Test
Slide 27
Testing for Significance: F Test
Hypotheses H0: β1 = β2 = . . . = βp = 0
Ha: One or more of the parameters
is not equal to zero.
Slide 28
F Test for Overall Significance
Hypotheses H0: β1 = β2 = 0
Ha: One or both of the parameters
is not equal to zero.
Slide 29
F Test for Overall Significance
ANOVA Output
Analysis of Variance
SOURCE DF SS MS F p
Regression 2 500.3285 250.164 42.76 0.000
Residual Error 17 99.45697 5.850
Total 19 599.7855
Slide 30
F Test for Overall Significance
Slide 31
Testing for Significance: t Test
Hypotheses
Test Statistics
Slide 32
t Test for Significance
of Individual Parameters
Hypotheses
Slide 33
t Test for Significance
of Individual Parameters
Regression Equation Output
Slide 34
t Test for Significance
of Individual Parameters
Test Statistics
Slide 35
End of Topic 6, Part B
Slide 36