Multiple Regression
Multiple Regression
Introduction
X + 2X 2 + X2
+ 1 1
+ X
1 1
2
Y = Y = 0 X1
0
H0: 1 = 2 = … = k = 0
H1: At least one i is not equal to zero.
ANOVA
df SS MS F Significance F
Regression k = 6 3123.8 520.6 17.14 0.0000
Residual n–k–1 = 93 2825.6 30.4
Total n-1 = 99 5949.5
SSR MSR=SSR/k
SSE MSE=SSE/(n-k-1)
F,k,n-k-1 = F0.05,6,100-6-1=2.17
F = 17.14 > 2.17
Conclusion:
Conclusion:There Thereisissufficient
sufficientevidence
evidencetotoreject
reject
the
thenullnullhypothesis
hypothesisininfavor favorofofthe
thealternative
alternativehypothesis:
hypothesis:
atatleast
leastone thei isi isnot
oneofofthe notequal
equaltotozero.
zero.Thus,
Thus,atatleast
least
one
oneindependent
independentvariable
variableisislinearly
linearlyrelated
relatedtotoY.Y.
This
Thislinear
linearregression
regressionmodel modelisisvalid
valid
Interpreting the Coefficients
• b0 = 38.14. This is the intercept, the value of Y when all the
variables take the value zero. Since the data range of all the
independent variables do not cover the value zero, do not interpret
the intercept.
• b1 = – 0.0076. In this model, for each additional room within 3 mile
of the La Quinta inn, the operating margin decreases on average
by .0076% (assuming the other variables are held constant).
• b2 = 1.65. In this model, for each additional mile that the nearest
competitor is to a La Quinta inn, the operating margin increases
on average by 1.65%, when the other variables are held constant.
• b3 = 0.020. For each additional 1000 sq-ft of office space, the
operating margin will increase on average by .02%, when the
other variables are held constant.
• b4 = 0.21. For each additional thousand students, the operating
margin increases on average by .21%, when the other variables
are held constant.
• b5 = 0.41. For each increment of $1000 in median
household income, the operating margin would increase
on average by .41%, when the other variables remain
constant.
• b6 = -0.23. For each additional mile to the downtown
center, the operating margin decreases on average
by .23%, when the other variables are held constant.
Testing Individual Coefficients
• The hypothesis for each i is:
H0 : i 0 Test statistic
H1 : i 0 b i
t i d.f. = n - k -1
sb i
• Excel output:
Insufficient Evidence
La Quinta Inns, Point Estimate
Xm18-01
• Predict the average operating margin of an inn at a site
with the following characteristics:
– 3815 rooms within 3 miles,
– Closet competitor .9 miles away,
– 476,000 sq-ft of office space,
– 24,500 college students,
– $35,000 median household income,
– 11.2 miles distance to downtown center.
MARGIN = 38.14 - 0.0076 (3815) +1.65 (.9) + 0.020 (476)
+0.21 (24.5) + 0.41 (35) - 0.23 (11.2) = 37.1%
Regression Diagnostics
• The conditions required for the model
assessment to apply must be checked.
– Is the error variable normally
distributed? Draw a histogram of the residuals
– Is the error variance constant? Plot the residuals versus the
predicted values of Y
– Are the errors independent? Plot the residuals versus the
time periods
– Can we identify outlier?
– Is multicolinearity (correlation between the Xi’s) a problem?