0% found this document useful (0 votes)

44 views10 pages

HW6 Solution

1. The document contains solutions to homework problems involving regression analysis and detecting multicollinearity. 2. For one dataset, variance inflation factors above 10 and a condition number above 1000 indicate severe multicollinearity. The best regression model relates the outcome variable y to regressors x5, x8, and x10. 3. For a second dataset, correlations close to 1 between some regressors suggest multicollinearity. Variance inflation factors above 10 and a condition number above 1000 also indicate severe multicollinearity. Stepwise regression identifies the same best model as the all-possible regressions approach, with y related to x5, x8, and x10.

Uploaded by

rita901112

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views10 pages

HW6 Solution

Uploaded by

rita901112

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

HW6_solution

Yao Song

12/9/2020

Q 9.2

# import dataset
dat1 <- read.csv("data-table-B21.csv", header = T, sep = ",") # 13 obs. of 6 var.
dat1 <- dat1[, -1]

a From the matrix of correlations between the regressors, would you suspect that multicollinearity is present?
corr <- cor(dat1[, 2:5])
corr

## x_1 x_2 x_3 x_4

## x_1 1.0000000 0.2285795 -0.8241338 -0.2454451
## x_2 0.2285795 1.0000000 -0.1392424 -0.9729550
## x_3 -0.8241338 -0.1392424 1.0000000 0.0295370
## x_4 -0.2454451 -0.9729550 0.0295370 1.0000000
From the correlations matrix between the regressors x1 , x2 , x3 , x4 , we find that this matrix reveals high
correlations in the pairs (x2 , x4 ) and (x1 , x3 ). Thus, the correlation matrix may indicate that there are
near-linear dependencies in the Hald cement data, where the problem of multicollinearity is said to exist.

b Calculate the variance inflation factors.

V IFj = Cjj = (1 − Rj2 )−1 ,

0 0
where C = X X. So the diagonal elements of inverse of X X are VIFs. Variance inflation factors (VIFs) for
each regression coefficient is calculated below.
diag(solve(corr))

## x_1 x_2 x_3 x_4

## 38.49621 254.42317 46.86839 282.51286
Since all 4 VIFs for regressors are greater than 10, it is an indication that the associated regression coefficients
are poorly estimated because of multicollinearity.

0
c Find the eigenvalues of X X.
By using eigen function in R to do eigen decomposition, we get eigenvalues as the following.
eigendom <- eigen(corr)
value <- eigendom$values
value

1
## [1] 2.235704035 1.576066070 0.186606149 0.001623746

0
d Find the condition number of X X.
k <- max(value)/min(value)
0
The condition number of X X is
λmax
κ= ,
λmin
0 0
where λmax is the largest eigenvalue of X X and λmin is the smallest eigenvalue of X X. The condition
0
number of X X is 1376.8806. Because κ is larger than 1000, severe multicollinearity is indicated.

Q 9.7

# import dataset
dat2 <- read.csv("data-table-B3.csv", header = T, sep = ",") # 32 obs. of 12 var.
dat2 <- dat2[-which(is.na(dat2$x3)), ]

a Does the correlation matrix give any indication of multicollinearity?

corr <- cor(dat2[, 2:12])
corr

## x1 x2 x3 x4 x5 x6
## x1 1.0000000 0.9408473 0.9891628 -0.34697246 -0.6720903 0.64279836
## x2 0.9408473 1.0000000 0.9643592 -0.28989951 -0.5509642 0.76141897
## x3 0.9891628 0.9643592 1.0000000 -0.32599915 -0.6728661 0.65312630
## x4 -0.3469725 -0.2898995 -0.3259992 1.00000000 0.4137808 0.03748643
## x5 -0.6720903 -0.5509642 -0.6728661 0.41378081 1.0000000 -0.21952829
## x6 0.6427984 0.7614190 0.6531263 0.03748643 -0.2195283 1.00000000
## x7 -0.7719151 -0.6259445 -0.7461800 0.55823570 0.8717662 -0.27563863
## x8 0.8623681 0.8027387 0.8641224 -0.30415026 -0.5613315 0.42206800
## x9 0.7974811 0.7105117 0.7881284 -0.37817358 -0.4534470 0.30038618
## x10 0.9515520 0.8878810 0.9434871 -0.35845879 -0.5798617 0.52036693
## x11 0.8244446 0.7086735 0.8012765 -0.44054570 -0.7546650 0.39548928
## x7 x8 x9 x10 x11
## x1 -0.7719151 0.8623681 0.7974811 0.9515520 0.8244446
## x2 -0.6259445 0.8027387 0.7105117 0.8878810 0.7086735
## x3 -0.7461800 0.8641224 0.7881284 0.9434871 0.8012765
## x4 0.5582357 -0.3041503 -0.3781736 -0.3584588 -0.4405457
## x5 0.8717662 -0.5613315 -0.4534470 -0.5798617 -0.7546650
## x6 -0.2756386 0.4220680 0.3003862 0.5203669 0.3954893
## x7 1.0000000 -0.6552065 -0.6551300 -0.7058126 -0.8506963
## x8 -0.6552065 1.0000000 0.8831512 0.9554541 0.6824919
## x9 -0.6551300 0.8831512 1.0000000 0.8994711 0.6326677
## x10 -0.7058126 0.9554541 0.8994711 1.0000000 0.7530353
## x11 -0.8506963 0.6824919 0.6326677 0.7530353 1.0000000
From the correlation matrix of regressors, we find that many off-diagonal elements in the correlation matrix
are close to 1, which indicate that there might be several near-linear dependencies in the gasoline mileage
data.

2
0
b Calculate the variance inflation factors and the condition number of X X. Is there any evidence of
multicollinearity?
The variance inflation factors are shown in the following.
diag(solve(corr))

## x1 x2 x3 x4 x5 x6 x7
## 119.487804 42.800811 149.234409 2.060036 7.729187 5.324730 11.761341
## x8 x9 x10 x11
## 20.917632 9.397108 85.744344 5.145052
Since VIFs for x1 , x2 , x3 , x7 , x8 and x10 are greater than 10, it implies a strong evidence of multicollinearity.
0
The condition number of X X is shown below.
eigendom <- eigen(corr)
value <- eigendom$values
k <- max(value)/min(value)

Because condition number κ is 2025.2393, which is larger than 1000. That is, severe multicollinearity is
indicated.

Q 10.5

a Use the all-possible-regressions approach to find an appropriate regression model.

best <- regsubsets(y ~ ., data = dat2)
sumbest <- summary(best)
sumbest

## Subset selection object

## Call: regsubsets.formula(y ~ ., data = dat2)
## 11 Variables (and intercept)
## Forced in Forced out
## x1 FALSE FALSE
## x2 FALSE FALSE
## x3 FALSE FALSE
## x4 FALSE FALSE
## x5 FALSE FALSE
## x6 FALSE FALSE
## x7 FALSE FALSE
## x8 FALSE FALSE
## x9 FALSE FALSE
## x10 FALSE FALSE
## x11 FALSE FALSE
## 1 subsets of each size up to 8
## Selection Algorithm: exhaustive
## x1 x2 x3 x4 x5 x6 x7 x8 x9 x10 x11
## 1 ( 1 ) "*" " " " " " " " " " " " " " " " " " " " "
## 2 ( 1 ) "*" " " " " "*" " " " " " " " " " " " " " "
## 3 ( 1 ) " " " " " " " " "*" " " " " "*" " " "*" " "
## 4 ( 1 ) " " " " " " " " "*" " " " " "*" "*" "*" " "
## 5 ( 1 ) " " " " " " " " "*" " " "*" "*" "*" "*" " "
## 6 ( 1 ) " " " " " " "*" "*" " " "*" "*" "*" "*" " "
## 7 ( 1 ) "*" " " "*" " " "*" " " "*" "*" "*" "*" " "
## 8 ( 1 ) "*" "*" "*" " " "*" " " "*" "*" "*" "*" " "

3
plot(sumbest$adjr2, main = "AdjR2 vs. p", xlab = "p")
abline(h = max(sumbest$adjr2))

AdjR2 vs. p
0.775
sumbest$adjr2

0.765
0.755

1 2 3 4 5 6 7 8

p
2
From the plot of RAdj,p versus p, we can conclude that the model involves x5 , x8 and x10 may be an
2
appropriate model with the largest RAdj,p = 0.781.
plot(sumbest$cp, main = "Cp vs. p", xlab = "p")
abline(h = min(sumbest$cp))

4
Cp vs. p

6
5
sumbest$cp

4
3
2
1
0

1 2 3 4 5 6 7 8

p
The plot of Cp versus p also indicates that it may be appropriate to choose the model involves x5 , x8 and x10
because it has the smallest Cp = −0.502.
fit3 <- lm(y ~ x5 + x8 + x10, data = dat2)
summary(fit3)

##
## Call:
## lm(formula = y ~ x5 + x8 + x10, data = dat2)
##
## Residuals:
## Min 1Q Median 3Q Max
## -4.6101 -1.9868 -0.6613 2.0369 5.8811
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 4.590404 11.771925 0.390 0.6998
## x5 2.597240 1.264562 2.054 0.0502 .
## x8 0.217814 0.087817 2.480 0.0199 *
## x10 -0.009485 0.001994 -4.757 6.38e-05 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 2.934 on 26 degrees of freedom
## Multiple R-squared: 0.8035, Adjusted R-squared: 0.7808
## F-statistic: 35.44 on 3 and 26 DF, p-value: 2.462e-09
Thus, the appropriate model relating y to x5 , x8 and x10 is

ŷ = 4.5904 + 2.5972x5 + 0.2178x8 − 0.0095x10 .

5
b Use stepwise regression to specify a subset regression model. Does this lead to the same model found in
part a?
fit31 <- lm(y ~ ., data = dat2)
select <- step(fit31, direction="both", trace = 0)
summary(select)

##
## Call:
## lm(formula = y ~ x5 + x8 + x10, data = dat2)
##
## Residuals:
## Min 1Q Median 3Q Max
## -4.6101 -1.9868 -0.6613 2.0369 5.8811
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 4.590404 11.771925 0.390 0.6998
## x5 2.597240 1.264562 2.054 0.0502 .
## x8 0.217814 0.087817 2.480 0.0199 *
## x10 -0.009485 0.001994 -4.757 6.38e-05 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 2.934 on 26 degrees of freedom
## Multiple R-squared: 0.8035, Adjusted R-squared: 0.7808
## F-statistic: 35.44 on 3 and 26 DF, p-value: 2.462e-09
The model from stepwise variable selection is the same as all-possible-regressions approach

ŷ = 4.5904 + 2.5972x5 + 0.2178x8 − 0.0095x10 .

Therefore, it is a strong evidence that the model relating y to x5 , x8 and x10 is an appropriate model for the
gasoline mileage performance data.

Q 10.8

Use the all-possible-regressions method to select a subset regression model for the Belle Ayr liquefaction data
in Table B.5. Evaluate the subset models using the Cp criterion. Justify your choice of final model using the
standard checks for model adequacy.
# import dataset
dat4 <- read.csv("data-table-B5.csv", header = T, sep = ",") # 27 obs. of 8 var.

best <- regsubsets(y ~ ., data = dat4)

sumbest <- summary(best)
sumbest

## Subset selection object

## Call: regsubsets.formula(y ~ ., data = dat4)
## 7 Variables (and intercept)
## Forced in Forced out
## x1 FALSE FALSE
## x2 FALSE FALSE
## x3 FALSE FALSE
## x4 FALSE FALSE

6
## x5 FALSE FALSE
## x6 FALSE FALSE
## x7 FALSE FALSE
## 1 subsets of each size up to 7
## Selection Algorithm: exhaustive
## x1 x2 x3 x4 x5 x6 x7
## 1 ( 1 ) " " " " " " " " " " "*" " "
## 2 ( 1 ) " " " " " " " " " " "*" "*"
## 3 ( 1 ) " " " " " " "*" " " "*" "*"
## 4 ( 1 ) " " " " "*" "*" " " "*" "*"
## 5 ( 1 ) " " "*" "*" "*" " " "*" "*"
## 6 ( 1 ) "*" "*" "*" "*" " " "*" "*"
## 7 ( 1 ) "*" "*" "*" "*" "*" "*" "*"
plot(sumbest$adjr2, main = "AdjR2 vs. p", xlab = "p")
abline(h = max(sumbest$adjr2))

AdjR2 vs. p
0.62 0.63 0.64 0.65 0.66 0.67
sumbest$adjr2

1 2 3 4 5 6 7

p
2
From the plot of RAdj,p versus p, we can conclude that the model involves x6 and x7 may be an appropriate
2
model with the largest RAdj,p = 0.675.
plot(sumbest$cp, main = "Cp vs. p", xlab = "p")
abline(h = min(sumbest$cp))

7
Cp vs. p

8
6
sumbest$cp

4
2
0

1 2 3 4 5 6 7

p
The plot of Cp versus p also indicates that it may be appropriate to choose the model involves x6 and x7
because it has the smallest Cp = −0.021.
fit4<- lm(y ~ x6 + x7, data = dat4)
summary(fit4)

##
## Call:
## lm(formula = y ~ x6 + x7, data = dat4)
##
## Residuals:
## Min 1Q Median 3Q Max
## -23.2035 -4.3713 0.2513 4.9339 21.9682
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 2.526460 3.610055 0.700 0.4908
## x6 0.018522 0.002747 6.742 5.66e-07 ***
## x7 2.185753 0.972696 2.247 0.0341 *
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 9.924 on 24 degrees of freedom
## Multiple R-squared: 0.6996, Adjusted R-squared: 0.6746
## F-statistic: 27.95 on 2 and 24 DF, p-value: 5.391e-07
The linear model relating y to x6 and x7 is

ŷ = 2.5265 + 0.0185x6 + 2.1858x7 .

8
Since F statistics is 27.95 and p-value is 0, which is less than significance level (0.05), we would reject
H0 : β6 = β7 = 0 and conclude there is a linear relationship between y and any of the regressors x6 , x7 .
R2 in the model is 0.6996, indicating that 69.96% of the total variability in y is explained by this model.
Adjusted R2 is 0.6746.
fit4_std <- rstandard(fit4)
qqnorm(fit4_std, ylab="Standardized Residuals", xlab="Normal Scores")
qqline(fit4_std, col = 2)

Normal Q−Q Plot

2
Standardized Residuals

1
0
−1
−2

−2 −1 0 1 2

Normal Scores
From the QQ plot, although there are some deviations from normality at the tails, the pattern almost fitted
the normality assumption.
plot(fit4$fitted.values, fit4$residuals, ylab = 'Residuals', xlab = 'Fitted Values', main = 'Residuals v
abline(0, 0)

9
Residuals vs Fitted
20
10
Residuals

0
−10
−20

10 20 30 40 50

Fitted Values
There is no significant pattern in the residuals versus fits plot. This suggests that the model does meet the
linearity assumption. Therefore, these results show that the model with two regressors x6 , x7 is adequate.

Milk Vending Machine
50% (2)
Milk Vending Machine
21 pages
2009 - Introductory Time Series With R - Select Solutions - Aug 05
33% (3)
2009 - Introductory Time Series With R - Select Solutions - Aug 05
16 pages
Extreme Programming For Safire Solutions
90% (10)
Extreme Programming For Safire Solutions
3 pages
Laboratory Activity 2: Code
No ratings yet
Laboratory Activity 2: Code
2 pages
House Rent Final7
100% (2)
House Rent Final7
63 pages
H-311 Linear Regression Analysis With R
100% (1)
H-311 Linear Regression Analysis With R
71 pages
Homework 2
100% (1)
Homework 2
14 pages
Lab-5-1-Regression and Multiple Regression
100% (2)
Lab-5-1-Regression and Multiple Regression
8 pages
Rotation V3 - Encrypt
No ratings yet
Rotation V3 - Encrypt
39 pages
Rallfun v37
No ratings yet
Rallfun v37
1,294 pages
ChatGPT For PowerBI and Azure
No ratings yet
ChatGPT For PowerBI and Azure
17 pages
Manual Detroit Diesel Serie 92
No ratings yet
Manual Detroit Diesel Serie 92
180 pages
User Manual M5 - Protocol 2 - V6.03
No ratings yet
User Manual M5 - Protocol 2 - V6.03
14 pages
Attractions MGMNT Handbook 2015 2016
No ratings yet
Attractions MGMNT Handbook 2015 2016
308 pages
A028 GLM-SC3
No ratings yet
A028 GLM-SC3
137 pages
Unit I Computer Networks - Network Fundamentals
No ratings yet
Unit I Computer Networks - Network Fundamentals
122 pages
Udyam Registration Certificate - The Lord's Family Spa
No ratings yet
Udyam Registration Certificate - The Lord's Family Spa
2 pages
Modeling Basics: Compartment Models Dimensional Analysis Stochastic Modeling
No ratings yet
Modeling Basics: Compartment Models Dimensional Analysis Stochastic Modeling
58 pages
Final Predictive Vaibhav 2020
No ratings yet
Final Predictive Vaibhav 2020
101 pages
DMB-24E Plus Transmodulator - User Manual From Digicast
No ratings yet
DMB-24E Plus Transmodulator - User Manual From Digicast
33 pages
Powershell
No ratings yet
Powershell
4 pages
STA302 Week12 Full
No ratings yet
STA302 Week12 Full
30 pages
Multiple Linear Regression: Beginning of Next Lecture - Online Course Evaluation (Bring A Tablet, Laptop, Phone?)
No ratings yet
Multiple Linear Regression: Beginning of Next Lecture - Online Course Evaluation (Bring A Tablet, Laptop, Phone?)
37 pages
Digital Literacy Skills Framework Accessible FSSP Edits October 2021
No ratings yet
Digital Literacy Skills Framework Accessible FSSP Edits October 2021
47 pages
Da Lab It
No ratings yet
Da Lab It
20 pages
Bda
No ratings yet
Bda
24 pages
Curve Fitting
No ratings yet
Curve Fitting
17 pages
soruma-SECOND-ASSEsiment L Reg
No ratings yet
soruma-SECOND-ASSEsiment L Reg
33 pages
Combinatorics 11695
No ratings yet
Combinatorics 11695
41 pages
Multiple Regression - Selecting The Best Equation: An Example
No ratings yet
Multiple Regression - Selecting The Best Equation: An Example
29 pages
05 Diagnostic Test of CLRM 2
No ratings yet
05 Diagnostic Test of CLRM 2
39 pages
CH 2
No ratings yet
CH 2
31 pages
AGM Night Vision Catalog 2025
No ratings yet
AGM Night Vision Catalog 2025
44 pages
Regression Analysis Script
No ratings yet
Regression Analysis Script
24 pages
Kongsberg HiPAP 502 Single SystemROMAS
No ratings yet
Kongsberg HiPAP 502 Single SystemROMAS
28 pages
MultivariableRegression Summary
No ratings yet
MultivariableRegression Summary
15 pages
The University of Auckland: Second Semester, 2004 Campus: City
No ratings yet
The University of Auckland: Second Semester, 2004 Campus: City
23 pages
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
No ratings yet
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
15 pages
STAT-2450 Assignment 1: Name:, Student ID: B00
No ratings yet
STAT-2450 Assignment 1: Name:, Student ID: B00
9 pages
MA 585: Time Series Analysis and Forecasting: February 12, 2017
No ratings yet
MA 585: Time Series Analysis and Forecasting: February 12, 2017
15 pages
Exame Do Dia 13 12 2019
No ratings yet
Exame Do Dia 13 12 2019
8 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
11 pages
20BCE1205 Lab3
No ratings yet
20BCE1205 Lab3
9 pages
FT Lenovo - V50a - 24IMB - AIO - Spec
No ratings yet
FT Lenovo - V50a - 24IMB - AIO - Spec
8 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
Problem 4.1 A)
No ratings yet
Problem 4.1 A)
11 pages
WEEK
No ratings yet
WEEK
17 pages
Da Lab File 2
No ratings yet
Da Lab File 2
13 pages
4027 Assignment Q5
No ratings yet
4027 Assignment Q5
12 pages
Final AK (Spring 2024)
No ratings yet
Final AK (Spring 2024)
14 pages
Model Selection
No ratings yet
Model Selection
11 pages
Mhs 1st Summative Test Math 9
No ratings yet
Mhs 1st Summative Test Math 9
6 pages
Linear Model Selection and Regularization
No ratings yet
Linear Model Selection and Regularization
23 pages
Homework4 1
No ratings yet
Homework4 1
10 pages
Assignment 1
No ratings yet
Assignment 1
16 pages
Problem Set 3
No ratings yet
Problem Set 3
9 pages
20mia1006 FDA LAB REGRESSION TYPES
No ratings yet
20mia1006 FDA LAB REGRESSION TYPES
11 pages
Code
No ratings yet
Code
10 pages
Model Solution - Econ f241 Mid
No ratings yet
Model Solution - Econ f241 Mid
3 pages
Graded Homework 1 Solutions
No ratings yet
Graded Homework 1 Solutions
19 pages
Homework 1
No ratings yet
Homework 1
8 pages
From The Help Desk: Seemingly Unrelated Regression With Unbalanced Equations
No ratings yet
From The Help Desk: Seemingly Unrelated Regression With Unbalanced Equations
7 pages
Department of Statistics Course STATS 330: Term Test 2003. 9:00 - 10:00 Friday, Sept 19, 2003
No ratings yet
Department of Statistics Course STATS 330: Term Test 2003. 9:00 - 10:00 Friday, Sept 19, 2003
8 pages
HW5
No ratings yet
HW5
8 pages
MathModel - Lecture 8 1
No ratings yet
MathModel - Lecture 8 1
8 pages
HW2 Solution
No ratings yet
HW2 Solution
7 pages
Photoshop Cc2014 Shortcuts PC
No ratings yet
Photoshop Cc2014 Shortcuts PC
1 page
Stepwiseselection MATTOUHI AICHA
No ratings yet
Stepwiseselection MATTOUHI AICHA
7 pages
Practicals Data
No ratings yet
Practicals Data
26 pages
AIS Wk1PostAct
No ratings yet
AIS Wk1PostAct
4 pages
ASSIGNMENT NO - 2, FDAS - SUMANYAKUMARI - Bfia
No ratings yet
ASSIGNMENT NO - 2, FDAS - SUMANYAKUMARI - Bfia
6 pages
Lab 5
No ratings yet
Lab 5
6 pages
Integer Factorization Using ML
No ratings yet
Integer Factorization Using ML
7 pages
Question Demo
No ratings yet
Question Demo
3 pages
Demo0 Sol1
No ratings yet
Demo0 Sol1
5 pages
Backward Elimination Mattouhi Aicha
No ratings yet
Backward Elimination Mattouhi Aicha
3 pages
Individual Part 4
No ratings yet
Individual Part 4
4 pages
Tutorial Session 12 - Model Selection Solution
No ratings yet
Tutorial Session 12 - Model Selection Solution
4 pages
Program Level Energy and Power Analysis
No ratings yet
Program Level Energy and Power Analysis
4 pages
Exercice V
No ratings yet
Exercice V
5 pages
IT Department Final Examinations Schedule, Semester I, AY 2022-2023 (Draft Version) 2 PDF
No ratings yet
IT Department Final Examinations Schedule, Semester I, AY 2022-2023 (Draft Version) 2 PDF
4 pages
Internet Safety Theory Assessment
No ratings yet
Internet Safety Theory Assessment
4 pages
Day 1 of AWS Journey
No ratings yet
Day 1 of AWS Journey
3 pages
Beautiful Architecture: Review
No ratings yet
Beautiful Architecture: Review
2 pages
Applied Epic en Us
No ratings yet
Applied Epic en Us
2 pages
22kV SWITCH ROOM OPTION
No ratings yet
22kV SWITCH ROOM OPTION
1 page
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
From Everand
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
CSPacademic
No ratings yet

HW6 Solution

Uploaded by

HW6 Solution

Uploaded by

HW6_solution

## x_1 x_2 x_3 x_4

b Calculate the variance inflation factors.

V IFj = Cjj = (1 − Rj2 )−1 ,

## x_1 x_2 x_3 x_4

a Does the correlation matrix give any indication of multicollinearity?

a Use the all-possible-regressions approach to find an appropriate regression model.

## Subset selection object

ŷ = 4.5904 + 2.5972x5 + 0.2178x8 − 0.0095x10 .

ŷ = 4.5904 + 2.5972x5 + 0.2178x8 − 0.0095x10 .

best <- regsubsets(y ~ ., data = dat4)

## Subset selection object

ŷ = 2.5265 + 0.0185x6 + 2.1858x7 .

Normal Q−Q Plot

You might also like