0% found this document useful (0 votes)

7 views

Midterm Practice Solutions

The document contains practice questions for an MS3252 midterm exam, including true or false statements and problem-solving exercises related to regression analysis. It covers topics such as model assumptions, hypothesis testing, confidence intervals, and model selection criteria. The document provides detailed calculations and interpretations for various regression models and statistical tests.

Uploaded by

詠芯謝

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Midterm Practice Solutions

Uploaded by

詠芯謝

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

MS3252 Midterm Practice Questions

I. True or False (2pts each)

Note: If a statement is not always true, it is regarded as a “false” statement.
1. A useful tool for assessing the appropriateness of model assumptions is a residuals
versus fitted values plot; if the model assumptions hold, this should resemble a null
plot (i.e. the data scatters around the zero line randomly).
TRUE

2. To study the relationship between cholesterol and patient height and weight, re-
searchers consider a regression model E(Y | X) = β0 + β1 X, where Y = LDL
cholesterol in mg/dL, and X = BMI = weight in kg / (height in m)2 ; in the ter-
minology of this course, BMI is the predictor, and height and weight are the two
independent variables.
FALSE

3. If the coefficient of determination for the regression y∼x1 is 0.60, so 60% of the
variation in y is explained by its linear association with x1, then the coefficient of
determination for the regression y∼x1+x2 will be at least 0.60.
TRUE

4. Comparing the two regression models y∼x1 and y∼x1+x2, their SST are different.
FALSE

5. The sum of squares due to regression SSR = SST −SSE = ni=1 (Ŷi −Ȳ )2 represents
P
the variation in y that remains unexplained after accounting for the relationship
between y and x hypothesized by the regression model.
FALSE

6. Consider a regression model E(Y | X1 , X2 , X3 ) = β0 + β1 X1 + β2 X2 + β3 X3 , all of

r12 , r13 and r23 are not that high, where rij denotes the pairwise correlation between
Xi and Xj , then we can conclude that no multicollinearity exists.
FALSE

7. Consider testing the hypothesis H0 : Reduced Model versus H1 : Full Model based
(SSEreduced − SSEf ull )/(dfreduced − dff ull )
on the test statistic F = . The denomi-
SSEf ull /dff ull
nator degree of freedom is the residual degree of freedom for the full model, while
the numerator degree of freedom is the number of additional parameters in the full
model over the reduced model.
TRUE

1
8. Consider a regression model of y∼x1, testing whether or not the slope coefficient
is zero is equivalent to testing whether or not the correlation coefficient between y
and x1 is zero.
TRUE

9. In a linear regression model, the variance function is assumed to be a constant

function of the response.
FALSE

10. For linear regression, the adjusted R2 is non-decreasing when more predictors are
added to the model, because more variation of the response is being explained.
FALSE

2
II. Problem Solving
Note: Show your steps! You may lose points for not justifying your answers. If the desired
degree of freedom is not available in the t or F tables, round it off to the closest available.
1. In an experiment, a metal ball is released from rest from different heights near the
ground surface and allowed to free fall in vacuum. The durations (in s) taken for the
ball to reach the ground (time) and its speeds (in m/s) when it reaches the ground
(speed) are recorded. The sample variance of speed is 62.45004 and that of time
is 0.6507599, while the correlation between them is 0.9930025. The R output of the
linear regression model speed∼time is given below, but some figures are missing.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 0.3645 0.5389 0.676 0.5
time ___A__ ___B__ ___C__ ______

Residual standard error: 0.9379 on 99 degrees of freedom

Multiple R-squared: ___D__, Adjusted R-squared: ___E__
F-statistic: ___F__ on 1 and 99 DF, p-value: ______
(a) What is the sample size of this data?
n = 101.

(b) Find the values of A, B, and C.

√
SY 62.45004
A = b1 = r = 0.9930025 √ = 9.727614.
SX 0.6507599
Se 0.9379
B = Sb1 = p 2
= p = 0.1162642.
(n − 1)SX 100(0.6507599)
b1 9.727614
C= = = 83.66818.
Sb1 0.1162642

(c) Interpret carefully the meaning of A in the model.

The estimated slope coefficient A equals 9.727614, which means on average,
the terminal speed of the ball when it lands on the ground surface will increase
by 9.727614 m/s for each 1 s increase in the time taken.

(d) Find the values of D and E.

D = R2 = r2 = 0.99300252 = 0.986054.
100
E = adjustedR2 = 1 − (1 − R2 ) = 0.9859131.
99

(e) Find the F-statistic of the model, i.e. the value of F. State the corresponding
null and alternative hypotheses, degrees of freedom, and conclude the test at
5% significance level.
H0 : βtime = 0 vs H1 : βtime ̸= 0.
The F-statistic is F = C 2 = 7000.364.
With d.f. 1 and 99, the critical value is between 3.92 and 4.00.

3
Reject the null, i.e. time is influential on speed.

(f) Construct a 95% confidence interval for the slope coefficient of time.

b1 ± t0.025,99 Sb1 = 9.727614 ± 1.9842(0.1162642) = [9.496923, 9.958305].

(g) Give a point estimate of the mean speed when reaching the ground for a journey
taking 5 s.
Ŷ = 0.3645 + 9.727614(5) = 49.00257(m/s).

2
(h) Given the sample mean of time is 4.564794, compute Sm for time = 5. Hence,
find the 95% confidence interval for the above point estimate.

(5 − X̄)2

2 2 1
Sm = Se + 2
= 0.01126972.
n (n − 1)SX
C.I. = Ŷ ± t0.025,99 Sm = [48.79193, 49.21321].

2. The R output of the linear regression model y∼x1 is given below.

Estimate Std. Error t value Pr(>|t|)
(Intercept) 0.9781 0.1405 6.963 1.43e-07
x1 -0.3204 0.1042 -3.076 0.00465

Residual standard error: 0.7691 on 28 degrees of freedom

Multiple R-squared: 0.2525, Adjusted R-squared: 0.2259
F-statistic: ____ on 1 and 28 DF, p-value: ________
Reproduce the following ANOVA table, i.e. find the values of c1 to c8.
Df Sum Sq Mean Sq F value Pr(>F)
x1 c1 c2 c3 c4 c5
Residuals c6 c7 c8

c1 = 1
c6 = 28
c5 = 0.00465
c4 = (−3.076)2 = 9.461776
c8 = 0.76912 = 0.5915148
c7 = c6c8 = 16.56241
c3 = c4c8 = 5.596781
c2 = c1c3 = 5.596781

4
3. The R output of the linear regression model y∼x1+x2 is given below.
Estimate Std. Error t value Pr(>|t|)
(Intercept) 0.98498 0.13363 7.371 6.28e-08
x1 -0.32333 0.09907 -3.264 0.00298
x2 0.39379 0.19797 1.989 0.05690

Residual standard error: 0.7315 on 27 degrees of freedom

Multiple R-squared: 0.3481, Adjusted R-squared: 0.2998
F-statistic: 7.208 on 2 and 27 DF, p-value: 0.003102
(a) What is the sample size?
n = 30.

(b) Write down the estimated regression equation.

y = 0.98498 − 0.32333x1 + 0.39379x2.

(c) Interpret the estimated slope coefficient of x1 carefully.

We expect y to decrease by 0.32333 unit on average for each unit increase in
x1 while x2 remains unchanged.

(d) Test whether or not the slope coefficient of x1 is zero. State your hypotheses,
test statistic, degree of freedom, and conclusion at 5% significance level.
H0 : β1 = 0 vs H1 : β1 ̸= 0, where β1 is the true slope coefficient of x1.
The t-statistic is −3.264.
The d.f. is 27, C.V. is 2.0518, and p-value is 0.00298.
Reject the null, i.e. x1 is influential on y in the presence of x2.

(e) Construct a 95% confidence interval for the slope coefficient of x1.

b1 ± t0.025,27 Sb1 = −0.32333 ± 2.0518(0.09907) = [−0.5266018, −0.1200582].

(f) Give a point estimate of y for a new observation with x1= 1 and x2= −1.
Also, find the corresponding 95% confidence interval if Sp2 = 0.6005.

ŷ = 0.98498 − 0.32333(1) + 0.39379(−1) = 0.26786.

Prediction Interval = ŷ ± t0.025,27 Sp = [−1.3221, 1.8578].

(g) Perform a partial F-test between the models y∼x1 and y∼x1+x2. State your
hypotheses, test statistic, degree of freedom, and conclusion at 5% significance
level.
H0 : β2 = 0 vs H1 : β2 ̸= 0, where β2 is the true slope coefficient of x2.
The F-statistic is 1.9892 = 3.956121.
The d.f. are {1, 27}, C.V. is 4.21, and p-value is 0.05690.
Accept the null, i.e. x2 is not influential on y in the presence of x1.

5
4. Consider a response y and four candidate predictors x1,x2,x3,x4. The following
table presents some statistics using different sets of variables as predictors.
predictors rsquare adjr cp aic
1 0 0 9.715913 47.22907
x1 0.03243409 -0.002121838 10.492631 48.23992
x2 0.16076178 0.130788991 5.652635 43.97125
x3 0.01834042 -0.016718846 11.024187 48.67375
x4 0.09851366 0.066317717 8.000380 46.11776
x1 x2 0.19107243 0.131151865 6.509442 44.86770
x1 x3 0.05421051 -0.015847975 11.671314 49.55702
x1 x4 0.12791266 0.063313600 8.891570 47.12310
x2 x3 0.16268818 0.100665082 7.579979 45.90231
x2 x4 0.30322294 0.251609826 2.279583 40.39038
x3 x4 0.12804384 0.063454496 8.886622 47.11859
x1 x2 x3 0.19425379 0.101283073 8.389454 46.74948
x1 x2 x4 0.32970944 0.252368225 3.280620 41.22775
x1 x3 x4 0.16140194 0.064640623 9.628491 47.94836
x2 x3 x4 0.30882514 0.229074195 4.068291 42.14820
x1 x2 x3 x4 0.33714981 0.231093776 5.000000 42.89289
(a) Using the all possible regression procedure with R2 as the criterion, which is
the best model and why?
y∼x1+x2+x3+x4, because of its highest R2 .

(b) Consider the forward selection procedure. List all the models that we shall
compare in the first step. Which model will you select to proceed and why?
y∼1, y∼x1, y∼x2, y∼x3, and y∼x4. We choose y∼x2 to proceed because it
has the lowest AIC.

(c) Continued on part (b), list all the models that we shall compare in the second
step. Which model will you select to proceed and why?
y∼x2, y∼x1+x2, y∼x2+x3, and y∼x2+x4. We choose y∼x2+x4 to proceed
because it has the lowest AIC.

(d) Continued on part (c), proceed in a similar manner for the third step and
onwards. Hence, which is the best model using the forward selection procedure
and why?
In the third step, we compare y∼x2+x4, y∼x1+x2+x4, and y∼x2+x3+x4. As
the current model y∼x2+x4 has the lowest AIC, we terminate the procedure
and claim it is the best model.

Applied Nonparametric Statistics 2
No ratings yet
Applied Nonparametric Statistics 2
15 pages
Midterm 1a Solutions
No ratings yet
Midterm 1a Solutions
9 pages
HW 03 Sol
No ratings yet
HW 03 Sol
9 pages
Module B Solutions
No ratings yet
Module B Solutions
13 pages
Week008 MachineProblem2
No ratings yet
Week008 MachineProblem2
2 pages
Model Solution_econ f241 Mid (1)
No ratings yet
Model Solution_econ f241 Mid (1)
3 pages
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
No ratings yet
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
5 pages
HW10 Solu F09
No ratings yet
HW10 Solu F09
4 pages
EF3450 1920B
No ratings yet
EF3450 1920B
5 pages
Tutorial Stat 322 PDF
No ratings yet
Tutorial Stat 322 PDF
58 pages
Correlation Regression
100% (1)
Correlation Regression
7 pages
Final Exam of Statistics June 2021
No ratings yet
Final Exam of Statistics June 2021
5 pages
05 Linear Regression 2
No ratings yet
05 Linear Regression 2
71 pages
Statistics 2nd Sem Numerical Solutions
No ratings yet
Statistics 2nd Sem Numerical Solutions
11 pages
Yy 1 Xy
No ratings yet
Yy 1 Xy
4 pages
MAS 132 - Statistics II
No ratings yet
MAS 132 - Statistics II
6 pages
Sample Solution
No ratings yet
Sample Solution
4 pages
B.Sc._H_Statistics_Linear_wLJeM9M
No ratings yet
B.Sc._H_Statistics_Linear_wLJeM9M
4 pages
ENME392-Sample Final
No ratings yet
ENME392-Sample Final
8 pages
MATH3714-Jan-2023 (1)
No ratings yet
MATH3714-Jan-2023 (1)
9 pages
Statistics 500: Midterm 1 Name
No ratings yet
Statistics 500: Midterm 1 Name
6 pages
ML Assignment 2
No ratings yet
ML Assignment 2
7 pages
EconometricsII Exercises
100% (1)
EconometricsII Exercises
27 pages
2024 Module Test 2 - 2
No ratings yet
2024 Module Test 2 - 2
6 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
1
No ratings yet
1
5 pages
Midterm Fall2014
No ratings yet
Midterm Fall2014
11 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
73 pages
Section 2
No ratings yet
Section 2
22 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Intermediate Statistics Test Sample 2
0% (1)
Intermediate Statistics Test Sample 2
19 pages
Intermediate Statistics Test Sample 2
100% (1)
Intermediate Statistics Test Sample 2
19 pages
(ISOM2500)[2020](s)final~=lgkemeh^_68321
No ratings yet
(ISOM2500)[2020](s)final~=lgkemeh^_68321
7 pages
Review For Final Exam: New Material ONLY
No ratings yet
Review For Final Exam: New Material ONLY
4 pages
MATH3714-Jan-2024 (1)
No ratings yet
MATH3714-Jan-2024 (1)
9 pages
Bivariate
No ratings yet
Bivariate
28 pages
Regression and Correlation
No ratings yet
Regression and Correlation
17 pages
19 SL Regression 2 320E F21
No ratings yet
19 SL Regression 2 320E F21
47 pages
Solved Application On Multiple Linear Regression Model
No ratings yet
Solved Application On Multiple Linear Regression Model
8 pages
2101 F 17 Assignment 1
No ratings yet
2101 F 17 Assignment 1
8 pages
Chapter05DemandEstimation (1)
No ratings yet
Chapter05DemandEstimation (1)
41 pages
Practice Midterm2 Fall2011
No ratings yet
Practice Midterm2 Fall2011
9 pages
01 SLR Final
No ratings yet
01 SLR Final
37 pages
1725857551_SMA32
No ratings yet
1725857551_SMA32
30 pages
Final Assignment: 1 Instructions
No ratings yet
Final Assignment: 1 Instructions
5 pages
STAT 252-Notes-Topic 5-Multiple Linear Regression
No ratings yet
STAT 252-Notes-Topic 5-Multiple Linear Regression
33 pages
Statistical Methods
No ratings yet
Statistical Methods
7 pages
Model Solution Quiz 1
No ratings yet
Model Solution Quiz 1
3 pages
4.1 Multiple Regression Models
No ratings yet
4.1 Multiple Regression Models
6 pages
Test 4 Key
No ratings yet
Test 4 Key
11 pages
AP Statistics - Chapter 14 Review Name - Part I - Multiple Choice (Questions 1-7) - Circle The Answer of Your Choice
No ratings yet
AP Statistics - Chapter 14 Review Name - Part I - Multiple Choice (Questions 1-7) - Circle The Answer of Your Choice
3 pages
Regression Model
No ratings yet
Regression Model
30 pages
PS Solutions Chapter 14
No ratings yet
PS Solutions Chapter 14
5 pages
Topic 3a
No ratings yet
Topic 3a
64 pages
FinQuiz - Curriculum Note, @InsightSquad Study Session 2, Reading 5
No ratings yet
FinQuiz - Curriculum Note, @InsightSquad Study Session 2, Reading 5
11 pages
MLR Probs
No ratings yet
MLR Probs
45 pages
Mago, Jessica Marionne O. - Hypothesis Tests in Simple Linear Regression - Quiz
No ratings yet
Mago, Jessica Marionne O. - Hypothesis Tests in Simple Linear Regression - Quiz
2 pages
QAM 2 End Term Deepak Prajapati
100% (1)
QAM 2 End Term Deepak Prajapati
7 pages
Solutions Manual to accompany Introduction to Linear Regression Analysis
From Everand
Solutions Manual to accompany Introduction to Linear Regression Analysis
Douglas C. Montgomery
1/5 (1)
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
From Everand
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
Shubhankar Paul
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
L2C-Multiple Regression C 2022-03-03 21_20_04
No ratings yet
L2C-Multiple Regression C 2022-03-03 21_20_04
24 pages
L2A-Multiple Regression a 2022-03-01 15-52-48
No ratings yet
L2A-Multiple Regression a 2022-03-01 15-52-48
25 pages
L2D-Multiple Regression D 2022-03-03 21_20_03
No ratings yet
L2D-Multiple Regression D 2022-03-03 21_20_03
31 pages
T2B-Tutorial Problem
No ratings yet
T2B-Tutorial Problem
2 pages
ch09_banking_mutual
No ratings yet
ch09_banking_mutual
52 pages
Regression hw3
No ratings yet
Regression hw3
3 pages
L2B-Multiple Regression B 2022-03-02 08_50_53 2022-03-03 21_20_02
No ratings yet
L2B-Multiple Regression B 2022-03-02 08_50_53 2022-03-03 21_20_02
23 pages
Assignment 3
No ratings yet
Assignment 3
10 pages
MS4226 Project Progress Report
No ratings yet
MS4226 Project Progress Report
3 pages
A2 copy 2
No ratings yet
A2 copy 2
8 pages
CB3044 Midterm Ch6 Answer.docx
No ratings yet
CB3044 Midterm Ch6 Answer.docx
10 pages
Chapter1_2024
No ratings yet
Chapter1_2024
94 pages
Chapter2_2024
No ratings yet
Chapter2_2024
66 pages
ch08_money_mortgage
No ratings yet
ch08_money_mortgage
52 pages
Group Assignment
No ratings yet
Group Assignment
7 pages
Lecture 7 Examples
No ratings yet
Lecture 7 Examples
24 pages
BCD Session 07
No ratings yet
BCD Session 07
30 pages
TS'Wood Creation of A Edge Band 2
No ratings yet
TS'Wood Creation of A Edge Band 2
7 pages
Undergraduate Tuition Fee Chart For 2018 Spring Semester PDF
No ratings yet
Undergraduate Tuition Fee Chart For 2018 Spring Semester PDF
1 page
Cow-Themed Wedding READING LP
No ratings yet
Cow-Themed Wedding READING LP
3 pages
Add_Rev_1B11_Sol_e
No ratings yet
Add_Rev_1B11_Sol_e
5 pages
One Step Inequalities-Vining16
No ratings yet
One Step Inequalities-Vining16
1 page
Asdasd
No ratings yet
Asdasd
114 pages
Mill Lesson 5 PDF
No ratings yet
Mill Lesson 5 PDF
57 pages
Introduction and Sop
No ratings yet
Introduction and Sop
3 pages
Tugas One Sample - Anggun
No ratings yet
Tugas One Sample - Anggun
4 pages
Seno de 18° PDF
No ratings yet
Seno de 18° PDF
1 page
Reality Choice Therapy - Report Rough Draft
100% (1)
Reality Choice Therapy - Report Rough Draft
3 pages
Parametric Vs Non Parametric Statistical Tests
No ratings yet
Parametric Vs Non Parametric Statistical Tests
3 pages
Multimedia Various Questions
100% (1)
Multimedia Various Questions
35 pages
Concept Note
No ratings yet
Concept Note
2 pages
Ascon Xe Series Manual
100% (1)
Ascon Xe Series Manual
22 pages
Lecture Part 7 - Biostat
No ratings yet
Lecture Part 7 - Biostat
71 pages
Critical Analysys of The Scrum Project Management Methodology PDF
No ratings yet
Critical Analysys of The Scrum Project Management Methodology PDF
8 pages
Test Bank Life in The Universe 3rd Edition
100% (1)
Test Bank Life in The Universe 3rd Edition
6 pages
Brief Curriculum Vitae: Specialisation: (P Ea 1. 2
No ratings yet
Brief Curriculum Vitae: Specialisation: (P Ea 1. 2
18 pages
Econ 102 Man Dal
No ratings yet
Econ 102 Man Dal
6 pages
Applied Engineering Geology Mannual 1
50% (2)
Applied Engineering Geology Mannual 1
110 pages
23-24 - Active and Passive Voice ULP Class VII Eng Lang
No ratings yet
23-24 - Active and Passive Voice ULP Class VII Eng Lang
3 pages
Contentless Syntax, Ineffable Semantics, and Transcendental Ontology. Reflections On Wittgenstein's Tractatus
No ratings yet
Contentless Syntax, Ineffable Semantics, and Transcendental Ontology. Reflections On Wittgenstein's Tractatus
6 pages
Pass Res B1plus ST 2B
No ratings yet
Pass Res B1plus ST 2B
1 page
NVH Workshop Brochure
No ratings yet
NVH Workshop Brochure
4 pages
Exercise 4 Osmoregulation
No ratings yet
Exercise 4 Osmoregulation
15 pages

Midterm Practice Solutions

Uploaded by

Midterm Practice Solutions

Uploaded by

MS3252 Midterm Practice Questions

I. True or False (2pts each)

6. Consider a regression model E(Y | X1 , X2 , X3 ) = β0 + β1 X1 + β2 X2 + β3 X3 , all of

9. In a linear regression model, the variance function is assumed to be a constant

Residual standard error: 0.9379 on 99 degrees of freedom

(b) Find the values of A, B, and C.

(c) Interpret carefully the meaning of A in the model.

(d) Find the values of D and E.

b1 ± t0.025,99 Sb1 = 9.727614 ± 1.9842(0.1162642) = [9.496923, 9.958305].

2. The R output of the linear regression model y∼x1 is given below.

Residual standard error: 0.7691 on 28 degrees of freedom

Residual standard error: 0.7315 on 27 degrees of freedom

(b) Write down the estimated regression equation.

(c) Interpret the estimated slope coefficient of x1 carefully.

b1 ± t0.025,27 Sb1 = −0.32333 ± 2.0518(0.09907) = [−0.5266018, −0.1200582].

ŷ = 0.98498 − 0.32333(1) + 0.39379(−1) = 0.26786.

Prediction Interval = ŷ ± t0.025,27 Sp = [−1.3221, 1.8578].

You might also like