0% found this document useful (0 votes)

91 views

Stat Modelling Assignment 5

This document appears to be an assignment submitted by a group of 7 students for a statistical modelling course. It contains their work and responses to 5 questions regarding multiple linear regression, model selection, and residual analysis. The assignment includes derivation of the least squares estimator in multiple linear regression, analysis of individual predictor significance, model selection using various criteria, and examination of residual plots.

Uploaded by

Fariha Ahmad

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

91 views

Stat Modelling Assignment 5

Uploaded by

Fariha Ahmad

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

STQS 2234

STATISTICAL MODELLING

TITLE
ASSIGNMENT 5

PREPARED FOR
DR. MARINA BINTI ZAHARI

GROUP 7
NORAINSYIRAH BINTI MOHAMED NORDIN

A151588

NUR SYAHIDAH BINTI KHALAPIAH

A150105

NUR NADIRAH BINTI MOHAMAD YOHYI

A151055

SITI NUR ZAWANIE BINTI MD SOBRI

A149121

NUR AZILA BINTI BAHARUDDIN

A148328

NURFARIHA NADHIRAH BINTI AHMAD

A151675

NOORMARINA BINTI MOKHTAR

A151110

QUESTION 1:
1. Consider the multiple linear regression model y = X + . If denotes the least squares
estimator of , show that = + [( )1 ] .
y = X +
Minimizes SSE; SSE = =1 2 = where = ; =
= ( )( )
= ( )( )
= +
=
=0
Derived from fitted model = ;
=
= ()
()1 =
From multiple linear regression model y = X + ;
= X +

()1

= X +

= [()1 ][ + ]
= + [()1 ]

3(e)
i.

Residual versus x1 -

ii.

Residual versus x2

iii.

Residual versus x3

iv.

Residual versus x4

QUESTION 2:

QUESTION 3
a) H 0 : j 0 ,

H 1 : At least one of the j is not equal to zero

j 1,2,...,5

Test Statistic: f 0 4.81

Compare f 0 4.81 > f 0.01,5,19 4.17 and p-value = 0.0052 < 0.01
Here, we reject H 0 . We conclude that the data is linearly related to x1 , x2 , x3 , x4 and x5 .

H 0 : 1 0
H 1 : 1 0
Test Statistic: t 0 2.47
Compare t 0 2.47 t 0.05 / 2,19 2.093 and p-value = 0.023 < 0.05
With 0.05 , we would reject the null hypothesis. This indicates the predictor contribute to
the model.

H0 : 2 0
H1 : 2 0
Test Statistic: t 0 2.74
Compare t 0 2.74 t 0.05 / 2,19 2.093 and p-value = 0.013 < 0.05
With 0.05 , we would reject the null hypothesis. This indicates the predictor contribute to
the model.

H 0 : 3 0
H1 : 3 0
Test Statistic: t 0 2.42
Compare t 0 2.42 t 0.05 / 2,19 2.093 and p-value = 0.026 < 0.05
With 0.05 , we would reject the null hypothesis. This indicates the predictor contribute to
the model.

H0 : 4 0
H1 : 4 0
Test Statistic: t 0 2.79
Compare t 0 2.79 t 0.05 / 2,19 2.093 and p-value = 0.012 < 0.05
With 0.05 , we would reject the null hypothesis. This indicates the predictor contribute to
the model.

H 0 : 5 0
H1 : 5 0
Test Statistic: t 0 0.25
Compare t 0 0.25 t 0.05 / 2,19 2.093 and p-value = 0.801 > 0.05
With 0.05 , we fail to reject the null hypothesis. This indicates the predictor could be
deleted from the model.

b) When x5 was excluded from the model and the model was re-fitted, it shows that the model is
better compare to the model with x5 .

c) Some of the residuals are having large number and these become the outliers.

2
2
2
d) In (a), R adj
is the least compared to the R adj
in (b) and (c). R adj
for (b) is higher than (a) and
2
lower than (c) when x5 is removed with 25 observations. For (c), the R adj
is the highest when
2
increases as test. This is because the
x5 is removed with 24 observations. It shows that the R adj

variables in the model are all useful for the model.

e)
Residuals versus x1
-

Based on the plots, the points are randomly scattered. More points are plotted at the
bottom of the graph or at negative region. This is because it is over-predicted. There are
also some outliers.

Residuals versus x 2
-

Based on the plots, the points are randomly scattered. More points are plotted at the
bottom of the graph or at negative region. This is because it is over-predicted. There are
also some outliers. Points at the left bottom are plotted closely to each other.

Residuals versus x3
-

Based on the plots, the points are randomly scattered. More points are plotted at the
bottom of the graph or at negative region. This is because it is over-predicted. There are
also some outliers. The points at the negative region of the graph are mostly at the same x
value.

Residuals versus x 4
-

Based on the plots, the points are randomly scattered. The points plotted are most likely
same at both regions. There are also some outliers.

f) H 0 : j 0 ,

H 1 : At least one of the j is not equal to zero

j 1,2,3,4

Test Statistic: f 0 21.79

Compare f 0 21.79 > f 0.05, 4, 20 2.87
Here, we reject H 0 . We conclude that the data is linearly related to x1 , x2 , x3 and x 4 .

H 0 : 1 0
H 1 : 1 0
Test Statistic: t 0 5.76
Compare t 0 5.76 t 0.05 / 2, 20 1.725
With 0.05 , we would reject the null hypothesis. This indicates the predictor contribute to
the model.

H0 : 2 0
H1 : 2 0
Test Statistic: t 0 5.96
Compare t 0 5.96 t 0.05 / 2, 20 1.725
With 0.05 , we would reject the null hypothesis. This indicates the predictor contribute to
the model.

H 0 : 3 0
H1 : 3 0
Test Statistic: t 0 2.90
Compare t 0 2.90 t 0.05 / 2, 20 1.725
With 0.05 , we would reject the null hypothesis. This indicates the predictor contribute to
the model.

H0 : 4 0
H1 : 4 0
Test Statistic: t 0 4.99
Compare t 0 4.99 t 0.05 / 2, 20 1.725
With 0.05 , we would reject the null hypothesis. This indicates the predictor contribute to
the model.

g) The residual plots against x1 , x2 , x3 and x 4 has boundary between -1 and 1 and there are no
pattern shown in the plots. All of the plots have an outlier. The points in the all plots are
symmetrically distributed and most of the points are near to zero.

QUESTION 4:

Based on the 2 -value criterion, the best model is the model with the two predictors PctComp
and PctTD as the 2 -value give a substantial increase by jumps from 64.8 to 85.1.

Based on the adjusted 2 -value and MSE criteria, the best model is the model with the seven
predictors Att, PctComp, Yds, YdsperAtt, TD, PctTD and PctInt as the model have the largest
adjusted 2 -value (100.0) and the smallest (5.1).

Based on the criterion, there are eight possible best models

the model with 6 predictors containing Att, PctComp, Yds, YdsperAtt, PctTD and
PctInt;

ii.

the model with 6 predictors containing Att, Comp, PctComp, YdsperAtt, PctTD and
PctInt;

iii.

the model with 7 predictors containing Att, PctComp, Yds, YdsperAtt, TD, PctTD and
PctInt;

iv.

the model with 7 predictors containing Att, PctComp, Yds, YdsperAtt, PctTD, Int and
PctInt;

the model with 8 predictors containing Att, PctComp, Yds, YdsperAtt, TD, PctTD, Int
and PctInt;

vi.

the model with 8 predictors containing Att, Comp, PctComp, Yds, YdsperAtt, TD,
PctTD and PctInt;

vii.

the model with 9 predictors containing Att, PctComp, Yds, YdsperAtt, TD, PctTD, Lng,
Int and PctInt;

viii.

and the model with 9 predictors containing Att, Comp, PctComp, Yds, YdsperAtt, TD,
PctTD, Int and PctInt.

As all of those models are unbiased models, because their values equal (or are below) the
number of parameters, .

QUESTION 5:

ECON7310: Elements of Econometrics: Research Project 2
No ratings yet
ECON7310: Elements of Econometrics: Research Project 2
29 pages
Homework 1
0% (1)
Homework 1
8 pages
Practical Research 1 Quarter 1 - Module 5: Kinds of Research Across Fields
67% (6)
Practical Research 1 Quarter 1 - Module 5: Kinds of Research Across Fields
16 pages
Cheat Sheet For Test 4 Updated
No ratings yet
Cheat Sheet For Test 4 Updated
8 pages
6.1 Test For Single Mean: Assumptions
No ratings yet
6.1 Test For Single Mean: Assumptions
17 pages
Simple Regression
No ratings yet
Simple Regression
35 pages
Lecture 2
No ratings yet
Lecture 2
26 pages
10 - Regression 1
No ratings yet
10 - Regression 1
58 pages
06 Simple Modeling
No ratings yet
06 Simple Modeling
16 pages
Lab 5 LR
No ratings yet
Lab 5 LR
9 pages
A4 Sheet For Exam
No ratings yet
A4 Sheet For Exam
3 pages
Homework5
No ratings yet
Homework5
6 pages
Modern Regression Homework 5-1
No ratings yet
Modern Regression Homework 5-1
8 pages
Econometric s
No ratings yet
Econometric s
86 pages
Assignment On Chapter-10 (Maths Solved) Business Statistics Course Code - ALD 2104
No ratings yet
Assignment On Chapter-10 (Maths Solved) Business Statistics Course Code - ALD 2104
32 pages
The Rocky Mountain District Sales Manager of Rath Publishing
No ratings yet
The Rocky Mountain District Sales Manager of Rath Publishing
22 pages
Final Exam-Solution
No ratings yet
Final Exam-Solution
11 pages
Design of Experiments. Montgomery DoE
No ratings yet
Design of Experiments. Montgomery DoE
6 pages
Evaluation of Analytical Data
No ratings yet
Evaluation of Analytical Data
58 pages
Statistical Models in R
No ratings yet
Statistical Models in R
18 pages
"Business Statistics For Managers" Unit 5
No ratings yet
"Business Statistics For Managers" Unit 5
34 pages
October 25, 2011
No ratings yet
October 25, 2011
27 pages
Chapter 1 - Solutions of Exercises
No ratings yet
Chapter 1 - Solutions of Exercises
4 pages
Linear Regression For Air Pollution Data: U T S A
No ratings yet
Linear Regression For Air Pollution Data: U T S A
14 pages
Section 6b: Hypothesis testing for µ, σ known: MATH 3320
No ratings yet
Section 6b: Hypothesis testing for µ, σ known: MATH 3320
38 pages
Business Statistics and Management Science Notes
No ratings yet
Business Statistics and Management Science Notes
74 pages
Final Sol
No ratings yet
Final Sol
5 pages
MKTG3010 Notes
No ratings yet
MKTG3010 Notes
8 pages
Step 1: Identify The Problem Parameters: B N Đã Nói
No ratings yet
Step 1: Identify The Problem Parameters: B N Đã Nói
26 pages
ECON814 January 2025 Exam Paper - Final Version-1-1
No ratings yet
ECON814 January 2025 Exam Paper - Final Version-1-1
6 pages
Problem Set 1
No ratings yet
Problem Set 1
3 pages
Econometrics I Final Examination Summer Term 2013, July 26, 2013
No ratings yet
Econometrics I Final Examination Summer Term 2013, July 26, 2013
9 pages
Assignment - Week 10
No ratings yet
Assignment - Week 10
6 pages
IGNOU MBA MS - 08 Solved Assignments 2011
No ratings yet
IGNOU MBA MS - 08 Solved Assignments 2011
12 pages
Chapter 7 C
No ratings yet
Chapter 7 C
27 pages
Amit Sir - Assignment
No ratings yet
Amit Sir - Assignment
19 pages
06 05 Adequacy of Regression Models
No ratings yet
06 05 Adequacy of Regression Models
11 pages
Introduction To Data Analysis Solutions
No ratings yet
Introduction To Data Analysis Solutions
5 pages
MASII Sample Questions
No ratings yet
MASII Sample Questions
14 pages
Coef de associação
No ratings yet
Coef de associação
17 pages
Econ140 Spring2016 Section07 Handout Solutions
No ratings yet
Econ140 Spring2016 Section07 Handout Solutions
6 pages
Probability and Statistics 4 - PARAMETER ESTIMATION
No ratings yet
Probability and Statistics 4 - PARAMETER ESTIMATION
20 pages
Bon Ferroni
No ratings yet
Bon Ferroni
3 pages
EC221 답 지운 것
No ratings yet
EC221 답 지운 것
99 pages
Running Head: Problem Solving Assignment
No ratings yet
Running Head: Problem Solving Assignment
11 pages
ECON1203/ECON2292 Business and Economic Statistics: Week 2
No ratings yet
ECON1203/ECON2292 Business and Economic Statistics: Week 2
10 pages
Chi Square
No ratings yet
Chi Square
16 pages
Regression Analysis: Interpretation of Regression Model
No ratings yet
Regression Analysis: Interpretation of Regression Model
22 pages
Diagnostico de Modelos
No ratings yet
Diagnostico de Modelos
4 pages
5 Inference FRM
No ratings yet
5 Inference FRM
123 pages
Oktalina (17051214009) Si 17 A
No ratings yet
Oktalina (17051214009) Si 17 A
3 pages
Test of Goodness of Fit
No ratings yet
Test of Goodness of Fit
38 pages
Project 4 Stats - Jacob Muscianese
No ratings yet
Project 4 Stats - Jacob Muscianese
3 pages
Worksheet 10 - Spring 2014 - Chapter 10 - Key
No ratings yet
Worksheet 10 - Spring 2014 - Chapter 10 - Key
4 pages
POQ:quiz
No ratings yet
POQ:quiz
10 pages
Ganda Ko
43% (7)
Ganda Ko
15 pages
Documents ZZZZ
No ratings yet
Documents ZZZZ
8 pages
Hair - Eye Color Data: Comparing Different Exact Tests
No ratings yet
Hair - Eye Color Data: Comparing Different Exact Tests
6 pages
Assignment R New 1
No ratings yet
Assignment R New 1
26 pages
Exact Sample Size Determination For Binomial Experiments
No ratings yet
Exact Sample Size Determination For Binomial Experiments
11 pages
Document 8
No ratings yet
Document 8
10 pages
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
Climate Change Analysis and Prediction (Literature Review) Complete
No ratings yet
Climate Change Analysis and Prediction (Literature Review) Complete
4 pages
Handbook of Regression Modeling in People Analytics 1st Edition Keith Mcnulty - Read the ebook now with the complete version and no limits
100% (1)
Handbook of Regression Modeling in People Analytics 1st Edition Keith Mcnulty - Read the ebook now with the complete version and no limits
79 pages
Firms Profitability and ESG Score A Machine Learning Approach
No ratings yet
Firms Profitability and ESG Score A Machine Learning Approach
19 pages
5. Appendices Vitae
No ratings yet
5. Appendices Vitae
47 pages
Yeung Algorithmic Regulation 2017 Accepted
No ratings yet
Yeung Algorithmic Regulation 2017 Accepted
40 pages
HW5+Solution
No ratings yet
HW5+Solution
10 pages
ReLu Heuristics For Avoiding Local Bad Minima
100% (2)
ReLu Heuristics For Avoiding Local Bad Minima
10 pages
A. Collinearity Diagnostics of Binary Logistic Regression Model
No ratings yet
A. Collinearity Diagnostics of Binary Logistic Regression Model
16 pages
Conger Et Al 2000
No ratings yet
Conger Et Al 2000
21 pages
ML Guided Project Template Notebook Subash Karnatakapu ML1 SALON
No ratings yet
ML Guided Project Template Notebook Subash Karnatakapu ML1 SALON
14 pages
(Ebook PDF) STAT2: Modeling With Regression and ANOVA 2nd Edition 2024 Scribd Download
100% (4)
(Ebook PDF) STAT2: Modeling With Regression and ANOVA 2nd Edition 2024 Scribd Download
41 pages
Regression
No ratings yet
Regression
24 pages
Structural Equation Model On Organization Communication Satisfaction of Hotel Employees in Region XII
No ratings yet
Structural Equation Model On Organization Communication Satisfaction of Hotel Employees in Region XII
21 pages
UNIT 8 Prob & Stat
No ratings yet
UNIT 8 Prob & Stat
31 pages
Machine Learning Assignment Solution
No ratings yet
Machine Learning Assignment Solution
30 pages
Approaches For Credit Scorecard Calibration: An Empirical Analysis
No ratings yet
Approaches For Credit Scorecard Calibration: An Empirical Analysis
40 pages
Franchising and Firm Risk
No ratings yet
Franchising and Firm Risk
11 pages
Keller (2017) A Longitudinal Study of The Individual Characteristics of Effective R&D Project Team Leaders
No ratings yet
Keller (2017) A Longitudinal Study of The Individual Characteristics of Effective R&D Project Team Leaders
14 pages
Pennycook G. & Rand D. (2019) - Susceptibility To Partisan Fake News Is Better Explained by Lack of Reasoning Than by Motivated Reasoning
No ratings yet
Pennycook G. & Rand D. (2019) - Susceptibility To Partisan Fake News Is Better Explained by Lack of Reasoning Than by Motivated Reasoning
12 pages
Sonek Assignment 2
No ratings yet
Sonek Assignment 2
3 pages
Model Regresi Logistik
No ratings yet
Model Regresi Logistik
12 pages
Simple Linear Regression Examples
No ratings yet
Simple Linear Regression Examples
14 pages
Basic Statistics: Basic Statistical Interview Question
No ratings yet
Basic Statistics: Basic Statistical Interview Question
5 pages
Chapter 2 NEW
No ratings yet
Chapter 2 NEW
17 pages
Jurnal Pengaruh Kompensasi Dan Motivasi Kerja Terhadap Kepuasan Kerja Karyawan (Penelitian Pada Kantor Pusat Bank CIMB Niaga Subdirektorat Digital Banking, Branchless and Partnership
No ratings yet
Jurnal Pengaruh Kompensasi Dan Motivasi Kerja Terhadap Kepuasan Kerja Karyawan (Penelitian Pada Kantor Pusat Bank CIMB Niaga Subdirektorat Digital Banking, Branchless and Partnership
30 pages
Regression Analysis: Unified Concepts, Practical Applications, and Computer Implementation
100% (2)
Regression Analysis: Unified Concepts, Practical Applications, and Computer Implementation
280 pages
The Impact of Board Structure On Corporate Financial Performance in Nigeria
No ratings yet
The Impact of Board Structure On Corporate Financial Performance in Nigeria
13 pages
Psyc 1010 Assignment
No ratings yet
Psyc 1010 Assignment
5 pages