0% found this document useful (0 votes)

10 views8 pages

W7 - Assumptions

Uploaded by

z13612909240

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views8 pages

W7 - Assumptions

Uploaded by

z13612909240

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

W7 – Assumptions

Introduction

All the tests we have been doing so far (week 2-3-4): t-test one sample, 2 samples t-test, linear
regression t-test etc… are so called parametric tests. When doing research, If you want to perform
these tests you should check some assumptions first (week 7). If the assumptions are not met, we
should therefore use non-parametric test (week 8). Following there is a summary of the
assumptions, next we will go in detail using the assignments.

When doing regression analysis [ lm(y ~ x, …… ) in R studio] we should check 4 main assumptions,
given the assignments, I see you just focus on 2 of them, so we will only focus on the 2 first ones.

- 1st : Normality distribution of the residuals

- 2nd : Equal variance of the residuals also called HOMO or HETEROSCEDASTICITY
- 3rd : Linearity and 4th : Independence

These assumptions can be checked in three ways:

- Statistically : Using tests and analyzing their P-Values

- Graphically : Using plots
- Using descriptive statistics (var, st.deviations, mean, median etc…)

I – Checking the assumptions Statistically:

1. Normality:
H0: Normal distribution vs HA: Not normal
If P-Value <0.05 we reject normal distribution: NO normal distribution
If P-Value >0.05 we accept normal distribution: YES normal distribution

Example and Interpretation:

Shapiro-Wilk normality test
data: data560$res2
W = 0.96348, p-value = 7.321e-07 <- P-Value <0.05, not normal distribution

2. Equal variance:
H0: Equal variance vs HA: Not equal variance
If P-Value <0.05 we reject equal variance: NO equal variance
If P-Value >0.05 we accept equal variance : YES equal variance

Example and Interpretation:

studentized Breusch-Pagan test
data: model1
BP = 6.053, df = 3, p-value = 0.154 -> P-Value > 0.05, yes equal variance

Levene's Test for Homogeneity of Variance (center = median)

Df F value Pr(>F)
group 2 12.1055 0.00798 -> P-Value < 0.05, no equal variance
42
II – Checking the assumptions Graphically

1. Normality

- Histograms: A “good” histogram should be unimodal (only one peak) and bell shaped. On
the right one can see that the histogram is right skewed (the tail is on the right).

- Normal QQ plots: The first plot mostly shows normal distribution as the dots (residuals) are
following the straight line except for low values. However, the second plot clearly shows
strong deviations form the straight line, suggesting strong violation of the normal
distribution.
2. Equal variance:

- Residuals plots : The residuals (dots) should be equally spread throughout the blue line, you
shouldn’t see any pattern.
o 1st plot: Good, there is equal variance in the residuals.
o 2nd and 3rd not good, there is a violation of the equal variance

- Boxplots: The spread of the residuals should be similar between the groups, for the first
example it’s fine, the second is not !
Example with Assignment 560:

Context: Suppose you interested in the relationship between crime and punishment: you expect
that crime is strongly related to the level of punishment, and that other factors do not play a role.
(the severity of crime is positively associated with the level punishment).

- Y = Punishment (dependent variable)

- X = Crime (independent variable)

Steps to check the assumptions in R:

- 1st : Make your model using lm

- 2nd : Add residuals and predicted values
- 3rd : Graphically check the assumptions
- 4th : Check them also using the statistical tests

Codes in red and interpretations in blue

1st : Creating the model: model1 <- data560 %>% lm(punish ~ crime, . )

2nd : add residuals and predicted

- res <- model1$residuals

- pred <- model1$fitted.values

3rd : Assumptions graphically:

- For normality:
o hist(data560$res)
o plot(model1,2)
- For equal variance:
o plot(model1,1)
o plot(model1,3)

Outputs + : respectively : plot(model,2) ; plot(model1,1) ; plot(model1,3)

Interpretation: From the normal QQ plot, we can see that there are quite a lot of deviations from the
straight dotted line, this is a violation of the normal distribution. From 2nd it seems that the equal
variance is mostly fine. However, from the 3rd plot we can see that the equal variance is not met.
4th : Assumptions statistically

- For normality: shapiro.test(res)

- For equal variance: bptest(model1)
- *Remember to run the libraries first, note that I chose breush pagan because my
independent variable is crime which is a scale variable
Shapiro-Wilk normality test

data: data560$res
W = 0.96967, p-value = 2.87e-06 -> P-Value <0.05 so not normal distribution

studentized Breusch-Pagan test

data: model1
BP = 7.1701, df = 1, p-value = 0.007413 -> P-value <0.05 so not equal variance

Steps more in detail, what did we do in R ?:

1st: Make your model with lm

- At this point you can already:

o Check R squared for quality of the model
o You can check the significance of the variables

2nd : Add residuals and predicted values

- Why? -> In order to check the assumptions

o Equal variance
o Normality

3rd : Graphically check the assumptions

- For equal variance: 2 options :

o 1st : use ggplot formula (See assignment)
 Residuals (Y) against predicted (X)
 Residuals (Y) against each X(independent variable)
o 2nd : use plot() formula: The “easy method”
 Plot(model_name,1) : for equal variance
 Plot(model_name,2) : for normal distribution
- For normality:
o 1st : plot(model_name,2)
o 2nd : hist(residuals)

4th : Test assumptions with statistics

- For normal distribution: Use Shapiro Wilk’s test

- For equal variance, it depends: Levene or Breush pagan?
o Breush pagan if scale independent variable.
o Levene if categorical independent variable.
- If at least one independent variable is scale we choose Breush Pagan (Even if the other
independent variable is categorical)

Assignment 561 : First part (From 1 to question 8)

Context: Suppose we study students, measure their reading abilities (on a scale from 1 to 10) at t1,
and do an experiment with a control group and two different treatments assumed to affect their
‘ability to read’. You first want to understand differences in reading ability at t1. In the first part of
the assignment, you suspect that both income of parents and whether parents read at home to
their children contribute to the reading abilities of the children.

- Model: Y(Reading) = b0 + b1(income) + b2(parents_read_home)

o We have:
o Income is scale
o Read at home is dummy

Checking assumptions Graphically: See Assignment Answers

Checking assumptions Statistically:

Codes for Normality: shapiro.test(data561 $res1)

Output:
Shapiro-Wilk normality test
data: data561$res1
W = 0.93781, p-value = 0.004326

Hypothesis:

- H0: Normal distribution

- HA: Not normal distribution
- Interpretation: P-Value <0.05 so we reject normality

Codes for Equal variance: bptest(model1)

-> Remember we use Breush Pagan because income is a scale variable

Output:
studentized Breusch-Pagan test
data: model1
BP = 23.053, df = 2, p-value = 9.865e-06

Hypothesis + Interpretation:

- H0: Equal variance

- H1: Not equal variance
- P-Value <0.05 so we reject equal variance
Assignment 561 : Second part: The experiment: (From 9 to question 14)

Content: In order to improve reading abilities, a school starts experimenting different teaching
methods. The school randomly assign children to three groups. One group (the control group) gets
an extra hour of class reading by the teacher. Two other groups are approached in a more
personalized way. One gets a reading app, in which famous actors read children stories, another
group gets a volunteer reading to them. Suppose you want to study the effect of one of the three
conditions (one of which is a control group).

- Model: Y(Reading) = B0 + B1(Teaching methods) where

o Teaching methods is a categorical variable

Checking assumptions Graphically: See Assignment Answers

Checking assumptions Statistically:

Codes for Normality: shapiro.test(data561 $res2)

Output:
Shapiro-Wilk normality test
data: data561$res2
W = 0.90059, p-value = 0.0001382

Hypothesis + Interpretation:

- H0: Normal distribution

- HA: Not normal distribution.
- Interpretation: P-Value <0.05 so we reject normality

Codes for Equal variance: leveneTest (model2)

-> Remember we use Levene because teaching methods is a nominal variable with multiple groups

Output:
Levene's Test for Homogeneity of Variance (center = median)
Df F value Pr(>F)
group 2 8.1055 0.000798 ***
57

Hypothesis:

- H0: Equal variance

- H1: Not equal variance
- P-Value <0.05 so we reject equal variance
Summary Assumptions

normality Equal variance

Graphs Histogram -> Residual analysis (residuals against
hist(data_name$res) predicted, and each independent
variable)
normal QQplot -> ggplot() formula
plot(model_name,2)
plot(model_name,1)
plot(model_name,3)

Test Shapiro wilk Levene or Breush-pagan

H0: There is normal H0: There is equal variance
distribution HA: There is not
HA: There is not
Levene if my independent variable
is categorical (if I have multiple
groups)
Breush if at least 1 independent
variable is scale.

Descriptives Skeweness should be St.deviations

between -0.5 and+0.5 for Rule of thumb:
normal distributed data var(biggest) / var(smallest)^2

Kurtosis should be between - -> if < 2: Equal variance assumed

3 and +3 for normal -> if > 2 : not equal variance
distributed data

Student Solutions Manual To Accompany An Introduction To Econometrics A Self Contained Approach 1st Edition Frank Westhoff PDF Download
100% (1)
Student Solutions Manual To Accompany An Introduction To Econometrics A Self Contained Approach 1st Edition Frank Westhoff PDF Download
84 pages
Two Sample T-Test
100% (1)
Two Sample T-Test
95 pages
Econometrics I - Lecture 5 (Wooldridge) Color
No ratings yet
Econometrics I - Lecture 5 (Wooldridge) Color
44 pages
Week5 Assumptions 1
No ratings yet
Week5 Assumptions 1
41 pages
Week 6: Assumptions in Regression Analysis
No ratings yet
Week 6: Assumptions in Regression Analysis
69 pages
EC501 Lecture 03
No ratings yet
EC501 Lecture 03
30 pages
Ecn 306
No ratings yet
Ecn 306
43 pages
Unit 561 Unequal Variance and More With Answers
No ratings yet
Unit 561 Unequal Variance and More With Answers
13 pages
Intro To Traditional and Bayesian M Using R-Guilford 2017
No ratings yet
Intro To Traditional and Bayesian M Using R-Guilford 2017
330 pages
Student Solutions Manual To Accompany An Introduction To Econometrics A Self Contained Approach 1nbsped 9780262317184 9780262525404 - Compress
No ratings yet
Student Solutions Manual To Accompany An Introduction To Econometrics A Self Contained Approach 1nbsped 9780262317184 9780262525404 - Compress
143 pages
Measuring Relationship Via Regression Analysis and Correlation-1
No ratings yet
Measuring Relationship Via Regression Analysis and Correlation-1
18 pages
Analysing Panel Data
No ratings yet
Analysing Panel Data
25 pages
Statistical Hypothesis Testing
No ratings yet
Statistical Hypothesis Testing
20 pages
SW CH 05 Piskula Mod
No ratings yet
SW CH 05 Piskula Mod
39 pages
STAT200 Week6 Homework Solutions
No ratings yet
STAT200 Week6 Homework Solutions
16 pages
1 The Econometrics of The Simple Regression Model: I 1 1i 2 2i K Ki I
No ratings yet
1 The Econometrics of The Simple Regression Model: I 1 1i 2 2i K Ki I
50 pages
Assignment5 - Fall 2024
No ratings yet
Assignment5 - Fall 2024
14 pages
Ed Aaaaaaa
No ratings yet
Ed Aaaaaaa
7 pages
(Ebook PDF) Elementary Statistics 4th Edition Instant Download
100% (3)
(Ebook PDF) Elementary Statistics 4th Edition Instant Download
57 pages
2017aug 02323 02402 Solution en
No ratings yet
2017aug 02323 02402 Solution en
43 pages
Module 5
No ratings yet
Module 5
24 pages
Stat 151 - Final Review
No ratings yet
Stat 151 - Final Review
15 pages
(Ebook PDF) Introduction To Econometrics, 4th Global Edition Instant Download
100% (6)
(Ebook PDF) Introduction To Econometrics, 4th Global Edition Instant Download
57 pages
Suggested Detailed Solutions For Assignment Set 1 - Updated Ex 4 - Nov 13
No ratings yet
Suggested Detailed Solutions For Assignment Set 1 - Updated Ex 4 - Nov 13
9 pages
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
100% (78)
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
6 pages
Programming With R Test 2
50% (2)
Programming With R Test 2
5 pages
(EMPTY) - Practice Test 2.5
No ratings yet
(EMPTY) - Practice Test 2.5
16 pages
Lecture 3 SLR - 2
No ratings yet
Lecture 3 SLR - 2
29 pages
Modern Regression Homework 5-1
No ratings yet
Modern Regression Homework 5-1
8 pages
CHI-SQUARED Test of A Contingency Table
No ratings yet
CHI-SQUARED Test of A Contingency Table
6 pages
Lecture 3 SLR - 2
No ratings yet
Lecture 3 SLR - 2
29 pages
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
100% (51)
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
26 pages
Reasoning With Data An Introduction To Traditional and Bayesian Statistics Using R Full Text Download
100% (12)
Reasoning With Data An Introduction To Traditional and Bayesian Statistics Using R Full Text Download
15 pages
Sta 226
No ratings yet
Sta 226
5 pages
Problems 1
No ratings yet
Problems 1
4 pages
Unit 2 Assignment SKELETON R spr18
No ratings yet
Unit 2 Assignment SKELETON R spr18
12 pages
(Ebook PDF) Introduction To Econometrics 4Th Edition by James H. Stock Install Download
No ratings yet
(Ebook PDF) Introduction To Econometrics 4Th Edition by James H. Stock Install Download
52 pages
Practice Solutions
No ratings yet
Practice Solutions
4 pages
(Ebook PDF) Elementary Statistics 4th Edition Instant Download
100% (1)
(Ebook PDF) Elementary Statistics 4th Edition Instant Download
52 pages
Assignment R New 1
No ratings yet
Assignment R New 1
26 pages
Hints of Assignment5 - Fall 2024
No ratings yet
Hints of Assignment5 - Fall 2024
11 pages
Regression With One Regressor-Hypothesis Tests and Confidence Intervals
100% (1)
Regression With One Regressor-Hypothesis Tests and Confidence Intervals
53 pages
Statistics
No ratings yet
Statistics
30 pages
KK Youth Profiling 2024
100% (6)
KK Youth Profiling 2024
1 page
Weatherwax Weisberg Solutions
No ratings yet
Weatherwax Weisberg Solutions
162 pages
Lecture 6: Classical Normal Linear Regression Model Some Basic Ideas
No ratings yet
Lecture 6: Classical Normal Linear Regression Model Some Basic Ideas
9 pages
Procurement Planning Linkage With Budgeting: Your Company Information
100% (3)
Procurement Planning Linkage With Budgeting: Your Company Information
205 pages
Normality, T-Test, ANOVA, Chi Square, Correlation
No ratings yet
Normality, T-Test, ANOVA, Chi Square, Correlation
31 pages
Value Chain Development
100% (2)
Value Chain Development
75 pages
Exam Topics Overview ECON1203
No ratings yet
Exam Topics Overview ECON1203
1 page
AMS 315 Final Examination Solution F2019B PDF
No ratings yet
AMS 315 Final Examination Solution F2019B PDF
16 pages
Reasoning With Data An Introduction To Traditional and Bayesian Statistics Using R ISBN 1462530265, 9781462530267 Premium Ebook Download
No ratings yet
Reasoning With Data An Introduction To Traditional and Bayesian Statistics Using R ISBN 1462530265, 9781462530267 Premium Ebook Download
16 pages
App Econ - Week6 PDF
No ratings yet
App Econ - Week6 PDF
6 pages
Introduction To Econometrics Ebook PDF
No ratings yet
Introduction To Econometrics Ebook PDF
89 pages
Modelling in R
No ratings yet
Modelling in R
47 pages
Intro Stat
No ratings yet
Intro Stat
324 pages
Activity 5 Stat
No ratings yet
Activity 5 Stat
6 pages
Wooldridge 6e Ch09 SSM
No ratings yet
Wooldridge 6e Ch09 SSM
8 pages
HMWK 4
No ratings yet
HMWK 4
5 pages
Body-Esteem Scale For Adolescents and Adults: Journal of Personality Assessment
No ratings yet
Body-Esteem Scale For Adolescents and Adults: Journal of Personality Assessment
18 pages
Group 1
100% (1)
Group 1
48 pages
The Strategy Process
No ratings yet
The Strategy Process
34 pages
Zhang Et Al. (2023) .The Relationship Between Trait Mindfulness and Resilience - A Meta Analysis
No ratings yet
Zhang Et Al. (2023) .The Relationship Between Trait Mindfulness and Resilience - A Meta Analysis
15 pages
Safety Geremew Tarekegn
No ratings yet
Safety Geremew Tarekegn
110 pages
Discussion1 Solution
No ratings yet
Discussion1 Solution
5 pages
Auditing I Course Outline
No ratings yet
Auditing I Course Outline
3 pages
LLM Research Methodology Notes
No ratings yet
LLM Research Methodology Notes
7 pages
COM351 Week 1 Research A Way of Thinking The Research Process
No ratings yet
COM351 Week 1 Research A Way of Thinking The Research Process
15 pages
Paired T-Test
No ratings yet
Paired T-Test
7 pages
Case Study For Reasons of Software Projects Failure: STUDENT NAMES: Ashenafi Gadisa
No ratings yet
Case Study For Reasons of Software Projects Failure: STUDENT NAMES: Ashenafi Gadisa
21 pages
The Ahmedabad Urban Development Plan-Making Process: A Critical Review
No ratings yet
The Ahmedabad Urban Development Plan-Making Process: A Critical Review
24 pages
Effectiveness Review: Disaster Risk Reduction Programming in Ethiopia's Somali Region
No ratings yet
Effectiveness Review: Disaster Risk Reduction Programming in Ethiopia's Somali Region
58 pages
Polite Team, 3 - I Research Paper Complete Chapter
No ratings yet
Polite Team, 3 - I Research Paper Complete Chapter
71 pages
Mid Komputasi Statistika (Firmina Fenanlampir 18504164
No ratings yet
Mid Komputasi Statistika (Firmina Fenanlampir 18504164
10 pages
The Impact of Forensic Accounting in Fraud Detection and Prevention: Evidence From Nigerian Public Sector
100% (1)
The Impact of Forensic Accounting in Fraud Detection and Prevention: Evidence From Nigerian Public Sector
8 pages
D4L1-Introduction-sep 2023
No ratings yet
D4L1-Introduction-sep 2023
35 pages
PRQ20231065 Consultancy For Repeat Assessment of Transport
No ratings yet
PRQ20231065 Consultancy For Repeat Assessment of Transport
58 pages
46 +Anyaoha+-+African+Journal+of+Social+and+Behavioural+Science+Vol+14,+No +2
No ratings yet
46 +Anyaoha+-+African+Journal+of+Social+and+Behavioural+Science+Vol+14,+No +2
7 pages
Introduction To Population of Nepal PDF
No ratings yet
Introduction To Population of Nepal PDF
99 pages
Farm Risk Management Past Present and Pr20210422-30272-65e4f5
No ratings yet
Farm Risk Management Past Present and Pr20210422-30272-65e4f5
20 pages
Blended Learning
No ratings yet
Blended Learning
17 pages
Forensic Science: The Impact of
No ratings yet
Forensic Science: The Impact of
16 pages
Best Example by Henk - Research Proposal 1
No ratings yet
Best Example by Henk - Research Proposal 1
14 pages
Scoring Rubrics
No ratings yet
Scoring Rubrics
14 pages
Unit 545 Differences Between Two or More Groups Non Parametric With Answers
No ratings yet
Unit 545 Differences Between Two or More Groups Non Parametric With Answers
10 pages
Guiding Principles On IMS
No ratings yet
Guiding Principles On IMS
17 pages
Unit 6 - Assignment With Answers
No ratings yet
Unit 6 - Assignment With Answers
9 pages
Unit 545 Differences Between Two or More Groups Non Parametric Without Answers
No ratings yet
Unit 545 Differences Between Two or More Groups Non Parametric Without Answers
8 pages
Unit 522 Understanding and Visualizing Linear Equations Without Answers
No ratings yet
Unit 522 Understanding and Visualizing Linear Equations Without Answers
8 pages
Unit 6 - Assignment Without Answers
No ratings yet
Unit 6 - Assignment Without Answers
6 pages
W6 - Interaction Equations
No ratings yet
W6 - Interaction Equations
6 pages
Having Less, Giving More: The in Uence of Social Class On Prosocial Behavior
No ratings yet
Having Less, Giving More: The in Uence of Social Class On Prosocial Behavior
16 pages
Assignmentdyads6 - 71455 - 4039886 - Assignment 4 - Method and Results Qualitative Draft-1
No ratings yet
Assignmentdyads6 - 71455 - 4039886 - Assignment 4 - Method and Results Qualitative Draft-1
4 pages
Shoe Tech Chap 1-3
No ratings yet
Shoe Tech Chap 1-3
14 pages
Unit 1 - Assignment With Answers
No ratings yet
Unit 1 - Assignment With Answers
4 pages
Unit 10 - Assignment Without Answers PM
No ratings yet
Unit 10 - Assignment Without Answers PM
3 pages
W3 (Extra) - Data 123 Practice Open Questions With Means
No ratings yet
W3 (Extra) - Data 123 Practice Open Questions With Means
9 pages
Interim Report: Training and Development Need Analysis
No ratings yet
Interim Report: Training and Development Need Analysis
6 pages
Unit 5 - Assignment Without Answers
No ratings yet
Unit 5 - Assignment Without Answers
2 pages
HW3
No ratings yet
HW3
3 pages
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
How to Find Inter-Groups Differences Using Spss/Excel/Web Tools in Common Experimental Designs: Book 1
From Everand
How to Find Inter-Groups Differences Using Spss/Excel/Web Tools in Common Experimental Designs: Book 1
P.Y. Cheng
No ratings yet
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet

W7 - Assumptions

Uploaded by

W7 - Assumptions

Uploaded by

W7 – Assumptions

- 1st : Normality distribution of the residuals

These assumptions can be checked in three ways:

- Statistically : Using tests and analyzing their P-Values

I – Checking the assumptions Statistically:

Example and Interpretation:

Example and Interpretation:

Levene's Test for Homogeneity of Variance (center = median)

- Y = Punishment (dependent variable)

Steps to check the assumptions in R:

- 1st : Make your model using lm

Codes in red and interpretations in blue

2nd : add residuals and predicted

- res <- model1$residuals

3rd : Assumptions graphically:

Outputs + : respectively : plot(model,2) ; plot(model1,1) ; plot(model1,3)

- For normality: shapiro.test(res)

studentized Breusch-Pagan test

Steps more in detail, what did we do in R ?:

1st: Make your model with lm

- At this point you can already:

2nd : Add residuals and predicted values

- Why? -> In order to check the assumptions

3rd : Graphically check the assumptions

- For equal variance: 2 options :

4th : Test assumptions with statistics

- For normal distribution: Use Shapiro Wilk’s test

Assignment 561 : First part (From 1 to question 8)

- Model: Y(Reading) = b0 + b1(income) + b2(parents_read_home)

Checking assumptions Graphically: See Assignment Answers

Checking assumptions Statistically:

Codes for Normality: shapiro.test(data561 $res1)

- H0: Normal distribution

Codes for Equal variance: bptest(model1)

- H0: Equal variance

- Model: Y(Reading) = B0 + B1(Teaching methods) where

Checking assumptions Graphically: See Assignment Answers

Checking assumptions Statistically:

Codes for Normality: shapiro.test(data561 $res2)

- H0: Normal distribution

Codes for Equal variance: leveneTest (model2)

- H0: Equal variance

normality Equal variance

Test Shapiro wilk Levene or Breush-pagan

Descriptives Skeweness should be St.deviations

Kurtosis should be between - -> if < 2: Equal variance assumed

You might also like