0% found this document useful (0 votes)

68 views7 pages

Exercises in Class Normality - SOLUTIONS

The document discusses analyzing the normality of residuals from regression models. 1) A regression model is estimated to explain worker salary based on various variables. Jarque-Bera test rejects normality of residuals, indicating tests on parameters may not be reliable. However, with a large sample size of over 900, central limit theorem implies tests can still use approximate distributions. 2) Another model is discussed where residuals appear bimodal, suggesting a misspecification like omitted structural change. 3) A model of sleep hours is estimated. The Jarque-Bera test strongly rejects normality of residuals based on their skewness and kurtosis. This violates assumptions for properties of the OLS estimator,

Uploaded by

damian camargo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views7 pages

Exercises in Class Normality - SOLUTIONS

Uploaded by

damian camargo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 7

Exercises on Normality:

1. We have a database for a sample of 935 workers, with information on the following variables:
SALARY (monthly salary), EDAT (age), EDUCACIO (years of study), CASAT (dummy variable
that is 1 if the individual is married, 0 otherwise), PERMANENCIA (years spent in the same job),
EXPERIENCIA (years of experience) and IQ (index score of intelligence).

1.a. From this information, we estimated the following model1: (Table 1.1):

Model (1)

Table 1.1

From this estimation (and without considering any other outcome), what can you say about the
individual and global statistical significance of the parameters estimated? And what about the
economic significance? From the value of R2, do you think that it is an appropriate model to explain
the salary? (0.5 points)

Without considering any other result, we can see that the p-value of the parameter associated with
the variable " experiencia " is greater than 0.05. This means that we cannot reject the null
hypothesis that this parameter is zero. This implies that in this model we should eliminate the
variable " experiencia" which is likely an irrelevant variable in the model (maybe also for
1
The data used for this estimate are from Wooldridge, J.M. (2006) "Introduction to Econometrics", 2nd Edition.
Ed: Thomson.
multicollinearity). The p-value associated with the parameter "permanencia " is less than 0.05,
which indicates that this variable is indeed relevant in explaining the salary. With respect to the
global significance (of the model) we see that the p-value associated with the F statistic is less than
0.05. Therefore, we reject the null hypothesis that all the estimated parameters are zero: the model
is globally significant.

Regarding the economic significance, we only have to interpret the coefficient associated with "
permanencia." Here we see that their influence is positive: a one more year spent in the company
implies an increase of salary of 10.82 units. Because the other variable is not significant, it is not
necessary to compare the magnitudes of the parameters.

The R2 obtained is very low: 1.7 %. Therefore, a very low percentage of the variability of the salary
is explained by the years spent in the same job. Therefore, it is likely that we have omitted the
inclusion of one or more relevant variables in the model.

1.b. From the estimation of the previous model, the following results are obtained regarding the
error term:

Table 1.2

1. Which are the characteristics of this distribution of the residuals? i.e comment the skewness
and the kurtosis.

The skewenss is higher than 0, meaning that there is positive asymmetry in the distribution.
Indeed the graph shows that residuals tend to be more concentrated on the left.

The Kurtosis is higher than 3, meaning that the distribution is Leptokurtic (it has a higher
peak with respect to the one of a normal distribution).

2. Run the Jarque-Bera normality test of the residuals, indicating the hypothesis that are
compared, calculate the corresponding statistical test and specifying which is the reached
conclusion and the implications for the estimation of the model. (1 point)

The compared hypotheses are:

H0: The error term follows a normal distribution

HA: the error term does not follow a normal distribution

The Jarque-Bera statistic is calculated as follows, based on the values of the asymmetry b1
(1.307232) and kurtosis b2 (6.007389):

For a significance level of 5%, the value of the statistic is greater than the critical value of chi-
square with two degrees of freedom (5,99), so we conclude that we must reject the hypothesis null.
Therefore, the error term does not follow a normal distribution. As we do not know the distribution
of the error term, in general we do not know the distribution of the estimated coefficients and thus
the tests on the parameters (individual, global significance) are not reliable. Also the asymptotic
efficiency is no longer guaranteed since it is based on the ML estimator that do not coincide
anymore with the OLS estimator. Nonetheless in this case the sample size is of more than 900
observations, we can assume applicability of the central limit theorem, which indicates that the t-
statistics converge to a normal distribution, while the F converges to a chi-squared distribution (and
therefore we know the approximate distribution for our tests). Hence we can make inference on the
model even if in small sample it could not be possible.

2. We have a database for a sample of 935 workers, with information on the following variables:
SALARY (monthly salary), EDAT (age), EDUCACIO (years of study), CASAT (dummy variable
that is 1 if the individual is married, 0 otherwise), PERMANENCIA (years spent in the same job),
EXPERIENCIA (years of experience) and IQ (index score of intelligence).

1.a. From this information, we estimated the following model2:

Model (1)

Table 1 shows the results of the Jarque-Bera test applied to the residuals of the OLS estimation of
model (1).

1. In your opinion which is the reason why the mean and the median of the residuals do not
correspond?

Because the distribution is not symmetric. The fact that the median is negative, while the
mean is equal to 0 (as it has to be) means that half of the observations are found before the
zero threshold, so that the distribution would be asymmetric with a positive asymmetry as
also shown by the skeweness index (higher than 0).

2. Indicates the null and alternative hypothesis of the Jarque-Bera test, interpret the result and
explain the implications of the result on the estimates of the model.

2
The data used for this estimate are from Wooldridge, J.M. (2006) "Introduction to Econometrics", 2nd Edition.
Ed: Thomson.
Table 1

Since the value in a table of a Chi-square distribution with two degrees of freedom for a
significance level (α) of 5% is 5,99, we reject the null hypothesis of normality.

3. We specify and estimate the following model:

Model (3)

Table 1. shows the results of the Jarque-Bera test applied to the residuals of the OLS estimation of
model (3).

1. Which kind of distribution seems the residual to follow? What does it suggests in terms of
misspecification?

It seems the residuals are plot as a bimodal distribution, which could indicate a
misspecification due to not capturing a structural change.

2. Are they satisfying the assumption of normality? Comment also the kurtosis and the
skewness.

The JB test is rejecting the null hypothesis of normality. Indeed the kurtosis points to a
platykurtic distribution (lower peak than for a normal distribution), while the skewness is
slightly positive indicating positive asymmetry (more observations concentrated on the left
part of the distribution).

Table 1.
4. Using data for 706 individuals, we estimate by OLS a regression model that explains the number
of daily hours for sleep (SLEEPD) as a function of the following variable: the number of daily
hours used for working (TOTWORKD), the years of education (EDUC), the age (AGE), a dummy
variable that takes the value of 1 in the case the individual is a man and 0 otherwise (MALE), a
dummy variable that takes on a value of 1 if the individual has kids with less than 3 years old and 0
otherwise (YNGKID) and the interaction between these last two variables (MALE*YNGKDS). The
results of this estimation are shown in table 1, while the graphics 1 and 2 show the histogram, some
descriptive statistics and the result of the JB test for the endogenous variable (graph 1) and the OLS
residuals presented in table1:
1. With this information, do you think there is some problem in this estimation as for what
concern the accomplishment to the basic hypothesis of the disturbance term of the OLS
model? Which will be the properties of the OLS estimator?

The basic hypothesis for the disturbance term of the model is that it follows a normal distribution
with zero expected value and variance-covariance matrix equal to sigma^2*I. This last hypothesis
means that the error term is homoskedastic and has no autocorrelation. By considering that these
data are cross-sectional is very unlikely there could be any problem of autocorrelation and
therefore, with the information we have in the graphs, we need to value whether there could be
problem of heteroskedasticity or absence of normality. Graph 2 shows the histogram of the
residuals for the OLS estimation, as well as information on the asymmetry and curtosis of the
distribution, which allows to compute the JB test of normality. The graph shows a residual
distribution quite symmetric (the coefficient for symmetry is 0.159), but quite peaked (the
coefficient for the kurtosis is 5.23), which suggests that probably that the hypotesis of normality
would not be complied. In fact, the JB test statistics (149.11) is clearly higher than the critical value
for the chi-square statistics with 2 dof. Therefore we reject the null hypothesis and conclude that the
residuals do not follow a normal distribution. In this case, the properties of the estimator would be
the usual ones, while for the test statistics, due to the central limit theorem and the presence of a big
sample size, we can rely to their (proxied) asymptotic distribution (the t converges to the normal
distribution, while the F to the chi-square). If the sample would have had lower observations, it
would have been not possible to perform the test statistics since the distribution of the variables
would have been not known.

2. Which could be the reasons of a not (eventually) compliance to the basic hypothesis?

Also, looking at the result of the regression we can think that the no compliance with the hypothesis
of normality could be related to omission of some relevant variable in the model, since the R2 is
very low showing that the explicative capacity of the model is not good. As for heteroskedasticity,
we do not have enough information to make any inference on that.

2015 - George Seber - The Linear Model and Hypothesis A General Unifying Theory - Springer International Publishing
No ratings yet
2015 - George Seber - The Linear Model and Hypothesis A General Unifying Theory - Springer International Publishing
208 pages
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
100% (51)
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
26 pages
Regresion Al 6:12
No ratings yet
Regresion Al 6:12
58 pages
Regresion Al 27:11
No ratings yet
Regresion Al 27:11
55 pages
Business Statistics
No ratings yet
Business Statistics
105 pages
Linear Regression
100% (2)
Linear Regression
28 pages
Econometrics Notes
No ratings yet
Econometrics Notes
95 pages
EC2020 - Tutorial 4 - MLR Inference
No ratings yet
EC2020 - Tutorial 4 - MLR Inference
18 pages
Efm
No ratings yet
Efm
22 pages
Multiple Choice Sample Questions
No ratings yet
Multiple Choice Sample Questions
3 pages
IST172 Problem Set II-2
No ratings yet
IST172 Problem Set II-2
7 pages
Market Manipulation Rules and IPO Underpricing: Huu - Duong@monash - Edu
No ratings yet
Market Manipulation Rules and IPO Underpricing: Huu - Duong@monash - Edu
54 pages
Econometric Answers For Some
No ratings yet
Econometric Answers For Some
5 pages
Sample Paper Mid 2
No ratings yet
Sample Paper Mid 2
10 pages
Problem Set in Statistics
No ratings yet
Problem Set in Statistics
11 pages
Nonparametric Statistics
No ratings yet
Nonparametric Statistics
32 pages
Problem Set 9 WK30 Questions 2023-24
No ratings yet
Problem Set 9 WK30 Questions 2023-24
2 pages
Activity 3 For Statistics Categorical Predictors in Regression
No ratings yet
Activity 3 For Statistics Categorical Predictors in Regression
9 pages
Solution
No ratings yet
Solution
6 pages
E4man PDF
No ratings yet
E4man PDF
184 pages
Stochastic Frontier Analysis Stata
100% (2)
Stochastic Frontier Analysis Stata
48 pages
Econometrics Revision Work
100% (6)
Econometrics Revision Work
6 pages
Eco Exercise 3answer Ans 1
No ratings yet
Eco Exercise 3answer Ans 1
8 pages
6.3 (I) The Turnaround Point Is Given by
No ratings yet
6.3 (I) The Turnaround Point Is Given by
3 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
MacKinnon Critical Values For Cointegration Tests Qed WP 1227
No ratings yet
MacKinnon Critical Values For Cointegration Tests Qed WP 1227
19 pages
Econ107 Assignment 1 Prep
No ratings yet
Econ107 Assignment 1 Prep
9 pages
Chap5 Chris Brooks
No ratings yet
Chap5 Chris Brooks
8 pages
Chap 5 MCQ
No ratings yet
Chap 5 MCQ
12 pages
GMM Resume PDF
100% (1)
GMM Resume PDF
60 pages
Lecture 2: MRA and Inference: Dr. Yundan Gong
No ratings yet
Lecture 2: MRA and Inference: Dr. Yundan Gong
52 pages
MScFE 610 Econometrics - CompiledVideo - Transcripts - M2
No ratings yet
MScFE 610 Econometrics - CompiledVideo - Transcripts - M2
14 pages
Umair Assignment
No ratings yet
Umair Assignment
19 pages
2018 CFA Level 2 Mock Exam Morning
No ratings yet
2018 CFA Level 2 Mock Exam Morning
40 pages
Seminar Questions
No ratings yet
Seminar Questions
5 pages
Tests For Specification Errors in Classical Linear Least-Squares Regression Analysis (Ramsey)
No ratings yet
Tests For Specification Errors in Classical Linear Least-Squares Regression Analysis (Ramsey)
23 pages
Sustainability and Ethical Banking: A Case Study of Punjab National Bank
No ratings yet
Sustainability and Ethical Banking: A Case Study of Punjab National Bank
13 pages
Unbalanced Panel Data PDF
No ratings yet
Unbalanced Panel Data PDF
51 pages
Business Analytics
No ratings yet
Business Analytics
10 pages
Assignment3 05.01.24
No ratings yet
Assignment3 05.01.24
4 pages
Group 11 SB - Assignment 2
No ratings yet
Group 11 SB - Assignment 2
5 pages
ANOVA Model With One Qualitative Variable
No ratings yet
ANOVA Model With One Qualitative Variable
4 pages
In-Semester Test - Proposed Solutions
No ratings yet
In-Semester Test - Proposed Solutions
6 pages
Term Paper Sample PDF
No ratings yet
Term Paper Sample PDF
10 pages
TT220 5 Apr18 Soln-1
No ratings yet
TT220 5 Apr18 Soln-1
2 pages
Data and Methodology
No ratings yet
Data and Methodology
5 pages
CH 12
No ratings yet
CH 12
30 pages
Fixed Effect and Random Effect
No ratings yet
Fixed Effect and Random Effect
17 pages
Malnutrition in The World
No ratings yet
Malnutrition in The World
11 pages
Exercise Before MID
No ratings yet
Exercise Before MID
5 pages
Questions
No ratings yet
Questions
4 pages
ps1 Build
No ratings yet
ps1 Build
4 pages
Chapter 9 Multiple Regression Analysis: The Problem of Inference
No ratings yet
Chapter 9 Multiple Regression Analysis: The Problem of Inference
10 pages
Syllabus BS 4years Economics PDF
No ratings yet
Syllabus BS 4years Economics PDF
114 pages
Microeconomics Production Theory
No ratings yet
Microeconomics Production Theory
29 pages
PS2
No ratings yet
PS2
2 pages
A Study On Impact of Covid-19 On Indian Stock Market
No ratings yet
A Study On Impact of Covid-19 On Indian Stock Market
40 pages
Ansprac 2
No ratings yet
Ansprac 2
6 pages
Simple Linear Regression: Coefficient of Determination
No ratings yet
Simple Linear Regression: Coefficient of Determination
21 pages
Linear Regression Model Before Estimation
No ratings yet
Linear Regression Model Before Estimation
4 pages
Example 2
No ratings yet
Example 2
7 pages
Latihan Soal Utk UAS
No ratings yet
Latihan Soal Utk UAS
5 pages
Instrumental PDF
No ratings yet
Instrumental PDF
69 pages
Ejc t2 Enge
No ratings yet
Ejc t2 Enge
5 pages
Type It Nicely (Latex or Word With Equation Editor) - Upload The Word or PDF File in Blackboard. Scanned Handwritten Problem Sets Are Not Allowed and Will Not Be Graded
No ratings yet
Type It Nicely (Latex or Word With Equation Editor) - Upload The Word or PDF File in Blackboard. Scanned Handwritten Problem Sets Are Not Allowed and Will Not Be Graded
3 pages
ps8 +fall2013
No ratings yet
ps8 +fall2013
6 pages
Due Monday, October 23
No ratings yet
Due Monday, October 23
3 pages
Exercise 1 Multiple Regression Model
No ratings yet
Exercise 1 Multiple Regression Model
6 pages
Molecular Genetics Exam
No ratings yet
Molecular Genetics Exam
3 pages
Example Econometrics
No ratings yet
Example Econometrics
6 pages
Arima Garch R
No ratings yet
Arima Garch R
9 pages
Econ 453 Final Project
No ratings yet
Econ 453 Final Project
15 pages
Organization of Land Surrounding Airports: The Case of The Aerotropolis
No ratings yet
Organization of Land Surrounding Airports: The Case of The Aerotropolis
31 pages
GMM Estimation PDF
No ratings yet
GMM Estimation PDF
35 pages
MSC in Economics, Ub. Econometrics 2. Control Lessons 4 & 5. Surname: Grehl Name: Miriam
No ratings yet
MSC in Economics, Ub. Econometrics 2. Control Lessons 4 & 5. Surname: Grehl Name: Miriam
4 pages
GMM 2
No ratings yet
GMM 2
30 pages
Econometrics With R
No ratings yet
Econometrics With R
56 pages
GLS Theory
No ratings yet
GLS Theory
34 pages
Econometrics Assignment
No ratings yet
Econometrics Assignment
2 pages
Y = Xβ+U VAR (U) =σ · I VAR β σ X X) : 1. No-Spherical Disturbances
No ratings yet
Y = Xβ+U VAR (U) =σ · I VAR β σ X X) : 1. No-Spherical Disturbances
8 pages
Literature Review On Corporate Governance Structure and Performance in Non-Financial Firms in Bangladesh
No ratings yet
Literature Review On Corporate Governance Structure and Performance in Non-Financial Firms in Bangladesh
9 pages
Advanced Quantitative Methods
No ratings yet
Advanced Quantitative Methods
125 pages
Chapter 5 TEst
No ratings yet
Chapter 5 TEst
18 pages
Exercises For Chapter 6 of Vinod's " Hands-On Intermediate Econometrics Using R"
No ratings yet
Exercises For Chapter 6 of Vinod's " Hands-On Intermediate Econometrics Using R"
25 pages
Theoretical Investigation On Determinants of Government-Linked Companies Capital Structure
No ratings yet
Theoretical Investigation On Determinants of Government-Linked Companies Capital Structure
14 pages
Jemal L - 2019 - Effect of Financial Literacy On Financial Performance of Medium Scale Enterprise - Case Study in Hwassa City, Ethiopia
No ratings yet
Jemal L - 2019 - Effect of Financial Literacy On Financial Performance of Medium Scale Enterprise - Case Study in Hwassa City, Ethiopia
7 pages
The Government Auditor Professionalism Determinant I Gusti Ayu Purnamawati
No ratings yet
The Government Auditor Professionalism Determinant I Gusti Ayu Purnamawati
14 pages
Critical Values For Cointegration Tests: February 1990
No ratings yet
Critical Values For Cointegration Tests: February 1990
20 pages
BMC Public Health: Media Suicide-Reports, Internet Use and The Occurrence of Suicides Between 1987 and 2005 in Japan
No ratings yet
BMC Public Health: Media Suicide-Reports, Internet Use and The Occurrence of Suicides Between 1987 and 2005 in Japan
8 pages
Index Introductory Econometrics For Finance
No ratings yet
Index Introductory Econometrics For Finance
7 pages
Exa Eco II Reevaluacion 2013 Javi ENGLISH DEFINITIVO SOLUZIONI
No ratings yet
Exa Eco II Reevaluacion 2013 Javi ENGLISH DEFINITIVO SOLUZIONI
3 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Errors of Regression Models: Bite-Size Machine Learning, #1
From Everand
Errors of Regression Models: Bite-Size Machine Learning, #1
Lee Baker
No ratings yet
Fuzzy Logic: Fundamentals and Applications
From Everand
Fuzzy Logic: Fundamentals and Applications
Fouad Sabry
No ratings yet

Exercises in Class Normality - SOLUTIONS

Uploaded by

Exercises in Class Normality - SOLUTIONS

Uploaded by

Exercises on Normality:

The compared hypotheses are:

H0: The error term follows a normal distribution

1.a. From this information, we estimated the following model2:

3. We specify and estimate the following model:

You might also like