0% found this document useful (0 votes)

63 views9 pages

EC229 Part II Answers

Uploaded by

fahadially263

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views9 pages

EC229 Part II Answers

Uploaded by

fahadially263

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

1.

Explain the nature, causes, and effects of heteroscedasticity and what are the
remedies of this problem. How does it violate the Gauss-Markov conditions? What
are its implications for the parameter estimates obtained by ordinary least squares?

Heteroscedasticity refers to a situation that occurs when the variance of the residuals is unequal
over a range of measured values; meaning that heteroscedasticity is a result of the linear
regression assumption of homoscedasticity (constant variance) being violated.

The occurrence of heteroscedasticity can be written mathematically as follows:

𝒗𝒂𝒓(𝝁 |𝑿 ) = 𝝈𝟐

and, represented diagrammatically as follows:

When it comes to discussing the causes of heteroscedasticity. There are six causes. One of the
causes being the violation of the one of the classical linear regression models, that sates that a
model should be correctly specified. When such an assumption is violated, and there is omitted
variable biasness, heteroscedasticity can occur.

A second reason is the presence of outlier. Outliers refer to an observation in a dataset that
deviates significantly from the other observations. It is unusually high or low value compared to
the rest of the data as a result of inclusion or exclusion of some observation especially when the
sample size is small, such an occurrence alters the results of the regression analysis.

A third reason for heteroscedasticity is following the error learning models. In this case, we are
referring to how when people are conducting research for example, as they grow to become
experts, their level of faults decline, or become more consistent; which leads to the population
variance decreasing overtime and causing heteroscedasticity.

Furthermore, as data collection techniques improve the population variance is likely to decrease.
For example, banks that have advanced data processing equipment are likely to produce less
errors.

A fifth reason for heteroscedasticity is skewness in data. If the distribution of the dependent
variable is asymmetric, then there is a presence of heteroscedasticity.

In addition, to the reasons stated above, heteroscedasticity can occur as a result of incorrect data
transformation, or incorrect functional form.

Moving onto the effects of heteroscedasticity, one of the first things to be observed is the Gauss-
Markov theorem. This theorem discusses the optimal properties that the coefficients in ordinary
least square estimations are required to possess. These properties are shortened by the denotation
BLUE. An estimator is said to be BLUE when the estimator is linear, unbiased, and have
minimum variance so that it is an efficient estimator. In saying so, we can see that due to
heteroscedasticity Gauss Markov’s theorem is violated as the estimators are no longer efficient
due to variance not being minimum and constant throughout the data. Although, the estimated
parameters remain unbiased.
In inclusion to the point mentioned above, heteroscedasticity produces biased standard errors.
This leads to unreliable confidence intervals and hypothesis testing, which in return sequels to
incorrect conclusions about the significance of variables.

Furthermore, as stated above, heteroscedasticity leads to biased standard errors, which causes the
usual F-test and T-test for significance to no longer be valid. Henceforth, increasing the Type 1
and Type 2 errors (false positives and false negatives).

Heteroscedasticity can also lead to the r-square being inflated. This makes the model appear to
have a better fit than the realistic perspective of things.

Furthermore, in the occurrence of heteroscedasticity occurring due to the model being miss-
specified, the estimates of regression coefficients will be misleading.

Lastly, heteroscedasticity makes comparison between data hard as the differences in error
variance complicate the comparison of goodness of fit measures.

In resolving, heteroscedasticity there are a number of methods. The first remedy being the
application of White’s correction. This was a simple method developed by Hal White. It is used
in the case of not knowing the population variance. Through the White’s correction for
heteroscedasticity a method used in econometrics to provide robust standard errors for regression
coefficients in the presence of heteroscedasticity. The robust standard errors allow for valid
statistical inference even in the presence of heteroscedasticity.

The use of weighted least squares is another efficient solution to heteroscedasticity as this
method assigns weights to observations inversely proportional to the variance of the error term.
This can help to stabilize the variance and improve the efficiency of the estimates. It is applied
when the population variance is known.

The generalized least squares (GLS) is a remedy to heteroscedasticity as it is an extension of

OLS (original least square) that accounts for heteroscedasticity by transforming the model so that
the error terms have constant variance.
Furthermore, as observed from the causes of heteroscedasticity, one the reasons of
heteroscedasticity are violating the assumption of a specified model. Henceforth, in order to
remedy such an occurrence of model misspecification, it is important to ensure that the model
has been correctly specified by including all relevant variables and interactions.

Lastly, when the reason for heteroscedasticity is due to differences between subgroups within the
data, analyzing these subgroups separately can help address the issue of heteroscedasticity.

Henceforth, as observed from above, these are the causes, effects, and remedies of
heteroscedasticity as being able to have homoscedasticity is crucial in regression analysis.
2. Explain the nature, causes and effects autocorrelation and what are the remedies of
this problem. How does it violate the Gauss-Markov conditions? What are its
implications for the parameter estimates obtained by ordinary least squares?

Autocorrelation refers to a situation that occurs when the error term in one period is correlated to
the error term in the previous periods. It is a result of violating the assumption of no serial
correlation between disturbances. Autocorrelation usually occurs in the case of time series data.

Autocorrelation can be symbolically stated as:

Causes of Autocorrelation:

(i) Inertia: An important of most economic time series is inertia, or sluggishness. As is well
known, time series such as GNP, price indexes, production, employment and
unemployment exhibit (business) cycles.
(ii) Specification Bias: (Omitted variable biasness)
(iii) Lags
(iv)Cobweb Phenomenon
(v) Non-stationarity

Effects of Autocorrelation:

 Autocorrelation leads to Gauss-Markov condition of efficiency being violated, as the

variance is no longer at its minimum value, but estimators still remain unbiased.

 The residual variance underestimates the true population variance. This results into R^2
being overestimated.

 Due to this inefficiency: there is a large variance which means large standard error. This
leads to statically significance tests such as t-test and f-test being invalid as we get
misleading results, such as the t-test being less than 1.96 which means it is statistically
insignificant when it is not true.

Remedies of Autocorrelation

In the case that it is pure autocorrelation (autocorrelation is not a result of model

misspecification):
(i) Use appropriate transformation of the original model
(ii) Newey-West
This is used in the case of large samples, in order to obtain robust standard errors that correct
for autocorrelation and heteroscedasticity in order to have valid inferences.
3. What is the difference between positive and negative first-order autocorrelation?

First-order autocorrelation refers to the correlation between a variable and its immediate previous
value. Here’s the difference between positive and negative first-order autocorrelation:

Positive First-Order Autocorrelation

Definition: This occurs when positive error terms in one period are likely to be followed by
positive error terms in the next period, and similarly, negative error terms are likely to be
followed by negative error terms. E.g. Positive Autocorrelation: If sales in one month are high,
sales in the next month are also likely to be high (and vice versa).

Pattern: The residuals show a consistent pattern of rising and falling, indicating a tendency for
errors to maintain their direction over time.

Implication: In time series data, this often suggests a momentum effect, where the process
shows persistence or trend-following behavior.

Negative First-Order Autocorrelation:

Definition: This occurs when positive error terms in one period are likely to be followed by
negative error terms in the next period, and vice versa. E.g. Negative Autocorrelation: If a stock's
price goes up one day, it is likely to go down the next day (and vice versa).

Pattern: The residuals alternate signs more frequently, indicating a tendency for errors to
reverse their direction from one period to the next.

Implication: This often suggests a mean-reverting process, where deviations from the mean are
corrected in subsequent periods.

Identifying Positive and Negative Autocorrelation through Durbin-Watson Test:

- Positive Autocorrelation: Durbin-Watson statistic close to 0 indicates positive
autocorrelation.

- Negative Autocorrelation: Durbin-Watson statistic close to 4 indicates negative

autocorrelation.

- No Autocorrelation: Durbin-Watson statistic around 2 indicates no autocorrelation.

Understanding the nature of autocorrelation is crucial for selecting appropriate modeling

techniques and ensuring the validity of statistical inferences in time series analysis.
4. Explain the difference between the following pairs of terms in the context of binary
choice models: (i) coefficient and marginal effect, (ii) R2 and likelihood ratio index,
(iii) predicted Y and observed Y

I) Coefficient and Marginal Effect

Coefficient Marginal Effect
Definition: In binary choice models (e.g., Definition: The marginal effect measures
logistic regression, probit models), the the change in the probability of the binary
coefficient represents the change in the outcome for a one-unit change in the
log-odds (for logistic regression) or the predictor variable, holding other variables
latent variable (for probit models) for a constant.
one-unit change in the predictor variable.
Interpretation: Marginal effects provide
Interpretation: Coefficients in these a more intuitive understanding of the
models are not directly interpretable in impact of a predictor on the probability of
terms of the probability of the binary the outcome. They are often computed at
outcome because they affect the latent the mean values of the predictors or
variable or log-odds, not the probability averaged over the sample.
itself.

ii) R² and Likelihood Ratio Index

R² Likelihood Ratio Index
Definition: In linear regression, R² Definition: This is a measure used in the
measures the proportion of the variance in context of binary choice models (e.g.,
the dependent variable that is predictable logistic regression). It is calculated as 1 -
from the independent variables. (log-likelihood of the fitted model / log-
likelihood of the null model), where the
Interpretation: Higher values of R² null model has only an intercept.
indicate a better fit of the model to the
data. However, in binary choice models, a Interpretation: The likelihood ratio index
direct analogue of R² is not typically used ranges from 0 to 1, with higher values
because the dependent variable is binary. indicating a better fit. Unlike R² in linear
regression, it does not directly measure
the proportion of variance explained but
provides an indication of model fit
relative to a null model.
iii) Predicted Y and Observed Y

Definition: In binary choice models, Definition: Observed Y refers to the actual

predicted Y refers to the predicted probability binary outcome observed in the data, typically
of the binary outcome being 1 for a given set coded as 0 or 1.
of predictor values.

Interpretation: Predicted probabilities range Interpretation: Observed Y values are the

from 0 to 1 and indicate the likelihood of the ground truth against which the predictions are
event occurring (e.g., the likelihood of a compared to assess the accuracy and
customer making a purchase). performance of the model.

Summary

Coefficient vs. Marginal Effect: Coefficients in binary choice models affect the latent variable
or log-odds, while marginal effects describe changes in the probability of the outcome.

R² vs. Likelihood Ratio Index: R² measures variance explained in linear models, while the
likelihood ratio index assesses model fit in binary choice models.

Predicted Y vs. Observed Y: Predicted Y represents model-derived probabilities, while

observed Y is the actual outcome in the dataset.
5. Using relevant example(s) show how and explain why the linear probability model is
considered an inappropriate model for the estimation of dummy dependent
variables.

6. A second-year student investigates the factors influencing graduating high school.

She defines a variable GRAD that is equal to 1 for those individuals who graduated,
and 0 for those who dropped out, and regresses it on ASVABC, the composite
cognitive ability test score. The regression output shows the result of fitting this
linear probability model

a) Interpret the regression results

ASVABC (test score) is scaled so that it has a mean of zero and its units are standard deviations.

One unit increase in ASVABC score on average increases the probability of graduating by 0.106,
that is 10.6%.

b) Comment on the significance of the variables estimates and the overall significance.

Based off the results shown on the STATA, both parameters are significant:

ASVABC is statistically significant as in accordance to the T-test T-calculated is greater than T-

critical.

The intercept is also statistically significant as in accordance to the T-test T-calculated is greater
than T-critical.

The overall significance of the regression model which is found through the F-test is statistically
significant as F-calculated is greater than F-critical.

Assignment QMT533 RUJUKN
No ratings yet
Assignment QMT533 RUJUKN
33 pages
Unit 6 Biology
89% (9)
Unit 6 Biology
11 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
71 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Multivariate Analysis – The Simplest Guide in the Universe: Bite-Size Stats, #6
From Everand
Multivariate Analysis – The Simplest Guide in the Universe: Bite-Size Stats, #6
Lee Baker
No ratings yet
TRANSPORTATION ENGINEERING - SW 1
No ratings yet
TRANSPORTATION ENGINEERING - SW 1
5 pages
CH 4 - Problems
No ratings yet
CH 4 - Problems
72 pages
Hsts423 Unit 4
No ratings yet
Hsts423 Unit 4
13 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
43 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
4 pages
HS Breakdown
No ratings yet
HS Breakdown
8 pages
Econometery ch2
No ratings yet
Econometery ch2
38 pages
OLS Assumptions
No ratings yet
OLS Assumptions
40 pages
Heteroscedasticity: What Happens If The Error Variance Is Nonconstant?
No ratings yet
Heteroscedasticity: What Happens If The Error Variance Is Nonconstant?
22 pages
Chapter One Part 2
No ratings yet
Chapter One Part 2
5 pages
Heteroscedsaticity Lecture 2023
No ratings yet
Heteroscedsaticity Lecture 2023
20 pages
5462 Et 15et
No ratings yet
5462 Et 15et
13 pages
18 2 12 Ajao
No ratings yet
18 2 12 Ajao
8 pages
Topic 5
No ratings yet
Topic 5
30 pages
Lesson 04
No ratings yet
Lesson 04
5 pages
Assignment For Viva
No ratings yet
Assignment For Viva
54 pages
Econometrics A
No ratings yet
Econometrics A
18 pages
Chapter 4
No ratings yet
Chapter 4
62 pages
Outline: Basic Econometrics in Transportation Basic Econometrics in Transportation
No ratings yet
Outline: Basic Econometrics in Transportation Basic Econometrics in Transportation
7 pages
Ch5 Slides Ed3 Feb2021
No ratings yet
Ch5 Slides Ed3 Feb2021
49 pages
Heteroscedasticity Workshop
No ratings yet
Heteroscedasticity Workshop
72 pages
Chapter 5 - Violations of Regression Assumptions
No ratings yet
Chapter 5 - Violations of Regression Assumptions
44 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
Econometrics Course For RDAE Chapter 5
No ratings yet
Econometrics Course For RDAE Chapter 5
82 pages
Chapter 4 - Acct
No ratings yet
Chapter 4 - Acct
16 pages
Chapter 4
No ratings yet
Chapter 4
38 pages
Diagnostic Tests
No ratings yet
Diagnostic Tests
51 pages
Econometrics: Autocorrelation: What Happens If The Error Terms Are Correlated?
No ratings yet
Econometrics: Autocorrelation: What Happens If The Error Terms Are Correlated?
43 pages
Econometrics Module
No ratings yet
Econometrics Module
79 pages
Chapter 5 Violations of CLRM Assumptions
100% (2)
Chapter 5 Violations of CLRM Assumptions
25 pages
MFIN 305 - Lecture3
No ratings yet
MFIN 305 - Lecture3
66 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
7 pages
Autocorrelation
No ratings yet
Autocorrelation
36 pages
Chapter 4-Volation Final Last 2018
No ratings yet
Chapter 4-Volation Final Last 2018
105 pages
CH - 5 - Econometrics UG
No ratings yet
CH - 5 - Econometrics UG
24 pages
Chris Brooks - Chapter 5 - Slides
No ratings yet
Chris Brooks - Chapter 5 - Slides
71 pages
Economics 308: Econometrics Professor Moody: Describing The Relationship Between Two Variables
No ratings yet
Economics 308: Econometrics Professor Moody: Describing The Relationship Between Two Variables
8 pages
Heteroscadasticity
No ratings yet
Heteroscadasticity
11 pages
Lecture 3
No ratings yet
Lecture 3
35 pages
Violations of OLS
No ratings yet
Violations of OLS
64 pages
Heteroscedasticity3 150218115247 Conversion Gate01
No ratings yet
Heteroscedasticity3 150218115247 Conversion Gate01
10 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
8 pages
Chap 11 Heterscedasticity
100% (1)
Chap 11 Heterscedasticity
45 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
21 pages
Chapter 4
No ratings yet
Chapter 4
55 pages
AGBOLA
No ratings yet
AGBOLA
9 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
17 pages
405 Econometrics: by Domodar N. Gujarati
No ratings yet
405 Econometrics: by Domodar N. Gujarati
47 pages
Econometrics For Finance Chapter 4
No ratings yet
Econometrics For Finance Chapter 4
44 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
15 pages
Ch5 - Slides 2022 - 11 - 29 - L1
No ratings yet
Ch5 - Slides 2022 - 11 - 29 - L1
35 pages
Econometrics moduleII
100% (2)
Econometrics moduleII
114 pages
ECF For Graduate Exam Revision 2021
No ratings yet
ECF For Graduate Exam Revision 2021
6 pages
Unit-12 Bcom
No ratings yet
Unit-12 Bcom
17 pages
Econometrics and Softwar Applications (Econ 7031) Assignment
No ratings yet
Econometrics and Softwar Applications (Econ 7031) Assignment
8 pages
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
Markov Models: An Introduction to Markov Models
From Everand
Markov Models: An Introduction to Markov Models
Steven Taylor
3/5 (1)
The Multiple Classical Linear Regression Model (CLRM) : Specification and Assumptions
No ratings yet
The Multiple Classical Linear Regression Model (CLRM) : Specification and Assumptions
19 pages
Cheat Sheet Quantitative Methods in Finance Nova Cheat Sheet Quantitative Methods in Finance Nova
0% (1)
Cheat Sheet Quantitative Methods in Finance Nova Cheat Sheet Quantitative Methods in Finance Nova
3 pages
Midterm Examination # 3: Sta 113: Probability and Statistics in Engineering Tuesday, 2008 Nov. 25, 1:15 - 2:30 PM
No ratings yet
Midterm Examination # 3: Sta 113: Probability and Statistics in Engineering Tuesday, 2008 Nov. 25, 1:15 - 2:30 PM
14 pages
Properties of The OLS Estimator: Quantitative Methods 2
No ratings yet
Properties of The OLS Estimator: Quantitative Methods 2
57 pages
Hypothesis Testing in The Multiple Regression
No ratings yet
Hypothesis Testing in The Multiple Regression
23 pages
Calibration Linear
No ratings yet
Calibration Linear
15 pages
Unit 3 - Hypothesis Testing
No ratings yet
Unit 3 - Hypothesis Testing
15 pages
Testing For Normality and Transforming Data
No ratings yet
Testing For Normality and Transforming Data
56 pages
Week 2 Test Statistics
No ratings yet
Week 2 Test Statistics
61 pages
GEEORD
No ratings yet
GEEORD
53 pages
Statistical Test Assumptions
No ratings yet
Statistical Test Assumptions
28 pages
Bayesian Tutorial
83% (6)
Bayesian Tutorial
76 pages
Fds Unit 4 FINSH
No ratings yet
Fds Unit 4 FINSH
37 pages
Business Statistics in Practice 7th Edition Bowerman Solutions Manualdownload
100% (7)
Business Statistics in Practice 7th Edition Bowerman Solutions Manualdownload
46 pages
126 Saena
No ratings yet
126 Saena
22 pages
Stat 102 Module 1
No ratings yet
Stat 102 Module 1
11 pages
2101 - Assignment 2
No ratings yet
2101 - Assignment 2
3 pages
Advance Stats Assignment
No ratings yet
Advance Stats Assignment
18 pages
Properties of Estimators
No ratings yet
Properties of Estimators
27 pages
Maths Past Paper May 24
No ratings yet
Maths Past Paper May 24
12 pages
Chapter 5: Statistical Aspects of Regression: and Are Only Estimates of and
No ratings yet
Chapter 5: Statistical Aspects of Regression: and Are Only Estimates of and
21 pages
Twelve P Value Misconceptions
No ratings yet
Twelve P Value Misconceptions
6 pages
20 - The Null and Alternative Hypotheses
100% (1)
20 - The Null and Alternative Hypotheses
37 pages
Simple Linear Regression Example
100% (1)
Simple Linear Regression Example
3 pages
Thesis Chi Square
100% (3)
Thesis Chi Square
5 pages
Comprehensive Analysis For Opening A Salon in IIM Kozhikode Campus
No ratings yet
Comprehensive Analysis For Opening A Salon in IIM Kozhikode Campus
9 pages
Advertising Adstock Transformation
No ratings yet
Advertising Adstock Transformation
8 pages
PR Ekonometrika
No ratings yet
PR Ekonometrika
8 pages

EC229 Part II Answers

Uploaded by

EC229 Part II Answers

Uploaded by

1.

The occurrence of heteroscedasticity can be written mathematically as follows:

and, represented diagrammatically as follows:

The generalized least squares (GLS) is a remedy to heteroscedasticity as it is an extension of

Autocorrelation can be symbolically stated as:

 Autocorrelation leads to Gauss-Markov condition of efficiency being violated, as the

In the case that it is pure autocorrelation (autocorrelation is not a result of model

Positive First-Order Autocorrelation

Negative First-Order Autocorrelation:

Identifying Positive and Negative Autocorrelation through Durbin-Watson Test:

- Negative Autocorrelation: Durbin-Watson statistic close to 4 indicates negative

- No Autocorrelation: Durbin-Watson statistic around 2 indicates no autocorrelation.

Understanding the nature of autocorrelation is crucial for selecting appropriate modeling

I) Coefficient and Marginal Effect

ii) R² and Likelihood Ratio Index

Definition: In binary choice models, Definition: Observed Y refers to the actual

Interpretation: Predicted probabilities range Interpretation: Observed Y values are the

Predicted Y vs. Observed Y: Predicted Y represents model-derived probabilities, while

6. A second-year student investigates the factors influencing graduating high school.

a) Interpret the regression results

ASVABC is statistically significant as in accordance to the T-test T-calculated is greater than T-

You might also like