0% found this document useful (0 votes)

154 views11 pages

Detecting and Resolving Heteroskedasticity in STATA-1

The document outlines methods for detecting and resolving heteroscedasticity in a regression model using STATA commands. It includes both informal graphical methods and formal statistical tests such as the Breusch-Pagan test, Glesjer LM test, Harvey-Godfrey test, Park LM test, and White test, all indicating significant evidence of heteroscedasticity. Solutions for addressing heteroscedasticity, including generalized least squares, taking logarithms of variables, and applying robust standard errors, are also discussed.

Uploaded by

Sharjeel Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

154 views11 pages

Detecting and Resolving Heteroskedasticity in STATA-1

Uploaded by

Sharjeel Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Detecting Heteroscedasticity

(STATA Commands)
Used Econometric Model: price=ß1+ ß2rooms+ ß3sqfeet
Data File: hprice.dta

1) Informal or Graphical Method

For generating the scatter plots between squared error terms and estimated price, and between
squared error terms and independent variables (rooms and sqfeet), first we run the regression and
generated estimated price (priceht), residuals (ut) and squared residuals (utsq) as follows:
Note: when we write text starting with *, it means these are comments to STATA commands for
guidance.
*regress the price on rooms and sqrfeet

*predict the estimated value of price (priceht)

*predict the residual or standard errors (ut) of our regression model

. predict ut, residual
*generate (g) the squared residuals or standard errors (ut) of our regression model
. g ustq=ut^2
*now generate the scatter diagrams to detect heteroscedasticity
. twoway (scatter utsq priceht) (lfit utsq priceht)
Figure 1
. twoway (scatter utsq rooms) (lfit utsq rooms)

Figure 2
. twoway (scatter utsq sqfeet) (lfit utsq sqfeet)

Figure 3
There is a clear evidence of heteroscedasticity in all three figures (the variation of squared residuals
is not constant). The variable room is the stronger case of heteroscedasticity, while sqfeet is
relatively week case of hetero.

2) Formal Methods
i) Breusch-Pagan (BP) Test: This test regresses the squared error term (utsq) on independent
variables (rooms and sqfeet in this case). The significant F-value or LM test values of this
auxiliary regression
. reg utsq rooms sqfeet

*The following command generates the 5% critical value of χ2 distribution.

. scalar chi2critical=invchi2tail(e(df_m), 0.05) for critical value of chi square, entering df testing variance

*The following command generates the p-value of χ2 distribution.

. scalar p_value=chi2tail(e(df_m), nR2)
* Now list all the scalar values generated previously with the following command.
Interpretation: Since calculated value of LM test (10.58) is greater than χ2 critical value (5.99), we
reject the null hypothesis of no heteroscedasticity.
*BP test can also be simply implemented with the following command after running the main
regression

Since the chi-square value is highlight significant (p<0.05), we reject the null hypothesis that there
is no heteroscedasticity in the model.
ii) Glesjer LM Test
This test regresses the absolute residuals (or standard errors) on independent variables. Therefore,
we need to generate the absolute residuals (absut) with the following command.
. g absut=abs(ut) generating abs values of error term
*Now run the auxiliary regression as follows:
repeating the same three commands

Since F-value is highly significant (>2.5; p<0.05), there is a strong evidence of heteroscedasticity.
The another way is to calculate LM test (which follows a χ2 distribution with k-1 degree of
freedom) by multiplying number of observations (n) with R-square (R2) of above auxiliary
regression.
*The following command generates the value of LM test by multiplying the estimates of
number of observations (e(N) with r-squared (e(r2).
. scalar nR2=e(N)*e(r2)
*The following command generates the 5% critical value of χ2 distribution.
. scalar chi2critical=invchi2tail(e(df_m), 0.05)
*The following command generates the p-value of χ2 distribution.
. scalar p_value=chi2tail(e(df_m), nR2)
* Now list all the scalar values generated previously with the following command.

Interpretation: Since calculated value of LM test (13.13) is greater than χ2 critical value (5.99),
we reject the null hypothesis of no heteroscedasticity.
iii) Harvey-Godfrey Test
This test regresses the log squared residuals (or standard errors) on independent variables.
Therefore, we need to generate the log squared residuals (lutsq) with the following command.
. g lutsq=log(utsq)
*Now run the auxiliary regression as follows:
Since F-value is highly significant (>2.5; p<0.05), there is a strong evidence of heteroscedasticity.
The another way is to calculate LM test (which follows a χ2 distribution with k-1 degree of
freedom) by multiplying number of observations (n) with R-square (R2) of above auxiliary
regression.
*The following command generates the value of LM test by multiplying the estimates of
number of observations (e(N) with r-squared (e(r2).
. scalar nR2=e(N)*e(r2)
*The following command generates the 5% critical value of χ2 distribution.
. scalar chi2critical=invchi2tail(e(df_m), 0.05)
*The following command generates the p-value of χ2 distribution.
. scalar p_value=chi2tail(e(df_m), nR2)
* Now list all the scalar values generated previously with the following command

Interpretation: Since calculated value of LM test (8.65) is greater than χ2 critical value (5.99), we
reject the null hypothesis of no heteroscedasticity.
iv) *Park LM Test
This test regresses the log squared residuals (or standard errors) on log independent variables.
Therefore, we need to generate the log independent variables with the following commands.
. g lrooms=log(rooms)
. g lsqfeet=log(sqfeet)
*Now run the auxiliary regression as follows:
Since F-value is highly significant (>2.5; p<0.05), there is a strong evidence of heteroscedasticity.
The another way is to calculate LM test (which follows a χ2 distribution with k-1 degree of
freedom) by multiplying number of observations (n) with R-square (R2) of above auxiliary
regression.
*The following command generates the value of LM test by multiplying the estimates of
number of observations (e(N) with r-squared (e(r2).
. scalar nR2=e(N)*e(r2)
*The following command generates the 5% critical value of χ2 distribution.
. scalar chi2critical=invchi2tail(e(df_m), 0.05)
*The following command generates the p-value of χ2 distribution.
. scalar p_value=chi2tail(e(df_m), nR2)
* Now list all the scalar values generated previously with the following command

Interpretation: Since calculated value of LM test (7.41) is greater than χ2 critical value (5.99), we
reject the null hypothesis of no heteroscedasticity.
v) *White Test
*no cross-products: In this case, we regress the squared residuals or errors on independent
variables (IVs) and their squared values (or quadratic terms).
* Generate the quadratic terms of IVs
. g rooms2=rooms^2
. g sqfeet2=sqfeet^2
*Now run the auxiliary regression as follows: we also put squared (quadratic) terms b\c sometimes
terms have non collinear relation
Since F-value is highly significant (>2.5; p<0.05), there is a strong evidence of heteroscedasticity.
The another way is to calculate LM test (which follows a χ2 distribution with k-1 degree of
freedom) by multiplying number of observations (n) with R-square (R2) of above auxiliary
regression.
*The following command generates the value of LM test by multiplying the estimates of
number of observations (e(N) with r-squared (e(r2).
. scalar nR2=e(N)*e(r2)
*The following command generates the 5% critical value of χ2 distribution.
. scalar chi2critical=invchi2tail(e(df_m), 0.05)
*The following command generates the p-value of χ2 distribution.
. scalar p_value=chi2tail(e(df_m), nR2)
* Now list all the scalar values generated previously with the following command

Interpretation: Since calculated value of LM test (16.20) is greater than χ2 critical value (5.99),
we reject the null hypothesis of no heteroscedasticity.
*with cross-products: In this case, we regress the squared residuals or errors on independent
variables (IVs), their squared values (or quadratic terms) and their interaction or cross-
product terms.
*Generate the quadratic terms of IVs. Note that we have already generated quadratic terms
in first method of White Test.
. g roomsXsqfeet=rooms*sqfeet
Since F-value is highly significant (>2.5; p<0.05), there is a strong evidence of heteroscedasticity.
The another way is to calculate LM test (which follows a χ2 distribution with k-1 degree of
freedom) by multiplying number of observations (n) with R-square (R2) of above auxiliary
regression.
*The following command generates the value of LM test by multiplying the estimates of
number of observations (e(N) with r-squared (e(r2).
. scalar nR2=e(N)*e(r2)
*The following command generates the 5% critical value of χ2 distribution.
. scalar chi2critical=invchi2tail(e(df_m), 0.05)
*The following command generates the p-value of χ2 distribution.
. scalar p_value=chi2tail(e(df_m), nR2)
* Now list all the scalar values generated previously with the following command

Interpretation: Since calculated value of LM test (17.23) is greater than χ2 critical value (5.99),
we reject the null hypothesis of no heteroscedasticity.

after running main regression:

simple command for white test:
- command: estat imtest, white
Resolving Heteroscedasticity
1) GLS/WLS
generalised least squares/weighted least squares
In this method, we divide the regression equation by the variance or standard deviation of that
independent variable which is mainly causing the heteroscedasticity problem. For instance, if we
look at above scatter diagrams, rooms variable has the strong case of heteroscedasticity.
dividing the whole reg equation with variance / std dev of the regressor causing hetero
*Run the following regression command by using rooms as analytical weight (aweight) in the
regression equation
graph will tell us the regressor causing
hetero and we'll mention the variable in
command whose weights stata will use.

*Now test the heteroscedasticity with the following BP test to check whether hetero is
removed or not

if insignificant, then hetero has

been removed

if we know the source of

hetero (variable) , we can run
wls/gls
Interpretation: Since the chi-square value is significant, the hetero problem still exists in the data.
Note: You can try the same method for priceht and sqfeet variables
2) *Taking Log of Variables
. g lprice=log(price)

Since the chi-square value of BP Test is insignificant, the hetero problem is removed from the
given model.
3) Applying White Heteroscedastic-consistent Robust Standard Errors Method
Remember this method only correct standard errors for hetero problem, but does not
remove the heteroscedasticity from the data
if nothing works

Statistics
100% (3)
Statistics
547 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
71 pages
Research 1: Quarter 1 - Module 1: Basic Science Process Skills
100% (2)
Research 1: Quarter 1 - Module 1: Basic Science Process Skills
27 pages
Showroom - Manegement System
No ratings yet
Showroom - Manegement System
26 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
66 pages
How To Write Research Methodology in Four Steps - With Examples
No ratings yet
How To Write Research Methodology in Four Steps - With Examples
29 pages
SHS Practical Research 2 Lesson 2 0c0
No ratings yet
SHS Practical Research 2 Lesson 2 0c0
10 pages
Charles Manson The Languageof Cults
No ratings yet
Charles Manson The Languageof Cults
45 pages
Module 1 Educ 800
No ratings yet
Module 1 Educ 800
10 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
82 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
51 pages
Talk Natural Vs Social
No ratings yet
Talk Natural Vs Social
13 pages
Dr. Gabao Research With Table of Spec
No ratings yet
Dr. Gabao Research With Table of Spec
16 pages
2912 11558 1 PB
No ratings yet
2912 11558 1 PB
10 pages
Proposal and Research BY Kitula
100% (1)
Proposal and Research BY Kitula
59 pages
Cash Flow Analysis of Construction Projects
No ratings yet
Cash Flow Analysis of Construction Projects
9 pages
Dissertation Data Collection and Analysis
100% (2)
Dissertation Data Collection and Analysis
4 pages
Communication Framework Document 08122023
No ratings yet
Communication Framework Document 08122023
68 pages
A FINALS Econometrics - II MCQs
100% (2)
A FINALS Econometrics - II MCQs
6 pages
Socio-Demographic Characteristics and Population Dynamics of Barangay Cogon in Gubat, Sorsogon: Insights For Community Development and Policy Formulation
No ratings yet
Socio-Demographic Characteristics and Population Dynamics of Barangay Cogon in Gubat, Sorsogon: Insights For Community Development and Policy Formulation
5 pages
Labov 1987
No ratings yet
Labov 1987
8 pages
Chapter 4
No ratings yet
Chapter 4
55 pages
Ec1 12
No ratings yet
Ec1 12
65 pages
File 1
No ratings yet
File 1
15 pages
Evo of Management Theories
No ratings yet
Evo of Management Theories
50 pages
Ch5 Slides Ed3 Feb2021
No ratings yet
Ch5 Slides Ed3 Feb2021
49 pages
Multi Agent Systems - Algorithmic, Theoretic, and Logical Foundations - ToC
No ratings yet
Multi Agent Systems - Algorithmic, Theoretic, and Logical Foundations - ToC
10 pages
ACPRO PracticaRequestForm
No ratings yet
ACPRO PracticaRequestForm
4 pages
14 Heteroskedasticity
No ratings yet
14 Heteroskedasticity
52 pages
Chapter 4
No ratings yet
Chapter 4
63 pages
Econometrics Guide E-Veiw
No ratings yet
Econometrics Guide E-Veiw
16 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
71 pages
Diagnostic Tests
No ratings yet
Diagnostic Tests
51 pages
Regress Y On X2 and X3 and Report The Results:: ECO 372 Assignment On Heteroscedasticity
No ratings yet
Regress Y On X2 and X3 and Report The Results:: ECO 372 Assignment On Heteroscedasticity
33 pages
Heteros Kedasti City
No ratings yet
Heteros Kedasti City
26 pages
Tests of Heteroscedasticity: Prof. Rizzi Laura
No ratings yet
Tests of Heteroscedasticity: Prof. Rizzi Laura
14 pages
Strategic MGT For Tsegaye 1
No ratings yet
Strategic MGT For Tsegaye 1
17 pages
Heteroskedasticity
No ratings yet
Heteroskedasticity
49 pages
Ec1 12
No ratings yet
Ec1 12
65 pages
SST Ûr Var: Principles of Econometrics - Class of October 14 Feunl
No ratings yet
SST Ûr Var: Principles of Econometrics - Class of October 14 Feunl
18 pages
ch11 Heteroscedasticity
No ratings yet
ch11 Heteroscedasticity
31 pages
Heteroscedasticity 2024
No ratings yet
Heteroscedasticity 2024
36 pages
Session Heteroscedasticity
No ratings yet
Session Heteroscedasticity
26 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
Ch08 Part 2 - Multtiple Regression
No ratings yet
Ch08 Part 2 - Multtiple Regression
45 pages
Lecture 4
No ratings yet
Lecture 4
43 pages
Points For Session 4 - Updated
No ratings yet
Points For Session 4 - Updated
9 pages
Econometrics Course For RDAE Chapter 5
No ratings yet
Econometrics Course For RDAE Chapter 5
82 pages
GAJHSS 23 39-43 VMGJbOK
No ratings yet
GAJHSS 23 39-43 VMGJbOK
6 pages
Ch5 - Slides 2022 - 11 - 29 - L1
No ratings yet
Ch5 - Slides 2022 - 11 - 29 - L1
35 pages
Tutorial 4
No ratings yet
Tutorial 4
16 pages
Tiktok Brand Activations On Gen Z
No ratings yet
Tiktok Brand Activations On Gen Z
34 pages
Advances 20220303 24
No ratings yet
Advances 20220303 24
13 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
8 pages
Quantitative Psychological Research The Complete Student's Companion 5th Edition David Clark-Carter All Chapters Instant Download
100% (4)
Quantitative Psychological Research The Complete Student's Companion 5th Edition David Clark-Carter All Chapters Instant Download
62 pages
(PolFund & PoliThe) Final Research Paper - Group 7
No ratings yet
(PolFund & PoliThe) Final Research Paper - Group 7
49 pages
AI Data Litercay Unit 2
No ratings yet
AI Data Litercay Unit 2
3 pages
Practical Session 2 Linear Regression Model Assumptions
No ratings yet
Practical Session 2 Linear Regression Model Assumptions
7 pages
202003271457478511akash Heteroscedasticity
No ratings yet
202003271457478511akash Heteroscedasticity
16 pages
Assignment
No ratings yet
Assignment
12 pages
Chapter 4
No ratings yet
Chapter 4
62 pages
MFIN 305 - Lecture3
No ratings yet
MFIN 305 - Lecture3
66 pages
Data Analysis With R: Sai Vaibhavi Tulasi
No ratings yet
Data Analysis With R: Sai Vaibhavi Tulasi
2 pages
Lecture 1
No ratings yet
Lecture 1
6 pages
Prac 11 Heteroscedasticity
No ratings yet
Prac 11 Heteroscedasticity
4 pages
Detection of Heteroscadasticity
No ratings yet
Detection of Heteroscadasticity
6 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Heteroskedasticity 2024
No ratings yet
Heteroskedasticity 2024
19 pages
Chapter 4
No ratings yet
Chapter 4
38 pages
Chris Brooks - Chapter 5 - Slides
No ratings yet
Chris Brooks - Chapter 5 - Slides
71 pages
Working With Suicidal Individuals A Guide To Providing Understanding, Assessment and Support EPUB DOCX PDF Download
100% (18)
Working With Suicidal Individuals A Guide To Providing Understanding, Assessment and Support EPUB DOCX PDF Download
16 pages
Heteroskedasticity
No ratings yet
Heteroskedasticity
9 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
3 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
21 pages
OLS Assumptions and Diagnostics
No ratings yet
OLS Assumptions and Diagnostics
18 pages
L1090 Lecture7 AU24
No ratings yet
L1090 Lecture7 AU24
27 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
15 pages
Lecture 10 Heteroscedasticity
No ratings yet
Lecture 10 Heteroscedasticity
6 pages
Chapter-05 Heter
No ratings yet
Chapter-05 Heter
105 pages
ECTRX Topic6 Heteroscedasticity
No ratings yet
ECTRX Topic6 Heteroscedasticity
31 pages
Slides 4 Iu
No ratings yet
Slides 4 Iu
23 pages
Chapter 8
No ratings yet
Chapter 8
54 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
20 pages
Ch5 Slides
No ratings yet
Ch5 Slides
32 pages
R Notesss
No ratings yet
R Notesss
12 pages
Econometrics 2 Midterm Exam
No ratings yet
Econometrics 2 Midterm Exam
23 pages
14 - Econometrics - Linear Regression
No ratings yet
14 - Econometrics - Linear Regression
18 pages
15-Econometrics-Linear Regression
No ratings yet
15-Econometrics-Linear Regression
25 pages
Statistics Course Notes
No ratings yet
Statistics Course Notes
86 pages
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)

Detecting and Resolving Heteroskedasticity in STATA-1

Uploaded by

Detecting and Resolving Heteroskedasticity in STATA-1

Uploaded by

Detecting Heteroscedasticity

1) Informal or Graphical Method

*predict the estimated value of price (priceht)

*predict the residual or standard errors (ut) of our regression model

*The following command generates the 5% critical value of χ2 distribution.

*The following command generates the p-value of χ2 distribution.

after running main regression:

if insignificant, then hetero has

if we know the source of

You might also like