Lecture 14 - Panel data models

Uploaded by

Gia Bảo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

Lecture 14 - Panel data models

Uploaded by

Gia Bảo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

PANEL DATA

MODELS
Nguyen Quang
[email protected]
1 - Panel data
2 - Pooled OLS estimator
3 - Fixed effects model
4 - Random effects model
5 - FE vs RE: Hausman test
6 - Between group estimator

COVERED IN
THIS LECTURE
PANEL DATA
PANEL DATA

• Cross-sectional data: observations from MANY units at a SINGLE time point.

𝑦! with 𝑖 = 1, … 𝑁
• Time-series data: observations from a SINGLE unit over MULTIPLE time periods
𝑦" with 𝑡 = 1, … , 𝑇
• Panel data: observations from MANY units over SEVERAL time periods
𝑦!" with 𝑖 = 1, … , 𝑁 and 𝑡 = 1, . . , 𝑇
Advantages

• More observations
• More variability
• Less collinearity between regressors
PANEL • Control of individual heterogeneity
• Reduce biases
DATA
Disadvantages

• Require more efforts collecting data

• Selectivity biases
PANEL DATA MODEL
REQUIRES WITHIN GROUP
VARIATION
• Panel data model (FE) requires variation
within group
• An example where panel data does not
work
𝑦!" = 𝛼 + 𝛽𝑥!" + 𝑢
• 𝑦!" is export volume from VN to country 𝑖
in year 𝑡
• 𝑥!" is the distance from VN to country 𝑖 in
year 𝑡
• As distance from VN to country 𝑖 does not
change from year to year, it can’t be
included in the fixed effect model.
Viet Nam Provincial data on
• rgdp: provincial GDP (mil. VND)
• labfo: number of laborers of provinces (1000
persons)
EXAMPLE
• rinvest: provincial gross investment (mil. VND)
DATA • pci: 100-point scaled composite index measuring
and ranking Vietnam’s provinces based on their
overall economic governance quality
• Data for 58 provinces, 5 years (2007-2011)
province provincecode year rgdp labfo rinvest pci
BRVT 11 2010 1.30E+08 531.1 2.60E+07 60.5507
BRVT 11 2009 1.30E+08 513 2.00E+07 64.2287
BRVT 11 2008 1.70E+08 519 1.80E+07 60.5126
BRVT 11 2011 1.40E+08 553.9 2.30E+07 66.13
BRVT 11 2007 1.20E+08 497.6 1.30E+07 65.6337
EXAMPLE Ca Mau 12 2009 1.90E+07 675.6 8.20E+06 61.0756
DATA Ca Mau
Ca Mau
12
12
2010
2007
2.20E+07
1.40E+07
677.1
625.5
9.70E+06
1.20E+07
53.5729
56.194
Ca Mau 12 2008 1.50E+07 654.1 8.70E+06 58.6385
Ca Mau 12 2011 2.40E+07 684.3 1.20E+07 59.43
Can Tho 13 2010 7.20E+07 680.7 1.60E+07 62.4605
Can Tho 13 2008 5.50E+07 684.4 1.00E+07 56.32
Can Tho 13 2011 9.00E+07 690.7 1.60E+07 62.66
Can Tho 13 2007 4.40E+07 680.6 9.70E+06 61.762
Can Tho 13 2009 6.00E+07 656 1.50E+07 52.3378
SUMMARY STATISTICS
MODEL
SPECIFICATION
• In this lecture we will consider the
specification

𝑦!" = 𝛼 + 𝛽𝑋!" + 𝑢
• 𝑦!" is the logarithm of real GDP of
province 𝑖 in year 𝑡
• 𝑋!" includes
• Logarithm of the labor force
• Logarithm of real investment
• Provincial competitiveness index (PCI)
POOLED OLS ESTIMATOR
POOLED OLS ESTIMATOR
• Data of all groups are pooled together
• No difference between groups

𝑦!" = 𝛼 + 𝛽𝑋!" + 𝑢!"

• Coefficients are identical for all groups.
• Some assumptions:
• The error term is not autocorrelated and homoscedastic
• 𝑋 is nonstochastic and not correlated with 𝑢 (𝑋 is strictly exogenous)
THE
POOLED
OLS IN R
POOLED OLS WITH ROBUST STANDARD ERRORS
CLUSTERED STANDARD ERRORS

• The Pooled OLS estimator (and other panel data models) assumes no
correlation between residuals of the same group (no autocorrelation)
• If we relax the assumption, then
cov 𝑢!" , 𝑢!# ≠ 0
• We then have heteroskedasticity and autocorrelation
• If this happens, the Pooled OLS estimator is still consistent, but the standard
errors are incorrect.
• In this case we may use the clustered robust standard errors.
POOLED OLS WITH
CLUSTERED STANDARD ERRORS
POOLED
OLS
USING
PACKAGE
PLM
FIXED EFFECTS MODEL
Within group estimator
THE FIXED EFFECTS MODEL
• The model
𝑦!" = 𝛼! + 𝛽𝑋!" + 𝑢!"
• The slopes are still identical for all groups.
• But each group has a different intercept.
• These intercepts are called fixed effects, which capture individual heterogeneity.
• Two estimators:
• Fixed effects estimator (within group)
• Least square dummy variable estimator (LSDV)
• Note: these are the two ways of estimating the FE model, not two different models.
WITHIN GROUP FIXED EFFECTS ESTIMATOR
• The model
𝑦!" = 𝛼! + 𝛽𝑋!" + 𝑢!" (1)
• We need to allow for the intercept to vary across groups.
• Now take the average of variables across time, note that the parameters are time-invariant
𝑦1!" = 𝛼! + 𝛽𝑋1!" + 𝑢1 !" (2)
# #
where 𝑦1!" = $ ∑$"%# 𝑦!" and 𝑋1!" = $ ∑$"%# 𝑋!"
• Then subtract (2) from (1)
𝑦!" − 𝑦1!" = 𝛼! − 𝛼! + 𝛽 𝑋!" − 𝑋1!" + 𝑢!" − 𝑢1 !"
• Which results in
𝑦4!" = 𝛽𝑋5!" + 𝑢4 !"
• With this way we can estimate 𝛽 but not the fixed effects.
WITHIN
GROUP FIXED
EFFECTS
ESTIMATOR
WITHIN GROUP FIXED EFFECTS ESTIMATOR
robust standard errors
WITHIN GROUP FIXED EFFECTS ESTIMATOR
clustered standard errors
LEAST SQUARES DUMMY VARIABLE ESTIMATOR

• For the model: 𝑦!" = 𝛼! + 𝛽𝑋!" + 𝑢!"

• We can estimate the fixed effects and 𝛽 by introducing the dummy variables
1 if 𝑗 = 𝑖
𝐷&! =
0 otherwise
• We can then estimate the following model using OLS
'
𝑦!" = C 𝛼& 𝐷&! + 𝛽𝑋!" + 𝑢!"
&%#
• This is the least squares dummy variable (LSDV) estimator.
• The LSDV slope estimates are identical to the within group FE estimates.
• However, LSDV also estimates the fixed effects.
• On the other hand, LSDV is not efficient when 𝑁 is large.
LSDV
FIXED
EFFECTS
ESTIMATOR
some factors omitted
LSDV WITH
ROBUST STANDARD ERRORS
LSDV WITH
CLUSTERED STANDARD ERRORS
• The model now includes time fixed effects

' )
LSDV TWO- 𝑦!" = 0 𝛼$ 𝐷$! + 0 𝛾( 𝐷(" + 𝛽𝑋!" + 𝑢!"
WAY FIXED $%& (%&

EFFECTS Where:
1 if 𝑔 = 𝑡
𝐷(" =
MODEL 0 otherwise
LSDV TWO-
WAY
FIXED
EFFECTS some factors omitted

MODEL
• The random effects model is presented by
𝑦!" = 𝛼 + 𝛽𝑋!" + 𝑢!"
• The error component now includes
𝑢!" = 𝜇! + 𝜖!"
RANDOM
• 𝜇! ~𝑁 0, 𝜎*+ the individual specific random
EFFECTS component
MODEL • 𝜖!" ~𝑁 0, 𝜎,+ the idiosyncratic disturbance
• In the random effects model, regressors can be
time-invariant.
• Estimation method: generalized least squares
RANDOM
EFFECTS
MODEL
RANDOM EFFECT MODEL
clustered standard errors
RANDOM VS.
FIXED EFFECTS
• The main difference is that the individual effects
RANDOM are assumed fixed in FE and random in RE.
• The random effects model is preferred for
VS. FIXED • The fixed effects vary over time.
• It is more efficient (higher degree of
EFFECTS freedom)
• It allows time-invariant regressors
HAUSMAN TEST

• Null hypothesis: both RE and FE estimates are consistent

• Alternative hypothesis: RE estimates are inconsistent
• Test statistics
𝐻 = 𝛽-. − 𝛽/. 0 𝑉 𝛽-. − 𝑉 𝛽/. 1& 𝛽-. − 𝛽/.
which follows 𝜒 + with df = number of regressors.
HAUSMAN
TEST IN R
• We can test only when both models have the
same set of regressors.
• If we include time-invariant regressor in the
RE model (which is not possible in FE
NOTES ON model), then Hausman test fails.
HAUSMAN • Hausman test check whether the two estimates
are equal.
TEST • If we reject the null hypothesis, the FE estimates
are consistent and RE model is mis-specified.
• If any regressor is correlated with the error
term, both estimates are biased.
BETWEEN
ESTIMATOR
• The between estimator analyzes the cross-
sectional variation.
• Suppose we have a data set with 𝑁 units and 𝑇
periods of time.
• Average over time all variables
BETWEEN
ESTIMATOR 𝑦H!" = 𝛼! + 𝛽 𝑋H!" + 𝑢H !"
• Where
#
• 𝑦1!" = $ ∑$"%# 𝑦!"
# $
1
• 𝑋!" = $ ∑"%# 𝑋!"
• We then have a data set of 𝑁 observations.
BETWEEN
ESTIMATOR

Econometric S Cheat Sheet
No ratings yet
Econometric S Cheat Sheet
3 pages
Lecture No 10
0% (1)
Lecture No 10
28 pages
Fixed Effects, Random Effects Model Cheat Sheet
100% (1)
Fixed Effects, Random Effects Model Cheat Sheet
4 pages
slides-6-iu
No ratings yet
slides-6-iu
38 pages
Fere
No ratings yet
Fere
46 pages
Topic 9: Panel Data Models
No ratings yet
Topic 9: Panel Data Models
46 pages
Panel Data Assign
No ratings yet
Panel Data Assign
19 pages
Introduction To Panel Data
No ratings yet
Introduction To Panel Data
20 pages
Fem & Rem
No ratings yet
Fem & Rem
20 pages
Topic 6 - Static Panel Data
No ratings yet
Topic 6 - Static Panel Data
21 pages
Panel Data Models
No ratings yet
Panel Data Models
25 pages
Some Basics For Panel Data Analysis
No ratings yet
Some Basics For Panel Data Analysis
21 pages
Chapter 5
No ratings yet
Chapter 5
25 pages
1170_10045_136696 (2)
No ratings yet
1170_10045_136696 (2)
61 pages
Topic 4 Panel Regression Model Wble
No ratings yet
Topic 4 Panel Regression Model Wble
34 pages
C6 - English
No ratings yet
C6 - English
18 pages
Ch11_slides_PA April 2024 (2)
No ratings yet
Ch11_slides_PA April 2024 (2)
27 pages
AE 2023 Lecture10
No ratings yet
AE 2023 Lecture10
40 pages
Intro Panel Data by Kurt-Univ Basel
No ratings yet
Intro Panel Data by Kurt-Univ Basel
8 pages
Panel Data: Fixed and Random Effects: I1 0 I1 0 I I1
No ratings yet
Panel Data: Fixed and Random Effects: I1 0 I1 0 I I1
8 pages
Fixed and Random Effects: Jos Elkink
No ratings yet
Fixed and Random Effects: Jos Elkink
121 pages
14 Panel Data Models
No ratings yet
14 Panel Data Models
31 pages
PLM
No ratings yet
PLM
51 pages
Week 3-1
No ratings yet
Week 3-1
25 pages
Plm
No ratings yet
Plm
51 pages
Panel Data Analysis
No ratings yet
Panel Data Analysis
61 pages
Chapter 2_Panel Data Regression
No ratings yet
Chapter 2_Panel Data Regression
30 pages
6 panelmf
No ratings yet
6 panelmf
18 pages
Chapter 2 Slides Handout
No ratings yet
Chapter 2 Slides Handout
48 pages
Note On Panel Data
No ratings yet
Note On Panel Data
19 pages
Lectute 2 - Panel Data Regression
No ratings yet
Lectute 2 - Panel Data Regression
30 pages
Panel Ecmiic2
No ratings yet
Panel Ecmiic2
57 pages
Lecture Series 1 Linear Random and Fixed Effect Models and Their (Less) Recent Extensions
No ratings yet
Lecture Series 1 Linear Random and Fixed Effect Models and Their (Less) Recent Extensions
62 pages
panel2up
No ratings yet
panel2up
9 pages
1669594424_72__UE_panelv3
No ratings yet
1669594424_72__UE_panelv3
35 pages
Panel Data Econometrics in R: The PLM Package: Yves Croissant Giovanni Millo
No ratings yet
Panel Data Econometrics in R: The PLM Package: Yves Croissant Giovanni Millo
51 pages
Croissant y Millo, Panel Data Econometrics
100% (1)
Croissant y Millo, Panel Data Econometrics
52 pages
Econometris II - 4
No ratings yet
Econometris II - 4
26 pages
Block 3
No ratings yet
Block 3
105 pages
econometrics-cheat-sheet
No ratings yet
econometrics-cheat-sheet
4 pages
Panel Data Analysi
No ratings yet
Panel Data Analysi
27 pages
2025 Static Panels
No ratings yet
2025 Static Panels
19 pages
Materi Teknik Data Panel
No ratings yet
Materi Teknik Data Panel
30 pages
Panel Cookbook
No ratings yet
Panel Cookbook
98 pages
Panel Data
No ratings yet
Panel Data
9 pages
Chapter_14
No ratings yet
Chapter_14
22 pages
ARM 2nd Mid
No ratings yet
ARM 2nd Mid
13 pages
Panel Data
100% (1)
Panel Data
13 pages
Week 1
No ratings yet
Week 1
48 pages
Panel Data Analysis
No ratings yet
Panel Data Analysis
9 pages
Panel Data Regression Models
100% (1)
Panel Data Regression Models
25 pages
Panel Data For Learing
100% (2)
Panel Data For Learing
34 pages
Panel Guidelines
No ratings yet
Panel Guidelines
3 pages
Fixed and Random Effects
No ratings yet
Fixed and Random Effects
23 pages
Week 2
No ratings yet
Week 2
61 pages
Panel Data Model
No ratings yet
Panel Data Model
18 pages
Panel Data
No ratings yet
Panel Data
105 pages
Panel Data Methods
No ratings yet
Panel Data Methods
17 pages
How to Find Inter-Groups Differences Using Spss/Excel/Web Tools in Common Experimental Designs: Book 1
From Everand
How to Find Inter-Groups Differences Using Spss/Excel/Web Tools in Common Experimental Designs: Book 1
P.Y. Cheng
No ratings yet
10 Minute Guide to Orthogonal Array Test Strategy
From Everand
10 Minute Guide to Orthogonal Array Test Strategy
Rajeev Nair Raman
No ratings yet
Solutions Manual to accompany An Introduction to Numerical Methods and Analysis
From Everand
Solutions Manual to accompany An Introduction to Numerical Methods and Analysis
James F. Epperson
5/5 (1)
Task 2 - TDT1
No ratings yet
Task 2 - TDT1
5 pages
Valvoline Crimson Grease PDS
No ratings yet
Valvoline Crimson Grease PDS
2 pages
This Is S A File About Accounting
No ratings yet
This Is S A File About Accounting
2 pages
Compact Mono Laser Multifunction Printer
No ratings yet
Compact Mono Laser Multifunction Printer
2 pages
Automate and Secure Your Home Using Zigbee Technology
No ratings yet
Automate and Secure Your Home Using Zigbee Technology
4 pages
Activity in Class # 2 Semana 3
No ratings yet
Activity in Class # 2 Semana 3
3 pages
Sudoku Squares and Chromatic Polynomials: Agnes M. Herzberg and M. Ram Murty
100% (1)
Sudoku Squares and Chromatic Polynomials: Agnes M. Herzberg and M. Ram Murty
10 pages
Quartz WP - Performance Management PDF
No ratings yet
Quartz WP - Performance Management PDF
23 pages
Spherical Mirrors Types of Spherical Mirrors
No ratings yet
Spherical Mirrors Types of Spherical Mirrors
7 pages
Canadian Metallurgical Quarterly Volume 11 Issue 2 1972 (Doi 10.1179 - cmq.1972.11.2.413) Wiegel, R.L. - Advances in Mineral Processing Material Balances
No ratings yet
Canadian Metallurgical Quarterly Volume 11 Issue 2 1972 (Doi 10.1179 - cmq.1972.11.2.413) Wiegel, R.L. - Advances in Mineral Processing Material Balances
12 pages
Free Electron Lasers: Giuseppe Dattoli and Alberto Renieri
No ratings yet
Free Electron Lasers: Giuseppe Dattoli and Alberto Renieri
6 pages
IDEO
No ratings yet
IDEO
10 pages
ESPCP General Notes Template
100% (1)
ESPCP General Notes Template
13 pages
Geokettle Readme
No ratings yet
Geokettle Readme
5 pages
Memory - IV: CS220: Introduction To Computer Organization 2011-12 Ist Semester
No ratings yet
Memory - IV: CS220: Introduction To Computer Organization 2011-12 Ist Semester
3 pages
Pendulum: The Second Useless Machine Ever
100% (1)
Pendulum: The Second Useless Machine Ever
4 pages
BASYX TriComm System Operation Manual v21
No ratings yet
BASYX TriComm System Operation Manual v21
58 pages
SF1 - 2020 - Grade 9 (Year III) - GARNET
No ratings yet
SF1 - 2020 - Grade 9 (Year III) - GARNET
8 pages
Emtek: Design Guide
No ratings yet
Emtek: Design Guide
28 pages
Awst 150608S PDF
No ratings yet
Awst 150608S PDF
125 pages
Chapter 4
No ratings yet
Chapter 4
15 pages
5 Keys To Wealth and Success PDF
100% (1)
5 Keys To Wealth and Success PDF
25 pages
Application of Genetic Algorithm in Intrusion Detection System
No ratings yet
Application of Genetic Algorithm in Intrusion Detection System
9 pages
Zoho-C-Code Snippets Practice
No ratings yet
Zoho-C-Code Snippets Practice
6 pages
Cambridge IGCSE™: First Language English 0500/22
No ratings yet
Cambridge IGCSE™: First Language English 0500/22
12 pages
Business and Professional Communication 3rd Edition Beebe Mottet Test Bank download
100% (2)
Business and Professional Communication 3rd Edition Beebe Mottet Test Bank download
65 pages
Share Full (Probability) Tests and Solutions (1 - 11)
No ratings yet
Share Full (Probability) Tests and Solutions (1 - 11)
105 pages
Foundations On Soft Soils For Khulna Medical
No ratings yet
Foundations On Soft Soils For Khulna Medical
6 pages

Lecture 14 - Panel data models

Uploaded by

Lecture 14 - Panel data models

Uploaded by

PANEL DATA

• Cross-sectional data: observations from MANY units at a SINGLE time point.

• Require more efforts collecting data

𝑦!" = 𝛼 + 𝛽𝑋!" + 𝑢!"

• For the model: 𝑦!" = 𝛼! + 𝛽𝑋!" + 𝑢!"

• Null hypothesis: both RE and FE estimates are consistent

You might also like