Chapter 5 Panel Data
Chapter 5 Panel Data
Wondu A (UOG)
5.1 Introduction
Panel data econometrics is the application of econometric
analysis on panel data. Panel data set is data set where the
same sample units /individuals are observed repeatedly
(more than once) at regular time interval. Panel data have
the characteristics of both crossectional and time series data.
It has space as well as time dimensions.
In panel data the same cross-sectional unit (individual, a
country, a region, firm, a household, etc.) is surveyed over a
repeated period of time.
There are other names for panel data, such as:
• Pooled data (pooling of time series and cross-sectional
observations),
• Crossection – time series data
5.1 cont’d…
• Micro - panel data - is a panel for which the time dimension(T)
is less than individual dimension (N). Three years observation of
1000 households
• Macro - panel data - is a panel for which the time dimension (T) is
similar to the individual dimension (N); E.g. 30 year GDP data for 30
countries.
• Longitudinal data (a study over time of a variable or group of
subjects),
• Event history analysis (e.g., studying the movement over time of
subjects through successive states or conditions),
• Cohort data / cohort analysis (e.g., following the career path of
graduates).
Although there are subtle variations, all these names essentially
connote movement over time of cross-sectional units.
5.1 cont’d…
Terminology and notations:
Cross section unit or panel units (individuals or subjects) are entities
such as country, region, state, firm, consumer, individuals, employees,
unemployed, professionals, organizations, etc. which are being
observed with respect some variables.
Uses of two or double subscripts or indices; ‘i’ and ‘t’:
i; - indicating cross-section unit and t - for time.
sigma_ 6.24239
uF test that
97 all u_i =0: F(13, 95) = 1.93 Prob > F =
0.0357estimate
sigma_ 9.50827store fem ( a command to save result)
e 37
rho .301198 (fraction of variance due to u_i)
9
exercise
xtreg liq asset loan rgdp, re
Coefficients ----
(b) (B) (b-B) sqrt(diag(V_b V_B))
fem rem Difference S.E.
Continued….
liq Coef. Std. Err. t P>t [95% Conf. Interval]
Clearly this removes the individual fixed effect, and so we can obtain
consistent estimates of β by estimating the equation in first differences by
OLS.
5.4 cont’d…..
FE or FD which is better?
FE and FD are two alternative ways of removing the fixed
effect. Which method should we use?
• First of all, when T = 2 (i.e. we have only two time
periods), FE and FD are exactly equivalent and so in this
case it does not matter which one we use (try to prove
this).
• But when T > =3, FE and FD are not the same.