Panel Data
Panel Data
y it x it u it
o
o where yit is the dependent variable, is the intercept term, is a k 1 vector of
parameters to be estimated on the explanatory variables, xit; t = 1, , T;
i = 1, , N.
The simplest way to deal with this data would be to estimate a single, pooled regression
on all the observations together. But pooling the data assumes that there is no
heterogeneity i.e. the same relationship holds for all the data.
A panel data set, while having both a cross-sectional and a time series dimension, differs in some
important respects from an independently pooled cross section. To collect panel data
sometimes called longitudinal datawe follow (or attempt to follow) the same individuals,
families, firms, cities, states, or whatever, across time.
Hsiao (2003) and Klevmarken (1989)
Pooled model
The pooled model specifies constant coefficients, the usual assumptions for crosssectional analysis.
This is the most restrictive panel data model and is not used much in the literature.
Individual-specific effects model
We assume that there is unobserved heterogeneity across individuals captured by
Example: unobserved ability of an individual that affects wages.
y it x it i v it
We can think of i as summarizing all of the variables that affect yit cross-sectionally but do
not vary over time for example, the sector that a firm operates in, a person's gender, or the
country where a bank has its headquarters, etc. Thus we would capture the heterogeneity that
is encapsulated in i by a method that allows for different intercepts for each cross sectional
unit.
This model could be estimated using dummy variables, which would be termed the least
squares dummy variable (LSDV) approach.( Introductory Econometrics for Finance Chris
Brooks 2013)
The Random Effects Model
An alternative to the fixed effects model described above is the random effects model,
which is sometimes also known as the error components model.
As with fixed effects, the random effects approach proposes different intercept terms for
each entity and again these intercepts are constant over time, with the relationships
between the explanatory and explained variables assumed to be the same both crosssectionally and temporally.
However, the difference is that under the random effects model, the intercepts for each
cross-sectional unit are assumed to arise from a common intercept (which is the same
for all cross-sectional units and over time), plus a random variable i that varies crosssectionally but is constant over time.
i measures the random deviation of each entitys intercept term from the global
intercept term . We can write the random effects panel model as
y it x it it
it i v it
Unlike the fixed effects model, there are no dummy variables to capture the heterogeneity
(variation) in the cross-sectional dimension.
Instead, this occurs via the i terms.
Fixed or Random Effects?
It is often said that the random effects model is more appropriate when the entities in the
sample can be thought of as having been randomly selected from the population, but a
fixed effect model is more plausible when the entities in the sample effectively constitute
the entire population.
Please remember me and my teachers and family in your prayers.
Superior university Lahore Pakistan
However, the random effects approach has a major drawback which arises from the fact
that it is valid only when the composite error term it is uncorrelated with all of the
explanatory variables.
This can also be viewed as a consideration of whether any unobserved omitted variables
(that were allowed for by having different intercepts for each entity) are uncorrelated
with the included explanatory variables. If they are uncorrelated, a random effects
approach can be used; otherwise the fixed effects model is preferable.
A test for whether this assumption is valid for the random effects estimator is based on a
slightly more complex version of the Hausman test.
If the assumption does not hold, the parameter estimates will be biased and inconsistent.
To see how this arises, suppose that we have only one explanatory variable, x2it that varies
positively with yit, and also with the error term, it. The estimator will ascribe all of any
increase in y to x when in reality some of it arises from the error term, resulting in biased
coefficients
The main question is whether the individual-specific effects are correlated with the Regressor. If
they are correlated, we have the fixed effects model. If they are not correlated, we have the
random effects model.
Random effect
Correlation between the individual, or crosssection specific, error component i and the X
Regressor. i (error component) and the Xs
are correlated, FEM may be appropriate
If T (the number of time series data) is large
and N (the number of cross-sectional units) is
small, FEM may be preferable
If the individual error component i and one or
more Regressor are correlated, then the ECM
estimators are biased
Best of luck. Soon ill update this file and elaborate further this topic, because of still Im in
learning process.
1/24/2016