Block 3
Block 3
Block 3
Equation Models-II
BLOCK 3
PANEL DATA MODELS
147
Advanced Topics
in Regression INTRODUCTION TO BLOCK 3
Analysis
Block 3 is on Panel Data Models. It has two units (Units 8 and 9). Unit 8 is
on Introduction to Panel Data. It first outlines the distinction between pooled
cross section data and panel data. Advantages of panel data over pooled data
is then explained. The unit discusses Chow Test used in situations of Linear
Static Panel Data Models. A discussion on differentiating between ‘fixed
effect models’ and ‘random effect models’, with a reference to Policy
Relevant Inference, is then presented in the unit.
148
UNIT 8 INTRODUCTION TO PANEL DATA* Introduction to
Panel Data
Structure
8.0 Objectives
8.1 Introduction
8.2 Panel Data Models
8.2.1 Pooled Cross Section Data
8.2.2 Panel Data
8.2.3 Advantage of Panel Data Over Pooled Data
8.0 OBJECTIVES
After reading this unit, you will be able to:
*
Dr. Poulomi Roy, Jadavpur University. 149
Panel Data • present a comparative profile of the relative contexts in which the FE or the RE
Models
model is appropriate to adopt; and
• outline, with illustration,’ the ‘policy relevant inference’ that could be drawn
from panel data models.
8.1 INTRODUCTION
In applications, econometricians often use either pure cross sectional data or
time series data. A cross sectional data is one which is collected for different
sample units for a same point of time (e.g. NSSO’s 5-yearly data on
manufacturing firms). A time series data, on the other hand, is collected over
different time points for the same set of sample units (e.g. GDP). Such
sample units, in a pure time series data, could themselves be cross sectional
in nature.
A data set that has both cross sectional and time series dimensions are
nowadays very common in empirical research. Such data sets, often used for
policy analysis, could be pooled to form a panel data set. Note that an
independently pooled cross section can be obtained by a random sampling
of a large population at different points of time (usually, but not necessarily,
for different years). From a statistical standpoint, such data sets have an
important feature i.e. they consist of independent sampled observations. Such
independently sampled observations play a key role in our analysis of cross-
sectional data, where, among other things, it rules out correlation in the error
terms across different observations. An independently pooled cross section
data differs from a single random sample. This is in the sense that sampling
from the population at different points of time likely leads to observations
that are not identically distributed. For instance, distributions of wages and
education have changed over time in most countries.
A panel data, or longitudinal data set, thus consists of time series data for
each cross sectional unit in the data set. Panel data is collected on the same
individuals or firms or geographical units over specified periods of time. The
key difference between the panel data and pooled data is that, in case of panel
data, the same cross sectional units are followed over a given time period. In
case of pooled data, different cross section units are observed for a given time
period. Thus, the main features of the three types of data are:
Panel data obtained by combining both the cross sectional and time series
data capture both the inter-cross-sectional differences as well as the intra
cross sectional dynamics. It has several other advantages over cross sectional
and time series data. For instance, cross sectional data may be viewed as a
panel with T =1 and time series data may be viewed as a cross section with
N = 1. Hence, panel data combining both cross section and time series data
provides more degrees of freedom and more sample variability than either
only the cross sectional or only the time series data. It hence improves the
efficiency of econometric estimates.
It is also frequently argued that the real reason one finds (or does not find)
certain effects is ‘due to ignoring the effects of certain variables in a model
specification which are correlated with the included explanatory variables’.
Panel data contain information on both the inter-temporal dynamics and the
individuality of the entities. This therefore allows for one to control for the
effects of missing or unobserved variables.
By pooling random samples drawn from the same population, but at different
points in time, we can get more precise estimators and test statistics with
higher power. Pooling is helpful in this regard only in-so-far as the
relationship between the dependent variable and at least some of the
independent variables remain constant over time. Using pooled cross sections
raises a statistical complication viz. the two populations could have different
distributions. To reflect for the fact that the populations may have different
distributions in different time periods, we allow the intercept to differ across
periods. This is also accomplished by including dummy variables for all but
one year i.e. for the earliest year in the sample which is usually chosen as the
base year. Sometimes, the pattern of coefficients on the year dummy
variables could itself be of interest.
155
Panel Data Check Your Progress 1 [answer within the space given in about 50-100
Models
words]
y it 1 y it 1 X it 2 ci it (8.4)
You may simply note at this stage that, as with any F test based on sums of
squared residuals, this test is not robust to Heteroscedasticity.
Second, one needs to consider the variability within subjects or cross section
of units. If subjects change little across time, a fixed effects model may not
work very well. This is because, there needs to be within-subject variability if
we are to use subjects as their own controls. If there is little variability within
subjects, then the standard errors from fixed effects model could be too large.
Conversely, random effects models will often have smaller standard errors.
But, the trade-off is that their coefficients are more likely to be biased.
Third, one needs to decide whether one wants to estimate the effect of
variables whose values do not change across time. With fixed effects models,
we do not estimate the effect of variables whose values do not change across
time. Rather, we control for them or ‘partial them out’. This is similar to an
158 experiment with random assignment. Though the RE models estimate the
effect of time-invariant variables, the estimates could be biased because we Introduction to
Panel Data
are not controlling for omitted variables. For a more clearer description, let us
consider a situation where y and x x1 , x2 ,.............., xk are observable
random variables with a linear relationship like:
y = α + xβ + c (8.8)
where ‘c’ the unobservable random variable. We are interested in the partial
effect of the observable explanatory variables xj while holding ‘c’ constant.
Our interest is to estimate the vector β . If ‘c’ is uncorrelated with x, then ‘c’
is just another unobserved factor uncorrelated with the explanatory variables.
If cov x j , c 0 for some j, then we cannot consistently estimate β .
159
Panel Data
Models
8.4.2 Random Effect Model
The random effects (RE) model is useful when we have reason to believe that
the unobserved effect is uncorrelated with all the explanatory variables. In
such a situation, the time constant’s unobserved effect is uncorrelated with
the explanatory variables and the parameters could be consistently estimated
by using a single cross section. There is therefore no need for panel data. But
using a single cross section disregards much useful information in the other
time periods. We can therefore use the data in a pooled OLS procedure i.e.
just run the OLS of dependent variable on the explanatory variables with the
time dummies. This, too, produces consistent estimators of the parameters
under the RE assumption. But it ignores the fact that the existence of
unobserved effect in the error term in each time period is serially correlated
across time. We can use the GLS method to solve for the serial correlation
problem.
To sum up, therefore, if the key explanatory variable is constant over time,
we cannot use FE to estimate its effect on dependent variable. In such
situations, we must rely on the RE (or pooled OLS) estimate. We can
however use the RE approach if we are able to assume that the unobserved
effect is uncorrelated with the explanatory variables. Typically, when one
uses random effects, many time-constant controls are included among the
explanatory variables. However, with the FE approach, it is not necessary to
include such controls. RE is preferred to pooled OLS due to its generally
higher efficiency.
Check Your Progress 2 [answer within space given in about 50-100 words]
1) Time series is data is collected over different time points for the same set
of sample units (e.g. GDP for states). Cross section data is collected over
different sample units for a same point of time (e.g. NSSO’s surveys in
India on 5-yearly basis).
2) In case of panel data, the same cross sectional units are followed up over
different time periods. In case of pooled data, different cross section
units (i.e. two independently selected random samples) are observed for
a given time period.
3) One, it allows for causal inference. Second, it allows us to study the
effect of lags in the behaviour or the result of decision making.
4) The complication is that the two samples might have come from
populations with different distributions. The way it is dealt with is by
allowing for different intercept terms or by using a ‘dummy variable’.
164
UNIT 9 ESTIMATION OF PANEL DATA Estimation of Panel
Data Models
MODELS*
Structure
9.0 Objectives
9.1 Introduction
9.2 Random Effect Estimation Method
9.3 Fixed Effect Estimation Method
9.4 Model Selection: Hausman Test
9.5 Simultaneous Equation Models in Panel Data
9.6 Let Us Sum Up
9.7 Key Words
9.8 Suggested Books for Further Reading
9.9 Answers/Hints to Check Your Progress Exercises
9.0 OBJECTIVES
After reading this unit, you will be able to:
9.1 INTRODUCTION
In a panel data set up, data is collected on each cross section unit ‘i’. It is
done over a period of time ‘T’. Therefore, if X is a vector of ‘k’ exogenous
variables affecting Y (the dependent variable), at any time point ‘t’, the
population model is of the form:
Yit = X it β + c i + u it , i = 1,2,…,N; t = 1,2,….,T (9.1)
where ci is the ‘unobserved effect’ and uit is the ‘random error term’. They
are also called as ‘idiosyncratic error terms’ because they change over ‘time
and cross sections units’ i.e. over i and t. Equation (9.1) is a panel regression
*
Dr. Poulomi Roy, Jadavpur University 165
Panel Data where ‘i’ indicates a cross section unit and ‘t’ is an indicator of time. The
Models
primary motivation behind a panel data analysis is solving for the ‘omitted
variables problem’. In panel data models, we consider ci as ‘time invariant’.
We further assume that the ‘unobserved effects’ are random variables. This is
in the case of a ‘linear static panel data model’. It is a ‘static model’ in the
sense that the explanatory variables are contemporaneous (fixed or pre-
determined) values corresponding to the value of Y in period t. In Equation
(9.1), to enable us to assume that the effect of ci is with a zero mean, we
explicitly add an intercept term. With this, Equation (9.1) takes the form:
i) For the ith cross section unit, at any time point ‘t’, the model is defined
as:
where ci is the ‘unobserved effect’ and uit is the ‘random error term’ (i =
1,2,……,N and t = 1,2,…..,T). βj’s are the parameters to be estimated.
ii) Each explanatory variable changes over time (at least for some i). No
perfect linear relationship exist among the explanatory variables.
iii) For each t, the ‘expected value of the idiosyncratic error terms’ given the
Xi’s [for all (∀) ‘t’ and ‘ci’] is ‘zero’ i.e. E ( u it | X i , c i ) = 0 .
Under the above assumptions, the random effect model (9.3) can be written
as:
where vit = ci + uit is the ‘composite error term’ such that E (vit | xti ) = 0 , t =
1,2,3,….,T. Note that the defining of the composite error term (as vit = ci +
166
uit) implies that the vit’s are ‘serially correlated across time’. Hence, Estimation of Panel
Data Models
�� �
����������� ���� , ��� ) = �� �� �,
t ≠ s.
� �
Therefore, the usual pooled OLS procedure cannot be applied. In such cases,
we have to apply the GLS method of estimation with ‘auto regressive serial
correlation’. Therefore, the model in (9.4) can be written as:
yi = xi β + vi (9.5)
where vi = ci jT + ui (jT being the TX1 vector of ones) and we define the
unconditional variance matrix vi as: Ω = E (vi , vi' ) . We assume Ω to be a
‘positive definite matrix’ which is constant for all cross section units due to
the assumption of random sampling. This means, for the consistency of GLS,
we need the ‘rank condition’ that Rank (E (X i'Ω −1 X i )) = k . Under this
assumption, E (ci , uit ) = 0 ∀ t=1,2,3,…,T and
( ) 2
E v 2 it = E (ci ) + 2 E (ci uit ) + E (u 2it ) = σ c2 + σ u2
E (v it , v is )= E (c i + u it , c i + u is )= ( )
E c i2 = σ 2
c
Therefore:
σ c2 + σ u2 σ c2 σ c2
σ c2 σ c2 + σ u2 σ c2
( '
Ω = E vi , vi = )
σ 2
σ c2 σ c2 + σ u2
c
Now, given that: (i) E (uit | xi , ci ) = 0, (ii) E (ci | xi ) = E (ci ) = 0 and (iii)
ˆ
Rank (E (X i
'
Ω −1
X i )) = k , β RE → β as N → ∞. Deriving the GLS
transformation that eliminates the serial correlation in the errors, requires
167
Panel Data [
sophisticated matrix algebra. For this, let us define: θ = 1 − σ u2 /(σ u2 + Tσ c2 ) ]
Models
which lies between zero and one. Then, the transformed equation becomes:
y it − θ y i = ( x it − θ x i ) β + ( v it − θ v i ) (9.5a)
where the ‘over bar’ denotes the ‘time averages’. This is a very interesting
equation, as it involves ‘quasi-demeaned data’ on each variable. If the model
considered had been of ‘fixed effects’, then the estimator would have
subtracted the time averages from the corresponding variable. Since we are
here considering the random effects transformation, it subtracts a fraction of
that time average, the fraction depending on σ u2 , σ c2 and T. The GLS
estimator is thus simply the pooled OLS estimator of Equation (9.5a). The
transformation in (9.5a) allows for the explanatory variables that are constant
overtime. This is in fact the one advantage of random effects (RE) over ‘fixed
effects’ (or first differencing). This is possible because RE assumes that the
‘unobserved effect’ is uncorrelated with all explanatory variables irrespective
of whether the explanatory variables are fixed over time or not. In many
applications, the whole rationale for using panel data is to allow for the
unobserved effect to be correlated with the explanatory variables.
Check Your Progress 1 [answer within the space given in about 50-100
words]
1) Specify a ‘linear static panel data model’. Why is the model called
‘static’?
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
2) Why is it that in the case of ‘linear static panel data models’, the usual
OLS method cannot be applied? Which method is to be applied in such
cases?
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
168
3) State the assumptions required to be made for the ‘random effect panel Estimation of Panel
Data Models
models’.
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
4) In Equation (9.5a), what is meant by ‘quasi demeaned data’? What is its
significance?
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
The random effect approach to estimate β effectively puts �� into the error
term. This is done under the assumption that �� is orthogonal to xit. It then
accounts for the implied serial correlation in the composite error term:
v it ci u it using a GLS analysis. In other words, the FE analysis
is correlated with xit. The T equations in the model are like:
assumes that c i
y it = x it β + c it jT + u it
(9.6)
where jT is the TX1 vector of one. The assumptions required for the FE model
are the following:
iii) Each explanatory variable changes over time (for at least some i) with
‘no perfect linear relationships’ existing among the explanatory
variables. 169
Panel Data iv) For each t, the expected value of the idiosyncratic error term, given the
Models
explanatory variables in all time periods, and the unobserved effect, is
‘zero’. That is: E (u it | xi , ci ) = 0 .
vii) Conditional on xi and ci, the uit are ‘iid’ i.e. independent and identically
distributed as N (0, σ u2 ).
yi = xi β + ci + ui (9.7)
1 1 1
where yi y u , xi xu , ui uu ,
T T T
yu
where, yu yi ,
xit xit xi , uit uit ui
Note that in (9.8), the ‘time demeaning’ has removed the individual specific
effect ci. The ‘fixed effects transformation’ is also called as the ‘within
transformation’. The important thing about Equation (9.9) is that the
unobserved effect ci, is eliminated. This suggests that we should estimate
(9.9) by pooled OLS. Thus, our ‘fixed effects estimator’ (also called the
‘within estimator’), is a pooled OLS estimator, based on the time-demeaned
variables. Now we proceed to estimate (9.9) by the pooled OLS method as
follows. Since, E ( x'it , uit ) = 0 t = 1,2,….,T, under a strict exogeneity
assumption on the explanatory variables, the fixed effects estimator is
170
unbiased. Alternatively, at least roughly, the idiosyncratic error uit are Estimation of Panel
Data Models
uncorrelated with each explanatory variable across all time periods. Note that
the ‘fixed effects estimator’ allows for the arbitrary correlation between c i
and the explanatory variables in any time period. Because of this, any
explanatory variable that is constant over time for all i gets swept away by
the fixed effects transformation: xit = 0 for all i and t. In other words, we
cannot include variables like gender or a city’s distance from a river. Note
also that the FE estimator is the pooled OLS estimator from the regression of
yit on xit , t = 1,2,….,T; i = 1,2,…,N. In order to ensure that the FE
estimator is well behaved asymptotically, we need a standard rank condition
on the matrix of time demeaned explanatory variables viz.
T
(
)
Rank ∑ E x' it , xit = Rank E x' it , xit [ ( )] = k .
t =1
This implies that the ‘time constant variables’ are not allowed in FE analysis
unless they are interacted with ‘time varying variables’ such as time
dummies. Hence, when analysing individuals, factors such as gender or race
cannot be included among xit. We can now define the FE estimator as:
N 1 N N T 1 N T
ˆ xi'
xi xi'
yi x it'
xit xit
yit (9.10a )
FE
i 1 i 1 i 1i 1 i 1t 1
and cˆ i = y i − x i βˆ FE (9.10b)
In the above wage equation, education is constant over time for each
individual in the sample. However, we can interact education with each year
171
Panel Data dummy to see how the return to education has changed over time. But we
Models
cannot use fixed effects to estimate the return to education in the base period.
This means we cannot estimate the return to education in any period. We can
only see how the return to education in each year differs from that in the base
period.
Check Your Progress 2 [answer within the space given in about 50-100
words]
1) In Equation (9.2), what does ‘ ci is orthogonal to uit ’ mean?
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
2) In what respects, the assumptions made for the FE differ from those
made for the RE effect?
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
3) How is the FE estimate obtained in principle? To what effect?
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
172
4) What is the d.f. in estimating the FE Equation (9.9)? Estimation of Panel
Data Models
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
5) What is a limitation of the FE estimate?
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
Hausman (1978) first proposed such a test and some econometrics packages
routinely compute the Hausman test under the full set of random effects
assumptions. The idea is that one uses the random effects estimates unless the
Hausman test rejects Cov (xit , ci ) = 0 . In practice, a failure to reject means
‘either that the RE and FE estimates are sufficiently close and it does not
matter which of the two methods is used’. A rejection, using the Hausman
test, is taken to mean that the key RE assumption, Cov (xit , ci ) = 0 , is false. In
such cases, the FE estimates are used.
Hausman test can be used to decide whether to use the RE model or the FE
model. It states that: “if δˆ RE denotes the vector of RE estimates (without the
coefficient on the time constant variable or aggregate time variables), and if
δˆFE denotes the corresponding FE estimates, then:
1 1
H ˆ ˆ Avarˆ ˆ FE Avarˆ ˆ RE ˆ ˆ ~ 2
FE RE FE RE M
(9.13)”
Check Your Progress 3 [answer within the space given in about 50-100
words]
The approach to estimating the SEMs with panel data involves two steps: (i)
eliminate the ‘unobserved effects’ from the equations by using the fixed
effects transformation (or first differencing) and (ii) find instrumental
variables for the endogenous variables in the transformed equation. This can
be very challenging because, for a convincing analysis, we need to find
instruments that change over time. Let us now consider an SEM for panel
data like:
y it1 = α 1 y it 2 + z it1 β 1 + ci1 + u it 1 (9.14)
y it 2 = α 1 y it 2 + z it 2 β 1 + ci 2 + u it 2 (9.15)
where i denotes cross section, t denotes time period, and, zit1 and zit2
represents linear functions of a set of exogenous explanatory variables. The
most flexible analysis allows the ‘unobserved effects’, ci1 and ci2, to be
correlated with all explanatory variables, including the elements in z.
However, we assume that the idiosyncratic structural error terms, uit1 and uit2,
are uncorrelated with z in both the equations and across all time periods. This
is in the sense of being truly exogenous. Further, except under very special
circumstance, yit2 is correlated with uit1, and yit1 is correlated with uit2.
175
Panel Data Now, let us consider the Equation (9.14). This cannot be estimated by the
Models
OLS since the composite error term cit1 + uit1 is potentially correlated with all
explanatory variables. Suppose we take their difference over time to remove
the unobserved effect cit1. That is:
∆y it1 = α 1 ∆y it 2 + ∆z it1 β 1 + ∆u it1 (9.16)
Now, with the differencing (or time-demeaning), we can only estimate the
effects of variables that change over time for at least some cross-sectional
units. The error term in the above equation is uncorrelated with ∆z it1 by
assumption but ∆yit 2 is not necessarily uncorrelated with ∆u it1 . Therefore, we
need an ‘instrumental variable (IV)’ for ∆yit 2 . It is possible that such IVs
comes from other equations. For instance, elements of ∆z it 2 that are not also
in ∆z it1 are natural IVs for ∆yit 2 .
and
log(wage) = α 2 hours + β 20 + β 21educ + β 22 exp er
(9.18)
+ β 23 exp er 2 + u 2
The variable age is the woman’s age in years, kidslt6 is the number of
children less than six years old, nwifeinc is the woman’s non-wage income
(like husband’s earnings), and educ and exper are years of education and
prior experience respectively. All variables except hours and log(wage) are
assumed to be exogenous. Note that this is a tenuous assumption since educ
might be correlated with the omitted variable ‘ability’ in either equation. But
for illustrative purposes, we ignore the omitted variable problem here. Note
the functional form in this system (where hours appear in level form but
wage in logarithmic form) which is popularly used in labour economics. This
is for the simple reason that hours of work would change less but changes in
wages could be sharper (or minute) requiring to be captured by taking their
logarithmic values.
The first equation is the ‘supply function’. It satisfies the ‘order condition’
because the two exogenous variables, exper and exper2, are omitted from the
labour supply equation. These exclusions are crucial restrictions in the sense
that it amounts to assuming that ‘once wage, education, age, number of small
children, and other income are controlled for, past experience has no effect
on current labour supply’. The ‘wage equation’ is also identified as at least
one of age, kidslt6, or nwifeinc has a non-zero coefficient in (9.17). After
differencing, the labour supply function becomes:
176
∆hours = β 0 + α 1∆ log( wageit ) + ∆(other factorsit )β1 + ∆uit (9.19) Estimation of Panel
Data Models
We can use Δexpeerit as an instrument for ∆ log( wageit ) . However, because
we are looking at people who work in every time period, Δexpeerit = 1 for all
i and t (since each person gets another year of experience after a year passes).
We cannot therefore use Δexpeerit as an IV, as it takes same value for all i
and t. In a panel data set up, often, participation in an experimental
programme is used to obtain IVs. Let us now consider another example of
‘job training’ and ‘worker productivity’ where we want to estimate the effect
of ‘another hour of job training’ on ‘worker productivity’. For any two years
(e.g. 1987 and 1988), consider the simple panel data model:
where scrapit (the ‘scrap rate’ i.e. number of items out of 100 that must be
scrapped) is regressed on hrsempit. Normally, we would estimate this
equation by OLS. But ∆uit could be correlated with ∆hrsempit . For instance,
a firm might hire more skilled workers, while at the same time reducing the
level of job training. In this case, we need an instrumental variable for
∆hrsempit . Generally, such an IV is hard to find. But we can use the fact that
in ∆hrsemp it , some firms received job training grants in one of the two years
(1988). If we assume that grant designation is uncorrelated with ∆uit
(something that is reasonable because the grants are given at the beginning of
the year), then ∆grant it is a valid IV provided ∆hrsempit and ∆grant it are
correlated. Job training and worker productivity are jointly determined, but
receiving a job training grant is exogenous in Equation (9.20).
Let us consider another example of SEM with panel data. In order to estimate
the causal effect of increase in prison population on crime rates at the state
level, let us consider the ‘instances of prison overcrowding litigation as
instruments for the growth in prison population’. For the equation estimated
in first differences, we can write an underlying ‘fixed effects model’ as:
log(crimeit ) = θ t + α 1 log( prisonit ) + z it1 β1 + ci1 + u it1 (9.21)
where θ t denotes different time intercepts, and crime and prison are measured
per 100,000 people. The prison population variable is measured on the last
day of the previous year. The vector zit1 contains log of ‘police per capita, log
of income per capita, the unemployment rate, proportions of black and those
living in metropolitan areas, and age distribution proportions’. Differencing
(9.21) gives the equation to be estimated as:
∆ log(crimeit ) = ξ t + α 1∆ log( prisonit ) + ∆z it1 β 1 + ∆u it1 (9.22)
Check Your Progress 4 [answer within the space given in about 50-100
words]
1) What are the two factors to be considered in the approach to estimate the
SEMs in panel data regressions?
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
178
2) How is the ‘unobserved effect’ removed from a composite error term like Estimation of Panel
Data Models
cit1 + uit1 in Equation (9.14)? What is a consequent disadvantage of this
approach and how is this dealt with?
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
……………………………………………………………………………
1) A linear static panel data model is like Yit = β0 + Xit βi + ci + uit. Here
Xi’s are fixed or pre-determined (and hence the model called as the
‘static’ model) corresponding to the value of Y in period t. The ci’s stand
for the ‘unobserved effect’ and uit’s are the ‘idiosyncratic error terms’.
The term ‘idiosyncratic’ refers to the random character of the
observation-specific ‘error term’. The zero mean characteristic of ci’s is
achieved by the inclusion of the explicit intercept term β0.
2) The OLS method cannot be applied because of the composite error term
vit = ci + uit which implies that the error terms are serially correlated
across time. In such cases, we have to apply the GLS method of
estimation with ‘auto regressive serial correlation’.
3) The assumptions required for RE model are: (i) the model is defined as
in equation (9.3) with a term for ‘unobserved effect’ (ci ) and ‘random
error term’ (uit), (ii) explanatory variables change over time but not with
perfect collinearity, (iii) the expected value of error terms (uit) given Xi
and ci is ‘zero’, (iv) var (uit) = σ u2 (v) the covariance of uncorrelated
error terms conditional to xi and ci is ‘zero’, (vi) E (ci ) = 0 and (vii)
var(ci | xi ) = σ c2 i.e. the assumption of homoscedasticity hold for ci’s.
4) The term ‘quasi’ means ‘partial’. The term ‘demeaned’ can be split as
‘de-meaned’ which means removing the effect of mean. Hence, the term
‘quasi demeaned data’ refers to the partial adjustment made in each
variable to remove the effects of their means. In effect, it is a partial
cleaning process attained by the estimation process of GLS.
180
2) Firstly, in terms of the effect on the estimated parameters. They are in the Estimation of Panel
Data Models
first stage consistent and in the later stage ‘best’ i.e. BLUE. With the
additional assumption on the normality of uit’s (conditional on xi and ci)
i.e. making the nature of error terms i.i.d., the t and F statistics have the
exact distributions without the need for approximation of large N and
small T.
3) By a process of ‘time demeaning’ transformation for removing the effect
of ‘average’ in Equation (9.6). The transformation eliminates the
‘unobserved effect’ ci as in the transformed Equations (9.8) or (9.9). The
estimation of (9.9) becomes the ‘pooled OLS’ method. A major
difference between the FE and the RE methods is that, FE is consistent
when ci and xit are correlated, whereas the RE estimate is not.
4) N(T –1) – k.
5) The pooled OLS’s standard errors and test statistics are generally invalid,
because, they ignore the serial correlation in the composite errors: vit = ci
+ uit.
181
Panel Data
Models
182