0% found this document useful (0 votes)

17 views

Chapter 2 Panel Data

Chapter 2 Panel Data - AFER

Uploaded by

Dylan Clarke

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Chapter 2 Panel Data

Chapter 2 Panel Data - AFER

Uploaded by

Dylan Clarke

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Chapter 2: Panel Data

Lu Liu

Contents
1 Introduction 2
1.1 Advantages of Panel Data . . . . . . . . . . . . . . . . . . . . . . . 2

2 The Models 4
2.1 The Fixed Eects Model . . . . . . . . . . . . . . . . . . . . . . . . 4
2.1.1 Within transformation . . . . . . . . . . . . . . . . . . . . . 6
2.2 Time-xed Eects Model . . . . . . . . . . . . . . . . . . . . . . . . 8
2.3 The Random Eects Model . . . . . . . . . . . . . . . . . . . . . . 11
2.4 Fixed Eects or Random Eects? . . . . . . . . . . . . . . . . . . . 12
2.4.1 The Hausman test . . . . . . . . . . . . . . . . . . . . . . . 14

3 Prep: Important Statistical Concepts 14

4 An example with Stata 15

1
1 Introduction
This chapter draws on Chapter 11 of Brooks (2019).
"Panel data" or "longitudinal data" refers to the pooling of observations on
a cross-section of entities such as households, rms and countries over several
time periods. A panel of data embodies information across both time and space.
Importantly, a panel follows the same entities over time. If the data are not on
the same entities measured over time, this would not be panel data but "repeated
cross-section". In this chapter, we will discuss the important features of panel data
and study the techniques used to model such data.
Econometrically, the most basic setup of panel data is

yit = α + βxit + uit (1.1)

where yit is the dependent variable, α is the intercept term, β is a 1 × k vector of

coecients for the explanatory variables, and xit is a k × 1 vector of observations
on the explanatory variables, t = 1, . . . , T , i = 1, ..., N , T is the number of time
periods and N is the number of entities. The OLS estimator of β is named the
pooled OLS estimator.

1.1 Advantages of Panel Data

There are several benets from using panel data.

1. Panel data give more variability and less collinearity among the variables.
The additional variation introduced by combining cross-sectional data with
time series data can help to mitigate problems of multicollinearity that
plagues time series models. Let's demonstrate with an empirical example.
Baltagi and Levin (1992) model the consumption of cigarette as a function of
lagged consumption, price and income in the US. In the aggregate time series

2
for the US, there is high collinearity between price and income. Instead,
Baltagi and Levin (1992) consider cigarette demand across 46 American
states between 1963 and 1988. High collinearity is less likely with a
panel across American states since the cross-section dimension adds more
informative data on price and income, and hence increases variability in the
data.

2. As will be demonstrated later in this chapter, panel data has the ability to
control for all time-invariant variables or cross-sectional invariant variables,
whose omission could bias the estimates in a typical cross-section study or
a time-series study. For example, in the cigarette demand study, state-
invariant variables such as national advertising on TV could aect the sales
of cigarette. So do time-invariant variables such as distribution of religious
population within dierent states. For example, Utah, which has a high
percentage of Mormon population, has a per capita sales of cigarette less than
one half of the national average. If these variables correlate with price and
income, omitting them from the regression will cause bias in the estimated
coecients for price and income. We will demonstrate that panel data are
able to control for these time-invariant variables or cross-sectional invariant
while time series or cross-sectional data cannot.

Another example is given by Hajivassiliou (1987) who studies the external

deb repayment problem using a panel of 79 developing countries between
1970 and 1982. These countries dier in terms of institutions, political
regimes, etc. These country-specic variables aect the countries' attitudes
about borrowing and lending and the way they are treated by the lenders.
Many of these variables are dicult to measure, but not accounting for them
causes bias.

3
3. Panel data are not only able to model why individual units behave dierently
(cross-sectional dimension) but also why a given unit behaves dierently at
dierent time periods (time dimension). This allows a researcher to identify
certain parameters or questions that cannot be addressed using pure cross-
sectional data or pure time-series data. Consider a situation in which the
average consumption level rises with 2% from one year to another. It might
imply 2% increase for all individuals. Or, it might imply an increase of 4%
for one half of the individuals and no change for the other half (or another
combination). To discriminate between these two models, we need to utilize
changes of consumption on an individual level, which is panel data.

2 The Models
In this section we discuss two common models for panel data, namely the xed
eects and the random eects models, and subsequently discuss the choice between
the two.

2.1 The Fixed Eects Model

To see how the xed eects model works, we take equation (1.1) above and
decompose the disturbance term, uit , into an individual specic eect, µi , and
the remainder disturbance, vit , that varies over time and entities. So we could
rewrite equation (1.1) to

yit = α + βxit + µi + vit . (2.1)

µi is named individual- or entity-xed eects, as it encapsulates all the

individual heterogeneity that aect yit cross-sectionally but do not vary over time
- for example, a person's gender, the industry that a rm operates in, the country

4
where a rm has its headquarters. In this model, we need to utilize the restriction
that µi = 0 for one arbitrary entity i to avoid perfect multicollinearity between
the individual-xed eect terms and the intercept term.
Eq(2.1) could be estimated using dummy variables

yit = α + βxit + µ2 D2i + µ3 D3i + · · · + µN DN i + vit . (2.2)

where D2i is a dummy variable that takes value 1 for all observations on the
second entity (e.g. the second rm) in the sample and zero otherwise, D3i is
a dummy variable that takes value 1 for all observations on the third entity
(e.g. the third rm) in the sample and zero otherwise, and so on. As stated
before, because of having the intercept term α, we put the dummy variable for the
rst entity to zero in order to avoid the "dummy variable trap", where there is
perfect multicollinearity between the dummy variables and the intercept. Putting
a dummy for one arbitrary entity to zero will do the same job. The parameters α,
β , µ2 , . . . , µN can be estimated by OLS. The implied estimator for β is referred to
as the least squares dummy variable (LSDV) estimator.
An alternative setting for the econometric presentation of the xed eects
model is

yit = βxit + µ1 D1i + µ2 D2i + µ3 D3i + · · · + µN DN i + vit (2.3)

where the intercept term α is suppressed but dummy variables for all the entities
are included. Notice that equation (2.2) and (2.3) are equivalent. Both have N + k
parameters to estimate (k parameters in β , N slope parameters for dummies in
(2.3), N − 1 slope parameters for dummies and α in (2.2)). β parameters are
identical. Howver, the interpretation of µi diers: α in eq(2.2) is the same as the
parameter for the rst entity's dummy variable, µ1 in eq (2.3); α + µi in eq(2.2) is
the same as µi in eq (2.3) for i > 1.

5
2.1.1 Within transformation
It is challenging to estimate N + k parameters when N is large. Often there can be
observations on hundreds of rms, or tens of thousands of households solicited from
surveys. Fortunately we can estimate β in a simpler way. It can be shown that
exactly the same estimator for β if we transform the data. This transformation,
known as the within transformation, involves subtracting the time-mean of each
entity away from the values of the variable. Then we perform the regression on
the deviations from the entity means. Essentially, this implies that we eliminate
the individual-xed eects µi .
1 PT
Let's dene ȳi = yit as the time-mean of the observations on y for entity
T t=1
1 PT
i and x̄i = xit as the time-mean of the explanatory variables. Eq(2.1)
T t=1
averaged over time gives

ȳi = α + β x̄i + µi + v̄i . (2.4)

Subtracting eq(2.4) from (2.1) gives

yit − ȳi = β(xit − x̄i ) + vit − v̄i (2.5)

This new regression contains demeaned variables only. Note that uit − ūi = (µi +
vit ) − (µ̄i + v̄it ) = vit − v̄it , because µi is time-invariant, so µi = µ̄i . In addition, α
is a constant, so it is also dropped after the transformation.
We could write eq(2.5) as

ÿit = β ẍit + v̈it (2.6)

where the double dots above the variables denote the demeaned values. The
implied estimator for β after the within transformation is referred to as the xed-
eects estimator or within estimator. Regression (2.6) can now be routinely
estimated using OLS on the pooled sample of demeaned data. µ̂i and α can be
recovered from eq(2.4). As µ1 = 0, α = ȳi=1 − β̂ x̄i=1 .

6
It is important to recognize that although estimating the regression (2.6) uses
only k degrees of freedom from N T observations, we also used a further N degrees
of freedom in constructing the demeaned variables. We lost a degree of freedom
the
for every one of the N individuals for which we estimated the mean. Hence,
number of degrees of freedom that must be used in estimating standard
erros in an unbiased way and when conducting hypothesis tests is N T −
N − k.

1. Computation of standard errors The s2 of the regression with k explanatory

variables and an intercept is obtained by dividing the residual sums of squares
by N T − k − 1. However, with (N − 1) xed eects (dummies), the proper s2
for a xed-eects model divides the residual sums of squares by N T − N − k .

2. The LSDV estimator obtained directly from the estimating the regression
with dummies and within estimator obtained from within transformation
have identical estimated values and standard errors.

3. Testing for xed eects To test for whether xed eects are necessary, we can
test the joint signicance of the µi in eq(2.2), i.e. H0 : µ2 = µ3 · · · = µN = 0,
by performing an F -test. This is a simple Chow test with the restricted
residual sums of squares (RRSS) being that of OLS on the pooled model and
the unrestricted residual sums of squares (URSS) being that of the LSDV
regression. If N is large, one can perform the Within transformation and use
that residual sum of squares as the URSS.

(RRSS − U RSS)/(N − 1)
F0 = ∼ FN −1,N T −N −k (2.7)
U RSS/(N T − N − k)

where RRSS = uit where uit is the residual in equation (1.1), and U RSS =
P 2

vit in equation (2.1).

P 2

7
If the F -test rejects xed eects, the xed-eect estimator and the LSDV
estimator are not ecient while the pooled OLS estimator in eq(1.1) is consistent
and eciency. If the model with xed eects is the true model, which implies that
the F -test does not reject xed eects, the pooled OLS estimator is biased and
inconsistent.
For consistency of the xed-eect estimator and the LSDV estimator, it is
required that E[(xit − x̄i )(vit − v̄i )] = 0. Because of the averages, this requires that
x is strictly exogenous: E[xit vis ] = 0, s = 0, 1, . . . , T .
The disadvantage of within transformation : The xed-eects (within) estima-
tor cannot estimate the eect of any time-invariant variable like gender, religion
or the sector of the rm operates in. These time-invariant variables are wiped out
by demeaning the variables. Alternatively, time-invariant variables are spanned
by the individual dummies.
For consistency of the between estimator, it is required that E[xit |µi ] = 0, in
addition to E[xit vis ] = 0. This additional assumption that explanatory variables
being uncorrelated to individual specic eects may be unreasonable.

2.2 Time-xed Eects Model

It is also possible to have time-xed eects in the model. We use time-xed eects
when we think that the average value of yi t changes over time.
For simplicity, let's assume vely that the average value of yit changes over time
but not cross-sectionally. With this assumption, we can write a time-xed eects
model without entity-xed eects

yit = α + βxit + λt + vit (2.8)

where λt captures all of the variables that vary over time but are constant cross-
sectionally. In the literature of nance, for example, business cycle aects many

8
variables such as bank credit supply and household saving. The change in business
cycle may inuence credit supply of all banks in the same way. Another example is
the regulatory environment or tax rate changes part-way through a sample period.
As in the entity-xed eects model, to avoid multi-collinearity, we put the
xed-eect for one time period to zero. A least squares dummy variable (LSDV)
model can be estimated

yit = α + βxit + λ2 T2t + λ3 T3t + ... · · · + λT TT t + vit (2.9)

where Tjt denotes a dummy variable that takes value 1 for time period j and zero
elsewhere. Similarly as in the entity-xed eects model, we can directly estimate
the parameters in eq(2.8) and obtain LSDV estimator for β . Alternatively, we can
avoid estimating the model containing all the dummies by conducting a within
transformation, which subtracts the cross-sectional averages from each observation

yit − ȳt = β(xit − x̄t ) + vit − v̄t (2.10)

1 PN
where ȳt = yit as the mean of the observations on y across entities for
N i=1
each time period.
We can test the two-way xed eects model against the entity-xed eect
model, time-xed eect model, and the pooled OLS model using F-test in the same
manner as we test the one-way xed eects models. Restricted residual sum of
squares and the number of degrees of freedom change with the null hypothesis. To
test for whether time-xed eects are necessary, we can test the joint signicance
of the λi in eq(2.2), i.e. H0 : λ2 = λ3 · · · = λT = 0, by performing an F -test.
Now, the unrestricted model is the time-xed eects one, therefore, URSS and the
degree of freedom of the unrestricted model are those of the time-xed eects.

(RRSS − U RSS)/(N − 1)
F0 = ∼ FN −1,N T −T −k (2.11)
U RSS/(N T − T − k)

9
Finally, it is possible to allow for both entity-xed eects and time-xed eects
within the same model. Such a model is termed a two-way xed eects model
or two-way error component model, which combines equations (2.1) and (2.8)

yit = α + βxit + µi + λt + vit . (2.12)

The LSDV equivalent model contains both cross-sectional and time dummies

yit = βxit + µ1 D1i + µ2 D2i + µ3 D3i + · · · + µN DN i

+ λ1 T1t + λ2 T2t + λ3 T3t + ... · · · + λT TT t + vit . (2.13)

The number of parameters is now k + N + T . Alternative to estimating the LSDV

model directly, we can do two-way within transformation. First, we can subtract
the time-mean of the variables in eq(2.12) to µi , yielding

yit − ȳi = β(xit − x̄i ) + (λt − λ̄t ) + (vit − v̄i ) (2.14)

which we could write as

ÿit = β ẍit + λ̈t + v̈it (2.15)

where the double dots above the variables denote the values after subtracting
the time mean. Then, we can subtract the cross-sectional mean of the variables in
eq(2.15) to remove λ̈t . Same estimates will be obtained if the within transformation
if step 2 is performed before step 1.
The two-way within transformation is more complicated to implement than the
one-way within transformation. Fortunately, a lot of statistical softwares estimate
the two-way within estimators for us. Alternatively, we can implement within
transformation only for the large data dimension, then estimate the regression on
demeaned variables with dummy variables for the small dimension. For example,
microdata panels (panels with large N and small T ) solicited from household

10
surveys normally span observations of many households but over a few periods.
For this case, we can rst transform the variables by subtracting time-mean and
then run a regression on the variables with dummy variables for time periods. The
opposite applies to macrodata panels (panels with small N and large T ) like panels
of observations for countries over many time periods.
We can test the two-way xed eects model against the entity-xed eect
model, time-xed eect model, and the pooled OLS model using F-test in the
same manner as we test the one-way xed eects models. Restricted residual sum
of squares and the number of degrees of freedom change with the null hypothesis.

2.3 The Random Eects Model

There are too many parameters in the xed eects model and the loss of degrees
of freedom can be avoided if µi and λt are random.
We write the entity-random eects model as

yit = α + βxit + µi + vit , µi ∼ IID(0, σµ2 ); vit ∼ IID(0, σv2 ), (2.16)

where µi + vit is treated as an error term consisting of two components: an

individual specic component, which does not vary over time, and a remainder
component, which is assumed to be uncorrelated over time. µi and vit are assumed
mutually independent and independent of xj s (for all entity j and time s). This
implies that the OLS estimators for α and β from eq(2.16) are unbiased and
consistent. The error components structure implies that the composite error
term µi + vit exhibits a particular form of autocorrelation as µi is constant over
time. Consequently, routinely computed standard errors of the OLS estimator are
incorrect, so the OLS estimators are inecient.
A more ecient estimator can be obtained by exploiting the structure of the
error covariance matrix in a generalised least squares (GLS) procedure. The

11
transformation involved in this GLS procedure is to subtract a weighted mean of
the variables. Dene the 'quasi-demeaned ' data as yit∗ = yit −θȳi and x∗it = xit −θx̄i ,
where ȳi and x̄i are time-mean of the observations on yit and xit , respectively.
Weight θ is a function of the variance of the observation error term, σv2 , and of the
variance of the entity-specic error term, σµ2

σv
θ =1− p 2 (2.17)
T σµ + σv2

This transformation will precisely ensure that there are no autocorrelations in the
error terms. We can easily compute the random-eects estimator by estimating
the transformed model using OLS:

yit∗ = α(1 − θ) + x∗it β + εit (2.18)

The error term εit is iid over individuals and time.

The variance components σµ2 and σv are unknown in practice. We can use the
feasible GLS estimator, where the unknown variances are consistently estimated in
a rst step. Fortunately, this should be automatically be implemented by standard
software packages.
Similarly to the time-xed eects model and the two-way xed eects model,
we can estimate a time-random eects model and a two-way random eects model
including random eects for time periods.

2.4 Fixed Eects or Random Eects?

The xed eects approach is conditional upon the value for µi . This makes sense
intuitively if the individuals in the sample are 'one of a kind', and cannot be viewed
as a random draw. In contrast, the random eects approach is not conditional upon
the individual µi s, but integrates them out. In this case, we are not interested in
the particular value of some individuals' µi ; we just focus on arbitrary individuals.

12
The random eects are more appropriate when the entities in the sample can be
thought of as having been randomly selected from the population. One way to
formalize this is noting that random eects model states that

E{yit |xit } = α + x0it β, (2.19)

while the xed eects model states that

E{yit |xit } = α + x0it β + µi . (2.20)

The β coecients in these two conditional expectations are the same only if
E{µi xit } = 0. Therefore, one may prefer the xed eects estimator if some interest
lies in µi , which makes sense if the number of units is relatively small and of a
specic nature.
The random eects approach is valid only when the error term µi is uncorrelated
with all explanatory variables. Random eects-estimators will be biased and
inconsistent if µi are correlated with some explanatory variables. To see how
this arises, suppose that we have only one explanatory variable, x2it , that varies
positively with yit and also with the error term, µi . The estimator will ascribe
all of any increase in y to x when in reality some of it arises from the error term,
resulting in biased coecients. In contrast, xed eects estimators is consistent
regardless of the relationship between explanatory variables and µi . Therefore,
even if we are not interested in particular individuals and the sample is randomly
selected from the population, the xed eects estimator may be preferred.
The xed-eects estimator exploits the within dimension of the data (dier-
ences within individuals) only. The between dimension of the data (dierences
between individuals) is lost due to within transformation, because transformation
produces observations in deviation from individual averages and removes the cross-
sectional variation. In contrast, random-eects estimator use more of the variation
in the data (specically, they use the cross sectional/between variation). So, if the

13
assumptions of the random eects model are valid (random-eects estimators are
consistent) , the random-eects estimators will be more ecient (have smaller
standard errors) than xed- eects estimators.

2.4.1 The Hausman test

As discussed in section 2.4, an important property for taking xed-eects
estimators or random-eects estimators is the correlation between µi and xit .
Hausman (1978) suggests a test for the null hypothesis that xit and µi are
uncorrelated. The general idea of a Hausman test is that two estimators are
compared: the xed-eects estimator is consistent under both the null and
alternative hypothesis and the random-eects estimator is consistent (and typically
ecient) under the null hypothesis only. A signicant dierence between the two
estimators indicates that the null hypothesis is unlikely to hold. Let us consider
the dierence vector β̂F E − β̂RE . To evaluate the signicance of this dierence, we
can compute the Hausman test statistic

ξH = (βFˆE − β̂RE )0 [V̂ {β̂}F E − V̂ {β̂RE }]−1 (β̂F E − β̂RE ) (2.21)

where V̂ s denote estimates of covariance matrices of the estimators. The statistic

ξH has an asymptotic Chi-squared distribution with k degrees of freedom.

3 Prep: Important Statistical Concepts

In statistical models, true values of parameters are unknown. The "estimator" of a
parameter is itself a random variable, which has certain distribution. A particular
realization of the estimator is an "estimate".
The bias of an estimator is the dierence between this estimator's expected
value and the true value of the parameter. An estimator b is unbiased, if it is on

14
average equal to the true value β of the parameter. This is formally formulated as

E{b} = β. (3.1)

Consistency is a so-called large sample property and, loosely speaking, says

that if we obtain more and more observations, the probability that our estimator is
away from the true value β becomes smaller and smaller. The formal formulation
of a consistent estimator is

lim P {|b − β| > δ} = 0 for all δ > 0 (3.2)

n→∞

We usually refer to this as 'the probability limit of b is β ' or 'b converges in

probability of β '.
The eciency of an estimator refers to how much information it extracts about
the parameter of interest from the sample. A more ecient estimator extracts more
information, in some sense, from a sample of a given size. Eciency measures
information extracted by the standard error of an estimator - smaller standard
error means greater eciency.

4 An example with Stata

We consider an application of panel data models on a test of capital asset pricing
model due to Fama and MacBeth (1973). We will discuss the test in detail later
in the course. A brief description is that the test involves a two-step estimation
procedure: rst, the beta is estimated in separate time series regression for each
rm, and second, for each time point t, a cross-sectional regression of the excess
returns on the estimated betas is conducted

Rit − Rf t = λ0 + λm βi + ui (4.1)

15
where the dependent variable is the excess return of i at each t and the independent
variable is the estimated beta for i.
If CAPM holds, λ0 should not be signicantly dierent from zero and λm should
approximate the (time average) equity market risk premium, Rm − Rf . Fama and
MacBeth (1973) proposed estimating this second stage regression separately for
each time period, and then taking the average of the parameter estimates to con-
duct hypothesis tests. However, we can see that regression (4.1) is a combination
setting of cross-section and time-series, therefore, we can also achieve a similar ob-
jective using a panel approach. For this example, we will use a sample comprising
the annual returns and estimated betas for eleven years on 2500 UK rms pro-
vided by Brooks (2019): https://www.cambridge.org/as/academic/subjects/
economics/finance/introductory-econometrics-finance-4th-edition?format=
PB → 'Resources' → 'General Resources' → 'Excel les' → 'panelex.xls'.

References
Baltagi, B. H. and Levin, D. (1992). Cigarette taxation: raising revenues and
reducing consumption. Structural Change and Economic Dynamics, 3(2):321
335.

Brooks, C. (2019). Introductory econometrics for nance. Cambridge university

press.

Fama, E. F. and MacBeth, J. D. (1973). Risk, return, and equilibrium: Empirical

tests. The journal of political economy, pages 607636.

Hajivassiliou, V. A. (1987). The external debt repayments problems of ldc's: An

econometric model based on panel data. Journal of Econometrics, 36(1):205
230.

16
Hausman, J. A. (1978). Specication tests in econometrics. Econometrica: Journal
of the Econometric Society, pages 12511271.

Introduction to Applied Econometrics Analysis Using Stata
From Everand
Introduction to Applied Econometrics Analysis Using Stata
Justin Doran
5/5 (3)
Evolutionary Analysis 5th Edition Herron Test Bank
100% (36)
Evolutionary Analysis 5th Edition Herron Test Bank
11 pages
Panel Data Analysis of Microeconomic Decisions: Fall 2020
0% (1)
Panel Data Analysis of Microeconomic Decisions: Fall 2020
25 pages
Panel Data Notes
No ratings yet
Panel Data Notes
26 pages
ECN3322 - Panel Data-1
No ratings yet
ECN3322 - Panel Data-1
56 pages
CHAPTER 7
No ratings yet
CHAPTER 7
121 pages
Panel Data Slides - 230919 - 160722
No ratings yet
Panel Data Slides - 230919 - 160722
92 pages
00 panels1e
No ratings yet
00 panels1e
20 pages
Chapter 2 Slides Handout
No ratings yet
Chapter 2 Slides Handout
48 pages
Econ-654 - Unit 3-PDM
No ratings yet
Econ-654 - Unit 3-PDM
211 pages
Advanced Econometrics
No ratings yet
Advanced Econometrics
61 pages
Panel Data Assign
No ratings yet
Panel Data Assign
19 pages
Panel Data
100% (2)
Panel Data
5 pages
Ch11_slides_PA April 2024 (2)
No ratings yet
Ch11_slides_PA April 2024 (2)
27 pages
Panel Data Regression Models-Seminar
No ratings yet
Panel Data Regression Models-Seminar
18 pages
Fem & Rem
No ratings yet
Fem & Rem
20 pages
Introduction To Panel Data
No ratings yet
Introduction To Panel Data
20 pages
PLM
No ratings yet
PLM
51 pages
Econometrics Chapter Four_Phoenix (1)
No ratings yet
Econometrics Chapter Four_Phoenix (1)
10 pages
Panel Ecmiic2
No ratings yet
Panel Ecmiic2
57 pages
Croissant y Millo, Panel Data Econometrics
100% (1)
Croissant y Millo, Panel Data Econometrics
52 pages
Panel Data Econometrics in R: The PLM Package: Yves Croissant Giovanni Millo
No ratings yet
Panel Data Econometrics in R: The PLM Package: Yves Croissant Giovanni Millo
51 pages
Panel Data From Time Series of Cross-Sections
No ratings yet
Panel Data From Time Series of Cross-Sections
18 pages
Ch10 Slides .Econometrics - MBA
No ratings yet
Ch10 Slides .Econometrics - MBA
32 pages
Slides 2014 Panel Data
No ratings yet
Slides 2014 Panel Data
67 pages
Panel Data Models
No ratings yet
Panel Data Models
25 pages
Panel Data
No ratings yet
Panel Data
105 pages
Block 3
No ratings yet
Block 3
105 pages
econometrics 2
No ratings yet
econometrics 2
20 pages
Econometris II - 4
No ratings yet
Econometris II - 4
26 pages
Panel Econometrics History
No ratings yet
Panel Econometrics History
65 pages
Introduction To Panel Data Analysis
No ratings yet
Introduction To Panel Data Analysis
18 pages
Panel Data Analysis
No ratings yet
Panel Data Analysis
42 pages
PanelDataAnalysiswithStata1FEandREModelsMPRA Paper 76869
No ratings yet
PanelDataAnalysiswithStata1FEandREModelsMPRA Paper 76869
58 pages
panel2up
No ratings yet
panel2up
9 pages
Panel Data Methods
No ratings yet
Panel Data Methods
17 pages
Block 3
No ratings yet
Block 3
36 pages
Chapter-11Panel Data
No ratings yet
Chapter-11Panel Data
13 pages
Panel Data Model
No ratings yet
Panel Data Model
18 pages
econometrics II CH-4 PPT (3)
No ratings yet
econometrics II CH-4 PPT (3)
25 pages
Introduction To Panel Data UG-students
100% (1)
Introduction To Panel Data UG-students
57 pages
Panel Data: Fixed and Random Effects: I1 0 I1 0 I I1
No ratings yet
Panel Data: Fixed and Random Effects: I1 0 I1 0 I I1
8 pages
Econometric Methods For Panel Data
No ratings yet
Econometric Methods For Panel Data
58 pages
Section10 Solutions
100% (1)
Section10 Solutions
11 pages
Panel Data Analysis With Stata Part 1: Fixed Effects and Random Effects Models
No ratings yet
Panel Data Analysis With Stata Part 1: Fixed Effects and Random Effects Models
26 pages
Topic 6 - Static Panel Data
No ratings yet
Topic 6 - Static Panel Data
21 pages
Guja - Chap 16 PDF
No ratings yet
Guja - Chap 16 PDF
26 pages
Ecotrics (PR) Panel Data 2
No ratings yet
Ecotrics (PR) Panel Data 2
16 pages
Fere
No ratings yet
Fere
46 pages
Panel Data-1 FD and FE Estimators (1)
No ratings yet
Panel Data-1 FD and FE Estimators (1)
4 pages
Intro Panel Data by Kurt-Univ Basel
No ratings yet
Intro Panel Data by Kurt-Univ Basel
8 pages
Ch11 Slides
No ratings yet
Ch11 Slides
49 pages
1669594424_72__UE_panelv3
No ratings yet
1669594424_72__UE_panelv3
35 pages
Panel Data Analysis Using STATA 13
No ratings yet
Panel Data Analysis Using STATA 13
17 pages
04 - Panel Data PDF
No ratings yet
04 - Panel Data PDF
84 pages
Introduction To Panel Data Analysis Using Eviews
No ratings yet
Introduction To Panel Data Analysis Using Eviews
43 pages
Note On Panel Data
No ratings yet
Note On Panel Data
19 pages
Panel 101
No ratings yet
Panel 101
48 pages
Chapter 14 Advanced Panel Data Methods: T T Derrorterm Complicate X y
No ratings yet
Chapter 14 Advanced Panel Data Methods: T T Derrorterm Complicate X y
13 pages
Co-Clustering: Models, Algorithms and Applications
From Everand
Co-Clustering: Models, Algorithms and Applications
Gérard Govaert
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Useful Timeseries R
No ratings yet
Useful Timeseries R
38 pages
Chicken Genetics SE
No ratings yet
Chicken Genetics SE
4 pages
Kinetic Theory of Gases
No ratings yet
Kinetic Theory of Gases
8 pages
Lecture 4 MLR - 1
No ratings yet
Lecture 4 MLR - 1
30 pages
Logistic Regression
No ratings yet
Logistic Regression
7 pages
Lecture Notes Confidence Intervals
No ratings yet
Lecture Notes Confidence Intervals
7 pages
1761-Article Text-3350-1-10-20221221
No ratings yet
1761-Article Text-3350-1-10-20221221
11 pages
Tabel Bunga Ekonomi Teknik
No ratings yet
Tabel Bunga Ekonomi Teknik
32 pages
Assignment 2 - Econometrics
No ratings yet
Assignment 2 - Econometrics
7 pages
Lecture 4 Linear Regression 1 07032024 082032pm
No ratings yet
Lecture 4 Linear Regression 1 07032024 082032pm
32 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
06_Banerjee and Banerjee_Business Analytics_Ch06
No ratings yet
06_Banerjee and Banerjee_Business Analytics_Ch06
21 pages
Statistical Methods For Bioinformatics Lecture 2
No ratings yet
Statistical Methods For Bioinformatics Lecture 2
47 pages
Sample Size and Estimation New
No ratings yet
Sample Size and Estimation New
4 pages
Supercritical Liquid-Gas Boundaries
No ratings yet
Supercritical Liquid-Gas Boundaries
2 pages
DATAENG Practice Problem 11
No ratings yet
DATAENG Practice Problem 11
6 pages
Econometric Analysis II (Theory and Lab) - Course Outline
No ratings yet
Econometric Analysis II (Theory and Lab) - Course Outline
3 pages
Estimation of Mean Using Two-Auxiliary Varaible
No ratings yet
Estimation of Mean Using Two-Auxiliary Varaible
10 pages
Assignment 3 - DS
No ratings yet
Assignment 3 - DS
9 pages
Nursyariah Ilman
No ratings yet
Nursyariah Ilman
3 pages
Lesson 5 6 Linear Regression Prerequisites II
No ratings yet
Lesson 5 6 Linear Regression Prerequisites II
10 pages
Shrinkage Method
No ratings yet
Shrinkage Method
2 pages
Artikel Jurnal PKDP
No ratings yet
Artikel Jurnal PKDP
22 pages
Unit Roots Tests Methods and Problems
No ratings yet
Unit Roots Tests Methods and Problems
28 pages
3 Simple Linear Regression
No ratings yet
3 Simple Linear Regression
71 pages
L4b - Perfomance Evaluation Metric - Regression
No ratings yet
L4b - Perfomance Evaluation Metric - Regression
6 pages
stats final assignment[
No ratings yet
stats final assignment[
9 pages
Final_Project
No ratings yet
Final_Project
6 pages
Linear Regression
No ratings yet
Linear Regression
20 pages

Chapter 2 Panel Data

Uploaded by

Chapter 2 Panel Data

Uploaded by

Chapter 2: Panel Data

3 Prep: Important Statistical Concepts 14

4 An example with Stata 15

yit = α + βxit + uit (1.1)

where yit is the dependent variable, α is the intercept term, β is a 1 × k vector of

1.1 Advantages of Panel Data

Another example is given by Hajivassiliou (1987) who studies the external

2.1 The Fixed Eects Model

yit = α + βxit + µi + vit . (2.1)

µi is named individual- or entity-xed eects, as it encapsulates all the

yit = α + βxit + µ2 D2i + µ3 D3i + · · · + µN DN i + vit . (2.2)

yit = βxit + µ1 D1i + µ2 D2i + µ3 D3i + · · · + µN DN i + vit (2.3)

ȳi = α + β x̄i + µi + v̄i . (2.4)

Subtracting eq(2.4) from (2.1) gives

yit − ȳi = β(xit − x̄i ) + vit − v̄i (2.5)

ÿit = β ẍit + v̈it (2.6)

1. Computation of standard errors The s2 of the regression with k explanatory

vit in equation (2.1).

2.2 Time-xed Eects Model

yit = α + βxit + λt + vit (2.8)

yit = α + βxit + λ2 T2t + λ3 T3t + ... · · · + λT TT t + vit (2.9)

yit − ȳt = β(xit − x̄t ) + vit − v̄t (2.10)

yit = α + βxit + µi + λt + vit . (2.12)

yit = βxit + µ1 D1i + µ2 D2i + µ3 D3i + · · · + µN DN i

+ λ1 T1t + λ2 T2t + λ3 T3t + ... · · · + λT TT t + vit . (2.13)

The number of parameters is now k + N + T . Alternative to estimating the LSDV

yit − ȳi = β(xit − x̄i ) + (λt − λ̄t ) + (vit − v̄i ) (2.14)

which we could write as

ÿit = β ẍit + λ̈t + v̈it (2.15)

2.3 The Random Eects Model

yit = α + βxit + µi + vit , µi ∼ IID(0, σµ2 ); vit ∼ IID(0, σv2 ), (2.16)

where µi + vit is treated as an error term consisting of two components: an

yit∗ = α(1 − θ) + x∗it β + εit (2.18)

The error term εit is iid over individuals and time.

2.4 Fixed Eects or Random Eects?

E{yit |xit } = α + x0it β, (2.19)

while the xed eects model states that

E{yit |xit } = α + x0it β + µi . (2.20)

2.4.1 The Hausman test

ξH = (βFˆE − β̂RE )0 [V̂ {β̂}F E − V̂ {β̂RE }]−1 (β̂F E − β̂RE ) (2.21)

where V̂ s denote estimates of covariance matrices of the estimators. The statistic

3 Prep: Important Statistical Concepts

Consistency is a so-called large sample property and, loosely speaking, says

lim P {|b − β| > δ} = 0 for all δ > 0 (3.2)

We usually refer to this as 'the probability limit of b is β ' or 'b converges in

4 An example with Stata

Brooks, C. (2019). Introductory econometrics for nance. Cambridge university

Fama, E. F. and MacBeth, J. D. (1973). Risk, return, and equilibrium: Empirical

Hajivassiliou, V. A. (1987). The external debt repayments problems of ldc's: An

You might also like

2.1 The Fixed Eects Model

µi is named individual- or entity-xed eects, as it encapsulates all the

2.2 Time-xed Eects Model

2.3 The Random Eects Model

2.4 Fixed Eects or Random Eects?

while the xed eects model states that

Brooks, C. (2019). Introductory econometrics for nance. Cambridge university