Econometrics II CH-4
Econometrics II CH-4
Department of Economics
Econometrics II
By Desalegn N.
Outlines
Introduction
In time series data, we observe the values of one or more variables over a period of
In cross-section data, values of one or more variables are collected for several
sample units, or entities, at the same point in time (e.g., consumption level for 50
In panel data, the same cross-sectional unit (say a family or a firm or state) is
Thus, panel data is a combination of cross-sectional and time series data. panel
Panel data/longitudinal data study over time about variables or group of subjects.
Advantage of Panel Data
Unobserved Heterogeneity
Individual dynamics
Panel Data take into account unobserved heterogeneity explicitly by allowing for
individual-specific variables.
It can better detect and measure effect that simply cannot be observed in pure
For instance, The effect of innate ability on earnings can be better studied if we
In addition firms' output depends on physical inputs denoted by x (observed in the
Thus, Uit= miβk+1 + εit as result estimating by OLS leads to biased and
inconsistent result.
However, Panel data offer the possibility to control for (eventually eliminate)
For instance, if Xit is correlated with unobserved individual specific factors (αi), it
More generally, estimating the model under the Fixed effect assumption eliminates
αi from the error term and, consequently, eliminates all endogeneity problems
relating to it.
Identification of Individual Dynamics
By studying the repeated cross section of observations, panel data are better suited
to study the dynamics of change. This is because individual histories are observed
Example: Spells of unemployment, job turnover, and labor mobility are better
By combining time series and cross-section observations, panel data give more
informative data, more variability, less collinearity among variables, more degrees
Panel data gives efficient and accurate result even the observation of panel equals
This reasoned by panel data allow explanatory variables to vary over two
In the case of panel data sets consisting time and cross-sectional dimension using
Fixed effect model: is simply a linear regression model in which the intercept terms
where αi is individual intercept, also, we usually assumed that all X’it are
Suppose firm's output (Yit) depends on the amount of labor (X1it) and capital
(X2it) for the period 2001-2022 for four firms: F1, F2, F3, and F4.
Here, there are 4 cross-sectional units and 22 time periods. Thus, the number of
In fixed effect model the intercept assumed to differ across individuals (here the
four firms), each individual's intercept does not vary over time; that is, it is time
invariant.
Since FEM: Estimate β while controlling for all sources of heterogeneity that are
This can be done by: dummy variable technique, particularly, the differential
intercept dummies.
Estimation of Panel Data Regression Model : The Fixed Effects Approach
F4, 0 otherwise.
Here there is no dummy for F1 (the comparison region). i.e, α1 represents the
how much the intercepts of F2, F3, and F4 differ from the intercept of F1.
Estimation of Panel Data Regression Model : The Fixed Effects Approach
(LSDV)/fixed effect.
But, if there exist many cross sectional unit using Least Square Dummy Variable
Using within estimator (FEE), exactly the same estimator for β is obtained if the
Assumption of FEM:
Unbiasedness: All Xit are independent of all εit. E {(Xit − 𝑋𝑖) εit} = 0
Consistent
Estimation of Panel Data Regression Model : Random Effects Model
REM: is a panel model that treat αi as random variable with a mean value of “α1” (no
subscript i here).
Accordingly the intercept value for an individual company can be expressed as:
αi = α1+ Ui
where Ui is a random error term with a mean value of zero and variance of constant
variance.
Estimation of Panel Data Regression Model : Random Effects Model
What we are essentially saying is that the four firms included in our sample are
drawing from a much larger universe. As result, selected firms have a common mean
value for the intercept (= α1) and the individual differences in the intercept values of
To demonstrate the mechanics of the random effects estimator, define the time
. .
means of Yit and Xit as where: 𝑌𝑖 = 𝑡 𝑌𝑖𝑡 /𝑇and 𝑋𝑖 = 𝑡 𝑋𝑖𝑡 /𝑇.
Estimation of Panel Data Regression Model : Random Effects Model
parameters.
If the individual effects αi and the explanatory variables are correlated, the
correlation can be handled by using the FEM, which essentially eliminates the αi
from the model, and thus eliminates any problems that they may cause and FEM is
appropriate.
Fixed or random effects model?: Hausman Test
In contrast, the REM may be preferred if one wants to make inference with respect
to the population characteristics. One way to formalize this is noting the following:
Thus, REM is appropriate when If the regressors and the individual effects are not
correlated.
The test used to choose between the two models is known as the Hausman test.
The null hypothesis of this test states that there is no correlation between
regressors and individual effects. So rejection of the null favors the FEM.