Panel Data Analysis Using Eviews
Panel Data Analysis Using Eviews
User Guide
Panel Data Analysis
With Special Application to Monetary
Policy Transmission Mechanism
Prepared By
Dr Esman Nyamongo
Assistant Director
Research Department
Central Bank of Kenya
Published by
COMESA Monetary Institute (CMI)
First Published 2019 by
All rights reserved. Except for fully acknowledged short citations for
purposes of research and teaching, no part of this publication may be
reproduced or transmitted in any form or by any means without prior
permission from COMESA.
Disclaimer
The views expressed herein are those of the author and do not in any way
represent the official position of COMESA, its Member States, or the
affiliated Institution of the Author.
~ vi ~
4.5 Random Effects Model ..................................................................... 70
4.5.1 Testing the validity of the random effects: Hausman test ............................ 71
References ................................................................................................... 73
~ vii ~
LIST OF FIGURES
Figure 1: Creating a work file .............................................................................. 34
Figure 2: Getting the data into Eviews .............................................................. 35
Figure 3: Estimation in a pool object ................................................................. 36
Figure 4: Estimation result .................................................................................. 37
Figure 5: BLC based on pooled regressions analysis ....................................... 44
Figure 6: GMM Model Specification ................................................................. 93
LIST OF TABLES
Table 1.1: Panel data set: Normalised bank size for 5 banks ........................... 4
Table 3.1: Raw data on bank size and loan ....................................................... 32
Table 3.2: Stacked data on bank size and loan ................................................. 33
Table 3.3: Data on size and growth rate of loan with cross-section
identifiers ............................................................................................. 38
Table 4.1: Dummy variables ................................................................................ 51
Table 4.2: Demeaned data ................................................................................... 61
Table 5.1: Data ...................................................................................................... 77
Table 5.2: transformed data: ................................................................................ 77
Table 6.1: Stacked data on bank size and loan ................................................. 86
Table 6.2: The Estimation results ....................................................................... 94
~ viii ~
LIST OF ACRONYMS
~ ix ~
PREFACE
The preparation of this User’s Guide followed a directive to COMESA Monetary
Institute (CMI) by the 22nd Meeting of the COMESA Committee of Governors of
Central Banks which was held in Bujumbura, Burundi in March 2017. Governors
noted that panel data analysis of monetary policy is an important prerequisite for
implementing a sound monetary policy, as it allows a judgement to be formed as to
the extent and the timing of monetary policy decisions which are appropriate in
order to maintain price stability.
The overall objective of this User’s Guide is to equip Users with skills to undertake
analysis of all aspects of panel data analysis. The User’s Guide demonstrates all
steps in panel data analysis from data organization to results interpretation,
applying bank level data using the EViews software.
Ibrahim A. Zeidy
Chief Executive Officer
~ x ~
ACKNOWLEDGEMENTS
Although this User’s Guide is a sole work of the author, its present form is
a product of inputs from various training workshops organized by
COMESA Monetary Institute (CMI). The Author acknowledges all
participants of various CMI training workshops, who provided comments
that assisted in making the User’s Guide clearer and more User friendly.
The Author also thanks the Director, Mr. Ibrahim Abdullahi Zeidy and the
Senior Economist, Dr. Lucas Njoroge for providing technical and expert
assistance, and all the staff of the Institute for the facilitation and logistical
support towards the completion of the User’s Guide.
~ xi ~
Chapter 1
There are two distinct sets of information that can be derived from cross-
sectional time series data. The cross-sectional component of the data set
reflects the differences observed between the individual subjects or entities
whereas the time series component which reflects the differences observed
for one subject over time. For instance, researchers could focus on the
differences in data between each person in a panel study and/or the
changes in observed phenomena for one person over the course of the
study (e.g., the changes in income over time of person 1 in Panel Data Set).
On the other hand, an undated panel uses group specific default integers as
cell IDs; by default, the cell IDs in each group are usually given by the
default integers (1, 2, ...N).
In the simplest form of balanced panel data, every cross-section follows the
same regular frequency, with the same start and end dates—for example,
data with 20 cross-sections, each with annual data from 1961 to 1970. In
this case, we say that the panel is balanced.
~ 2 ~
Introduction to Panel Data Analysis
“missing” cell IDs, we ensure that all of the cross-sections have the same
set of cell IDs.
Balancing data between the starts and ends involves adding observations
with cell IDs that are not in the given group, but are both observed
elsewhere in the data and lie between the start and end cell ID of the given
group. If, for example, the earliest cell ID for a given group is “2000” and
the latest ID is “2010”, the set of cell IDs to consider adding is taken from
the list of observed cell IDs that lie between these two dates. The effect of
balancing data between starts and ends is to create a panel that is internally
balanced, that is, balanced for observations with cell IDs ranging from the
latest start cell ID to the earliest end cell ID.
Assuming one is interested in analysing data for 5 banks over the period
2000 to 2016. A typical balanced panel dataset may appear as shown on
Table 1:
~ 3 ~
Guidelines on Panel Data Analysis
Table 1.1: Panel data set: Normalised1 bank size for 5 banks
Bank1 Bank2 Bank3 Bank4 Bank5
2000 6.3 4.6 6.7 4.1 3.5
2001 5.8 5.1 6.7 2.9 3.2
2002 4.5 3.4 6.7 1.8 3.3
2003 2.1 2.2 9.3 -1.4 3.3
2004 2.0 2.0 9.8 -2.1 3.6
2005 1.9 1.7 9.8 -1.5 3.6
2006 2.0 1.5 9.8 -1.4 3.6
2007 1.5 -2.2 9.8 -2.4 3.6
2008 0.5 1.6 9.8 -2.2 3.6
2009 0.6 0.1 9.8 -1.4 1.1
2010 1.0 1.2 10.5 -1.3 0.8
2011 6.4 1.8 9.8 -1.8 0.8
2012 1.5 2.0 8.5 -1.1 2.8
2013 2.4 2.4 9.8 -0.7 2.8
2014 2.5 2.5 9.8 0.1 2.7
2015 3.7 1.5 9.8 0.5 2.7
2016 3.3 3.4 9.8 0.4 2.7
1
Bank Size is measured by total assets. In order to control for the trend in size, total assets for
each bank is normalized by subtracting the log of total assets for each bank from the sample
average. The result is a normalized bank size with banks whose size is larger than the sample
average having a positive value and banks with a size smaller than the sample average
having a negative value.
~ 4 ~
Introduction to Panel Data Analysis
~ 5 ~
Guidelines on Panel Data Analysis
sorts of bias. If panel data over this time period are available, it would
allow the possibility of observing the before- and effects on
individuals of decriminalization as well as providing the possibility of
isolating the effects of treatment from other factors affecting the
outcome.
ii. Controlling the impact of omitted variables. It is frequently
argued that the real reason one finds (or does not find) certain effects
is due to ignoring the effects of certain variables in one’s model
specification which are correlated with the included explanatory
variables. Panel data contain information on both the intertemporal
dynamics and the individuality of the entities may allow one to
control the effects of missing or unobserved variables. For instance,
MaCurdy’s (1981) life-cycle labor supply model under certainty
implies that because the logarithm of a worker’s hours worked is a
linear function of the logarithm of her wage rate and the logarithm of
worker’s marginal utility of initial wealth, leaving out the logarithm of
the worker’s marginal utility of initial wealth from the regression of
hours worked on wage rate because it is unobserved can lead to
seriously biased inference on the wage elasticity on hours worked
since initial wealth is likely to be correlated with wage rate. However,
since a worker’s marginal utility of initial wealth stays constant over
time, if time series observations of an individual are available, one can
take the difference of a worker’s labor supply equation over time to
eliminate the effect of marginal utility of initial wealth on hours
worked. The rate of change of an individual’s hours worked now
depends only on the rate of change of her wage rate. It no longer
depends on her marginal utility of initial wealth.
iii. Uncovering dynamic relationships. “Economic behavior is
inherently dynamic so that most econometrically interesting
relationship are explicitly or implicitly dynamic” (Nerlove, 2002).
However, the estimation of time adjustment pattern using time series
data often has to rely on arbitrary prior restrictions such as Koyck or
Almon distributed lag models because time series observations of
~ 6 ~
Introduction to Panel Data Analysis
~ 7 ~
Guidelines on Panel Data Analysis
~ 8 ~
Chapter 2
Data analysis within the context of panel may be conducted using a variety
of software currently available in the market. For the purpose of this User’s
Guide we demonstrate how to use EVIEWS to analyse panel data sets. In
addition, while acknowledging that we need to use bank data we, however,
use fictitious data to demonstrate how to get started in EVIEWS.
Therefore, in this chapter we will data generated purposely for
demonstration.
depending on the version of the software. For you to open and operate
Eviews, take the cursor to this icon and double click the mouse.
Across the bottom is the Current Path for reading data and saving files.
The EViews Help Menu is going to become a close friend. Use it when
you need guidance on how to navigate the software.
~ 10 ~
Getting Started in Eviews Software
Before we get data into Eviews for analysis there is data preparation which
needs to be done in excel. The first step in data preparation in excel is to get
data for all variables and prepare it using the following procedure. We
demonstrate data preparation using data using normalised Bank level data
on Bank size. After this has been demonstrated then any data set may be
prepared in the same version. The first step in data preparation in excel is to
get data for each individual cross sections, putting similar data together, that
is, if one is organising a range of data sets on banks, we require that, for
example, data on ‘Bank size’, be put together while noting the name of the
bank where the bank relates to. In our present case we have 5 cross sections
(five banks) as shown in the table. The second step is to provide cross
section identifiers. In the table you may notice that the variable name is
‘Size’. In order to identify the data with the bank we include an extension.
Here we add an underscore (_) followed by the unique name for each cross
section. In our present case the variable for the five cross sections are as
follows ‘Size_Bank1; Size_Bank2; Size_Bank3; Size_Bank4; Size_Bank5’. In
case one has more than one variable the same procedure is followed. We
will explain this further under the naming conventions in the pool object.
~ 11 ~
Guidelines on Panel Data Analysis
Once you have populated the window as indicated above, click on ‘OK’ to
obtain the following screen. Now save your workfile in your preferred
location by clicking file, save.
~ 12 ~
Getting Started in Eviews Software
Copy your data in excel excluding the year and paste as shown below;
~ 13 ~
Guidelines on Panel Data Analysis
Click ok and close the group. Your workfile now looks as follows;
~ 14 ~
Getting Started in Eviews Software
Pool object. When working with pooled data, note that the series that are
being pooled need not be of the same dimension, so for example we can
have time series for various variables comprising different samples. The
data in our workfiles are balanced, that is, they have a common sample
period.
~ 15 ~
Guidelines on Panel Data Analysis
Step 1: Click object, new object, and the following window emerges;
~ 16 ~
Getting Started in Eviews Software
~ 17 ~
Guidelines on Panel Data Analysis
Step 3: Click ok and the window below asking you to supply cross-sectional
identifiers emerges;
~ 18 ~
Getting Started in Eviews Software
~ 19 ~
Guidelines on Panel Data Analysis
~ 20 ~
Getting Started in Eviews Software
Note the use of the ‘?’ character. The ? tells EViews to apply the
transformation to the cross-sectional series in the Pool. The lags of the
series (-1) must be placed after the ?. One can also define and/or adjust the
sample size under the sample window.
Note that this is just for demonstration on how to conduct data transformations. So we
are not going to perform this procedure. There are many other transformations which can
be done within the pool object, this is just one of them.
~ 21 ~
Guidelines on Panel Data Analysis
PANEL A PANEL B
In terms of viewing the data, we can either plot the series individually or
open them as a Group (note that a group is not the same as a Pool object
for EViews. A Group object is mainly used for working with collections of
series.) If you select all (or some of the) series and right click, you can open
them as a Group. Non-continuous time series are chosen by pressing the
Ctrl key and keeping it pressed while selecting individual time series.
In our case we have already created a group, SIZE. To open this group
double click on it. You will see the window below;
Once that data has been successful imported into Eviews, the next step is to
know how to conduct basic analysis. Plot the series by selecting view,
graph. Select the Multiple graphs option in the Multiple series window
and click ok.
~ 22 ~
Getting Started in Eviews Software
Whenever we begin working with a new data set, it is always a good idea to
take some time to simply examine the data. This will help ensure that there
were no mistakes in the data itself or in the process of reading in the data. It
also provides us with a chance to observe the general (time-series)
~ 23 ~
Guidelines on Panel Data Analysis
behaviour of the series we will be working with. Points to look out for
include the possible existence of time trends or structural breaks in the
underlying data. The first thing one should always do is to plot the data to
make sure that it looks fine.
Once data has been entered into Eviews as indicated above and basic
graphical representations have been done. The next step is to conduct
descriptive statistics. In this case there are two avenues available: (i)
computing descriptive statistics for the group (ii) conducting descriptive
statistics for all the groups in the panel at one.
Step 2: Obtaining the descriptive statistics: Once the data has been
presented on the screen as shown on panel B, you then click ‘view’ on
~ 24 ~
Getting Started in Eviews Software
the tool bar that contains the series. Then you will get a drop down
window which contains a number of options. In our case choose
‘Descriptive stat’. Once this is highlighted you will need to choose
whether you need statistics for individual series in the group or
common statistics. If you click on ‘individual series, you will obtain the
following:
A blank screen shown below pops-up. The next step is to list all the
variables of interest in the space provided. Remember to include ‘?’ after
listing the variable as shown below.
~ 25 ~
Guidelines on Panel Data Analysis
~ 26 ~
Getting Started in Eviews Software
~ 27 ~
Chapter 3
A pooled regression involves obtaining the data for all the cross sections of
interest over time and stacking them and run a simple ordinary least squares
(OLS) to obtain the estimates which are BEST. In this case we visualise the
pooled regression model as follows:
Yi ,t X i ,t i ,t 1
Where is Y, the dependent variable, is observed for all cross sections (i= 1,
2…. N), over time t, (t=1, 2 …T). X is the independent variable while it is
the error term. For a better representation of Equation 1, each of the
variables shown may be presented as for each cross section as follows:
Guidelines on Panel Data Analysis
~ 30 ~
Pooled Regression Analysis
In addition, the pooled model being the most basic of the known methods
of panel data analysis, we will devote time to explain how this approach
works. After that is done then it will be very clear on how data is organised
and model estimated in any panel specification. For convenience we seek to
illustrate the panel data estimation using the model shown in Equation 1,
reproduced here as:
Yi ,t X i ,t i ,t
~ 31 ~
Guidelines on Panel Data Analysis
As shown in Table 1, we observe that data for bank size and loan are
observed for each cross section for the period 2000-2016. For example, in
the case of Bank1, the observed size and loan are 4.4 and 6.3, respectively in
2000. To estimate a panel model using this data we stack this data as shown
in Table 2. Stacking involves getting the data for size and loan for each
cross section and arranging them with size and its corresponding loan
arranged as follows:
~ 32 ~
Pooled Regression Analysis
~ 33 ~
Guidelines on Panel Data Analysis
~ 34 ~
Pooled Regression Analysis
You may notice that in Panel A, a dialog box will emerge requiring you to
create workfile. If you click on the ‘Workfile Structure Type’ you will
obtain 3 options (Dated-regular frequency, Unstructured/Undated,
Balanced panel). However, as noted earlier, this dataset is undated,
therefore you need to choose ‘Unstructured/Undated’ in the drop down
menu named ‘Dated-regular frequency’. In addition, you are required to
choose the ‘Data Range’. In our case we have 85 observations. If you click
on ‘OK’ then you will obtain Panel B in Figure 2.
Step 3: Getting the Data into Eviews: Panel B in Figure 1 has been
configured to accept data with a range of 1-85… 85 obs for each bank).
The data we are interested in getting into Eviews as shown in Table 2 is
in Excel, therefore it needs to be copied from Excel and pasted into
Eviews as follows: In the Eviews workfile, on the main tool bar, click on
‘Quick’. Here you will find several options, but choose ‘empty Group’.
In the empty group, paste the data to obtain the following
~ 35 ~
Guidelines on Panel Data Analysis
You will notice that the workfile has 2 variables size and loan. This shows
that the data has been loaded into Eviews successfully and is ready for
further analysis.
You will notice that the estimation method is required. At the bottom of the
Box you will see a provision for ‘Estimation Setting’ – Method. The Default
method is the LS- Least Squares (NLS and ARMA). In the pooled regression
~ 36 ~
Pooled Regression Analysis
approach, the estimation method is the LS, therefore if you click on ‘OK’,
you will obtain the following output:
~ 37 ~
Guidelines on Panel Data Analysis
Table 3.3: Data on size and growth rate of loan with cross-section identifiers
LOAN BY BANK BANK SIZE
Loan_Bank1 Loan_Bank2 Loan_Bank3 Loan_Bank4 Loan_Bank5 Size_Bank1 Size_Bank2 Size_Bank3 Size_Bank4 Size_Bank5
2000 4.4 5.7 5.3 15.9 4.6 6.3 4.6 6.7 4.1 3.5
2001 2.8 4.6 5.2 4.8 4.0 5.8 5.1 6.7 2.9 3.2
2002 12.9 6.1 6.6 5.3 6.2 4.5 3.4 6.7 1.8 3.3
2003 9.2 6.9 1.3 3.5 6.9 2.1 2.2 9.3 -1.4 3.3
2004 3.5 4.4 6.1 4.9 8.3 2.0 2.0 9.8 -2.1 3.6
2005 3.4 8.0 7.5 1.6 5.5 1.9 1.7 9.8 -1.5 3.6
2006 2.4 7.1 17.1 3.0 5.5 2.0 1.5 9.8 -1.4 3.6
2007 1.3 4.3 13.6 7.7 5.4 1.5 -2.2 9.8 -2.4 3.6
2008 1.2 4.5 6.0 5.5 4.3 0.5 1.6 9.8 -2.2 3.6
2009 10.3 6.0 1.5 4.4 4.9 0.6 0.1 9.8 -1.4 1.1
2010 4.2 5.3 1.7 15.8 7.3 1.0 1.2 10.5 -1.3 0.8
2011 11.9 6.1 8.1 8.7 4.8 6.4 1.8 9.8 -1.8 0.8
2012 14.5 6.4 1.6 3.8 4.8 1.5 2.0 8.5 -1.1 2.8
2013 8.5 8.8 4.0 4.8 7.3 2.4 2.4 9.8 -0.7 2.8
2014 7.8 5.2 5.0 1.6 7.7 2.5 2.5 9.8 0.1 2.7
2015 0.9 4.5 5.7 3.8 5.1 3.7 1.5 9.8 0.5 2.7
2016 1.0 6.1 3.8 4.9 7.5 3.3 3.4 9.8 0.4 2.7
~ 38 ~
Pooled Regression Analysis
Recall the steps in creating a pool workfile and cross section identifiers
from section 2.3.2
Step 3: Estimate model: You should use the Method dropdown menu to
choose between LS - Least Squares (LS and AR), TSLS - Two-Stage Least
Squares (TSLS and AR), and GMM / DPD – Generalized Method of
Moments / Dynamic Panel Data (DPD) techniques. In this case we are
interested in the LS - Least Squares (LS and AR). Click on ‘OK’ to obtain
the following:
Panel A Panel B
So far we have used variables bank size and growth rate of loans by bank to
illustrate how one navigates the Eviews software to generate output.
However, for illustration based on ‘monetary transmission mechanism’ it is
not possible to parade bank specific data. To meet this objective, however,
we present and interpret the results based on a study on Kenya:
~ 39 ~
Guidelines on Panel Data Analysis
bank level data on 39 commercial banks using monthly data for 2001-2015.
To accomplish this task, we adopt a framework similar to that of Kashyap
and Stein (2000) and Walker (2013) which exploits the heterogeneous
nature of commercial banks to establish whether or not the BLC exists in
Kenya.
~ 40 ~
Pooled Regression Analysis
The validity of the bank lending channels requires that commercial banks
which experience a change in their deposits or reserves on account of
monetary policy action should adjust their lending. Suggesting that a
negative relationship is expected between a monetary policy indicator and
commercial banks’ lending. However, as shown in the literature the choice
of a monetary policy variable is controversial. As shown by Kashyap and
Stein (2000), there is no general agreement on the appropriate choice of an
indicator of monetary policy. In the literature a set of possible indicators
have been suggested: change in short term interest rate under the control of
the central bank, the residuals from a vector autoregression (VAR)
representing the reaction function of the central bank (Bernanke and
Mihov, 1998), the narrative approach (Boschen and Mills, 1995). Etc. In
this study we proxy monetary policy by the Central Bank Rate (CBR)
and/or the interbank rate.
~ 41 ~
Guidelines on Panel Data Analysis
log S it
S log S it i 1
Nt ;
Where S the adjusted bank size and N is the number of banks in the
sample.
Following Kashyap and Stein (2000), it is shown that given the same
characteristics of banks, except the level of liquidity, the banks will react
differently to a monetary policy shock. That is assuming two banks which
both face difficulties in raising external finance, and are alike in all respects
except that one has a more liquid balance sheet (as measured by the ratio of
securities to assets) than the other. In the event of a monetary shock, it is
easier for the more liquid bank to protect its loan portfolio, as it can draw
down on its buffer stock of securities. In contrast, the less liquid bank will
have to cut its new loans to a greater extent to prevent its securities holdings
from falling too low. Whereas relatively liquid banks can draw down their
liquid assets to shield their loan portfolio, this is not feasible for less liquid
banks. This therefore suggests that more liquid banks tend to be less
sensitive to monetary policy shock compared with those with low liquidity.
Therefore, maintaining high liquidity levels is not conducive for monetary
policy transmission. How then do we measure liquidity? There are various
ways, in this study, however, we measure as a ratio of liquid assets to total
assets.
N
log L it
L log Lit i 1
Nt
A it Nt
~ 42 ~
Pooled Regression Analysis
Where K the adjusted capitalization of commercial banks, E is the total
equity and A is the total assets.
~ 43 ~
Guidelines on Panel Data Analysis
~ 44 ~
Pooled Regression Analysis
References
Ashcraft, Adam B., 2006. New evidence on the lending channel. Journal of Money,
Credit and Banking, 38, 751-775.
Altunbas, Y., Fazylov, O., and Molyneux, P., (2002). Evidence on the Bank
Lending Channel in Europe. Journal of Banking and Finance. 26:11. 2093-
2110.
Bernanke, B.S. and Gertler, M., (1995). Inside the black box: The credit channel of
monetary policy. Journal of Economic Perspectives. 9:4. 27-48.
De Bondt, Gabe, J., 1999. Credit channels in Europe: Cross-country investigation.
Research Memorandum WO&E no. 569. De Nederlandsche Bank,
February.
Ehrmann, M., Gambacorta, L., Martinez-Pages, J., Sevestre, P., and Worms, A.,
(2001). Financial systems and the role of banks in monetary policy
transmission in the euro area. European Central Bank Working Paper No.
105.
Favero, Carlo A., Giavazzi, Francesco, Flabbi, Luca, 1999. The Transmission
mechanism of monetary policy in Europe: Evidence form banks’ balance
sheets. National Bureau of Economic Research, Working Paper no. 7231.
Gambacorta, Leonardo, 2005. Inside the bank lending channel. European
Economic Review, 49, 1737-1759.
Kashyap, A.K., and Stein, J.C, (2000). What Do a Million Observations on Banks
Say about the Transmission of Monetary Policy? American Economic
Review. 90:3. 407-428.
Kashyap, Anil K., Stein, Jeremy C., 1995. The impact of monetary policy on bank
balance sheets. Carnegie-Rochester Conference Series on Public Policy 42,
151-195.
Kashyap, Anil K., Stein, Jeremy C., 1997. The role of banks in monetary policy: A
survey with implications for the European Monetary Union. Economic
Perspectives, Federal Reserve Bank of Chicago 21, pp. 2–19.
Kishan, Ruby P., Opiela, Timothy P., 2000. Bank size, bank capital, and the bank
lending channel. Journal of Money, Credit and Banking, 32, 121-141.
~ 45 ~
Guidelines on Panel Data Analysis
~ 46 ~
Chapter 4
uit vi t it 2
Guidelines on Panel Data Analysis
uit vi it 3a
uit t it 3b
~ 48 ~
Error Component Model Analysis
yit 0 1 xit t it 4a
Or
yit 0 1 xit vi it 4b
The fixed effects model assumes that ( vi ) and ( t ) are separate parameters.
For illustration purposes we use the case where we estimate the ( vi ). In
estimating the separate parameters ( vi ) use the following two equivalent
methods: the Least Squares Dummy Variable (LSDV) method and the
Within-Q- Estimation method.
yi X i 1T vi i
~ 49 ~
Guidelines on Panel Data Analysis
In this case the parameters for X are estimated. In addition, dummies for
each of the cross sections are estimated. In its non-compact form, it is
represented as follows:
y1 X 1 1T . 0 v1 1
.
. . 0 0 . .
. .
0 0 . .
y X 0 0 0 1T vn i
i n
y X I n 1T v X Dv
To estimate a model using this method the following steps are followed:
Step 1: Organise the data for the key variables size and growth of loan
as shown below:
LOAN BY BANK BANK SIZE
Loan_Bank1 Loan_Bank2 Loan_Bank3 Loan_Bank4 Loan_Bank5 Size_Bank1 Size_Bank2 Size_Bank3 Size_Bank4 Size_Bank5
2000 4.4 5.7 5.3 15.9 4.6 6.3 4.6 6.7 4.1 3.5
2001 2.8 4.6 5.2 4.8 4.0 5.8 5.1 6.7 2.9 3.2
2002 12.9 6.1 6.6 5.3 6.2 4.5 3.4 6.7 1.8 3.3
2003 9.2 6.9 1.3 3.5 6.9 2.1 2.2 9.3 -1.4 3.3
2004 3.5 4.4 6.1 4.9 8.3 2.0 2.0 9.8 -2.1 3.6
2005 3.4 8.0 7.5 1.6 5.5 1.9 1.7 9.8 -1.5 3.6
2006 2.4 7.1 17.1 3.0 5.5 2.0 1.5 9.8 -1.4 3.6
2007 1.3 4.3 13.6 7.7 5.4 1.5 -2.2 9.8 -2.4 3.6
2008 1.2 4.5 6.0 5.5 4.3 0.5 1.6 9.8 -2.2 3.6
2009 10.3 6.0 1.5 4.4 4.9 0.6 0.1 9.8 -1.4 1.1
2010 4.2 5.3 1.7 15.8 7.3 1.0 1.2 10.5 -1.3 0.8
2011 11.9 6.1 8.1 8.7 4.8 6.4 1.8 9.8 -1.8 0.8
2012 14.5 6.4 1.6 3.8 4.8 1.5 2.0 8.5 -1.1 2.8
2013 8.5 8.8 4.0 4.8 7.3 2.4 2.4 9.8 -0.7 2.8
2014 7.8 5.2 5.0 1.6 7.7 2.5 2.5 9.8 0.1 2.7
2015 0.9 4.5 5.7 3.8 5.1 3.7 1.5 9.8 0.5 2.7
2016 1.0 6.1 3.8 4.9 7.5 3.3 3.4 9.8 0.4 2.7
~ 50 ~
Error Component Model Analysis
Step 2: Create cross section dummies. Here the dummies for each cross
section as if they are variables. In this case we have 5 cross-section,
therefore we need 5 dummies as follows:
As shown above we create a dummy for each cross section. For example, in
the case for Bank1, we create a dummy with 1 and zeroes elsewhere, for the
period 2000 to 20016.
~ 51 ~
Guidelines on Panel Data Analysis
Cross-section
Data range is stated identifiers are listed
Step 4: Getting data into Eviews: To get data into Eviews, click on the
‘Quick button’ followed by ‘Empty Group’. Then the following empty
box will appear:
This is the box where data for each individual variable will be entered. In
our case we have 2 variables (loan and bank size) and 5 dummy variables.
Therefore, in total we have seven variables to enter separately. For example,
we may start with bank size, in which case we cut the data on variable bank
size from Excel and paste it in this blank box to give:
~ 52 ~
Error Component Model Analysis
Give variable
name here
The same procedure is followed for the rest of the variables i.e. DLOAN,
D1, D2, D3, D4 and D5. Once all the variables have been pasted in Eviews,
The 7 variables
are entered and
each is identified
as a group
~ 53 ~
Guidelines on Panel Data Analysis
Enter Dependent
variable here
followed by ? Enter independent
variables here each
followed by ?
However, to avoid the Dummy variable trap, we exclude one cross section
dummy from the regression. In this particular regression you may notice we
have excluded D5. There is no rule regarding the dummy variable to
exclude from the regression. The parameter for excluded dummy variable
will be accounted for once the estimation procedure is completed. Once all
the variables have been entered as shown click on the ‘OK’ button and the
following result will show.
~ 54 ~
Error Component Model Analysis
As you may notice, this is a long procedure for implementing the DVE
approach. However, Eviews has a shortcut which delivers on the same
results. In this case, when entering data for the independent variables do
not enter dummy variables in the space provided. Instead enter the constant
‘C’ in the space as shown below:
~ 55 ~
Guidelines on Panel Data Analysis
You may wish to compare the result from the long- and short- procedure
for implementing the DVEM as shown below. For instance, panels A and B
show that results for bank size (SIZE?) are exactly the same whether we use
dummy variable estimation or the short cut via cross-section identifiers.
Panel A Panel B
~ 56 ~
Error Component Model Analysis
In this illustration we
abstract from the basic steps
in the data preparations and
creating an enabling
environment in Eviews for
panel data analysis.
Therefore, for us to show
how the Least Squares
Dummy Variable (LSDV)
estimation method is executed we follow the following steps:
~ 57 ~
Guidelines on Panel Data Analysis
Step 1: Setting up the LSDV. In the LSDV model set up, the important
aspect is the estimation of the individual specific coefficients. To
implement this, we perform only one modification to the estimation
procedure in the pooled regression approach. The dependent variable as
before is “tcad?”. To allow for cross section specific coefficients you
enter a ‘C’ in the space provided for Cross-section specific coefficients.
~ 58 ~
Error Component Model Analysis
y it X it v i it
E X it v i 0 5
E X it it 0
yi X i vi i
1 T 1 T 1 T
y
T t 1
yit ; X X it ; v vi
T t 1 T t 1
~ 59 ~
Guidelines on Panel Data Analysis
Step 2: Demeaning the data: For each variable subtract the mean
obtained from step 1 to obtain the following:
yit yi X it X i vi vi it i
~
y it X it ~ it
~
̂ FE X X X ~
~ ~ 1 ~ n ~ ~ n
~ ~
y Xi Xi X i yi
i 1 i 1
Step 1: Data preparation: Organise the data for the variables loan, and
bank size and also compute the mean value for each of the cross-
sections.
~ 60 ~
Error Component Model Analysis
Step 3: copy the transformed data and paste it in Eviews. But first
open a new file with the same dimensions as before. While following the
steps for creating a panel object, we estimate the equation with
transformed data (SizeT and DloanT) as shown below:
~ 61 ~
Guidelines on Panel Data Analysis
~ 62 ~
Error Component Model Analysis
Click on drop
down and select
‘Fixed’
~ 63 ~
Guidelines on Panel Data Analysis
The LSDV and within-q- methods are equivalent: you may now see that
the LSDV and the Within-q- method are equivalent as shown in panels A
and B where the results for bank size (SIZE?) are exactly the same.
~ 64 ~
Error Component Model Analysis
Panel A Panel B
yit 0 1 xit vi it
The test being implemented here calls seeks to establish whether ( vi ) exists
or not by testing the following hypothesis:
H 0 : v1 v2 .......... ......... v N 1 0
~ 65 ~
Guidelines on Panel Data Analysis
F
RSS URSS / N 1 F
URSS / NT N K
N 1 NT N K
Step 1: Estimate the fixed effects model: In our example above we obtained
the following result based on cross-section effects:
Step 2: Once the result above is obtained, go to the tools bar and click on
‘view’ then choose ‘Fixed/Random Effects testing’ followed by
‘Redundant Fixed Effects—likelihood ratio’ and obtain the following:
~ 66 ~
Error Component Model Analysis
Step 3: The Decision rule: the part of the result which is critical to our
decision is indicated here below. The decision rule is as follows: if the P-
value < 0.05 we reject the null hypothesis, implying that the fixed effects are
not redundant. However, if P-value > 0.05, we fail to reject the null
hypothesis, implying that the fixed effects are redundant and therefore
pooled estimation is valid.
~ 67 ~
Guidelines on Panel Data Analysis
In this case it is observed that the Cross-Section F statistics is 88.2 with (4,
79) degrees of freedom whose associated p-value is 0.000, which is less than
0.05, therefore we reject the null hypothesis. In the present case fixed
effects are critical and need to be considered.
1
n
ln( Li ,t ) 1 ln( Li ,t 1 ) 2 IRt n 3 log(Sizei , t )
n0
In this illustration we abstract from the basic steps in the data preparations
and creating an enabling environment in Eviews for panel data analysis.
Therefore, for us to show how the Least Squares Dummy Variable (LSDV)
estimation method is executed we follow the following steps:
~ 68 ~
Error Component Model Analysis
Step 1: Setting up- the Within Estimation. In the within model set up,
the important aspect is the computation of the individual specific
coefficients. Eviews is configured to implement this approach. To
implement the fixed effects, you check the Estimation Method - the Fixed
and Random effects are automated. The default shows cross-section:
‘None” and Period: ‘None’. First, you should account for individual and
period effects using the Effects specification dropdown menus. By
default, Eviews assumes that there are no effects so that both dropdown
menus are set to None. You may change the default settings to allow for
either Fixed or Random
effects in either the cross-
section or period dimension,
or both. The dependent
variable as before is “tcad?”.
To allow for fixed effects,
you click on Cross-section
and choose ‘Fixed’.
The estimated
coefficient of the
monetary policy variable
(IBR) is negative and
significant at 1 percent
level of testing. The
estimated coefficient is
0.15, implying that a 1
percent change in
monetary policy stance
will result in a 0.15
percent change in the
total credit to the
~ 69 ~
Guidelines on Panel Data Analysis
private sector, in the opposite direction. The fact that the estimated
coefficient is negative and significant suggests that monetary policy is
effective in Kenya.
The estimated coefficient of bank size is found to be positive and
significant at 1 percent as well, suggesting that larger banks lend to
change lending to the private sector more than the small banks.
The other coefficients, mainly those with an extension of ‘_C’ are the
fixed effects. You may notice the fixed effects do not have –statistics
nor standard errors, meaning they are not estimated but computed.
~ 70 ~
Error Component Model Analysis
To estimate the random effects model we follow the same procedure like
the one for the fixed effects model. In this case we follow the following
steps:
Select random
effects option from
the drop-down menu
here
~ 71 ~
Guidelines on Panel Data Analysis
~ 72 ~
Error Component Model Analysis
References
Ashcraft, Adam B., 2006. New evidence on the lending channel. Journal of Money,
Credit and Banking, 38, 751-775.
Altunbas, Y., Fazylov, O., and Molyneux, P., (2002). Evidence on the Bank
Lending Channel in Europe. Journal of Banking and Finance. 26:11. 2093-
2110.
Bernanke, B.S. and Gertler, M., (1995). Inside the black box: The credit channel of
monetary policy. Journal of Economic Perspectives. 9:4. 27-48.
De Bondt, Gabe, J., 1999. Credit channels in Europe: Cross-country investigation.
Research Memorandum WO&E no. 569. De Nederlandsche Bank,
February.
Ehrmann, M., Gambacorta, L., Martinez-Pages, J., Sevestre, P., and Worms, A.,
(2001). Financial systems and the role of banks in monetary policy
transmission in the euro area. European Central Bank Working Paper No.
105.
Favero, Carlo A., Giavazzi, Francesco, Flabbi, Luca, 1999. The Transmission
mechanism of monetary policy in Europe: Evidence form banks’ balance
sheets. National Bureau of Economic Research, Working Paper no. 7231.
Gambacorta, Leonardo, 2005. Inside the bank lending channel. European
Economic Review, 49, 1737-1759.
Kashyap, A.K., and Stein, J.C, (2000). What Do a Million Observations on Banks
Say about the Transmission of Monetary Policy? American Economic
Review. 90:3. 407-428.
Kashyap, Anil K., Stein, Jeremy C., 1995. The impact of monetary policy on bank
balance sheets. Carnegie-Rochester Conference Series on Public Policy 42,
151-195.
Kashyap, Anil K., Stein, Jeremy C., 1997. The role of banks in monetary policy: A
survey with implications for the European Monetary Union. Economic
Perspectives, Federal Reserve Bank of Chicago 21, pp. 2–19.
Kishan, Ruby P., Opiela, Timothy P., 2000. Bank size, bank capital, and the bank
lending channel. Journal of Money, Credit and Banking, 32, 121-141.
~ 73 ~
Guidelines on Panel Data Analysis
~ 74 ~
Chapter 5
y X u 1
uit vi t it 2
Where: v… the cross –section effect (time invariant), t, time effect (cross-
section invariant) and e the error term with the usual properties (details to
be provided). In the case where both vi and ti are non-zero, i.e.:
uit vi t it 3c
This is referred to as a 2-way-error components model.
Guidelines on Panel Data Analysis
5.1.1.1 Two –way – error components model: The least squares dummy
variable estimation method
Step 1: preparation of data: We present the data as shown in Table 5.1 and
also compute time- and cross-section means as shown in the green
columns for each variables Size and Loan. In addition, we compute the
mean of means shown on the year cell in the table.
~ 76 ~
Error Component Model Analysis
Step 2: data transformation: In this case the new data (transformed data) is
obtained by subtracting the mean of cross-sections, mean of time
periods and adding back the mean of means to obtain the following data:
~ 77 ~
Guidelines on Panel Data Analysis
Step 3: Copy and paste the data presented on Table 5.2 and paste it in
Eviews in a manner similar to the one discussed in Chapter 3 and 4. You
follow the steps by first going to the main tool bar and choose
‘Object’….. New object….pool’. Once you choose ‘pool’ you will be
prompted to list the ‘cross-section identifiers’ and you if you had settled
on some specific identifiers you then will be able to see an output similar
to the one below.
Step 4: Estimation: once the cross section identifiers have been listed then
process to the tool bar on that dialog box and click on ‘Estimate’ to obtain
the dialog box ‘pool estimation’ shown above. Then populate it with the
dependent variable ‘spreadt2’and regressors ‘SizeT2’ and a constant ‘C’ as
shown in Step 3. To obtain the estimation result click on ‘OK’ to obtain the
following output:
~ 78 ~
Error Component Model Analysis
The output presented is based on a long method but is a step by step way of
demonstrating how the 2-way –error components model is estimated using
fixed effects. We now turn to a short cut when is implemented by copying
and pasting the raw data from Excel to Eviews (i.e. without transformation
‘Loan’(spread and ‘Size’). Once this is done, follow the process of creating a
pool object as discussed above to obtain the following screen:
~ 79 ~
Guidelines on Panel Data Analysis
Step 5: populating the ‘pool estimation’ dialog box. For one to implement
the 2 way fixed effects model we first need to input the dependent
variable in the space ‘Dependent variable’ in our case the dependent
variable is ‘Spread’ and also the space for regressors’ in our case the
regressors are ‘Size’ and C.
Panel A Panel B
Now you may notice that the result obtained above is similar to the one
obtained earlier. For comparison see the following set of output. Notice
that the coefficient for bank size is the same in both cases as shown below:
is this correct?
~ 80 ~
Error Component Model Analysis
~ 81 ~
Chapter 6
yit yi ,t 1 X it i t it
E it 0
E it js 2
E it js 0
The LSDV estimator is consistent for the static model whether the effects
are fixed or random. Therefore need to show that the LSDV is inconsistent
for a dynamic panel data with individual effects, whether the effects are
fixed or random. The bias of the LSDV estimator in a dynamic model is
generally known as dynamic bias or Nickell’s bias (1981).
y it y it 1 y i 2 y i1 u i 3 u i 2
yi 3 yi 2 yi 2 yi1 ui 3 ui 2
Consider t=4
yi 4 yi 3 yi 3 yi 2 ui 4 ui 3
~ 84 ~
Dynamic Panel Data Analysis
yi1 0 . . 0
0 yi1 , yi 2 . . .
Wi . . . . .
. . . . 0
0 . . 0 yi1 , yi 2 ......... yiT 2
Because the instruments are not correlated with the remaining error term,
then our moment condition is stated as:
E wi ui 0
Estimating this equation by GLS yields the preliminary Arellano and Bond
one-step consistent estimator. In case there are other regressors then:
5 KAPi ,t 6 X t 7 Dit vi i ,t
~ 85 ~
Guidelines on Panel Data Analysis
time invariant error component; i,t is the error term with the usual
properties.
Step 1: Organising the data in Excel- Unlike the procedure we have been
using in the previous cases, here we stark the data as follows:
~ 87 ~
Guidelines on Panel Data Analysis
Panel A Panel B
Step 3: When you click on OK on the screen in Panel B, above you will
obtain the screen shown here. You may notice the screen has two new
elements namely the dateid and crossid. These elements are essential in
ensuring that data on a specific cross section at a particular time can be
identified with ease.
~ 88 ~
Dynamic Panel Data Analysis
Step 4: Getting the data from Excel to Eviews: The procedure for
getting data from Excel to Eviews is as explained in Chapter 1, on working
with ‘stacked data’. For illustration, we are working 3 variables: loans
disbursed by bank (Loans), bank size (Size) and policy rate (IBR). Following
the steps of getting stacked data from excel to Eviews you will obtain the
following (adjust for IBR which is cross section wide variable):
~ 89 ~
Guidelines on Panel Data Analysis
Step 5: the regression: in this case you will estimate the equation where
the dependent variable is ‘Loan’ and the independent variables are ‘size
and IBR’. The procedure for estimating a simple OLS equation applies
here. Once this is followed you will then obtain the screen shown below
with the dependent variable listed first followed by the independent
variables, size and ibr, and then the constant C. To estimate a dynamic
GMM model you will need to click on ‘Estimation settings’ and
choose the ‘GMM/DPD- Generalised Method of Moments/Dynamic
Panel’ method as shown here:
The first screen welcomes you to the dynamic panel data model
wizard. As indicated in the screen, you are informed that the wizard
aids you in specifying a
member of the class of
dynamic panel data models
with fixed effects. You are
cautioned that this class of
models are designed for
panels with a large number
of cross-sections and a short
time series. In addition, you
~ 90 ~
Dynamic Panel Data Analysis
are cautioned not to use the wizard to specify a static panel data
model. After this information set, you can then click on ‘Next’
button.
Step 1: Specify the dependent variable: The wizard then shows
you that there are 6 steps in
estimating a dynamic panel. The first
step, as shown in in the screen shot
is to specify the dependent variable.
As indicated, dynamic panel data
models have the feature that lags of
the dependent variable appear as
regressors. At this step, you are
required to specify the dependent variable. For our case we had
specified ‘loan’ as the dependent variable. As indicated earlier, the
dynamic panel uses lags of the dependent variable as regressors,
therefore you are required to specify the lags you want to use. Eviews
has set the lags at 1, however, if you click on the button provided you
will select the desired number of lags.
Step 2: specify any other
regressors: Ideally, without
specifying any other regressors a
dynamic panel will be estimated
since the lag of the dependent
variable has been included as a
regressor. However, at this step you
are required to include any other
regressors that you may consider necessary in the regression. In our
model we included ‘Size’ and ‘ibr’ as additional regressors as shown in
the screen above. In addition, you
are reminded that in case you need
period dummy variables (period
specific effects), you can click on
the box provided on ‘Include
period dummy variables (period
fixed effects)’.
~ 91 ~
Guidelines on Panel Data Analysis
~ 92 ~
Dynamic Panel Data Analysis
Period. The default set is ‘White Period’. Unless you have reasons to
change the setting, you are advised to work with the default settings,
and (3) computation of standard errors.
After going through all the steps above click on the ‘Next’ button to obtain
screen shown in Panel A below. In this panel you are informed that the
wizard will transfer your specification to the GMM equation estimation
dialog. The finally, click on the button ‘Finish’ to conclude the procedure.
Otherwise, you have an option to go back by clicking on the ‘Back’ button
or abort the process by clicking on the ‘Cancel’ button. If you choose to
proceed by clicking on the ‘Finish’ button, you will obtain the screen
shown in Panel B below. In this last step you are shown the screen which is
similar to the one you started with.
Step 6: viewing the estimation results: To view the estimation results you
click on ‘OK’ to obtain the following results:
~ 93 ~
Guidelines on Panel Data Analysis
The table above shows the estimation results based on a dynamic GMM
procedure. The critical things to check out for in this output are the
following:
~ 94 ~
Dynamic Panel Data Analysis
References
Ahn, S C and Schmidt, P (1995), ‘Efficient estimation of models for dynamic panel
data’, Journal of Econometrics, Vol. 68, No. 1, pages 5-27.
Balestra, P and Nerlove, M (1966), ‘Pooling cross section and time series data in
the estimation of a dynamic model: the demand for natural gas’,
Econometrica, Vol. 34, No. 3, pages 585-612.
Baltagi, B H and Chihwa Kao, C (2000), ‘Nonstationary panels,
cointegration in panels and dynamic panels: a survey’, Chapter 1 in
Baltagi, B H (ed), Advances in econometrics, volume 15: nonstationary
panels, panel cointegration and dynamic panels, Amsterdam, JAI Press,
pages 7-51.
Hausman, J A and Taylor, W E (1981), ‘Panel data and unobservable individual
effects’, Econometrica, Vol. 49, No. 6, pages 1377-98.
Nerlove, M and Balestra, P (1992), ‘Formulation and estimation of econometric
models for panel data’, Chapter 1 in Mátyás, L and Sevestre, P (eds), The
econometrics of panel data: fundamentals and recent developments in theory and practice,
Amsterdam, Kluwer Academic Publishers, pages 3-18.
Pesaran, M H, Shin, Y and Smith, R (1999), ‘Pooled mean group estimation of
dynamic heterogeneous panels’, Journal of the American Statistical Association,
Vol. 94, No. 446, pages 621-34.
~ 95 ~
Chapter 7
What about the cross-sectional data? Are they I(0) or I(1)? To obtain an
answer, select the 5 bank size series and open them as a Group − note that
we have just done that! In the window for the Group, click on View and
select Unit Root Test…. A window will open that provides you with a set
of options emerge as shown below.
Guidelines on Panel Data Analysis
To begin with, we should note that the first three tests are different from
the remaining three. In particular, we can classify panel unit root tests on
the basis of whether there are restrictions on the autoregressive process
across cross-sections or series. To see this, consider an AR(1) process for
panel data:
~ 98 ~
Non-Stationary Panel Analysis
Note the subscript i on the autoregressive term, ρi. There are two natural
assumptions we can make about the ρi. If we assume that ρi = ρ, then we
assume that the persistence parameters are common across cross-sections;
this is the test performed by Levin et al. (2002), Breitung (2000) and Hadri
(2000). Of those three tests, the null hypothesis for the first two is that the
series under investigation is I(1), i.e., non-stationary, whereas the Hadri4 test
is performed under the null of stationarity (no unit root).
The different panel unit root tests available in EViews are summarised in
the Table below;
What is the best test to perform? That really depends on what the overall
aim is, whether you wish to estimate cross-sectional regressions with the
same slope coefficients or with different coefficients accounting for the
4 The Hadri panel unit root test is similar to the KPSS unit root test, and has null hypothesis of
no unit root in any of the series in the panel.
~ 99 ~
Guidelines on Panel Data Analysis
For the time being, click on Summary (which is the default option) and select
the balanced panel option. Alternatively, you may use the drop-down menu
associated with Test type to select an individual panel unit-root test statistic.
Note that, as in the case of the individual series, we can include a constant
and/or a time trend i.e. select individual intercept and trend. You may choose
to exclude the deterministic time trend – but if the trend is included in one
equation, it should be included in all.
Unlike the standard EViews single series unit root tests, where t-statistics
are reported for the significance of the constant and/or the trend, the panel
~ 100 ~
Non-Stationary Panel Analysis
unit tests in EViews report only the actual unit root test. Therefore, we
ought to perform all three tests to see whether we get different answers. We
now report the results from all these three tests, starting with the most
general and ending with the most specific. To start off with, we will assume
a fixed, user-specified lag length of one, so you should enter 1 in the User
specified box for the lag length as shown above. We note in passing that
depending on the specific set-up of the panel unit-root tests, not all six tests
are computed all the time.
For the test with a constant and trend, denoted Individual trend and intercept5,
we should find the results of the panel unit root tests shown in Table below.
As mentioned in the previous paragraph, in this particular case the Hadri
test is not computed.
5 Note that selecting Individual intercept is equivalent to including individual fixed effects,
while Individual trend and intercept is equivalent to both fixed effects and trends.
~ 101 ~
Guidelines on Panel Data Analysis
Note that the panel unit-root tests are internally inconsistent, i.e., the five
tests that test the same null hypothesis (e.g., Levin et al. and Breitung), are in
agreement as to the rejection of the null hypothesis of a unit root in bank
size series.
For the test with a constant only (equivalent to individual fixed effects only), we
proceed as follows;
~ 102 ~
Non-Stationary Panel Analysis
The results of the panel unit root tests are given in the table below.
Similarly, the four available tests (we have lost Breitung’s test) are in
agreement as to the rejection of the null hypothesis of a unit root.
Finally, for the test with no constant and no trend, i.e., no deterministic
regressors, we proceed as follows;
~ 103 ~
Guidelines on Panel Data Analysis
All the unit-root tests suggest that the cross-section series are stationary.
Note that we have allowed for only one lag in the tests. Good empirical
practice calls for some experimentation with alternative lags to determine
whether the results change in any way.
~ 104 ~
Non-Stationary Panel Analysis
Alternative criteria for evaluating the optimal lag length may be selected via
the combo box (Akaike, Schwarz, Hannan-Quinn, Modified Akaike,
Modified Schwarz, Modified Hannan-Quinn), and you may limit the
number of lags to try in automatic selection by entering a number in the
Maximum lags box. This procedure sets the lag length to the value of p
that minimises the respective information criteria. Ng and Perron (2001)
stress that good size and power properties of all unit root tests rely on the
proper choice of the lag length p used for specifying the ADF test
regression. They argue, however, that traditional model selection criteria
such as the AIC and BIC are not well suited for determining p with
integrated data. Instead, they suggest modified information criteria (MIC).
On the basis of a series of simulation experiments, Ng and Perron
recommend selecting the lag length p by minimising the modified AIC
(MAIC) in the univariate context. On the basis of this advice, we will also
use the MAIC in a panel context. For a test results for a model with
constant and trend, denoted Individual trend and intercept, we proceed as
follows;
~ 105 ~
Guidelines on Panel Data Analysis
Make sure you uncheck the Use balanced sample box and we have
chosen 8 as the maximum lag length. The results are shown in the table
below
Consistent with the previous tests, the results assuming a common unit root
procedure indicate the absence of a unit root. Therefore we reject the null
hypothesis that the series are I(1). We, therefore, conclude that the series
under investigation are stationary.
~ 106 ~
References
Ahn, S C and Schmidt, P (1995), ‘Efficient estimation of models for dynamic panel
data’, Journal of Econometrics, Vol. 68, No. 1, pages 5-27.
Arellano, M (1989), ‘A note on the Anderson-Hsiao estimator for panel data’,
Economics Letters, Vol. 31, No. 4, pages 337-41.
Arellano, M (1993), ‘On the testing of correlated effects with panel data’, Journal of
Econometrics, Vol. 59, No. 1-2, pages 87-97.
Arellano, M (2003), Panel data econometrics, Oxford, Oxford University Press.
Arellano, M and Bond, S R (1991), ‘Some tests of specification for panel
data: Monte Carlo evidence and an application to employment
equations’, Review of Economic Studies, Vol. 58, No. 2, pages 277-97.
Arellano, M and Bover, O (1995), ‘Another look at the instrumental variable
estimation of error-components models’, Journal of Econometrics, Vol. 68,
No. 1, pages 29-51.
Balestra, P and Nerlove, M (1966), ‘Pooling cross section and time series data in
the estimation of a dynamic model: the demand for natural gas’,
Econometrica, Vol. 34, No. 3, pages 585-612.
Baltagi, B H and Chihwa Kao, C (2000), ‘Nonstationary panels,
cointegration in panels and dynamic panels: a survey’, Chapter 1 in
Baltagi, B H (ed), Advances in econometrics, volume 15: nonstationary
panels, panel cointegration and dynamic panels, Amsterdam, JAI Press,
pages 7-51.
Breitung, J and Pesaran, M H (2005), ‘Unit roots and cointegration in panels’,
Chapter 9 in Matyas, L and Sevestre, P (eds), The econometrics of panel data:
fundamentals and recent developments in theory and practice, Amsterdam, Kluwer
Academic Publishers, pages 279-322. http://www.ect.uni-
bonn.de/mitarbeiter/joerg-
breitung/documents/breitung_pesaran.pdf/view.
Engle, R F and Granger, C W J (1987), ‘Co-integration and error correction:
representation, estimation and testing’, Econometrica, Vol. 55, No. 2,
pages 251-76.
Guidelines on Panel Data Analysis
Harris, R and Sollis, R (2003), ‘Panel data models and cointegration’, Chapter 7 in
Applied time series modelling and forecasting, Oxford, John Wiley & Sons.
Hausman, J A (1978), ‘Specification tests in econometrics’, Econometrica, Vol. 46,
No. 6, pages 1251-71.
Hausman, J A and Taylor, W E (1981), ‘Panel data and unobservable individual
effects’, Econometrica, Vol. 49, No. 6, pages 1377-98.
Im, K S, Pesaran, M H and Shin, Y (2003), ‘Testing for unit roots in
heterogeneous panels’, Journal of Econometrics, Vol. 115, No. 1, pages
53-74.
Johansen, S (1988), ‘Statistical analysis of cointegrating vectors’, Journal of Economic
Dynamics and Control, Vol. 12, No. 2/3, pages 231-54.
Johansen, S (1991), ‘Estimation and hypothesis testing of cointegration vectors in
Gaussian vector autoregressive models’, Econometrica, Vol. 59, No. 6, pages
1551-80.
Levin, A and Lin, C-F (1993a), ‘Unit root tests in panel data: asymptotic and
finite sample properties’, Department of Economics, University of San
Diego Discussion Paper 93-23.
Levin, A and Lin, C-F (1993b), ‘Unit root tests in panel data: new results’,
Department of Economics, University of San Diego Discussion Paper 93-56.
Levin, A, Lin, C-F and Chu, C-S J (2002), ‘Unit root test in panel data: asymptotic
and finite-sample properties’, Journal of Econometrics, Vol. 108, No. 1, pages
1-24.
Maddala, G S and Wu, S (1999), ‘A comparative study of unit root tests with panel
data and a new simple test’, Oxford Bulletin of Economics and Statistics, Vol.
61, No. S1, pages 631-52.
Mundlak, Y (1978), ‘On the pooling of time series and cross-sectional data’,
Econometrica, Vol. 46, No. 1, pages 69-85.
Nerlove, M and Balestra, P (1992), ‘Formulation and estimation of econometric
models for panel data’, Chapter 1 in Mátyás, L and Sevestre, P (eds), The
econometrics of panel data: fundamentals and recent developments in theory and practice,
Amsterdam, Kluwer Academic Publishers, pages 3-18.
Neyman, J and Scott, E L (1948), ‘Consistent estimates based on partially
consistent observations’, Econometrica, Vol. 16, No. 1, pages 1-32.
~ 108 ~
Non-Stationary Panel Analysis
~ 109 ~