0% found this document useful (0 votes)

69 views11 pages

Journal of Statistical Software: Implementing Panel-Corrected Standard Errors in R: The Pcse Package

This document summarizes an R package called pcse that implements panel-corrected standard errors (PCSEs) for time-series cross-section (TSCS) data. TSCS data have repeated observations over time for different units and often exhibit contemporaneous correlation across units and heteroskedasticity. The pcse package allows users to estimate linear models on TSCS data using ordinary least squares and calculate PCSEs, which provide more accurate standard errors that account for the non-spherical error structure of TSCS data. Key features include handling both balanced and unbalanced panel data, and providing a robust estimator of the covariance matrix that accounts for contemporaneous correlation and heteroskedasticity across units.

Uploaded by

lZbrunoZl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views11 pages

Journal of Statistical Software: Implementing Panel-Corrected Standard Errors in R: The Pcse Package

Uploaded by

lZbrunoZl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

JSS Journal of Statistical Software

June 2011, Volume 42, Code Snippet 1. http://www.jstatsoft.org/

Implementing Panel-Corrected Standard Errors

in R: The pcse Package

Delia Bailey Jonathan N. Katz

YouGov Polimetrix California Institute of Technology

Abstract
Time-series–cross-section (TSCS) data are characterized by having repeated observa-
tions over time on some set of units, such as states or nations. TSCS data typically display
both contemporaneous correlation across units and unit level heteroskedasity making in-
ference from standard errors produced by ordinary least squares incorrect. Panel-corrected
standard errors (PCSE) account for these these deviations from spherical errors and allow
for better inference from linear models estimated from TSCS data. In this paper, we
discuss an implementation of them in the R system for statistical computing. The key
computational issue is how to handle unbalanced data.

Keywords: pcse, time-series–cross-section, covariance matrix estimation, contemporaneous

correlation, heteroskedasity, R.

1. Introduction
Time-series–cross-section (TSCS) data are characterized by having repeated observations over
time on some set of units, such as states or nations. TSCS data have become common in
applied studies in the social sciences, particularly in comparative political science applications.
These data often show non-spherical errors because of contemporaneous correlation across the
units and unit level heteroskedasity. When fitting linear models to TSCS data, it is common
to use this non-spherical error structure to improve inference and estimation efficiency by
a feasible generalized least squares (FGLS) estimator suggested by Parks (1967) and made
popular by Kmenta (1986).
However, Beck and Katz (1995) showed that the Parks (1967) model had poor finite sample
properties. In particular, in a simulation study they showed that the estimated standard
errors for this model generated confidence intervals that were significantly too small, often
underestimating variability by 50% or more, and with only minimal gains in efficiency over a
2 pcse: Panel-Corrected Standard Errors in R

simple linear model that ignored the non-spherical errors. Therefore, Beck and Katz (1995)
suggested estimating linear models of TSCS data by ordinary least squares (OLS)1 and they
proposed a sandwich type estimator of the covariance matrix of the estimated parameters,
which they called panel-corrected standard errors (PCSE), that is robust to the possibility of
non-spherical errors.2
Although the PCSE covariance estimator bears some resemblance to heteroskedasity con-
sistent (HC) estimators (see, for example, Huber 1967; White 1980; MacKinnon and White
1985), these other estimators do not explicitly incorporate the known TSCS structure of the
data.3 This leads to important differences in implementation.
This paper describes an implementation of PCSEs in the R system for statistical computing (R
Development Core Team 2011). All of the functions described here are available in the package
pcse that is available from Comprehensive R Archive Network (CRAN) at http://CRAN.
R-project.org/package=pcse. The key computational issue is how to handle unbalanced
data. TSCS data is unbalanced when the number of observations for units vary.
R packages that estimate various models for panel data include plm (Croissant and Millo 2008)
and systemfit (Henningsen and Hamann 2007), that also implement different types of robust
standard errors. Some of these are only robust to unit heteroskedasity and possible serial
correlation. The pcse standard error estimate is robust not only to unit heteroskedacity, but
it also robust against possible contemporaneous correlation across the units that is common
in TSCS data.4 Package plm also provides an implementation of Beck and Katz (1995) PCSE
in the function vcovBK()5 that can be applied to panel models estimated by plm().
The next section fixes notation by briefly reviewing the linear TSCE model and the derivation
of PCSEs. Section 3 considers the computational issues with unbalanced panels. Section 4
illustrates the use of the package pcse. Finally, Section 5 concludes.

2. TSCS data and estimation

The critical assumption of TSCS models is that of “pooling,” that is, all units are characterized
by the same regression equation at all points in time. Given this assumption we can write
the generic TSCS model as:

yi,t = xi,t β + i,t ; i = 1, . . . , N ; t = 1, . . . , T (1)

where xi,t is a vector of one or more (k) exogenous variables and observations are indexed by
both unit (i) and time (t).
TSCS analysts typically put some structure on the assumed error process. In particular,
they usually assume that for any given unit, the error variance is constant, so that the only
source of heteroskedasticity is differing error variances across units. Analysts also assume
1
OLS is implemented in R’s function lm().
2
The estimator is actually rather poorly named as it really used for TSCS data, in which the time dimension
is large enough for serious averaging within units, as opposed to panel data, which typically have short time
dimensions. However, this is the nomenclature used in the literature.
3
These heteroskedastic constituent covariance estimators are available in the R in the sandwich package
(Zeileis 2004)
4
For a discussion of the differences between TSCS and panel data see Beck and Katz (2011).
5
The function vcovBK() was not yet part of plm when the first version of pcse was developed.
Journal of Statistical Software – Code Snippets 3

that all spatial correlation is both contemporary and does not vary with time. The temporal
dependence exhibited by the errors is also assumed to be time invariant, and may also be
invariant across units. We, however, will be ignoring temporal dependence for the remainder
of this paper by assuming that the analyst has controlled for it either by including the lagged
dependent variable, yi,t−1 , in the set of regressors, xi,t , or using some sort of differencing.
Since these assumptions are all based on the panel nature of the data, we call them the “panel
error assumptions.”
As is well known, the correct formula for the sampling variability of the OLS estimates from
Equation 1 is given by the square roots of the diagonal terms of

Cov(β̂) = (X> X)−1 {X> ΩX}(X> X)−1 . (2)

If the errors obey the spherical error assumption — i.e., Ω = σ 2 I, where I is an N T × N T

identity matrix — this simplifies to the usual OLS formula, where the OLS standard errors
are the square roots of the diagonal terms of

c2 (X> X)−1
σ

c2 is the usual OLS estimator of the common error variance, σ 2 . If the errors obey the
where σ
panel structure, then this provides incorrect standard errors. Equation 2, however, can still
be used, in combination with that panel structure of the errors to provide accurate PCSEs.
For panel models with contemporaneously correlated and panel heteroskedastic errors, Ω is
an N T × N T block diagonal matrix with an N × N matrix of contemporaneous covariances,
Σ, along the diagonal. To estimate Equation 2 we need an estimate of Σ. Since the OLS
estimates of Equation 1 are consistent, we can use the OLS residuals from that estimation to
provide a consistent estimate of Σ. Let ei,t be the OLS residual for unit i at time t. We can
estimate a typical element of Σ by
PTi,j
t=1 ei,t ej,t
Σ̂i,j = , (3)
Ti,j

with the estimate Σ̂ being comprised of all these elements. We then use this to form the
estimator Ω̂ by creating a block diagonal matrix with the Σ̂ matrices along the diagonal.
With balanced data where Ti,j = T, ∀i = 1, . . . , N , we can simplify this to

(E> E)
Σ̂ = (4)
T
where E is the T × N matrix of residuals and hence estimate Ω by

Ω̂ = Σ̂ ⊗ IT (5)

where ⊗ is the Kronecker product. PCSEs are thus computed by taking the square root of
the diagonal elements of

PCSE = (X> X)−1 X> Ω̂X(X> X)−1 . (6)

4 pcse: Panel-Corrected Standard Errors in R

3. Computational issues

3.1. Balanced data

The computational issues are fairly straightforward for balanced data. We need only the
vector of residuals from a linear fit, the model matrix (X), and indicators for group and time.
Given the indicators for group and time, we can appropriately reshape the vector of residuals
into an N × T matrix. We can then directly calculate Σ̂ from Equation 4 and, therefore,
the PCSE for the fit. Within R, the function lm returns a lm object that contains, among
other items, the residuals and the model matrix. It does not, however, include indicators for
unit and time and these must be supplied by the user. The package pcse implements the
estimation described above in the function pcse, which takes the following arguments:

pcse(lmobj, groupN, groupT, ...)

The first argument lmobj is a fitted linear model object as returned by lm. The argument
groupN is a vector indication which cross-sectional unit an observation is from and groupT
indicates which time period.

3.2. Unbalanced data

The only interesting computational issue is how to handle unbalanced data sets. With an
unbalanced dataset, Equation 4 is no longer valid. There have been two alternatives estimation
procedures suggested for unbalanced data. The first is to estimate Σ using a balanced subset
of the data. The second alternative is to calculate the elements of Σ pairwise. We will consider
each in turn.
The advantage of using the balanced subset approach is its computation ease. The largest
balanced subset of the data can be found using the following simple R code:

units <- unique(groupN)

N <- length(units)
time <- unique(groupT)
T <- length(time)
brows <- c()
for (i in 1:T) {
br <- which(groupT == time[i])
check <- length(br) == N
if (check) {
brows <- c(brows, br)
}
}

It first computes the unit and time identifiers and their respective number N and T. The index
brows gives all of the balanced rows. We can restrict the calculations of Σ to this balanced
subset of data. This allows us to once again use Equations 4. The downside to this approach
is that we are not using all of the available data to estimate Σ.
Recall that Σ is the contemporaneous correlation between every pair of units in our sample.
The alternative approach then is to use Equation 3 for each pair i, j ∈ N to construct our
Journal of Statistical Software – Code Snippets 5

estimate Σ̂. That is, for each pair of units we determine with temporal observations overlap
between the two. We use this pairwise balanced sample to estimate Σ̂i,j .
We could do this directly by looping over all possible pairs and using Equation 3. However,
for large N this can be a large set to loop over. We can improve on this by instead filling in
the residual vector with zeros for the missing observations needed to balance out the data.
Clearly, these filled in observation do not alter the sum of the product of the residuals, since
they contribute zero if either i or j have been filled in. As long as we divide by the appropriate
Ti,j = min(Ti , Tj ), we will appropriately calculate the correlation between i and j. This is
approach we use in pcse() when the option pairwise = TRUE is used.

4. Example
In this section we demonstrate the use of the package pcse. The data we will use is from
Alvarez, Garrett, and Lange (1991), hereafter AGL, and were reanalyzed using a simple linear
model in Beck, Katz, Alvarez, Garrett, and Lange (1993). The data set is available in the
package as the data frame agl. The data cover basic macro-economic and political variables
from 16 OECD nations from 1970 to 1984. AGL estimated a model relating political and labor
organization variables (and some economic controls) to economic growth, unemployment, and
inflation. The argument was that economic performance in advanced industrialized societies
was superior when labor was both encompassing and had political power or when labor was
weak both in politics and the market. Here we will only look at their model of economic
growth.
First both the package and the data need to be loaded into R with

R> library("pcse")
R> data("agl")

We can then fit their basic model of economic growth with

R> agl.lm <- lm(growth ~ lagg1 + opengdp + openex + openimp +

+ central + leftc + inter + as.factor(year), data = agl)

The model assumes that growth depends on lagged growth (lagg1), vulnerability to OECD
demand (opengdp), OECD export (openex), OECD import (openimp), labor organization
index (central), year fixed effects (as.factor(year)), the fraction of cabinet portfolios held
by “left” parties (leftc), and interaction of central and leftc (inter). The interest focuses
on the last three variables, particularly the interaction.
Here are the fit and standard errors without correcting for the panel structure of the data
(note to save space the estimates for the year effects have been excluded from the printout.):

R> summary(agl.lm)

Call:
lm(formula = growth ~ lagg1 + opengdp + openex + openimp + central +
leftc + inter + as.factor(year), data = agl)
6 pcse: Panel-Corrected Standard Errors in R

Residuals:
Min 1Q Median 3Q Max
-5.837 -1.197 0.067 1.170 4.529

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 5.968890 0.916402 6.51 5.0e-10 ***
lagg1 0.050315 0.139204 0.36 0.71812
opengdp -0.002330 0.001867 -1.25 0.21342
openex 0.002008 0.001208 1.66 0.09786 .
openimp -0.000609 0.001679 -0.36 0.71720
central -0.763563 0.216258 -3.53 0.00051 ***
leftc -0.024712 0.009276 -2.66 0.00829 **
inter 0.012868 0.003614 3.56 0.00045 ***
as.factor(year)1971 -1.205114 0.658282 -1.83 0.06851 .
as.factor(year)1972 0.530824 0.699475 0.76 0.44874
as.factor(year)1973 0.572342 0.702811 0.81 0.41633
as.factor(year)1974 -5.085447 1.100673 -4.62 6.6e-06 ***
as.factor(year)1975 -5.280737 0.953471 -5.54 8.7e-08 ***
as.factor(year)1976 -0.210344 0.885918 -0.24 0.81255
as.factor(year)1977 -2.267336 0.676865 -3.35 0.00095 ***
as.factor(year)1978 -1.258657 0.742950 -1.69 0.09167 .
as.factor(year)1979 -1.632229 0.702948 -2.32 0.02116 *
as.factor(year)1980 -3.902415 0.775032 -5.04 1.0e-06 ***
as.factor(year)1981 -4.652619 0.824901 -5.64 5.2e-08 ***
as.factor(year)1982 -4.532462 0.926328 -4.89 1.9e-06 ***
as.factor(year)1983 -2.775442 0.744784 -3.73 0.00025 ***
as.factor(year)1984 -0.770129 0.807864 -0.95 0.34150
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 1.82 on 218 degrees of freedom

Multiple R-squared: 0.502, Adjusted R-squared: 0.454
F-statistic: 10.5 on 21 and 218 DF, p-value: <2e-16

We can correct the standard errors by using:

R> agl.pcse <- pcse(agl.lm, groupN = agl$country, groupT = agl$year)

Included in the package is a summary function, summary.pcse, that can redisplay the esti-
mates with the PCSE used for inference:

R> summary(agl.pcse)

Results:

Estimate PCSE t value Pr(>|t|)

Journal of Statistical Software – Code Snippets 7

(Intercept) 5.968890 0.89298 6.684 1.91e-10

lagg1 0.050315 0.15188 0.331 7.41e-01
opengdp -0.002330 0.00179 -1.301 1.94e-01
openex 0.002008 0.00114 1.753 8.09e-02
openimp -0.000609 0.00166 -0.368 7.13e-01
central -0.763563 0.26569 -2.874 4.46e-03
leftc -0.024712 0.00668 -3.698 2.75e-04
inter 0.012868 0.00295 4.367 1.95e-05
as.factor(year)1971 -1.205114 0.14328 -8.411 5.34e-15
as.factor(year)1972 0.530824 0.27687 1.917 5.65e-02
as.factor(year)1973 0.572342 0.28987 1.974 4.96e-02
as.factor(year)1974 -5.085447 0.83220 -6.111 4.51e-09
as.factor(year)1975 -5.280737 0.67528 -7.820 2.24e-13
as.factor(year)1976 -0.210344 0.67377 -0.312 7.55e-01
as.factor(year)1977 -2.267336 0.22281 -10.176 3.76e-20
as.factor(year)1978 -1.258657 0.36798 -3.420 7.46e-04
as.factor(year)1979 -1.632229 0.31179 -5.235 3.87e-07
as.factor(year)1980 -3.902415 0.42860 -9.105 5.63e-17
as.factor(year)1981 -4.652619 0.52777 -8.816 3.83e-16
as.factor(year)1982 -4.532462 0.64582 -7.018 2.81e-11
as.factor(year)1983 -2.775442 0.39866 -6.962 3.89e-11
as.factor(year)1984 -0.770129 0.53750 -1.433 1.53e-01

---------------------------------------------

# Valid Obs = 240; # Missing Obs = 0; Degrees of Freedom = 218.

We note that the standard error on central has increased a bit, but the standard errors of
the other two variables of interest, leftc and inter have actually decreased.
We have also included an unbalanced version of the AGL data set, aglUn, that was created
by randomly deleting some observations. This was only done to demonstrate how estimates
vary by casewise and pairwise estimation of the covariance matrix and is not a recommended
modeling strategy. As before, we can estimate the same model by:

R> data("aglUn")
R> aglUn.lm <- lm(growth ~ lagg1 + opengdp + openex + openimp +
+ central + leftc + inter + as.factor(year), data = aglUn)
R> aglUn.pcse1 <- pcse(aglUn.lm, groupN = aglUn$country,
+ groupT = aglUn$year, pairwise = TRUE)
R> summary(aglUn.pcse1)

Results:

Estimate PCSE t value Pr(>|t|)

(Intercept) 6.198165 0.87255 7.1035 1.90e-11
lagg1 -0.007295 0.15069 -0.0484 9.61e-01
opengdp -0.001962 0.00181 -1.0811 2.81e-01
8 pcse: Panel-Corrected Standard Errors in R

openex 0.002215 0.00115 1.9319 5.47e-02

openimp -0.000914 0.00166 -0.5516 5.82e-01
central -0.842650 0.24450 -3.4464 6.87e-04
leftc -0.028387 0.00702 -4.0451 7.37e-05
inter 0.014521 0.00307 4.7286 4.17e-06
as.factor(year)1971 -0.817961 0.16592 -4.9299 1.68e-06
as.factor(year)1972 0.577742 0.26045 2.2182 2.76e-02
as.factor(year)1973 0.628091 0.26656 2.3562 1.94e-02
as.factor(year)1974 -4.988305 0.81934 -6.0882 5.43e-09
as.factor(year)1975 -5.072186 0.66781 -7.5952 1.03e-12
as.factor(year)1976 -0.523931 0.66352 -0.7896 4.31e-01
as.factor(year)1977 -2.115323 0.18978 -11.1464 6.31e-23
as.factor(year)1978 -1.173037 0.34402 -3.4098 7.81e-04
as.factor(year)1979 -1.867747 0.31185 -5.9893 9.15e-09
as.factor(year)1980 -4.083112 0.43107 -9.4721 6.32e-18
as.factor(year)1981 -4.448396 0.50744 -8.7663 6.61e-16
as.factor(year)1982 -4.456624 0.62940 -7.0807 2.17e-11
as.factor(year)1983 -2.831709 0.42371 -6.6831 2.11e-10
as.factor(year)1984 -0.984174 0.57233 -1.7196 8.70e-02

---------------------------------------------

# Valid Obs = 230; # Missing Obs = 10; Degrees of Freedom = 208.

Here we see the estimates of the pairwise version of the PCSE, since the option pairwise =
TRUE was given. The results are close to the original results for the balanced data.
If we preferred the casewise estimate that uses the largest balanced subset to estimate the
contemporaneous correlation matrix, we do that by:

R> aglUn.pcse2 <- pcse(aglUn.lm, groupN = aglUn$country,

+ groupT = aglUn$year, pairwise = FALSE)

Warning message:
In pcse(aglUn.lm, groupN = aglUn$country, groupT = aglUn$year, ...
Caution! The number of CS observations per panel, 7, used to compute
the vcov matrix is less than half theaverage number of obs per panel
in the original data.You should consider using pairwise selection.

R> summary(aglUn.pcse2)

Results:

Estimate PCSE t value Pr(>|t|)

(Intercept) 6.198165 0.721172 8.5946 2.00e-15
lagg1 -0.007295 0.123454 -0.0591 9.53e-01
opengdp -0.001962 0.001243 -1.5784 1.16e-01
Journal of Statistical Software – Code Snippets 9

openex 0.002215 0.000782 2.8327 5.07e-03

openimp -0.000914 0.001191 -0.7674 4.44e-01
central -0.842650 0.264484 -3.1860 1.66e-03
leftc -0.028387 0.006387 -4.4444 1.43e-05
inter 0.014521 0.002829 5.1324 6.55e-07
as.factor(year)1971 -0.817961 0.203861 -4.0124 8.39e-05
as.factor(year)1972 0.577742 0.230489 2.5066 1.30e-02
as.factor(year)1973 0.628091 0.237070 2.6494 8.68e-03
as.factor(year)1974 -4.988305 0.583396 -8.5505 2.66e-15
as.factor(year)1975 -5.072186 0.482912 -10.5033 5.62e-21
as.factor(year)1976 -0.523931 0.546879 -0.9580 3.39e-01
as.factor(year)1977 -2.115323 0.194109 -10.8976 3.61e-22
as.factor(year)1978 -1.173037 0.298877 -3.9248 1.18e-04
as.factor(year)1979 -1.867747 0.283784 -6.5816 3.72e-10
as.factor(year)1980 -4.083112 0.361778 -11.2862 2.36e-23
as.factor(year)1981 -4.448396 0.411130 -10.8199 6.22e-22
as.factor(year)1982 -4.456624 0.504551 -8.8329 4.29e-16
as.factor(year)1983 -2.831709 0.386623 -7.3242 5.19e-12
as.factor(year)1984 -0.984174 0.456290 -2.1569 3.22e-02

---------------------------------------------

# Valid Obs = 230; # Missing Obs = 10; Degrees of Freedom = 208.

Here we see that the software has issued a warning about the calculation of the standard
errors. Although there only 10 missing observations, they are intermingled through out the
data. This means that the largest balanced panel only has seven time points, whereas the
data runs for 14. In this case, it is not clear that PCSEs will be correctly estimated although
in this case they are not that different from the casewise estimate.

5. Summary
This paper briefly reviews estimation of panel-corrected standard errors for time-series–cross-
section (TSCS) data. It discusses an implementation of estimating them in the R system for
statistical computing in the pcse package.

Computational details
The results in this paper were obtained using R 2.13.0 with the package pcse 1.8. R and the
pcse package are available from CRAN at http://CRAN.R-Project.org/.

References

Alvarez RM, Garrett G, Lange P (1991). “Government Partisanship, Labor Organization,

and Macroeconomic Performance.” American Political Science Review, 85, 539–556.
10 pcse: Panel-Corrected Standard Errors in R

Beck N, Katz JN (1995). “What To Do (and Not To Do) with Times-Series–Cross-Section

Data in Comparative Politics.” American Political Science Review, 89(3), 634–647.

Beck N, Katz JN (2011). “Modeling Dynamics in Time-Series–Cross-

Section Political Economy Data.” Annual Review of Political Science.
doi:10.1146/annurev-polisci-071510-103222. Forthcoming.

Beck N, Katz JN, Alvarez RM, Garrett G, Lange P (1993). “Government Partisanship, La-
bor Organization, and Macroeconomic Performance: A Corrigendum.” American Political
Science Review, 87, 945–948.

Croissant Y, Millo G (2008). “Panel Data Econometrics in R: The plm Package.” Journal of
Statistical Software, 27(2), 1–43. URL http://www.jstatsoft.org/v27/i02/.

Henningsen A, Hamann JD (2007). “systemfit: A Package for Estimating Systems of Si-

multaneous Equations in R.” Journal of Statistical Software, 23(4), 1–40. URL http:
//www.jstatsoft.org/v23/i04/.

Huber PJ (1967). “The Behavior of Maximum Likelihood Estimation Under Nonstandard

Conditions.” In LM LeCam, J Neyman (eds.), Proceedings of the Fifth Berkeley Symposium
on Mathematical Statistics and Probability. University of California Press, Berkeley, CA.

Kmenta J (1986). Elements of Econometrics. 2nd edition. Macmillan, New York, NY.

MacKinnon JG, White H (1985). “Some Heteroskedasticity-Consistent Covariance Matrix

Estimators with Improved Finite Sample Properties.” Journal of Econometrics, 29, 305–
325.

Parks R (1967). “Efficient Estimation of a System of Regression Equations When Disturbances

Are Both Serially and Contemporaneously Correlated.” Journal of the American Statistical
Association, 62(318), 500–509.

R Development Core Team (2011). R: A Language and Environment for Statistical Computing.
R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL http:
//www.R-project.org/.

White H (1980). “A Heteroskedasticity-Consistent Covariance Matrix and a Direct Test for

Heteroskedasticity.” Econometrica, 48, 817–838.

Zeileis A (2004). “Econometric Computing with HC and HAC Covariance Matrix Estimators.”
Journal of Statistical Software, 11(10), 1–17. URL http://www.jstatsoft.org/v11/i10/.

Affiliation:
Jonathan N. Katz
California Institute of Technology
DHSS 228-77
Journal of Statistical Software – Code Snippets 11

Pasadena, CA 91125, United States of America

E-mail: [email protected]
URL: http://jkatz.caltech.edu/

Journal of Statistical Software http://www.jstatsoft.org/

published by the American Statistical Association http://www.amstat.org/
Volume 42, Code Snippet 1 Submitted: 2007-01-22
June 2011 Accepted: 2007-10-16

Feasible Generalized Least Squares For Panel Data With Cross-Sectional and Serial Correlations
No ratings yet
Feasible Generalized Least Squares For Panel Data With Cross-Sectional and Serial Correlations
18 pages
Wooldridge 7e Ch14 SM
100% (1)
Wooldridge 7e Ch14 SM
10 pages
Econometrie - Prof - Jula
No ratings yet
Econometrie - Prof - Jula
60 pages
Chapter 2 - Panel Data Regression
No ratings yet
Chapter 2 - Panel Data Regression
30 pages
Dadm Assesment #2: Akshat Bansal
No ratings yet
Dadm Assesment #2: Akshat Bansal
24 pages
(@avid - For - Books) 100 Quotes That Will Change
93% (27)
(@avid - For - Books) 100 Quotes That Will Change
103 pages
Dave Ramsey's Complete Guide To - Dave Ramsey PDF
100% (10)
Dave Ramsey's Complete Guide To - Dave Ramsey PDF
438 pages
Applied Micro Methods
No ratings yet
Applied Micro Methods
64 pages
Extended Character Sheet 5e
No ratings yet
Extended Character Sheet 5e
25 pages
Panel Data Analysis
No ratings yet
Panel Data Analysis
39 pages
CBSE 11th SQP Economics Mind Maps
No ratings yet
CBSE 11th SQP Economics Mind Maps
7 pages
Lec16-Stata PanelData
No ratings yet
Lec16-Stata PanelData
39 pages
Oxf Bull Econ Stat - 2021 - Khan - Assessing Sampling Error in Pseudo Panel Models
No ratings yet
Oxf Bull Econ Stat - 2021 - Khan - Assessing Sampling Error in Pseudo Panel Models
28 pages
Panel Data I
No ratings yet
Panel Data I
40 pages
Applied Micro Methods Dark Mode
No ratings yet
Applied Micro Methods Dark Mode
63 pages
Specification Testing To Spatial Models
No ratings yet
Specification Testing To Spatial Models
47 pages
10.1016@B978 0 12 816797 7.00009 6
No ratings yet
10.1016@B978 0 12 816797 7.00009 6
13 pages
Beck, N., & Katz, J. N. (1995) - What To Do (And Not To Do) With Time-Series Cross-Section Data. American Political Science Review, 89 (3), 634-647
No ratings yet
Beck, N., & Katz, J. N. (1995) - What To Do (And Not To Do) With Time-Series Cross-Section Data. American Political Science Review, 89 (3), 634-647
14 pages
Panel Data Analysis: Fixed & Random Effects (Using Stata 10.x)
0% (1)
Panel Data Analysis: Fixed & Random Effects (Using Stata 10.x)
40 pages
Estimating Dynamic Common Correlated Effects Models in Stata
No ratings yet
Estimating Dynamic Common Correlated Effects Models in Stata
41 pages
Robust Regression
No ratings yet
Robust Regression
7 pages
Panel Data On Eviews
No ratings yet
Panel Data On Eviews
15 pages
What To Do (And NOT To Do) With Time-Series Cross-Section Data
No ratings yet
What To Do (And NOT To Do) With Time-Series Cross-Section Data
14 pages
72 UE Panelv3
No ratings yet
72 UE Panelv3
35 pages
Violations of Classical Linear Regression Assumptions Mis-Specification
No ratings yet
Violations of Classical Linear Regression Assumptions Mis-Specification
7 pages
Christian Heumann, Michael Schomaker Shalabh-Introduction To Statistics and Data Analysis With Exercises, Solutions and Applications in R-Springer (2017)
100% (3)
Christian Heumann, Michael Schomaker Shalabh-Introduction To Statistics and Data Analysis With Exercises, Solutions and Applications in R-Springer (2017)
453 pages
L'Analyse de Données de Panel
No ratings yet
L'Analyse de Données de Panel
40 pages
TCH442E Quantitative Methods For Finance
No ratings yet
TCH442E Quantitative Methods For Finance
21 pages
PRINCIPAL - SSRN Id1658640 Code647971
No ratings yet
PRINCIPAL - SSRN Id1658640 Code647971
14 pages
ECON0019 Lecture10 Slides
No ratings yet
ECON0019 Lecture10 Slides
26 pages
PACOTE PRAIS COM TESTE DE WHITE (vcovHC)
No ratings yet
PACOTE PRAIS COM TESTE DE WHITE (vcovHC)
2 pages
Handout 5 Panel Data
No ratings yet
Handout 5 Panel Data
23 pages
Panel Ecmiic2
No ratings yet
Panel Ecmiic2
57 pages
Robust Regression Modeling With STATA Lecture Notes
No ratings yet
Robust Regression Modeling With STATA Lecture Notes
93 pages
Panel Data Econometrics in R: The PLM Package: Yves Croissant Giovanni Millo
No ratings yet
Panel Data Econometrics in R: The PLM Package: Yves Croissant Giovanni Millo
51 pages
Econometrics I: Problem Set II: Prof. Nicolas Berman November 30, 2018
No ratings yet
Econometrics I: Problem Set II: Prof. Nicolas Berman November 30, 2018
4 pages
Panel Data
No ratings yet
Panel Data
14 pages
Lectute 2 - Panel Data Regression
No ratings yet
Lectute 2 - Panel Data Regression
30 pages
Xtxtpcse
No ratings yet
Xtxtpcse
11 pages
DM Screen A4
No ratings yet
DM Screen A4
3 pages
Data Preprocessing
No ratings yet
Data Preprocessing
3 pages
Lecture Notes On Measurement Error
No ratings yet
Lecture Notes On Measurement Error
15 pages
AP Statistics Problems #8
No ratings yet
AP Statistics Problems #8
7 pages
Big Life Journal, Challenges Kit (Big Life Journal) (Z-Library)
100% (7)
Big Life Journal, Challenges Kit (Big Life Journal) (Z-Library)
45 pages
Problem Set 1: Panel Data
No ratings yet
Problem Set 1: Panel Data
3 pages
Non-Spherical Errors: 1 Efficient OLS
No ratings yet
Non-Spherical Errors: 1 Efficient OLS
14 pages
Grass Roots Comics
No ratings yet
Grass Roots Comics
82 pages
(DK How Things Work) DK - How Business Works - The Facts Visually Explained-DK (2022)
100% (14)
(DK How Things Work) DK - How Business Works - The Facts Visually Explained-DK (2022)
354 pages
Probleme MK
No ratings yet
Probleme MK
4 pages
GCSE Formula Sheet
No ratings yet
GCSE Formula Sheet
1 page
Resilience Kit - Big Life Journal-2
100% (10)
Resilience Kit - Big Life Journal-2
70 pages
Great Games - 175 Games Activities For Families Groups Children - B009NNKZ3W
95% (22)
Great Games - 175 Games Activities For Families Groups Children - B009NNKZ3W
214 pages
How To Be A Math Genius - Your Brilliant Brain and How To Train It
96% (26)
How To Be A Math Genius - Your Brilliant Brain and How To Train It
128 pages
Life Skills by DK
100% (18)
Life Skills by DK
98 pages
04-PROFIL-DE-INVESTITOR-www.adrian.asoltanie.com_
No ratings yet
04-PROFIL-DE-INVESTITOR-www.adrian.asoltanie.com_
2 pages
Orthogonal Array Testing
100% (2)
Orthogonal Array Testing
9 pages
POSITIVITY & CONNECTION KIT UK - Big Life Journal
100% (7)
POSITIVITY & CONNECTION KIT UK - Big Life Journal
69 pages
NAT Reviewer Statistics and Probability For Printing
No ratings yet
NAT Reviewer Statistics and Probability For Printing
6 pages
Emotional Grit
100% (14)
Emotional Grit
150 pages
The Ecology Book PDF
100% (35)
The Ecology Book PDF
354 pages
How Science Works The Facts Visually Explained by DK
93% (14)
How Science Works The Facts Visually Explained by DK
131 pages
Peer Tutoring Approach and Academic Performance of Pupils: An Experimental Study
No ratings yet
Peer Tutoring Approach and Academic Performance of Pupils: An Experimental Study
23 pages
Robust Regression
No ratings yet
Robust Regression
52 pages
Heteroskedasticity
100% (1)
Heteroskedasticity
23 pages
Timelines of History - The Ultimate Visual Guide To The Events That Shaped The World PDF
100% (19)
Timelines of History - The Ultimate Visual Guide To The Events That Shaped The World PDF
514 pages
One Million This A Visual Encyclopedia
100% (19)
One Million This A Visual Encyclopedia
305 pages
Mba Statistics Midterm Review Sheet
No ratings yet
Mba Statistics Midterm Review Sheet
1 page
Regression
No ratings yet
Regression
46 pages
When Something Is Bugging Me - Big Life Journal
100% (3)
When Something Is Bugging Me - Big Life Journal
6 pages
Geography As You'Ve Never Seen It Before
100% (29)
Geography As You'Ve Never Seen It Before
192 pages
SHS Practical Research 2
No ratings yet
SHS Practical Research 2
4 pages
Calculus Better Explained PDF
92% (12)
Calculus Better Explained PDF
76 pages
Science Year by Year
99% (76)
Science Year by Year
402 pages
The ADHD Workbook For Kids
97% (78)
The ADHD Workbook For Kids
185 pages
Math Better Explained PDF
100% (10)
Math Better Explained PDF
99 pages
How To Be A Coder - Learn To Think Like A Coder With Fun Activities
100% (12)
How To Be A Coder - Learn To Think Like A Coder With Fun Activities
146 pages
Handouts Educ 206
No ratings yet
Handouts Educ 206
5 pages
Affirmation Bracelets For Kids - Big Life Journal PDF
100% (3)
Affirmation Bracelets For Kids - Big Life Journal PDF
3 pages
Dissertation Using Multiple Regression
100% (3)
Dissertation Using Multiple Regression
8 pages
RP JEPEM Formatting
No ratings yet
RP JEPEM Formatting
6 pages
10 Ways To Love Me For Me - Big Life Journal
100% (1)
10 Ways To Love Me For Me - Big Life Journal
4 pages
DND Starter Sheet
No ratings yet
DND Starter Sheet
2 pages
Course Outline
No ratings yet
Course Outline
2 pages
When I Feel Worried About School - Big Life Journal
100% (2)
When I Feel Worried About School - Big Life Journal
4 pages
Research in Social Sciences: Prepared By: Mr. Ronald H. Abesamis LPT, CSP, RSP, Maed Instructor Iii
No ratings yet
Research in Social Sciences: Prepared By: Mr. Ronald H. Abesamis LPT, CSP, RSP, Maed Instructor Iii
78 pages
121-124 (Dr. Athar 2)
No ratings yet
121-124 (Dr. Athar 2)
4 pages
Metode Statistika
No ratings yet
Metode Statistika
27 pages
4.10 Descriptive Statistics
No ratings yet
4.10 Descriptive Statistics
18 pages
Gju Study Plan - Bida Business Intelligence and Data Analytics v5
No ratings yet
Gju Study Plan - Bida Business Intelligence and Data Analytics v5
31 pages
Defense
No ratings yet
Defense
26 pages
AI Fellowship Syllabus LATAM
No ratings yet
AI Fellowship Syllabus LATAM
17 pages
Mms - E.pdf 3
No ratings yet
Mms - E.pdf 3
11 pages
Value Education
0% (1)
Value Education
13 pages
Clustering Today
No ratings yet
Clustering Today
52 pages
Lec-9 - Joint Moments and Joint Characteristic Functions of Functions of Two Random Variables
No ratings yet
Lec-9 - Joint Moments and Joint Characteristic Functions of Functions of Two Random Variables
20 pages
Quality Framework
No ratings yet
Quality Framework
10 pages
D&D Character Workshop Presentation
No ratings yet
D&D Character Workshop Presentation
64 pages
Sacred Heart Sibu 2013 M2 (Q)
No ratings yet
Sacred Heart Sibu 2013 M2 (Q)
3 pages
Chapter-5 Introduction To Probability
No ratings yet
Chapter-5 Introduction To Probability
15 pages
My SMART Goal UK - Big Life Journal
100% (2)
My SMART Goal UK - Big Life Journal
6 pages
Ain Shams University
No ratings yet
Ain Shams University
15 pages
01.multiple Linear Regression - Ipynb - Colaboratory
No ratings yet
01.multiple Linear Regression - Ipynb - Colaboratory
10 pages
The 7 Habits of A Positive Parent - Big Life Journal
No ratings yet
The 7 Habits of A Positive Parent - Big Life Journal
3 pages
Material1 Latex
No ratings yet
Material1 Latex
7 pages
Assignments Business Economics
No ratings yet
Assignments Business Economics
2 pages
Assignment Instruction For SRU60204 - M211
No ratings yet
Assignment Instruction For SRU60204 - M211
2 pages
5 Day Kindness Challenge - Big Life Journal
100% (1)
5 Day Kindness Challenge - Big Life Journal
7 pages
Lessons in Bioinformatics - Dot Plots: Lessons in Bioinformatics, #1
From Everand
Lessons in Bioinformatics - Dot Plots: Lessons in Bioinformatics, #1
Björn Olsson
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Advanced Mathematics for Engineers and Scientists
From Everand
Advanced Mathematics for Engineers and Scientists
Paul DuChateau
4/5 (2)
Recursive Analysis
From Everand
Recursive Analysis
R. L. Goodstein
No ratings yet
General Stochastic Processes in the Theory of Queues
From Everand
General Stochastic Processes in the Theory of Queues
Vaclav E. Benes
No ratings yet
Nonlinear Transformations of Random Processes
From Everand
Nonlinear Transformations of Random Processes
Ralph Deutsch
No ratings yet
Analytical Methods of Optimization
From Everand
Analytical Methods of Optimization
D. F. Lawden
No ratings yet
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)
Matrix Theory
From Everand
Matrix Theory
Joel N. Franklin
No ratings yet
Geometric Hashing: Efficient Algorithms for Image Recognition and Matching
From Everand
Geometric Hashing: Efficient Algorithms for Image Recognition and Matching
Fouad Sabry
No ratings yet
Level Set Method: Advancing Computer Vision, Exploring the Level Set Method
From Everand
Level Set Method: Advancing Computer Vision, Exploring the Level Set Method
Fouad Sabry
No ratings yet

Journal of Statistical Software: Implementing Panel-Corrected Standard Errors in R: The Pcse Package

Uploaded by

Journal of Statistical Software: Implementing Panel-Corrected Standard Errors in R: The Pcse Package

Uploaded by

JSS Journal of Statistical Software

June 2011, Volume 42, Code Snippet 1. http://www.jstatsoft.org/

Implementing Panel-Corrected Standard Errors

Delia Bailey Jonathan N. Katz

Keywords: pcse, time-series–cross-section, covariance matrix estimation, contemporaneous

2. TSCS data and estimation

yi,t = xi,t β + i,t ; i = 1, . . . , N ; t = 1, . . . , T (1)

Cov(β̂) = (X> X)−1 {X> ΩX}(X> X)−1 . (2)

If the errors obey the spherical error assumption — i.e., Ω = σ 2 I, where I is an N T × N T

PCSE = (X> X)−1 X> Ω̂X(X> X)−1 . (6)

3.1. Balanced data

pcse(lmobj, groupN, groupT, ...)

3.2. Unbalanced data

units <- unique(groupN)

We can then fit their basic model of economic growth with

R> agl.lm <- lm(growth ~ lagg1 + opengdp + openex + openimp +

Residual standard error: 1.82 on 218 degrees of freedom

We can correct the standard errors by using:

R> agl.pcse <- pcse(agl.lm, groupN = agl$country, groupT = agl$year)

Estimate PCSE t value Pr(>|t|)

(Intercept) 5.968890 0.89298 6.684 1.91e-10

# Valid Obs = 240; # Missing Obs = 0; Degrees of Freedom = 218.

Estimate PCSE t value Pr(>|t|)

openex 0.002215 0.00115 1.9319 5.47e-02

# Valid Obs = 230; # Missing Obs = 10; Degrees of Freedom = 208.

R> aglUn.pcse2 <- pcse(aglUn.lm, groupN = aglUn$country,

Estimate PCSE t value Pr(>|t|)

openex 0.002215 0.000782 2.8327 5.07e-03

# Valid Obs = 230; # Missing Obs = 10; Degrees of Freedom = 208.

Alvarez RM, Garrett G, Lange P (1991). “Government Partisanship, Labor Organization,

Beck N, Katz JN (1995). “What To Do (and Not To Do) with Times-Series–Cross-Section

Beck N, Katz JN (2011). “Modeling Dynamics in Time-Series–Cross-

Henningsen A, Hamann JD (2007). “systemfit: A Package for Estimating Systems of Si-

Huber PJ (1967). “The Behavior of Maximum Likelihood Estimation Under Nonstandard

MacKinnon JG, White H (1985). “Some Heteroskedasticity-Consistent Covariance Matrix

Parks R (1967). “Efficient Estimation of a System of Regression Equations When Disturbances

White H (1980). “A Heteroskedasticity-Consistent Covariance Matrix and a Direct Test for

Pasadena, CA 91125, United States of America

Journal of Statistical Software http://www.jstatsoft.org/

You might also like

yi,t = xi,t β + i,t ; i = 1, . . . , N ; t = 1, . . . , T (1)