Stefanski
Stefanski
To cite this article: L. A. Stefanski (2000) Measurement Error Models, Journal of the American Statistical Association, 95:452,
1353-1358
Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained
in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no
representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the
Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and
are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and
should be independently verified with primary sources of information. Taylor and Francis shall not be liable for
any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever
or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of
the Content.
This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any
form to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http://
www.tandfonline.com/page/terms-and-conditions
Theory and Methods of Statistics 1353
Bozdogan, H. (1994), Proceedings of the First U.S./Japan Conference of Lehmann, E. L. (1983), Theory of Point Estimation, New York Wiley.
the Frontiers of Statistical Modeling, Vols. 1-3, Amsterdam: Kluwer. Lindley, D. V. (1956), “On a Measure of Information Provided by an Ex-
Brockett, P. L. (1991), “Information Theoretic Approach to Actuarial Sci- periment,” Annals of Mathematical Statistics, 27, 986-1005.
ence: A Unification and Extension of Relevant Theory and Applica- (1961), “The Use of Prior Probability Distributions in Statistical
tions,” Transactions of Society of Actuaries, 43, 73-135. Inference and Decision,” in Proceedings of the Fourth Berkeley Sympo-
Brockett, P. L., Charnes, A,, Cooper, W. W., Learner, D., and Phillips, F. Y. sium, 1, Berkeley, C A University of California Press, pp. 436468.
(1995),“Information Theory as a Unifying Statistical Approach for Use Maasoumi, E. (1993), “A Compendium to Information Theory in Eco-
in Marketing Research,” European Journal of Operational Research, 84, nomics and Econometrics,” Econometric Reviews, 12(2), 137-181.
31&329. Pourahmadi, M., and Soofi, E. S. (2000), “Predictive Variance and Infor-
Burnham, K. P., and Anderson, D. R. (1998), Model Selection and Infer- mation Worth of Observations in Time Series,” Journal of Time Series
ence: A Practical Information-TheoreticApproach, New York: Springer- Analysis, 21, 413-434.
Verlag. Rao, C. R. (1973), Linear Statistical Inference and Its Applications (2nd
Cover, T. M., and Thomas, J. A. (1991), Elements of Information Theory, ed.), New York: Wiley.
New York: Wiley. Shannon, C. E. (1948), “A Mathematical Theory of Communication,” Bell
Csiszar, I. (19911, “Why Least Squares and Maximum Entropy? An Ax- System Technical Journal, 27, 379423.
iomatic Approach to Inference in Linear Inverse Problems,” The Annals Shore, J. E., and Johnson, R. W. (1980), “Axiomatic Derivation of the
of Statistics, 19, 2032-66. Principle of Maximum Entropy and the Principle of Minimum Cross-
Fisher, R. A. (1921),“On Mathematical Foundations of Theoretical Statis- Entropy,” IEEE Transactions on Informational Theory, IT-26, 26-37.
tics,” Philosophical Transactions of the Royal Society of London, Ser.
Soofi, E. S. (1994), “Capturing the Intangible Concept of Information,”
A, 222,309-368.
Journal of the American Statistical Association, 89, 1243-1254.
Fomby, T. B., and Hill, R. C. (1997),Advances in Econometrics: Applying
(1997), “Information Theoretic Regression Methods,” in Advances
Maximum Entropy to Econometric Problems, Vol. 12, Greenwich, CT:
in Econometrics:Applying Maximum Entropy to Econometric Problems,
JAI Press.
Gokhale, D. V., and Kullback, S. (1978), The Information in Contingency
12, eds. T. B. Fomby and R. C. Hill, Greenwich, C T JAI Press, pp.
25-83.
Downloaded by [Osaka University] at 21:26 19 November 2014
with error and explains why this vignette shares the same These four examples have in common the feature that for
title. certain statistical analyses, the variable X is preferred over
As for the second comment, the fact that statisticians are W : Methods for estimating animal abundance from line-
moving toward more complex models in general is clear transect data assume that distances are measured without
from reading any current issue of JASA. At the time of the error; for assessing the relationship between heart disease
Toronto meetings, the field of measurement error modeling and blood pressure, long-term average blood pressure is the
was already broadening to include more realistic models meaningful risk factor; personal exposure is the relevant
required for certain applications. In the period since then, risk factor in studies of the effects of NO2 exposure on res-
the breadth and complexity of the models studied under the piratory illness in children; and excess daily mortality due
heading of measurement error models has grown at an ever- to exposure to particulate matter is more directly related
increasing rate. This expansion is straining the terminology, to average personal exposure than to concentrations mea-
notation, and assumptions that were adequate for describing sured at central monitoring stations. These examples also
linear errors-in-variables models, and also is creating new have in common the feature that X is not observable for
problems for research. each observational unit, whereas W is. The problem is to fit
Despite the rapid developments in the field and the in- models described in terms of X (and other observed vari-
creasing use of measurement error models in practice over ables) given data on W (and the other observed variables).
the last 15-20 years, it is likely that many readers are not fa- Other, more detailed descriptions of measurement error
miliar with measurement error problems. The sections that problems have been given by Carroll, Ruppert, and Stefan-
follow provide an introduction to measurement error prob- ski (1995) and Fuller (1987).
Downloaded by [Osaka University] at 21:26 19 November 2014
ysis of the observed data when the purpose of the analysis is equation errors E,, it transpires that pylW = XpYlx + o p ( l ) ,
inference about a model defined in terms of X . Problems of where 1 = s$/(s$ + s;), where s2 denotes the sample
this nature are commonly called measurement error prob- variance of the subscript variable. Because 0 < 1 < 1, a
lems, and the statistical models and methods for analyzing consequence of ignoring measurement error is attenuation
such data are called measurement error models. (bias toward 0) in the slope estimator.
The terminology belies the generality of the class of This simple model illustrates the key features of measure-
problems and models connoted by the phrase “measurement ment error in regression models: The parameters of interest
error.” Its origins lie in the special case in which the substi- ( q p ) appear in the regression model for E ( Y I X ) , which
tute variable W is a measurement (in the usual sense of the depends on the unobserved variable X ; the second com-
word) of the true value of X . For example, the measured ponent of the model relates the observed substitute vari-
perpendicular distance W that an animal lies from a tran- able W to X (in this case via the additive error model
sect line can differ significantly from the true distance X W = X + U , note that in this case E ( W I X ) = X ) ; and the
depending on the method of measurement. Similarly, a per- model fit to the observed data results in biased estimators
son’s measured systolic blood pressure W differs from his of the parameters of interest. A less obvious feature that
or her long-term average systolic blood pressure X because it shares with other measurement error models is a lack of
of significant temporal variation as well as instrument and identifiability. The regression parameters ( a ,p) cannot be
reader error. These cases are fundamentally different from consistently estimated without additional data (e.g., repli-
the situation in which the ambient concentrations of NO2 in cate measurements) or additional information in the form
the bedroom and kitchen of a child’s home W = (Wl, W Z ) of distributional or moment restrictions on the error distri-
are substitutes for the child’s personal exposure to NO;! X butions or the distribution of X . An often-studied, early ver-
(which is directly measurable via a personal monitor but sion of the model known as the classical errors-in-variables
not done for reasons of cost or inconvenience). The latter model (Fuller 1987, sec. 1.3) incorporates the assumption
situation is similar to that in which airborne particulate mat- that the ratio of error variances 7 = a:/& is known. Under
ter concentration measured daily at a central location W is this identifiability assumption,the regression parameters are
used as a substitute for daily average (over a population consistently estimated using the method of orthogonal least
of individuals) personal exposure to particulate matter X squares, formulated and described more than 100 years ago
in time series studies of the relationship between daily air by R. J. Adcock (1877, 1878). However, the historical im-
pollution and daily mortality. portance of the model in which the ratio of error variances
Theory and Methods of Statistics 1355
is assumed known is greater than its practical significance infinitely many nuisance parameters, for which the defi-
(Carroll et al. 1995; Carroll and Ruppert 1996). Finally, it ciencies of maximum likelihood estimation are now well
is important to distinguish between the classical errors-in- known. The pioneering work of Neyman and Scott (1948)
variables model and the classical error model. The latter and Kiefer and Wolfowitz (1956) on estimation in the pres-
phrase refers only to the measurement error component of ence of numerous nuisance parameters had significant im-
a model and describes the additive error model W = X + U , pact not only on the direction of research in the errors-in-
with U and X independent (or uncorrelated). variables literature, but also on the course of research in
statistics in general.
3. LINEAR ERRORS-IN-VARIABLES MODELS
Generalizations and variations of linear error-in-variables 4. MEASUREMENT ERROR IN NONLINEAR MODELS
models were studied extensively in the 100 years following In the 1980s, the research emphasis in measurement er-
Adcock’s description of orthogonal least squares. The ma- ror models shifted to problems in which the model of
jor generalizations include multiple and multivariate multi- interest defined in terms of X is something other than
ple linear regression, in which one or more predictor vari- a linear model. With increasing frequency, articles ap-
ables are measured with error and some predictor variables peared on nonlinear regression models with errors in both
are error free. The major variations depend on assumptions variables, generalized linear models with predictor vari-
about the unobserved predictors and the type of data or dis- ables measured with error, and nonparametric distribu-
tributional (moment) information available, ensuring iden- tion/density/regression estimation in the presence of mea-
tifiability of the regression parameters. surement error. But these are not the first investigations of
Downloaded by [Osaka University] at 21:26 19 November 2014
In functional errors-in-variables models, the unobserved measurement error in nonlinear models. Eddington ( 1913)
X i are modeled as unknown, nonrandom constants (pa- studied distribution estimation, and Trumpler and Weaver
rameters), whereas in structural errors-in-variables mod- (1953, sec. 1.56) described an interesting approach to
els, the observables and unobservables jointly vary in nonlinear regression errors-in-variables problems; however,
repeated sampling. For example, consider modeling the these researches were the exceptions, whereas by the late
relationship between aquatic species diversity Y and acid- 1980s, the focus of measurement error modeling research
neutralizing capacity X , given measurements of (Y,X ) was on nonlinear models.
from each of n lakes. If the only lakes of interest are those The early research in errors-in-variables models was
represented in the sample, then it is appropriate to model driven by applications in the physical sciences, especially
Xi, i = 1, . . . , n, as unknown constants. Alternatively, if the astronomy, and soon thereafter also by econometric appli-
lakes represented in the data are a random sample from a cations. Much of the current research in nonlinear measure-
large population of lakes, then it is appropriate to model ment error models is motivated by applications in the health
Xi,i = 1,.. . ,n, as iid random variables. sciences, this is especially true of research in generalized
Parameters are identified under a wide variety of combi- linear measurement error models. The article by Carroll,
nations of error distributional or moment restrictions and Spiegelman, Lan, Bailey, and Abbott (1984) on measure-
the existence of additional data (e.g., replicate measure- ment error in binary regression marks the shift in emphasis
ments), each giving rise to a unique and interesting infer- from linear to nonlinear measurement error models, and is
ence problem. Discussion of the numerous variations of noteworthy for breaking ground in the application of mea-
the model, the methods of analysis adapted to each, and surement error modeling in the health sciences and in the
the early contributions of many well-known statisticians to study of generalized linear models with measurement error.
the field were provided by Madansky (1959). Sprent (1989) Greater variety in the types of error model (the model re-
provided a more recent overview of the field, including ref- lating the observed W and unobserved X)studied also char-
erences to other key review articles. acterizes the recent research in measurement error model-
The traditional distinction in the literature between func- ing. The so-called Berkson error model, wherein the ob-
tional models and structural models is, for some applica- served values W are fixed in repeated sampling and the X
tions, not as relevant as the distinction between functional values vary (Berkson 1950), which is little studied in linear
modeling and structural modeling as defined by Carroll et models because of the lack of rectifiable problems that it
al. (1995, sec. 1.2), which facilitates classification of estima- causes, plays a more significant role in the study of non-
tion methods based on the strength of the assumptions made linear measurement error models. The distinction between
about the latent variables X I ,Xa, . . .. However, functional Berkson error and classical error models is subtle but im-
models played an important role in the study of measure- portant because of the very different problems associated
ment error models and in statistics more generally. with each (Carroll et al. 1995, sec. 1.3; Fuller 1987, sec.
In a functional errors-in-variables model, the unobserved 1.6.4). A frequently cited example due to Fuller (1987, sec.
latent variables are unknown parameters. Consequently, for 1.6.4) illustrates the key feature of Berkson error. Consider
a functional model with sample size n, and hence measure- an experiment in which the quality of cement, Y, is to be
ments on n latent variables, the unknown parameter vector studied as a function of the amount of water in the mix-
includes ( X I ,X z , . . . , X n ) and so has dimension increas- ture. The amount of water is controlled by setting a me-
ing linearly with sample size. Functional errors-in-variables tered valve to specified values. Because of fluctuations in
models are practical and classical examples of models with water pressure and inaccuracies in the metered valve, the
1356 Journal of the American Statistical Association, December 2000
true amount of water in a mixture X , differs from the pre- that the error variance is known, or that it is independently
scribed amount W . Note that W is controlled as part of estimable, say from replicate measurements or validation
the experimental design and is not random. In replication data. The model is identified more generally, but in this
of the experiment at the same value of W , the true amount case identifiability does not imply the ability to obtain esti-
X will vary. If the valve is correctly calibrated, then a rea- mators with acceptable finite-sample properties (Carroll et
sonable model is X = W + U , where U is a mean-0 error al. 1995, sec. 7.1.1). A number of estimation methods have
regardless of the value of W . Thus for unbiased Berkson been studied for this model and were described by Carroll
error, E ( X I W ) = W , whereas for unbiased classical error, et al. (1995). One such method, chosen because of its link
E(WIX) = x. to the work of Neyman and Scott (19481, is described next.
There are experimental and sampling designs/studies in Consider the functional version of the logistic measure-
which the pure forms of error models (classical and Berk- ment error model with errors, U,, that are normally dis-
son) arise naturally, and hence their importance in ap- tributed with known variance &. In this case, the density
plied and theoretical work. However, there are applications of ( y Z , W,) is
where neither error model is directly applicable (measuring
f Y W ( Y ,WlPO, P z , XZ)
a child's exposure to NO2 and measuring average expo-
sure to particulate matter are two such examples), although
either may be appropriate after transformation of the raw = { H ( a + @ X i > } Y { l - H(a+@xi)}'-y-q3
OlJ
("iUXi),
~
measured at central sites). The fact that measurement er- lihood Estimator in the Presence of Infinitelv Manv Incidental Parame-
ror affects variable selection and model building is not in ters,” Annals of Mathematical Statistics, 27, 887-906.
Lipfert, F. W., and Wyzga, R. E. (1995), “Uncertainties in Identifying Re-
dispute, although the severity of its effect in multivariable sponsible Pollutants in Observational Epidemiology Studies,” Inhalation
models is not known. Model building and variable selec- Toxicology, 7, 671489.
tion in the presence of measurement error is a potentially MacMahon, S., Peto, R., Cutler, J., Collins, R., Sorlie, P., Neaton, J., Ab-
interesting and useful topic of study. bott, R., Godwin, J., Dyer, A,, and Stamler, J. (1990), “Blood Pressure,
Stroke and Coronary Heart Disease: Part 1, Prolonged Differences in
Blood Pressure: Prospective Observational Studies Corrected for the
REFERENCES Regression Dilution Bias,” Lancet, 335, 765-774.
Adcock, R. J. (1877), “Note on the Method of Least Squares,” Analyst, 4, Madansky, A. (1959), “The Fitting of Straight Lines When Both Variables
183-184. are Subject to Error,” Journal of the American Statistical Association,
-(1878), “A Problem in Least Squares,” Analyst, 5, 53-54. 54, 173-205.
Berkson, J. (1950), “Are There Two Regressions?,” Journal ofthe Ameri- Neyman, J., and Scott, E. L. (1948), “Consistent Estimates Based on Par-
can Statistical Association, 45, 164-180. tially Consistent Observations,” Econometrica, 16, 1-32.
Carroll, R. J., and Ruppert, D. (1996), “The Use and Misuse of Orthog- Schwartz, J., Dockery, D. W., and Neas, L. M. (1996), “Is Daily Mortality
onal Regression in Linear Errors-in-Variables Models,” The American Associated Specifically With Fine Particles?,” Journal of the Air and
Statistician, 50, 1 4 . Waste Management Association, 46, 927-939.
Carroll, R. J., Ruppert, D., and Stefanski, L. A. (1995), Measurement Error Sprent, P. (1989), “Some History of Functional and Structural Relation-
in Nonlinear Models, London: Chapman and Hall. ships,” in Statistical Analysis of Measurement Error Models and Appli-
Carroll, R. J., Spiegelman, C., Lan, K. K., Bailey, K. T., and Abbott, cations, eds. P. J. Brown and W. A. Fuller, Providence, RI: American
R. D. (1984), “On Errors-in-Variables for Binary Regression Models,” Mathematical Society.
Biometrika, 71, 19-26. Stefanski, L. A,, and Carroll, R. J. (1985), “Covariate Measurement Error
Cook, J., and Stefanski, L. A. (1994), “A Simulation Extrapolation Method in Logistic Regression,” The Annals of Statistics, 13, 1335-1351.
Downloaded by [Osaka University] at 21:26 19 November 2014
for Parametric Measurement Error Models,” Journal of the American (1987), “Conditional Scores and Optimal Scores in Generalized
Statistical Association, 89, 1314-1328. Linear Measurement Error Models,” Biometrika, 74, 703-716.
Eddington, A. S. (1913), “On a Formula for Correcting Statistics for the Stefanski, L. A., and Cook, J. (1995), “Simulation Extrapolation: The Mea-
Effects of a Known Probable Error of Observation,” Mon. Not. R. As- surement Error Jackknife,” Journal of the American Statistical Associ-
trom. Soc., 73, 359-360. ation, 90, 1247-1256.
Fuller, W. A. (1987), Measurement Error Models, New York: Wiley. Trumpler, R. J., and Weaver, H. F. (19531, Statistical Astronomy, Berkeley,
Kiefer, J., and Wolfowitz, J. (1956), “Consistency of the Maximum Like- CA. University of California Press.
Robert L. Strawderman is Associate Professor, Department of Bio- @ 2000 American Statistical Association
metrics, Cornell University, 434 Warren Hall, Ithaca NY 14853 (E-mail: Journal of the American Statistical Association
rls54 @ cornell.edu). December 2000, Vol. 95, No. 452, Vignettes