Contents
Contents
Principles of
Econometrics
k Fifth Edition k
R. CARTER HILL
Louisiana State University
WILLIAM E. GRIFFITHS
University of Melbourne
GUAY C. LIM
University of Melbourne
k
k
Brief Contents
PREFACE v 10 Endogenous Regressors and
LIST OF EXAMPLES xxi Moment-Based Estimation 481
1 An Introduction to Econometrics 1 11 Simultaneous Equations Models 531
Probability Primer 15
12 Regression with Time-Series Data:
2 The Simple Linear Regression Nonstationary Variables 563
Model 46
13 Vector Error Correction and Vector
3 Interval Estimation and Hypothesis Autoregressive Models 597
Testing 112
14 Time-Varying Volatility and ARCH
4 Prediction, Goodness-of-Fit, and Models 614
Modeling Issues 152
15 Panel Data Models 634
5 The Multiple Regression Model 196
16 Qualitative and Limited Dependent
6 Further Inference in the Multiple Variable Models 681
k Regression Model 260 k
APPENDIX A Mathematical Tools 748
iv
k
k
Table of Contents
PREFACE v P.5.6 Covariance Between Two Random
LIST OF EXAMPLES xxi Variables 27
P.6 Conditioning 29
1 An Introduction to P.6.1 Conditional Expectation 30
P.6.2 Conditional Variance 31
Econometrics 1 P.6.3 Iterated Expectations 32
P.6.4 Variance Decomposition 33
1.1 Why Study Econometrics? 1 P.6.5 Covariance Decomposition 34
1.2 What Is Econometrics About? 2 P.7 The Normal Distribution 34
1.2.1 Some Examples 3 P.7.1 The Bivariate Normal Distribution 37
1.3 The Econometric Model 4 P.8 Exercises 39
1.3.1 Causality and Prediction 5
1.4 How Are Data Generated?
1.4.1 Experimental Data 6
5 2 The Simple Linear Regression
1.4.2 Quasi-Experimental Data 6 Model 46
1.4.3 Nonexperimental Data 7
1.5 Economic Data Types 7 2.1 An Economic Model 47
1.5.1 Time-Series Data 7 2.2 An Econometric Model 49
1.5.2 Cross-Section Data 8 2.2.1 Data Generating Process 51
1.5.3 Panel or Longitudinal Data 9 2.2.2 The Random Error and Strict
k Exogeneity 52
k
1.6 The Research Process 9
2.2.3 The Regression Function 53
1.7 Writing an Empirical Research Paper 11 2.2.4 Random Error Variation 54
1.7.1 Writing a Research Proposal 11 2.2.5 Variation in x 56
1.7.2 A Format for Writing a Research Report 11 2.2.6 Error Normality 56
1.8 Sources of Economic Data 13 2.2.7 Generalizing the Exogeneity
1.8.1 Links to Economic Data on the Internet 13 Assumption 56
1.8.2 Interpreting Economic Data 14 2.2.8 Error Correlation 57
1.8.3 Obtaining the Data 14 2.2.9 Summarizing the Assumptions 58
2.3 Estimating the Regression Parameters 59
Probability Primer 15
2.3.1 The Least Squares Principle 61
P.1 Random Variables 16 2.3.2 Other Economic Models 65
P.2 Probability Distributions 17 2.4 Assessing the Least Squares Estimators 66
P.3 Joint, Marginal, and Conditional 2.4.1 The Estimator b2 67
Probabilities 20 2.4.2 The Expected Values of b1 and b2 68
P.3.1 Marginal Distributions 20 2.4.3 Sampling Variation 69
P.3.2 Conditional Probability 21 2.4.4 The Variances and Covariance of b1
P.3.3 Statistical Independence 21 and b2 69
P.4 A Digression: Summation Notation 22 2.5 The Gauss–Markov Theorem 72
P.5 Properties of Probability Distributions 23 2.6 The Probability Distributions of the Least
P.5.1 Expected Value of a Random Variable 24 Squares Estimators 73
P.5.2 Conditional Expectation 25
2.7 Estimating the Variance of the Error Term 74
P.5.3 Rules for Expected Values 25
2.7.1 Estimating the Variances and Covariance of
P.5.4 Variance of a Random Variable 26
the Least Squares Estimators 74
P.5.5 Expected Values of Several Random
2.7.2 Interpreting the Standard Errors 76
Variables 27
xii
k
k
k
k
4.3.6 Identifying Influential Observations 169 5.5.2 One-Tail Hypothesis Testing for a Single
4.4 Polynomial Models 171 Coefficient 220
4.4.1 Quadratic and Cubic Equations 171 5.5.3 Hypothesis Testing for a Linear Combination
of Coefficients 221
4.5 Log-Linear Models 173
4.5.1 Prediction in the Log-Linear Model 175 5.6 Nonlinear Relationships 222
4.5.2 A Generalized R2 Measure 176 5.7 Large Sample Properties of the Least Squares
4.5.3 Prediction Intervals in the Log-Linear Estimator 227
Model 177 5.7.1 Consistency 227
4.6 Log-Log Models 177 5.7.2 Asymptotic Normality 229
5.7.3 Relaxing Assumptions 230
4.7 Exercises 179 5.7.4 Inference for a Nonlinear Function of
4.7.1 Problems 179
Coefficients 232
4.7.2 Computer Exercises 185
5.8 Exercises 234
Appendix 4A Development of a Prediction 5.8.1 Problems 234
Interval 192 5.8.2 Computer Exercises 240
Appendix 4B The Sum of Squares Appendix 5A Derivation of Least Squares
Decomposition 193 Estimators 247
Appendix 4C Mean Squared Error: Estimation and
Appendix 5B The Delta Method 248
Prediction 193 5B.1 Nonlinear Function of a Single
Parameter 248
5 The Multiple Regression 5B.2 Nonlinear Function of Two Parameters 249
Model 196 Appendix 5C Monte Carlo Simulation 250
5C.1 Least Squares Estimation with Chi-Square
k 5.1 Introduction 197 Errors 250 k
5.1.1 The Economic Model 197 5C.2 Monte Carlo Simulation of the Delta
5.1.2 The Econometric Model 198 Method 252
5.1.3 The General Model 202 Appendix 5D Bootstrapping 254
5.1.4 Assumptions of the Multiple Regression 5D.1 Resampling 255
Model 203 5D.2 Bootstrap Bias Estimate 256
5.2 Estimating the Parameters of the Multiple 5D.3 Bootstrap Standard Error 256
Regression Model 205 5D.4 Bootstrap Percentile Interval Estimate 257
5.2.1 Least Squares Estimation Procedure 205 5D.5 Asymptotic Refinement 258
5.2.2 Estimating the Error Variance σ2 207
5.2.3 Measuring Goodness-of-Fit 208 6 Further Inference in the Multiple
5.2.4 Frisch–Waugh–Lovell (FWL) Theorem 209
Regression Model 260
5.3 Finite Sample Properties of the Least Squares
Estimator 211 6.1 Testing Joint Hypotheses: The F-test 261
5.3.1 The Variances and Covariances of the Least 6.1.1 Testing the Significance of the Model 264
Squares Estimators 212 6.1.2 The Relationship Between t- and
5.3.2 The Distribution of the Least Squares F-Tests 265
Estimators 214 6.1.3 More General F-Tests 267
5.4 Interval Estimation 216 6.1.4 Using Computer Software 268
5.4.1 Interval Estimation for a Single 6.1.5 Large Sample Tests 269
Coefficient 216 6.2 The Use of Nonsample Information 271
5.4.2 Interval Estimation for a Linear Combination
6.3 Model Specification 273
of Coefficients 217
6.3.1 Causality versus Prediction 273
5.5 Hypothesis Testing 218 6.3.2 Omitted Variables 275
5.5.1 Testing the Significance of a Single 6.3.3 Irrelevant Variables 277
Coefficient 219
k
k
Table of Contents xv
k
k
Appendix 8C Properties of the Least Squares 10.1.2 Why Least Squares Estimation
Residuals 410 Fails 484
8C.1 Details of Multiplicative Heteroskedasticity 10.1.3 Proving the Inconsistency
Model 411 of OLS 486
Appendix 8D Alternative Robust Sandwich 10.2 Cases in Which x and e are Contemporaneously
Estimators 411 Correlated 487
10.2.1 Measurement Error 487
Appendix 8E Monte Carlo Evidence: OLS, GLS, and
10.2.2 Simultaneous Equations Bias 488
FGLS 414
10.2.3 Lagged-Dependent Variable Models with
Serial Correlation 489
9 Regression with Time-Series 10.2.4 Omitted Variables 489
Data: Stationary Variables 417 10.3 Estimators Based on the Method
of Moments 490
9.1 Introduction 418 10.3.1 Method of Moments Estimation of a
9.1.1 Modeling Dynamic Relationships 420 Population Mean and Variance 490
9.1.2 Autocorrelations 424 10.3.2 Method of Moments Estimation in the Simple
Regression Model 491
9.2 Stationarity and Weak Dependence 427
10.3.3 Instrumental Variables Estimation in the
9.3 Forecasting 430 Simple Regression Model 492
9.3.1 Forecast Intervals and Standard Errors 433 10.3.4 The Importance of Using Strong
9.3.2 Assumptions for Forecasting 435 Instruments 493
9.3.3 Selecting Lag Lengths 436 10.3.5 Proving the Consistency of the IV
9.3.4 Testing for Granger Causality 437 Estimator 494
9.4 Testing for Serially Correlated Errors 438 10.3.6 IV Estimation Using Two-Stage Least Squares
9.4.1 Checking the Correlogram of the Least (2SLS) 495
k k
Squares Residuals 439 10.3.7 Using Surplus Moment Conditions 496
9.4.2 Lagrange Multiplier Test 440 10.3.8 Instrumental Variables Estimation in the
9.4.3 Durbin–Watson Test 443 Multiple Regression Model 498
10.3.9 Assessing Instrument Strength Using the
9.5 Time-Series Regressions for Policy
First-Stage Model 500
Analysis 443
10.3.10 Instrumental Variables Estimation in a
9.5.1 Finite Distributed Lags 445
General Model 502
9.5.2 HAC Standard Errors 448
10.3.11 Additional Issues When Using IV
9.5.3 Estimation with AR(1) Errors 452
Estimation 504
9.5.4 Infinite Distributed Lags 456
10.4 Specification Tests 505
9.6 Exercises 463
10.4.1 The Hausman Test for Endogeneity 505
9.6.1 Problems 463
10.4.2 The Logic of the Hausman Test 507
9.6.2 Computer Exercises 468
10.4.3 Testing Instrument Validity 508
Appendix 9A The Durbin–Watson Test 476 10.5 Exercises 510
9A.1 The Durbin–Watson Bounds Test 478
10.5.1 Problems 510
Appendix 9B Properties of an AR(1) Error 479 10.5.2 Computer Exercises 516
Appendix 10A Testing for Weak Instruments 520
10 Endogenous Regressors and 10A.1 A Test for Weak Identification 521
10A.2 Testing for Weak Identification:
Moment-Based Estimation 481
Conclusions 525
10.1 Least Squares Estimation with Endogenous Appendix 10B Monte Carlo Simulation 525
Regressors 482 10B.1 Illustrations Using Simulated Data 526
10.1.1 Large Sample Properties of the OLS 10B.2 The Sampling Properties of
Estimator 483 IV/2SLS 528
k
k
12.1 Stationary and Nonstationary Variables 564 14.1 The ARCH Model 615
12.1.1 Trend Stationary Variables 567 14.2 Time-Varying Volatility 616
12.1.2 The First-Order Autoregressive Model 570
12.1.3 Random Walk Models 572
14.3 Testing, Estimating, and Forecasting 620
k
k
15.1.3 Using OLS to Estimate the Panel Data 16.2.9 Binary Choice Models with a Binary
Regression 639 Endogenous Variable 699
15.2 The Fixed Effects Estimator 640 16.2.10 Binary Endogenous Explanatory
15.2.1 The Difference Estimator: T = 2 640 Variables 700
15.2.2 The Within Estimator: T = 2 642 16.2.11 Binary Choice Models and Panel
15.2.3 The Within Estimator: T > 2 643 Data 701
15.2.4 The Least Squares Dummy Variable 16.3 Multinomial Logit 702
Model 644 16.3.1 Multinomial Logit Choice
15.3 Panel Data Regression Error Assumptions 646 Probabilities 703
15.3.1 OLS Estimation with Cluster-Robust 16.3.2 Maximum Likelihood Estimation 703
Standard Errors 648 16.3.3 Multinomial Logit Postestimation
15.3.2 Fixed Effects Estimation with Cluster-Robust Analysis 704
Standard Errors 650 16.4 Conditional Logit 707
15.4 The Random Effects Estimator 651 16.4.1 Conditional Logit Choice
15.4.1 Testing for Random Effects 653 Probabilities 707
15.4.2 A Hausman Test for Endogeneity in the 16.4.2 Conditional Logit Postestimation
Random Effects Model 654 Analysis 708
15.4.3 A Regression-Based Hausman Test 656 16.5 Ordered Choice Models 709
15.4.4 The Hausman–Taylor Estimator 658 16.5.1 Ordinal Probit Choice Probabilities 710
15.4.5 Summarizing Panel Data Assumptions 660 16.5.2 Ordered Probit Estimation and
15.4.6 Summarizing and Extending Panel Data Interpretation 711
Model Estimation 661 16.6 Models for Count Data 713
15.5 Exercises 663 16.6.1 Maximum Likelihood Estimation of the
15.5.1 Problems 663 Poisson Regression Model 713
k 15.5.2 Computer Exercises 670 16.6.2 Interpreting the Poisson Regression k
Appendix 15A Cluster-Robust Standard Errors: Model 714
Some Details 677 16.7 Limited Dependent Variables 717
16.7.1 Maximum Likelihood Estimation
Appendix 15B Estimation of Error
of the Simple Linear Regression
Components 679
Model 717
16.7.2 Truncated Regression 718
16 Qualitative and Limited 16.7.3 Censored Samples and Regression 718
Dependent Variable Models 681 16.7.4 Tobit Model Interpretation 720
16.7.5 Sample Selection 723
16.1 Introducing Models with Binary Dependent 16.8 Exercises 725
Variables 682 16.8.1 Problems 725
16.1.1 The Linear Probability Model 683 16.8.2 Computer Exercises 733
16.2 Modeling Binary Choices 685 Appendix 16A Probit Marginal Effects: Details 739
16.2.1 The Probit Model for Binary 16A.1 Standard Error of Marginal Effect at a Given
Choice 686 Point 739
16.2.2 Interpreting the Probit Model 687 16A.2 Standard Error of Average Marginal
16.2.3 Maximum Likelihood Estimation of the Probit Effect 740
Model 690 Appendix 16B Random Utility Models 741
16.2.4 The Logit Model for Binary Choices 693 16B.1 Binary Choice Model 741
16.2.5 Wald Hypothesis Tests 695 16B.2 Probit or Logit? 742
16.2.6 Likelihood Ratio Hypothesis Tests 696
Appendix 16C Using Latent Variables 743
16.2.7 Robust Inference in Probit and Logit
16C.1 Tobit (Tobit Type I) 743
Models 698
16C.2 Heckit (Tobit Type II) 744
16.2.8 Binary Choice Models with a Continuous
Endogenous Variable 698 Appendix 16D A Tobit Monte Carlo Experiment 745
k
k
k
k
xx Table of Contents
C.6.7 Type I and Type II Errors 833 C.10 Kernel Density Estimator 851
C.6.8 A Relationship Between Hypothesis Testing
C.11 Exercises 854
and Confidence Intervals 833
C.11.1 Problems 854
C.7 Some Other Useful Tests 834 C.11.2 Computer Exercises 857
C.7.1 Testing the Population Variance 834
C.7.2 Testing the Equality of Two Population
Means 834 D Statistical Tables 862
C.7.3 Testing the Ratio of Two Population
Variances 835 TableD.1 Cumulative Probabilities for the Standard
C.7.4 Testing the Normality of a Population 836 Normal Distribution 𝚽(z) = P(Z ≤ z) 862
C.8 Introduction to Maximum Likelihood TableD.2 Percentiles of the t-distribution 863
Estimation 837
TableD.3 Percentiles of the Chi-square
C.8.1 Inference with Maximum Likelihood
Distribution 864
Estimators 840
C.8.2 The Variance of the Maximum Likelihood TableD.4 95th Percentile for the F-distribution 865
Estimator 841 TableD.5 99th Percentile for the F-distribution 866
C.8.3 The Distribution of the Sample
TableD.6 Standard Normal pdf Values 𝛟(z) 867
Proportion 842
C.8.4 Asymptotic Test Procedures 843
C.9 Algebraic Supplements 848 INDEX 869
C.9.1 Derivation of Least Squares Estimator 848
C.9.2 Best Linear Unbiased Estimation 849
k k