Econometric Methods
Econometric Methods
Econometric Methods
Econometric Methods
Max Marks: 20 Max Time: 1.5 Hrs
1. Consider the following regressions of average hourly earnings on gender and education and other
characteristics. (12 Marks)
Table 1
Regressor (1) (2) (3)
College(X1 ) 5.46 5.48 5.44
(0.21) (0.21) (0.21)
Female(X2 ) −2.46 −2.62 −2.62
(0.2) (0.2) (0.2)
Age(X3 ) 0.29 0.29
(0.04) (0.04)
North(X4 ) 0.69
(0.3)
West(X5 ) 0.60
(0.28)
South(X6 ) −0.27
(0.26)
Intercept 12.69 4.40 3.75
(0.14) (1.05) (1.06)
where (dependent variable) - average hourly earnings (in 1998 dollars); College = binary vari-
able (1 if college, 0 if high school); Female = binary variable (1 if female, 0 if male); Age = age
(in years); North = binary variable (1 if Region = East, 0 otherwise) West = binary variable (1
if Region = West, 0 otherwise) South - binary variable (1 if Region = South. 0 otherwise) East -
binary variable (1 if Region = East, 0 otherwise)
a) Using the regression results in column (1):
(i) Is the college-high school earnings difference estimated from this regression statistically
significant at the 5% level? Construct a 95% confidence interval of the difference.
For the two-tailed test
H0 : β 1 = 0
H1 : β 1 6= 0
The t-statistic is 5.46/0.21 = 26.0 > 1.96 (tcrit ). So, the coefficient is statistically
significant at the 5% level. The 95% CI is 5.46±1.96×0.21.
(ii) Is the male-female earnings difference estimated from this regression statistically signif-
icant at the 5% level? Construct a 95% confidence interval for the difference.
For the two-tailed test
H0 : β 2 = 0
H1 : β 2 6= 0
The t-statistic is −2.64/0.20 = −13.2 and |t|= 13.2 > 1.96. So, the coefficient is
statistically significant at the 5% level. The 95% CI is -2.64±1.96×0.20.
b) Using the regression results in column (2):
(i) Is age an important determinant of earnings? Use an appropriate statistical test and/or
confidence interval to explain your answer.
0.29
Yes, age is an important determinant of earnings. Using a t-test, the t-statistic is 0.04 =
−13
7.25, with a p-value of 4.2×10 , implying that the coefficient on age is statistically
1
2
Y = b0 + b1 χ + u
and
X =χ+ν
with the assumptions χ = N (χ̄, σx2 ), u = N (0, σu2 ), ν = N (0, σν2 ), E(χu) = 0, E(χν) = 0,
E(uν) = 0. That is errors are normally and independently distributed of χ and of each other
3
with zero means and constant variances. Under these assumptions show that: Var(X)=σχ2 + σν2 ,
Var(Y)=b21 σχ2 + σu2 .
Proof. This is the case of error in the explanatory variable. Here we can treat χ as the true (but
unobservable) value of the explanatory variable. Instead what is observed is X which is related to χ in
the following manner:
X =χ+ν
where ν is the random variable representing the errors of measurement in X. Let us derive formally the
mean and variance of the distributions of X and Y .
(a) Var(X )=σχ2 + σν2
Because X = χ + ν, we have E(X) = E(χ + ν) = E(χ) + E(ν) = χ̄ + 0 = χ̄.
Now Var(X) = E(X − E(X))2 . However, we just saw that E(χ) = χ̄. Hence,
H0 : β1 = 0, β2 = 0, β3 = 0, β4 = 0
H1 : At least one is not zero
We can test this out by constructing an F -test using the R2 value for the given (unrestricted)
model (since R2 value for the restricted model - with the restrictions hypothesized in the H0
- is zero).
R2 /q 0.773/4 0.1933
F4,n−4−1 = = = = 70.68 > 2.5
(1 − R2 )/(n − k − 1) (1 − 0.773)/(88 − 4 − 1) 0.002735
Hence we reject the null hypothesis at 5% level of significance suggesting that the parameters
jointly are important. However, the proposed test here is not relevant from the question’s
standpoint.
b) To test if the assessed housing price is a rational valuation (That is, to see if the price what
was assessed earlier is a perfect predictor of the price at which the house was actually sold,
i.e., elasticity is 1). In addition, lotsize, sqrft, bdrms should not help to explain log(price),
once the assessed value has been controlled for. Then the hypothesis to be tested becomes
H0 : β1 = 1, β2 = 0, β3 = 0, β4 = 0
H1 : At least one is not zero
If we impose these restrictions in the original model, we get
\ = β0 + 1.log(assess)
log(price)
However, this is a model altogether with a different dependent variable than the unrestricted
model given to us. To test the generalized hypothesis, thus we would need information, on
the SSR of the restricted model (information on the R2 from the restricted model is of no use
because the dependent variable in both models are not the same).
Hence, given the limitated information provided in the question, the only test that is possible
for us to perform is to check if the elasticity is 1, that is, H0 : β1 = 1 against the alternative
H1 : β1 6= 1 (or with an one-sided alternative). Hence the t-statistic will be
β̂1 − β1 1.043 − 1
= = 0.284 < 1.96
SE(β̂1 ) 0.151
Hence, we fail to reject the null hypothesis, which suggests that there is limited evidence
against the alternative hypothesis of not having a rational valuation.