Tutorials2016s1 Week7 Answers-3
Tutorials2016s1 Week7 Answers-3
Readings
Read Chapters 5.1-5.2 (excluding 5.2a) thoroughly. Chapters 5.2a and 5.3 will not be
covered and are not examinable.
Read Chapter 6 (excluding 6.1a). Chapter 6.1a will not be covered and is not examinable.
Make sure that you know the meanings of the Key Terms at the end of Chapters 5 & 6.
Now we use equation (5.5), where 1 > 0: plim(1 ) = 1 + 21 > 1, so 1 has a positive
inconsistency (asymptotic bias).
This makes sense: if we omit risktol from the regression and it is positively correlated with
funds, some of the estimated effect of funds is actually due to the effect of risktol.
Q2. Wooldridge Ch6 Q4 (Also use Stata to confirm the results in part (ii)). (See Ch6_4.do)
(ii) We use the values pareduc = 32 and pareduc = 24 to interpret the coefficient on
educpareduc. The difference in the estimated return to education is .00078(32 24) = .0062, or
about .62 percentage points. (Percentage points are changes in percentages.)
(iii) When we add pareduc by itself, the coefficient on the interaction term is negative. The t
statistic on educpareduc is about 1.33, which is not significant at the 10% level against a two-
sided alternative. Note that the coefficient on pareduc is significant at the 5% level against a
two-sided alternative. This provides a good example of how omitting a level effect (pareduc in
this case) can lead to biased estimation of the interaction effect.
Confirming the results in part (ii) provides an opportunity to illustrate some further Stata
features. In Ch6_4.do we illustrate:
the tabulate command (tab abbreviation works)
the generate command (gen abbreviation works)
the if qualifier used to limit analysis to observations satisfying some condition
Statas treatment of missing observations
------------------------------------------------------------------------------
lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .0467522 .0104767 4.46 0.000 .0261835 .067321
ed_ped | .000775 .0002107 3.68 0.000 .0003612 .0011887
exper | .018871 .0039429 4.79 0.000 .0111299 .026612
tenure | .0102166 .0029938 3.41 0.001 .0043391 .0160942
_cons | 5.646519 .1295593 43.58 0.000 5.392158 5.90088
------------------------------------------------------------------------------
.
. * While this if command may be useful it is actually not required here
. * Using complete cases is what Stata does by default
. reg lwage educ ed_ped exper tenure
------------------------------------------------------------------------------
lwage | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .0467522 .0104767 4.46 0.000 .0261835 .067321
ed_ped | .000775 .0002107 3.68 0.000 .0003612 .0011887
exper | .018871 .0039429 4.79 0.000 .0111299 .026612
tenure | .0102166 .0029938 3.41 0.001 .0043391 .0160942
_cons | 5.646519 .1295593 43.58 0.000 5.392158 5.90088
------------------------------------------------------------------------------
Q3. Wooldridge Ch6 QC8 (See Ch6_C8.do)
(i) The estimated equation (where price is in dollars) is
= 21,770.3 + 2.068 lotsize + 122.78 sqrft + 13,852.5 bdrms
(29,475.0) (0.642) (13.24) (9,010.1)
n = 88, R2 = .672, adjusted-R2 = .661, = 59,833.
------------------------------------------------------------------------------
price | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
lotsize | .0020677 .0006421 3.22 0.002 .0007908 .0033446
sqrft | .1227782 .0132374 9.28 0.000 .0964541 .1491022
bdrms | 13.85252 9.010145 1.54 0.128 -4.065141 31.77018
_cons | -21.77031 29.47504 -0.74 0.462 -80.38466 36.84405
------------------------------------------------------------------------------
.
. * price is in $'000. What happens if price was in $?
. gen pricenew=1000*price
------------------------------------------------------------------------------
pricenew | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
lotsize | 2.067707 .6421258 3.22 0.002 .790769 3.344644
sqrft | 122.7782 13.23741 9.28 0.000 96.45415 149.1022
bdrms | 13852.52 9010.145 1.54 0.128 -4065.14 31770.18
_cons | -21770.31 29475.04 -0.74 0.462 -80384.66 36844.04
------------------------------------------------------------------------------
There is simply a rescaling of the coefficients leaving t-statistics, p-values and R-squared values
the same. This is the same regression just with interpretation of effects now expressed in
dollars rather than thousands of dollars.
The predicted price at lotsize = 10,000, sqrft = 2,300, and bdrms = 4 can be simply obtained from
the regression of pricenew on (lotsize 10,000), (sqrft 2,300), and (bdrms 4) to yield about
$336,707.
. * For (i)-(iii) need transformed explanatory variables
. gen lotsize0=lotsize-10000
. gen sqrft0=sqrft-2300
. gen bdrms0=bdrms-4
------------------------------------------------------------------------------
pricenew | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
lotsize0 | 2.067707 .6421258 3.22 0.002 .790769 3.344644
sqrft0 | 122.7782 13.23741 9.28 0.000 96.45415 149.1022
bdrms0 | 13852.52 9010.145 1.54 0.128 -4065.14 31770.18
_cons | 336706.7 7374.466 45.66 0.000 322041.7 351371.6
------------------------------------------------------------------------------
(ii) We want the intercept estimate and the associated 95% CI from this regression. The CI is
approximately 336,706.7 14,665, or about $322,042 to $351,372 when rounded to the nearest
dollar.
(iii) We must use equation (6.36) to obtain the standard error of and then use equation (6.37)
(assuming that price is normally distributed). But from the regression in part (ii), se( 0 )
7,374.5 and 59,833. Therefore, se( 0 ) [(7,374.5)2 + (59,833) 2]1/2 60,285.8. Using 1.99
as the approximate 97.5th percentile in the t84 distribution gives the 95% CI for price0, at the
given values of the explanatory variables, as 336,706.7 1.99(60,285.8) or, rounded to the
nearest dollar, $216,738 to $456,675. This is a fairly wide prediction interval. But we have not
used many factors to explain housing price. If we had more, we could, presumably, reduce the
error standard deviation, and therefore , to obtain a tighter prediction interval.