0% found this document useful (0 votes)
3 views

Assignment 1

Uploaded by

2182064arvita
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Assignment 1

Uploaded by

2182064arvita
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Assignment 1

EC 003: Introductory Econometrics


Dept. of Economics, Delhi School of Economics, University of Delhi
Semester I, 2024-2025

Course Instructors: Part 3


Devesh Birwal [DB] : (Contact email: [email protected])

1. What is the role of the stochastic error term ui in regression analysis? What is
the difference between the stochastic error term and the residual ûi ?

2. Are the following models linear regression models? Why or why not? a. Yi =
eβ1 +β2 Xi +ui
1
b. Yi = 1+eβ1 +β2 Xi +ui

c. . lnYi = β1 + β2 ( X1i ) + ui
d. Yi = β1 + (0.75 − β1 )e−β2 (Xi −2) + ui
e. Yi = β1 + β23 Xi + ui

3. Consider the following nonstochastic models (i.e., models without the stochastic
error term). Are they linear regression models? If not, is it possible, by suitable
algebraic manipulations, to convert them into linear models?
1
a. Yi = β1 +β2 Xi
Xi
b. Yi = β1 +β2 Xi
1
c. Yi = 1+e−β1 −β2 Xi

4. Given the assumptions in column 1 of the table, show that the assumptions in
column 2 are equivalent to them.
Assumptions of the Classical Model
(1) (2)
E(ui |Xi ) = 0 E(Yi |Xi ) = β2 + β2 X
cov(ui , uj ) = 0 i ̸= j cov(Yi , Yj ) = 0 i ̸= j
2
var(ui |Xi ) = σ var(Yi |Xi ) = σ 2

1
5. Consider the following formulations of the two-variable PRF:

Model I: Yi = β1 + β2 Xi + ui
Model II: Yi = α1 + α2 (Xi − X) + ui

a. Find the estimators of β1 and α1 . Are they identical? Are their variances
identical?
b. Find the estimators of β2 and α2 . Are they identical? Are their variances
identical?
c. What is the advantage, if any, of model II over model I?

6. Suppose you run the following regression:

yi = β̂1 + β̂2 xi + ui

where, as usual, yi and xi are deviations from their P


respective mean values. What
xy
will be the value of β̂1 ? Why? Will β̂2 be equal to P xi 2 i ? Why?
i

7. If r, the coefficient of correlation between n pairs of values (Xi , Yi ), is positive,


then determine whether each of the following statements is true or false:
a. r between (−Xi , −Yi ) is also positive.
b. r between (−Xi , Yi ) and that between (Xi , −Yi )can be either positive or nega-
tive.
c. Both the slope coefficients βyx and βxy are positive, where βyx = slope coefficient
in the regression of Y on X and βxy = slope coefficient in the regression of X on
Y.

8. If X1 , X2 , and X3 are uncorrelated variables each having the same standard de-
viation, show that the coefficient of correlation between X1 + X2 and X2 + X3 is
1
equal to 2
. Why is the correlation coefficient not zero?

2
9. Regression without any regressor. Suppose you are given the model: Yi = β1 + ui .
Use OLS to find the estimator of β1 . What is its variance and the RSS? Does
the estimated β1 make intuitive sense? Now consider the two-variable model Yi =
β1 + β2 Xi + ui . Is it worth adding Xi to the model? If not, why bother with
regression analysis?

10. The relationship between nominal exchange rate and relative prices. From annual
observations from 1985 to 2005, the following regression results were obtained,
where Y = exchange rate of the Canadian dollar to the U.S. dollar ( CD
$
) and X =
ratio of the U.S. consumer price index to the Canadian consumer price index; that
is, X represents the relative prices in the two countries:

Yt = −0.912 + 2.250Xt r2 = 0.440

se = 0.096
a. Interpret this regression. How would you interpret r2 ?
b. Does the positive value of Xt make economic sense? What is the underlying
economic theory?
c. Suppose we were to redefine X as the ratio of the Canadian CPI to the U.S.
CPI. Would that change the sign of X? Why?

11. Following table gives data on nominal and real gross domestic product (GDP) for
the United States for the years 1959–2005.
a. Plot the GDP data in current and constant (i.e., 2000) dollars against time.
b. Letting Y denote GDP and X time (measured chronologically starting with 1
for 1959, 2 for 1960, through 47 for 2005), see if the following model fits the GDP
data:

Yt = β1 + β2 Xt + ut

Estimate this model for both current and constant-dollar GDP.

3
c. How would you interpret β2 ?
d. If there is a difference between β2 estimated for current-dollar GDP and that
estimated for constant-dollar GDP, what explains the difference?
e. From your results what can you say about the nature of inflation in the United
States over the sample period?

Year NGDP RGDP Year NGDP RGDP


1959 506.6 2,441.3 1983 3,536.7 5,423.8
1960 526.4 2,501.8 1984 3,933.2 5,813.6
1961 544.7 2,560.0 1985 4,220.3 6,053.7
1962 585.6 2,715.2 1986 4,462.8 6,263.6
1963 617.7 2,834.0 1987 4,739.5 6,475.1
1964 663.6 2,998.6 1988 5,103.8 6,742.7
1965 719.1 3,191.1 1989 5,484.4 6,981.4
1966 787.8 3,399.1 1990 5,803.1 7,112.5
1967 832.6 3,484.6 1991 5,995.9 7,100.5
1968 910.0 3,652.7 1992 6,337.7 7,336.6
1969 984.6 3,765.4 1993 6,657.4 7,532.7
1970 1,038.5 3,771.9 1994 7,072.2 7,835.5
1971 1,127.1 3,898.6 1995 7,397.7 8,031.7
1972 1,238.3 4,105.0 1996 7,816.9 8,328.9
1973 1,382.7 4,341.5 1997 8,304.3 8,703.5
1974 1,500.0 4,319.6 1998 8,747.0 9,066.9
1975 1,638.3 4,311.2 1999 9,268.4 9,470.3
1976 1,825.3 4,540.9 2000 9,817.0 9,817.0
1977 2,030.9 4,750.5 2001 10,128.0 9,890.7
1978 2,294.7 5,015.0 2002 10,469.6 10,048.8
1979 2,563.3 5,173.4 2003 10,960.8 10,301.0
1980 2,789.5 5,161.7 2004 11,712.5 10,703.5
1981 3,128.4 5,291.7 2005 12,455.8 11,048.6
1982 3,255.0 5,189.3

4
(billions of dollars, except as noted; quarterly data at seasonally adjusted annual
rates; RGDP in billions of chained [2000] dollars)
Source: Economic Report of the President, 2007. Table B-1 and B-2.

12. Let kids denote the number of children ever born to a woman, and let educ denote
years of education for the woman. A simple model relating fertility to years of
education is

kids = β0 + β1 educ + u,

where u is the unobserved error.


a. What kinds of factors are contained in u? Are these likely to be correlated with
level of education?
b. Will a simple regression analysis uncover the ceteris paribus effect of education
on fertility? Explain

13. In the simple linear regression model y = β0 + β1 x + u, suppose that E(u) ̸= 0.


Letting α0 = E(u), show that the model can always be rewritten with the same
slope, but a new intercept and error, where the new error has a zero expected
value.

14. The data set BWGHT.RAW contains data on births to women in the United
States. Two variables of interest are the dependent variable, infant birth weight
in ounces (bwght), and an explanatory variable, average number of cigarettes the
mother smoked per day during pregnancy (cigs). The following simple regression
was estimated using data on n = 1,388 births:

ˆ = 119.77 − 0.514cigs
bwght

a. What is the predicted birth weight when cigs = 0? What about when cigs = 20
(one pack per day)? Comment on the difference.
b. Does this simple regression necessarily capture a causal relationship between
the child’s birth weight and the mother’s smoking habits? Explain.

5
c. To predict a birth weight of 125 ounces, what would cigs have to be? Comment.
d, The proportion of women in the sample who do not smoke while pregnant is
about 0.85. Does this help reconcile your finding from part c.

15. Using data from 1988 for houses sold in Andover, Massachusetts, from Kiel and
McClain (1995), the following equation relates housing price (price) to the distance
from a recently built garbage incinerator (dist):

ˆ
log(price) = 9.40 + 0.312log(dist)
n = 135 R2 = 0.162.

a. Interpret the coefficient on log(dist). Is the sign of this estimate what you
expect it to be?
b. Do you think simple regression provides an unbiased estimator of the ceteris
paribus elasticity of price with respect to dist? (Think about the city’s decision
on where to put the incinerator.)
c. What other factors about a house affect its price? Might these be correlated
with distance from the incinerator?

16. Consider the savings function



sav = β0 + β1 inc + u u= ince

where e is a random variable with E(e) = 0and V ar(e) = σe2 . Assume that e is
independent of inc.
a. Show that E(u|inc) = 0, so that the key zero conditional mean assumption
(Assumption SLR.4) is satisfied. [Hint: If e is independent of inc, then E(e|inc) =
E(e).]
b. Show that V ar(u|inc) = σe2 inc, so that the homoskedasticity Assumption
SLR.5 is violated. In particular, the variance of sav increases with inc. [Hint:
V ar(e|inc) = V ar(e), if e and inc are independent.]

6
c. Provide a discussion that supports the assumption that the variance of savings
increases with family income.

17. Consider the standard simple regression model y = β0 + β1 x + u under the Gauss-
Markov Assumptions SLR.1 through SLR.5. The usual OLS estimators βˆ0 andβˆ1
are unbiased for their respective population parameters. Let β˜0 be the estimator
of β1 obtained by assuming the intercept is zero (see Section 2.6).
a. Find E(β˜1 ) in terms of the xi ,β0 , and β1 . Verify that β˜1 is unbiased for β1 when
the population intercept (β0 ) is zero. Are there other cases whereβ˜1 is unbiased?
b. Find the variance of β˜1 . (Hint: The variance does not depend on β0 .)
c. Show that V ar(β˜1 ) ≤ V ar(βˆ1 ). [Hint: For any sample of data, i=1,n x2i ≥
P

− x̄)2 , with strict inequality unless x̄ = 0.]


P
i=1,n (xi

d. Comment on the trade off between bias and variance when choosing between
βˆ1 and β˜1 .

You might also like