0% found this document useful (0 votes)

24 views104 pages

Economterics Final 2024 10

Uploaded by

J O

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views104 pages

Economterics Final 2024 10

Uploaded by

J O

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 104

www.rsgclasses.

com
Rahul Sir( SRCC Graduate, DSE Alumni)

RSGCLASSES

ECONOMICS (H) SEM-4

INTRODUCTORY ECONOMETRICS

BY RAHUL SIR
(SRCC GRADUATE , DSE ALUMNI)
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

INDEX
CHAPTERS No. CHAPTER NAME Page
1. Simple linear regression 3-18

2. Multiple Linear regression 19-33

3. Functional form of regression 34-48

4. Dummy Variable 49-62

5. Multicollinearity 63- 73

6. Heteroscedasticity 74- 85

7. Autocorrelation 86-98

8. Model Selection Criteria 99-104

nOTE- IF YOU HAVE FIND ANY MISTAKE IN

QUESTIONS OR ANSWERS PLS CONTACT RAHUL
SIR AT 9810148860.
THANKYOU………..
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

Chapter- 1
SIMPLE LINEAR REGRESSION ANALYSIS

OBJECTIVE TYPE QUESTION

Choose the Best alternative for each question

1. Regression analysis is concerned with estimating
a. The mean value of the dependent value
b. The mean value of the explanatory variable
c. The mean value of the correlation coefficient
d. The mean value of the fixed variable
2. The locus of the conditional means of Y for the fixed values of X is the
a. Conditional expectation function
b. Intercept line
c. Population regression line
d. Linear regression line

3. E(Y|Xi) = f(Xi) is referred to as

a. Conditional expectation function
b. Intercept line
c. Population regression line
d. Linear regression line
4. Liner regression model is
a. Linear in explanatory variables but may not be linear in parameters
b. Nonlinear in parameters and must be linear on variables
c. Linear in parameters and must be linear in variables
d. Linear in parameters and may or not be linear in variables

5. In Yi = β1+β2Xi +ui, ui can take values that are

a. Only positive
b. Only negative
c. Only zero
d. Positive, negative or zero
6. In Yi = E (Y|Xi) + ui the deterministic component is given by
a. yi
b. E (Y|Xi)
c. ui
d. E(Y|Xi) + ui

7. The sample Regression line is at best an approximation of the population

regression. The statement
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

a. Is always true
b. Is always false
c. May sometimes be true sometimes false
d. Nonsense statement
8. Yi =  1 +  2Xi + ui , represents
a. Sample regression function
b. Population regression function
c. Nonlinear regression function
d. Estimate of regression function
9. Yi = β̂1 +β̂2Xi+ ûi ,represents
a. Sample regression function
b. Population regression function.
c. Nonlinear regression function
d. Estimate of regression function
10. In Yi = β̂1 + β̂2Xi + ûi’ ̂βi and β̂2 represent.
a. Fixed component
b. Residual component
c. Estimates
d. Estimators
11. In Yi = β̂1 + β̂2Xi = ûi , ûi represent.
a. Fixed component
b. Residual component estimated
c. Estimates
d. Estimators

Yi + β̂2Xi
12. In sample regression function, the observed Yi can be expressed as Yi = ̂
+ ûi. The statement is
a. True
b. False
c. Depend on 𝛽̂ 2
d. Depends on 𝑌̂i

13. In Yi = β̂1 + β̂2Xi + ûi , ’ûi gives the difference between

a. The actual and estimated Y values
b. The actual and estimated X values
c. The actual and estimated beta values
d. The actual and estimated u values
14. Under the least square procedure, larger the 𝑢̂i (in absolute terms), the larger the
a. Standard error
b. Regression error
c. Squared sum of residuals
d. Difference between true parameter and estimated parameter

15. The method of least squares provide with unique estimates of β̂1 and β̂2 that give
the smallest possible value of
a. 𝑢̂i
b.  𝑢̂i
c.   𝑢̂i
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

̂𝒊 𝟐
d.  𝒖

16. The least square estimators are

a. Period estimators
b. Point estimators
c. Population estimators
d. Popular estimators
17. The mean value of the estimated (𝑌̂) is
a. Equal to the mean value of actual Y .
b. Not equal to mean value of actual Y (𝑌̂).
c. Equal to the mean value of actual X(𝑋).
d. Not Equal to mean value of actual X(𝑋).

18. The mean value of ui conditional upon the given Xi is

a. Positive values
b. Negative values
c. Equal to zero
d. Any of the above

19. In classical linear regression model, Xi and ui are

a. Positively correlated
b. Negatively correlated
c. Highly correlated
d. Not correlated

20. Homoscedastic refers to the error terms having

a. Zero mean
b. Positive variance
c. Constance variance
d. Positive mean
21. One of the assumptions of CRLM is that the values of the explanatory variable X
must
a. All be positive
b. Not all be the same
c. All be negative
d. Average to zero
22. In statistics standard error measures the
a. Precision of an estimate
b. Correlation between Y and X
c. Specification error of the model
d. Autocorrelation in the regression model
23. In a two variable linear regression model the slope coefficient measures
a. The mean value of Y
b. The change in Y which the model predicts for a unit change in X
c. The change in which the model predicts for a unit change in Y
d. The value of Y for any given value of X
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

24. The fitted regression of equation is given by 𝑌̂𝑖 = 12 + 0.5 X. What is the value of
the residual at the point X=50, Y=70 ?
a. 57
b. -57
c. 0
d. 33

25. What is the number of degrees of freedom for a simple bivariate linear regression
with 100 observations?
a. 100
b. 97
c. 98
d. 2

26. Given the assumption of the CRLM, the least squares estimates possess some
optimum properties given by Gauss-Markov theorem. Which of these statements
is NOT part of the theorem
a. The estimators of 𝛽̂ 2 is a linear function of a random variable
b. The average value of the estimator 𝛽̂ 2 is equal to zero
c. The estimator 𝛽̂ 2has minimum variance
d. The estimator 𝛽̂ 2 is unbiased estimator

27. Coefficient of correlation

a. Lies between -1 and +1
b. Is always equal to zero
c. Is a measure of nonlinear dependence of two variables
d. Implies causation in a relationship

28. For coefficient of determination r2 for a regression model

a. r2= Y
b. 0 <r2 <1
c. r2 <1
d. r2 = 0
29. When the estimated slope coefficient in the simple regression model is zero, then
a. r2 = Y
b. 0 <r2 <1
c. r2 = 1
d. r2 = 0

30. Zero correlation does not necessarily imply independence between the two
variables. The statement is
a. False
b. True
c. Depends on the mean value of X and Y
d. None

31. The r2 measures the percentage of the total variation in

a. Y explained by betas
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

b. Y explained by 𝑢̂i X explained by Y

c. Y explained by the regression model
d. None

̂𝑖 = Yi for each I in a regression model then the value of r2 would be

32. When 𝑌
a. r2 = Y
b. 0 <r2 <1
c. r2 = 1
d. r2 = 0

TRUE FALSE
State whether the following statements are true false , or uncertain, Give your reasons.
Be precise.
i) The stochastic error term 𝑢𝑖 and the residual term 𝑒𝑖 mean the same thing.

ii) The PRF gives the value of the dependent variable corresponding to each value of the
independent variable.

iii) A linear regression model linear in the variables.

iv) In the linear regression model the explanatory variable is the cause and the
dependent variable is the effect.

v) The conditional and unconditional mean of a random variable are the same thing.

vi) In practice, the two- variable regression model is useless because the behavior of a
dependent variable can never be explained by a single explanatory variable.

vii) The sum of the deviation of a random variable from its mean value is always equal to
zero.

viii) OLS is an estimating procedure that minimizes the sum of the errors squared,∑𝑒𝑖 2

ix) The coefficient of correlation, r, has the same sign as the slope coefficient b2.

x) r² is the ration of TSS/ESS.

xi) In simple regression model 𝑌𝑖 = 𝛽1 + 𝛽2 𝑋𝑖 + 𝑢𝑖 , the OLS estimator 𝛽̂1 and 𝛽̂2 each
follow normal distribution only if 𝑢𝑖 follows normal distribution.[Eco(h)2019]

xii) If the estimate of slope coefficient in a bivariate regression is zero, the measure of
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

coefficient of determination is also zero. [Eco(h)2019]

xiii) If you choose a higher level of significance, a regression coefficient is more likely
to be significant. [Eco(h)2013]
xiv) In the regression modal Yt = B1 + B2Xi + ui, suppose we obtain a 95% confidence
interval for B2 as (0.1934, 1.8499), we can say the probability is,95% that this interval
includes the B2. [Eco(h)2014]

xv) In a two-variable PRF, if the slope coefficient 𝛽2 is zero; the intercept 𝛽1 is estimated
by the sample mean. [Eco(h)2015]

xvi) All Actual 𝑌𝑖 cannot lie above the sample linear regression line. [Eco(h)2017]

xvii) Consider a simple regression model estimated using OLS. It is known that the
Explained Sum of Squares is 75% higher than the Residual Sum of Squares. This
implies that more than 75% of the total variation in the dependent variable is
explained by the variation in the explanatory variable. [Eco(h)2023]

xviii) In a simple regression model estimated using OLS, the residuals (ei) are such that
𝑒̅ = 0 and 𝑒̅ 2 = 0. [Eco(h)2023]

xix) The OLS estimate of slope coefficient of regressing Y on X is same as that of

regressing X on Y. [Eco(h)2023]

xx) In a linear regression In Y = 𝛽1 + 𝛽2 X, + 𝑢𝑖 ; the measure of goodness of fit R2 was

estimated as 0.70. The p - value of the slope coefficient is 0.578. The coefficient is
statistically significant since X explains 70% of variation in Y. [Eco(h)2023]

xxi) If X and Y are related to each other by the equation: Y = 2 + 0.5 X, the correlation
coefficient between them is 0.5 [Eco(h)2023]

xxii) In linear regression models, 𝑟 2 value is invariant to changes in the unit of

measurement, as it is dimensionless. [Eco(h)2022]

xxiii) The correlation coefficient between 𝑈 = 3 X + 2 and 𝑉 = −4𝑌 + 5 is the same as

between X and Y. [Eco(h)2022]

Proofs
1. Prove that ̅
Y = b1 + b2̅
X
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

2. Prove that Σ 𝑒𝑖 =0

3. Prove that Σ𝑒𝑖 𝑥𝑖 =0 where ei is the residual term and 𝑥𝑖 is the deviation
of Xi from mean.

4. ̂𝑖 𝑢𝑖 =0 where ui is the residual term and 𝑌

Prove that Σ𝑌 ̂𝑖 is the
estimated value of X.

5. Prove that residual term is uncorrelated with independent variable.

6. Prove that residual term is uncorrelated with the predicted value.

7. Prove that the mean of predicted value of Yi is always equal to actual mean, i.e.,
𝑌̂ =𝑌

8. Prove that Σ𝑥𝑖 𝑦𝑖 = Σ𝑋𝑖 𝑦𝑖 = Σ𝑥𝑖 𝑌𝑖

9. Prove that the least square estimator b2 is linear, unbiased and consistent.

10. In CLRM, shows that OLS estimator for the slope coefficient is linear and unbiased.

11. Show that the OLS, estimators have the property of being linear and unbiased.

12. Prove that the least square estimators have the minimum variance amongst the
class of estimators.

13. Prove that the OLS estimators are best linear Unbiased Estimators (BLUE).

14. Derive the numerical properties of the OLS estimators and the regression line.

15. Show that. Cov( 𝛽̂1 , 𝛽̂2 ) =-𝑋 Var(𝛽̂2 )

16. Show that :

17. Prove that :

18. If we have two regression model Y on X and X on Y then show that product of two
regression slope coefficients of X on Y and Y on X is coefficient of determination.

19. If the estimation of slope coefficient in a bivariate regression is zero. The measure
of coefficient of determination is also zero.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

20. Consider the following regression 𝑌𝑖 = 𝛽1 𝑋𝑖 + 𝑢𝑖 where 𝛽̂1 is the OLS estimator of
𝛽1 .
i) Find the value of 𝛽̂1
ii) Find V(𝛽̂1 )
iii) Verify that 𝛽̂1 is unbiased.

21. For the model 𝑌𝑖 =𝛽1 + 𝑢𝑖 , given that all the CLRM Assumption are satisfied , use
OLS to find the estimator of 𝛽1 . Show that this estimator can be decomposed into
the true value plus a linear combination of the disturbance term in the sample. Also
demonstrate that this estimator is an unbiased estimator of 𝛽1 .[ Eco(h) 2015]

LINEAR IN PARAMETER ,LINEAR IN VARIABLE

1. State whether the following models are linear regression models:

1
(a) Yi = β1 + β2 {𝑋} + ui (b)Yi = β1 + β2 In(Xi) +ui
(c) In Yi = β1 + β2 Xi +ui (d) Yi = e β1 + β2 In(Xi) +ui
(e) Yi = β1 –𝛽2 3 Xi +ui

Ans: (a) LIP, (b) LIP, (c) LIP, (d) LIP (e) LIV
2. Determine whether the following models are linear in the parameters, or the
variables, or both. Which of these models are linear regression models?
1
(i) InYi = β1 + β2 In(Xi) + ui, (ii) Yi = β1 + 𝛽2 Xi + ui
1
(iii) Yi = β1 + 𝛽2 2 Xi + ui, (iv) InYi = β1 - β2 {𝑋 } + ui
2
(v) Y = 𝑒 𝛽1 +𝛽2 𝑋𝑖 +𝑢𝑖 (vi) Yi = β1 – β32 Xi +ui
Ans. (i) LIP (ii) LIV (iii) LIV (iv) LIP (v) Neither (vi) LIV

3. Determine whether the following models are linear in parameters or variables or

both. Which of these models are linear regression models:
1
(a) Yi = β1 + β2 {𝑋} + ui (b) Yi = β1 + β2 In(Xi) +ui
(c) InYi = β1 + β2 Xi +ui (d) In Yi = In β1 + β2 InXi +ui

Calculate the elasticity for all the cases:

Ans: (a) LIP (b) LIP (c) LIP (d) none

ESTIMATION OF REGRESSION LINE , HYPOTHESIS TESTING & CONFIDENCE

INTERVAL OF REGRESSION COFFICIENT , ANOVA TABLE

1. You are given the following data on X and Y.

X 1 2 3 4 5

Y 3 5 7 14 11
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

(i)Obtain the estimated regression equation using ordinary least squares when Y
is regressed on X with in an intercept term.
(ii) Prepare the ANOVA table for this data.

2. From the following hypothetical data on weekly family consumption expenditure

(Y) and weekly Family income (X) fit a two variable linear regression model
Y = β1 + β2 Xi +ui

Yi 70 65 90 95 110 115 120 140 155 160

Xi 80 100 120 140 160 180 200 220 240 260

Also find standard errors of β1, β2 are coefficients of determination

3. Fit the linear regression Y = β1 + β2 Xi for the following data:

X -4 -3 -2 -1 0 1 2 3 4

Y 10 20 30 40 50 60 70 80 90

Also find the variance and standard variance errors of intercept and slope
coefficients.

4. Given below is the data for 10 years from the economic survey of india:
Year Private Final Consumption Expenditure GDP
(PFCE) (in Rs. ‘0000 cr.)
(in Rs. ‘0000cr.)

1985-86 43 54

1986-87 43 55

1987-88 45 56

1988-89 48 62

1989-90 51 67

1990-91 53 69

1991-92 54 70

1992-93 55 74

1993-94 57 78

1994-95 61 86
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

We take PFCE as dependent variable Y and GDP independent variable X.

Find:
(i) Marginal propensity to consume,
(ii) ESS
(iii) RSS
(iv) Coefficient of determination,
(v) Test the null hypothesis Ho: MPC ≥0.5 at 5% level of significance.
(vi) Construct the ANOVA table for the above Data and find the F–statistic.

5. You have the following data based on 50 observations:

(i) Estimate the linear regression of Y on X,

(ii) Interpret the slope coefficient,
(iii) If construct the ANOVA table and calculate R2.

6. For a simple linear regression model , Y i =B1 + B2Xi + ui the following data are
given for 22 observations:

(i) Compute the least squares estimates of the slope and intercept parameters.
(ii) Prepare an ANOVA table for the above results
(iii) Test the hypothesis that B2 = 1 at 5% level of significance. How would your
testing procedure change if you were given the true value of the error
variance?

7. For a sample of 10 observations of the following results are obtained:

𝝨X=1700 , 𝝨Y=1110 , 𝝨XY=205500 , 𝝨X²=322000 , 𝝨Y²=132100

(i) Find the regression coefficients and regression line.

(ii) Test whether regression coefficients are statistically significant at 5% level of
significance.
(iii) Calculate and interpret coefficient determination.

8. Given the following summary results for 6 pairs of observations on the dependent
variable Y and the independent variable X, calculate the 95% confidence interval for
the true regression coefficient β1.

Σ Xi = 90; Σ Yi =10.5; Σ X2i = 1694; ΣY2i = 20.29; Σ XiYi = 181.1

9. Using the following data:

n = 10, ΣYi = 5070, Σ Xi = 5,60,000 , ΣYiXi = 30,55,50,000, ΣX2i = 47,60,00,00,000,
ΣY2i = 26,07,100

(i) Fit the linear regression Yi = β1 + β 2Xi ,

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

(ii) Find S.E.( 𝛽̂1 ) and S.E.( 𝛽̂2 ),

(iii) Find 95% confidence intervals of slope and intercept coefficients,
(iv) Test the significance of slope coefficients at 5%.

10. You have the following information:

∑ 𝑋 = 1680, ∑ 𝑌 = 1110, ∑ 𝑋𝑌 = 204200, ∑ 𝑋 2 = 315400, ∑ 𝑌 2 = 133300, 𝑛 = 10.

Assume all assumptions of CLRM are fulfilled. Obtain.

i. 𝛽̂1 and 𝛽̂2

ii. Establish 95% interval for the population slope coefficient 𝛽1
iii. 𝑅2 [Eco(h) 2019]

11. For the regression model answer the questions that follow:

(i) Interpret the regression model.

(ii) Find 95% confidence intervals of slope and intercept coefficients,
(iii) Test the significance of slope coefficient at 5%.

12. Consider the following regression:

̂i = -66.1058 + 0.0650 Xi
Y
se (10.7509) ( )
t ( ) (18.73)
n = 20, r = 0.946
2

Fill in the missing numbers. Would you reject the hypothesis that true B 2 is zero at α
= 0.05? Tell whether you are using a one tailed or two tailed test and why?

13. Given the following regression between retails sales of passenger cars (Si) and real
disposable income (Xi)
Ŝi = 5807 + 3.24Xi
SE = (1.634)
R = 0.22, n = 30
2

(i) Interpret the regression coefficients of Xi.

(ii) Establish a 95% confidence interval for coefficients of Xi.
(iii) Compute t-value under zero null hypothesis and test at 5% level of significance.
Which t-test would you use one tail or two tail and why?

14. A regression was run between per capita savings (S0 and per capita income were
obtained):
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

Ŝi = 450.03+ 0.67Yi

SE (151.105) (0.011) n=20
(i) What is the economic interpretation of regression coefficients?
(ii) What do you think about the sign of constant term? What can be the possible reason
behind
it?
(iii) Say something about goodness of fit. Also carry out, ‘t’ test for slope coefficient at
1%.
(iv) Reform the above model by stating this is per 100 rupees. What do you think would
be impact on slope intercept?
(iv) Prepare 99% confidence intervals.

15. A regression was run between personal consumption expenditure (was run between
personal consumption expenditure (Y) and gross domestic product (X) all measured
in billions of dollars for the years 1982 to 1996 and the following results were
obtained:
̂i = -184.0780 + 0.7064Xi
Y
Se = (46.2619) (0.007827)
R2 = 0.22
(i) What is the economic interpretation of regression coefficient?
(ii) What is MPC?
(iii) Interpret r2.
(iv) Prepare 95% confidence intervals of regression coefficient.
(v) Test the significance of β1 and β2 writing the hypothesis.

16. The rational expectation hypothesis claim that expectation are un biased, i.e., the
average predicted value is equal to the actual values of the variable under
investigation. A researcher wished to see the validity of this claim with reference to
the interest rates on 3 months US treasury bills for 30 quarterly observations. The
results of the regression of actual interest (ri) on the predicted interest rates (r*i)
were as follows:
r̂i = 0.0240 + 0.9400 r*i
se (0.86) (0.14)
Carry out the tests to see the validity of the rational expectation hypothesis (choose
α=5%). Assume all basic assumption of the classical linear regression model are
satisfied.

MEAN FORECASTING
1. Using cross- sectional data on total sales and profits for 27 German companies in
1995, the following model is estimated:
Profit= B1+ B2 Salesi + ui
Where
Profits: Total profits in millions of dollars
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

Sales: Total sales in billions of dollars

The regression results are given below:
Estimates of Coefficients Standard errors

Constant 83.5753 118.131

Sales 18.4338 4.4463

r2=0.4074
(a) Construct a 95% confidence interval for the slope coefficient. What can you say about
its statistical significance?
(b) Prove that in a simple regression model with an intercept, the F statistic for goodness
of fit of the model is equal to the square of the t statistic for a two sided t test on the
slope coefficient. Verify this statement for the regression results given in these
questions.
(c) Find the forecasted mean profits if annual sales are 25 billion dollars. Explain the
concept of a confidence band for true mean profits.

2. The following regression equation was estimated for 10 observations on X and Y.

̂i = 24-0.5Xt
Y mean = 170. Σx2 = 33.000. σ2 = 42
Establish a 95% confidence interval for E (Y/X=100)

3. Based on the data collected on a particular Monday for 13 B.A (H) Economics,
second year students we want to estimate the following population regression
Equation: 𝑌𝑖 = 𝛽1 + 𝛽2 𝑋𝑖 + 𝑢𝑖
Where:
𝑌𝑖 : Travelling time (in hrs) for the ith student from her home to college.
𝑋𝑖 : distance from home to college for ith student in km.

The sample gave the following values:

∑ 𝑋𝑖 = 195, ∑ 𝑌𝑖 = 26, ∑ 𝑋𝑖2 = 3050, ∑ 𝑌𝑖2 = 53 , ∑ 𝑋𝑖 𝑌𝑖 = 400

Using the above data and assuming that all the CLRM assumptions are satisfied, using a
95% confidence interval for the predicted mean travelling time when the distance
between college and a student’s house is 11Km

Relation between F and 𝒕𝟐

1. Given the following regression results (t statistics are reported in parentheses)
̂
Yi = 16,899 – 2978.5Xi
t (8.51) (-4.72) R2 = 0.6149
Use the relationship between R 2, F and t to find out the underlying sample size.

2. Given the following regression results (t statistics are reported in parentheses)

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

̂i = 4.3863 + 1.08132Xi
Y
t (4.42) (13.99) R2 = 0.938
Use the relationship between R 2, F and t to find out the underlying sample size.

JARQUE BERA TEST

1. Explain the steps involved in the Jarque-Bera test for testing the validity of the
normality assumption in an empirical exercise. Perform the test for a JB test statistic
value equal to 0.8153 at 5% level of significance.

2. A researcher computers Jarque-Bera statistic, for a large sample as 7.378. Does it

provide evidence in favour of normality of the error term. Use 5% level of
significance.

3. Test the normality of residuals using the following data:

Skewness 1.50555
Kurtosis 6.432967
No. of observation 379

4. Information was collected on daily changes in rupee (distribution A) and daily

return on nifty (distribution B) for six month (150 day) and following are the
summarized results:
Distribution A Distribution B
Mean 39.29 10.53
Standard Deviation 8.17 8.024
Skewness 0.38 1.78
Kurtosis 2.61 6.24
Determine which of the above distribution is normally distribution clearly specifying the
test

EXAM STYLE QUESTION

1. Suppose that you are considering opening a restaurant at a location where average
traffic volume is 1000 care per day. To help you decide whether to open the restaurant
or not, you collect data on daily sales (in thousands of rupees) and average traffic
volume (in hundreds of cars per day) for a random sample of 22 restaurants. You set
up your model as:

Salesi = B1 + B2 AVtraffici + ui

You know that ∑ 𝑋𝑖 𝑌𝑖 = 17170, ∑ 𝑋𝑖2 = 13055, 𝑌̅ = 32, 𝑋̅ = 22.5

i. Obtain the ordinary least square estimator of the slope, coefficient and interpret
it
ii. Estimate the average sales for your potential restaurant location.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

iii. Will the values of the coefficient of determination change if you want to change
the unit of sales from thousands of rupees, leaving units of traffic volume
unchanged?
Explain your answer. [Eco(h) 2014]

2. Using data on sales of cameras (SALES) and its price (PRICE in thousands of rupees)
for 17 brands, the effect of price on sales is given by:

SALESt = 𝛼 + 𝛽 PRICEi + ui

This is tested using OLS method The results obtained are as follows (t-ratios are
mentioned within parentheses). Assume all assumptions for classical linear regression
model hold good.

̂ i= 112.85 – 2.375 PRICEi

SALES

a) Interpret the slope coefficient

b) Construct 95% confidence interval for the slope coefficient.
c) Interpret R2. [Eco(h) 2016]

3. Let the population regression function be:

𝑦𝑖 = 𝐵1 + 𝐵2𝑥𝑖 + 𝜇𝑖

Where 𝑦𝑖 and 𝑥𝑖 are deviations from their respective mean values.

i) What will be the estimated value of 𝐵1? Why?

ii) Derive the estimate of B2 and show that it is identical to the one obtained
from a regression of Y on X. Explain why it is so.
iii) How would you test the hypothesis that the error term in a two variable simple
regression model is normally distributed?
iv) Derive an expression for the 95% confidence intervals for the mean prediction for the
two variable simple linear regression model. [Eco(h) 2020]

4. Following regression output is based on a sample of 30 farms where Y = output of

rice per acre in tonnes and X = quantity of manure applied per acre in kgs.
𝑌̂𝑖 = 384.105 + 3.67𝑋𝑖
𝑠𝑒 = (151.54) (1.00)
𝑅𝑆𝑆 = 6776
Construct a 95% confidence interval for mean output when 8kg of manure is applied
given that the sample average of manure applied per acre is 5kgs. [Eco(h) 2022]
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

5. How do you test for normality of error terms in the PRF using Jarque Bera test ?
What happen to least square estimates if the errors are not normally distributed?
What are its consequences for the Gauss Markov theorem?[Eco(h) 2021]

6. Consider the following formulations of the two variable PRF:

Model I- 𝑌𝑖 =𝛽1 + 𝛽2 𝑋𝑖 +𝑢𝑖

Model II- 𝑌𝑖 =𝛼1 + 𝛼2 (𝑋𝑖 -𝑋 )+𝑢𝑖

a. Find the estimators of 𝛽1 & 𝛼1 . Are they identical ? Are their variances identical ?
b. Find the estimators of 𝛽2 & 𝛼2 . Are they identical ? Are their variances identical ?
c. What is the advantage , if any , of the model II over model I ?

7. Suppose that the regression model 𝑌𝑖 = 𝐵1 + 𝐵2 𝑋𝑖 + 𝑢𝑖 , is estimated using the least

squares method as 𝑌̂𝑖 = 𝑏1 + 𝑏2 𝑋𝑖 . If 𝑌 is related: to 𝑍 through the equation, 𝑍𝑖 = 𝐴1 +
𝐴2 𝑋𝑖 + 𝑢𝑖 and another is estimated using the method of least squares as 𝑍̂𝑖 = 𝑎1 +
𝑎2 𝑋𝑖 .
(i) Are the slope coefficients of the two estimated regression equations the same, i.e.,
is 𝑎2 = 𝑏2 ?
(ii) How will the t statistics of 𝑎2 be related to the t statistics of 𝑏2 ?[Eco(h) 2018]

8. Based on a sample of size 20, the following regression line was estimated using the
least-squares method,

𝑌̂𝑖 = 5 + 3𝑋𝑖

In addition 𝑋̅ = 2 ∑(𝑋𝑖 − 𝑋̅ )2 and the standard error of regression was estimated to be

equal to 1.

Construct a 95% confidence interval estimate of the true population mean of 𝑌 for 𝑋0 =
15. Do you expect the confidence interval to be wider if a similar interval is estimated for
𝑋0 = 2? Explain your answer. [Eco(h) 2018]
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

CHAPTER-2
Multiple Linear Regression

Choose the Best alternative for each question

1. The simplest possible multiple regression model is a
a. One variable model
b. Two variable model
c. Three variable model
d. Multi-variable model
2. Multiple linear regression models
a. Are linear in parameter and linear in variables
b. Are linear in parameter and may not be linear in variables
c. May not be linear in parameter but are linear in variables
d. May not be linear in parameter and variables
3. In Yi = 𝛽 1X1i + 𝛽 2X2i +𝛽 3X3i+ui Where Xii= 1 for all i. This is an example of
a. Three variable model
b. X variable model
c. Four variable model
d. Three beta model
4. In Yi = 𝛽 1X1i + 𝛽 2X2i +𝛽 3X3i+ui, the partial regression coefficients are given by
a. 𝛽 2 and 𝛽 2
b. 𝛽 2 and 𝛽 3
c. 𝛽 1 and 𝛽 3
d. 𝛽 1 and 𝑢i
5. In classical linear regression model, Var(ui) =σ2 refers to the assumption of
a. Zero mean value of disturbance term
b. Homoscedasticity
c. No autocorrelation
d. No multicollinearity
6. In classical linear regression model, λ2X2i + λ3X3=0 with λ3=λ3 = O refers to the
ASSUMPTION of
a. Zero mean value of disturbance term
b. Homoscedasticity
c. No autocorrelation
d. No multicollinearity
7. In classical linear regression model, Cov (ui, uj)=0, i≠jrefers to the assumption of
a. Zero mean value of disturbance term
b. Homoscedasticity
c. No autocorrelation
d. No multicollinearity
8. The assumption of perfect multicollinearity means that
a. There should be no correlation among the regressors
b. There should be no linear relationship among the regressors
c. There should be no nonlinear relationship among the rergressors
d. There should be no relationship among the regressors
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

9. Given Yi = 𝛽 1X1i + 𝛽 2X2i +𝛽 3X3i+ui, state which of the following statement is true
a. 𝛽 2 measures the change in the mean value of Y per unit change in X2 ,holding
the value of X3 constant
b. 𝛽 3 gives the net effect of a unit change in X3, on the mean value of Y, net of
any effect that X2 may have on mean Y
c. Both a and b are true
d. Neither a nor b is true
10. The measure of proportion or percentage of a variation in Y explained by the
explanatory variables (X2, X3, …) jointly is given by
a. r2
b. R2
c. R
d. None
11. Multiple coefficient of determination measures the
a. Goodness of fit of multiple regression model
b. Homoscedasticity of multiple regression model
c. Heteroscedasticity of multiple regression model
d. Multicollinearity of multiple regression model
12. When R2 = 1; 𝑅̅2 would be equal to
a. 0
b. +1
c. -1
d. Less than 1
̅
13. 𝑅 can take values
2

a. Between 0 and 1
b. Between -1 and 1
c. Between -1 and 0
d. Less than equal to +1

14. The Values of 𝑅̅2is always less than R 2. This statement is

a. Incorrect
b. Correct
c. Depends of k value
d. Depends on n value
15. In comparing two models on the basis of goodness of fit
a. The sample size must be the same
b. The dependent variable must be the same
c. The independent variables must be the same
d. Both a and b above
16. Quadratic function is represented by
a. Yi = 𝛽 0 +𝛽 1 X2i+ui
b. Yi = 𝛽 0 +𝛽 1 Xi+𝛽 2 X2i+ui
c. Yi = 𝛽 0 +𝛽 1 Xi+𝛽 2 X2i+𝛽 3 X3i+ui
d. Yi = 𝛽 0 + 𝛽 1 X3i +ui
17. Given the regression model Yi=𝛽 1+𝛽 2X2i +𝛽 3X3i+ui,, how would you state the null
hypothesis to test that X2 has no influence on Y with X3 held constant.
a. H0: 𝛽 1 = 0
b. H0: 𝛽 2 = 0
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

c. H0: 𝛽 3 = 0
d. H0: 𝛽 2 = 0 given 𝛽 3 = 0
18. In hypothesis testing using t statistics, when the computed t value is found to
exceed the critical t value at the chosen level of significance, then
a. We reject the null hypothesis
b. We do not reject the null hypothesis
c. It depends on alternate hypothesis
d. It depends on F value
19. A hypothesis such as H0: 𝛽 2 = 𝛽 3 = 0, can be tested using
a. t-test
b. Chi-square test
c. ANOVA test
d. F-test
20. In regression model Yi=𝛽 1+𝛽 2X2i +𝛽 3X3i+ui,,testing the overall significance of the
model using F-test, degrees of freedom used (k-1), (n-1), where k is equal to
a. 2
b. 1
c. 3
d. Sample size
21. When 𝑅2 for a regression model is equal to zero, the F value is equal to
a. Infinity
b. High positive value
c. Low positive value
d. Zero
22. In the multiple regression model, the adjusted R2
a. Cannot be negative
b. Will never be greater than the regression R2
c. Equals to square of correlation coefficient r
d. Cannot decrease when an additional explanatory variable is added

TRUE/ FALSE
Stats whether the following statement is True or False. Give reasons for your answer:

1. Two or more models cannot be comparable on the basis of 𝑅2 ?

2. In Multiple linear regression analysis degree of freedom corresponding to total sum
of square is n-1?
3. If coefficient of correlation is -1 then residual sum of square is negative ?
4. In regression model 𝑌1 = 𝐵1 + 𝐵1 𝑋2𝑖 + 𝐵3 𝑋3𝑖 + 𝑢1 , if all vales of 𝑋3 are identical, then
the variance of ordinary least squares estimators of the slope coefficients is not
defined ?
5. The value of 𝑅̅2 is always greater than 𝑅2 .
6. 𝑌𝑖 = 𝛽1 + 𝛽2 𝑋2𝑖 + 𝛽3 𝑋3𝑖 + 𝜖𝑖 is estimated as 𝑌̂𝑖 = 𝛽̂1 + 𝛽̂2 𝑋2𝑖 + 𝛽̂3 𝑋3𝑖 using OLS. Here
𝑋2 and 𝛽1 are random variables and 𝛽̂3 is unknown.
7. If the regression model : 𝑌1 = 𝐵1 + 𝐵1 𝑋2𝑖 + 𝐵2 𝑋2𝑖 + 𝐵3 𝑋3𝑖 + 𝑢1, is estimated using the
method of ordinary least squares, the sum of the estimated residuals (𝑒𝑖 ) is zero.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

8. An increase in the number of explanatory variables in a multiple regression model

will necessarily increase adjusted R squared.
9. An addition of a variable in a regression model with 30 observations and 4 variables,
would always lead to a rise in R2 and adjusted R2, given that the additional variable is
statistically significantly different from zero at a = 20%.
10. In a multiple regression model 𝑌𝑖 = 𝛽1 + 𝛽2 𝑋2𝑖 + 𝛽3 𝑋3𝑖 + 𝑢𝑖 , testing a joint
restriction 𝐻0 : 𝛽2 = 𝛽3 = 0 is same as testing for 𝐻0 : 𝛽2 = 0 and 𝐻0 : 𝛽3 = 0.

PROOFS

1. Show that arithmetic mean of residual 𝑒𝑖 is always equal to zero.

2. Show that 𝑒𝑖 would be uncorrelated with estimated Y values.
3. Show that 𝑒𝑖 would be uncorrelated with 𝑥𝑖 values, where 𝑥𝑖 = 𝑋𝑖 -𝑋.
4. In Multiple linear regression analysis mean value of dependent variable is always
equal to mean predicted value of dependent variable.

Practical Question

1. You are given the following data:

Y 1 3 8

X2 1 2 3

X3 2 1 -3

Obtain the estimated regression equation using ordinary least squares if Y is regressed
on X2 and X3 with an intercept term.

2. An econometric analyst is estimating the following production function from annual

data on a firm in India:
Q = β0 + β1 L + β 2 K
Where L = Rupees of Labor
K = Rupees of Capital
The analyst knows that the firm always budget Rs. 12 Lakhs a year of labor and
capital together. The other relevant data are provided:
2 2
Σ𝑋2𝑖 = 14588, Σ𝑋3𝑖 = 2725, Σ𝑌𝑖2 = 47921, Σ𝑋2𝑖 Yi = 7454,
Σ𝑋3𝑖 Yi = 4554 ̅ ̅
Σ𝑋2𝑖 𝑋3𝑖 = 4796, 𝑋2 = 5802.25, 𝑋3 = 18, 𝑌̅ = 67,
N = 14

Can you estimate the regression coefficients in this model? Explain your answers.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

3. The following results were obtained from a sample of 12 firms on their output (Y),
labour input (X2) and capital input (X3), measured in arbitrary units:

4. The following tables contains the scales price of 5 holiday cottages in Ushered,
Denmark, together with the age and the livable area of each cottage.

Price (in$) Age (in Years) Area (in m2)

Yi X2i X3i

745 36 66

895 37 68

442 47 64

440 32 53

1598 10 101

Suppose it is thought that the price obtained for a cottage depends primarily on the age
and livable area. A possible model for the data might be th linear regression model
Yi = β1+β2X2i+β3X3i+ ui

where the random errors ui are independent, normally distributed random variables
with the zero mean and constant variances. Fit the model and obtain the parameters
and their respective standard errors.

5. You are given the following data based on a simple regression estimated for the
relationship between price (X2) and quantity of oranges sold (Y) in a super market
and also on the amount spent on advertising the product (X 3), for 12 consecutive
days.

(ii) Test the statistical significance of each estimated regression coefficient using α =
5%

6. You are given the following data based on 15 observations:

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

(i) Estimate the three multiple regression coefficients and their standards
error .
(ii) Obtain R2 and
(iii) Test the statistical significance of each estimated regression coefficient
using α = 5%

7. Consider the following estimate regression estimated equation:

̂
Yi = 1336.049 + 12.7413X2i +85.7640X3i
se = (175.2725) (0.9123 ) (8.8019)
t = (-7.6226). (13.9653) (9.7437)
R2 = 0.8906, F = 118.0585, n = 32

Where, Y = Auction price of antique clock

X2 = Age of clock
X3 = Number of bidders

(i) Interpret all the three coefficients of the equation.

(ii) What do you understand by the concept of standard error of an estimate?
How would you calculate it?
(iii) Test the whether the age of clock has any significant contribution in
explaining the variation in auction price of antique clock.
(iv) Would you say that this regression equation is a god fit on the data? Explain
the basis of your answers.
(v) Test the overall significance of this equation i.e., test the joint hypothesis that
X2 and X3 are in significant in explaining the variance in Y.
(vi) What is the relationship between F and R2? Establish this for the regression
results presented above.
8. Consider the following regression for an imaginary country, say Utopia, for a period
of 15 years variables are: IMP = imports, GNP = Gross National Product and CPI =
Consumer Price Index.
̂ = -108.20 + 0.045 GNP2t -0.931CPI3t
IMP
t = (3.45) (1.23 ) (1.844)
R2 = 0.9894
(i) Test whether, individually, the partial slope coefficients for GNP and CPI are
statistically significant at the 5% level of significance.
(ii) Test the whether GNP and CPI jointly have nay statistical significance in
explaining variations in exports. Cary out this test at 5% level of significance.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

9. Consider the following model relating the gain in salary due to an MBA degree to a
number of its determinants.

Where,
SLRYGAIN = Post salary MBA minus pre MBA salary, in thousands of dollars.
TUTION = annual tuition coast, in thousands of dollars.
Z1 = MBA skills in being in analysts, graded by recruiters.
Z2 = MBA skills in being team players, grade by recruiters.
Z3 = Curriculum evaluation by MBA’s.
Using data for top 25 business schools, the coefficients were estimated as follows,
standard errors in parenthesis.

B^1 60.899 (2.513)

B^2 0.314 (0.750)
B^3 -0.3948 (2.756)
B^4 -2.016 (2.165)
B^5 -5.325 (3.773)

(i) Carry out individual two tail tests at 10% level of significance for the slope
coefficients.
(ii) Test the model for overall significance at the 10% level if R2 = 0.461 was
obtained for the model.
10. For the multiple regression model for Y = mental impairment, X 1 = life events, and
X2 = SES.
E(Y) = α + β1X1 + β2X2
Following table contains the required results:
Coff. Std.Error t
(Constant) 28.230 2.174 12.984
LIFE .103 .032 3.177
SES -.097 .029 -3.351
n = 40, R = 0.9542
2

(i) Interpret the regression model.

(ii) Test the significance of partial slope coefficients.
(iii) Construct the 95% confidence interval for partial slope of coefficients
(iv) Construct the ANOVA Table and Test whether the model is significant.

11. The grades points average (GPA) of a random sample of 427 students in a college
were regressed on verbal SAT scores (VSAT) and mathematicians SAT scores
(MSAT) and the following regression model was estimated. (Standard errors are
reported in parentheses)
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

̂
𝐺𝑃𝐴 = 0.423 + 0.398VSATi + 0.001MSATi
SE (0.220) (0.061) (0.00029)

(i) The analyst found the unadjusted R2 = 0.22 and concluded that the VSAT and
MSAT scores are not good predictors of GPA. Do you agree with him? Write
down all the steps to test his claim and check it at 5% level of significance.
(ii) Suppose a student’s VSAT and MSAT scores increased by 100 points each.
How much increase in GPA can be expected?
(iii) As a result of the college policy if all the GPA scores were increased by 10%
what impact would it have on the regression coefficients and coefficient of
determination R2.

12. Using time series data for 1979 to 2009 for a certain economy, the following model of
demand for money was estimated:

𝑀𝐷𝑖 = 𝐵1 + 𝐵2 𝑌𝑖 , +𝐵3 𝐼𝑁𝑇𝑅𝐴𝑇𝐸𝑖 + 𝑢𝑖

Where

MD = Quantity of money demanded, measured in billions of rupees.

Y = National income, measured in billions of rupees

INTRATE = Interest rate in percent on 3 month treasury bills.

The table below has estimates of the coefficients and their standard errors

Variable Estimate of coefficients Standard errors

CONTACT 0.003 0.009

Y 0.530 0.112

INTRATE -0.0261 0.101

a) Interpret the slope coefficients.

b) Test the overall significance of the model, at 5% level of significance, if coefficient
of determination reported for the model is 0.519.

13. A relationship was established between demands for housing (H). Gross National
Product (GDP), interest rate (INT) prevailing in the economy. The following results
were obtained:
̂ = 678.89 + 0.905GNP – 169.65INT
H
t = (1.80) (3.64) (-3.87)
R = 0.432, R = 0.375, df = 20
2 2
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

(i) Calculate the F value from the data?

(ii) What conclusion do you draw from the F-value?

14. Consider the following simple regression model

Price = 𝛽0 + 𝛽1 assess + 𝑢

Where, Price is the housing price

Assess is the assessment of housing price.

The estimated equation is

̂ = −14.47 + 0.976 𝐴𝑠𝑠𝑒𝑠𝑠

𝑃𝑟𝑖𝑐𝑒

𝑡 = (16.27) (0.049)

𝑛 = 88, 𝑆𝑆𝑅 = 165644.51, 𝑟 2 = 0.820

i. How will you test the constraints 𝛽1 = 1 and 𝛽0 = 0 in the above regression if you
are given the SSR in the restricted model as 209448.99? Conduct the necessary
test(s) at 1% level of significance and give your conclusion?
ii. Suppose now that the estimated model is
Price = 𝛽0 + 𝛽1 Assess +𝐿𝑜𝑡𝑠𝑖𝑧𝑒 + 𝛽3 𝑆𝑞𝑟𝑓𝑡 + 𝛽4 𝐵𝑑𝑟𝑚𝑠 + 𝑢
Where
Lotsize = the size of the lot
Sqrft = the square footage
Bdrms = the number of bedrooms
The R2 = from estimating this model using the same 88 houses is 0.829. Test at
1% level of significance that all partial slope coefficients are equal to zero.

15. Based on the data for 1965 – IQ to 1983 – IVQ (n = 76), the following results were
obtained in the regression model to explain the personal consumption expenditure:
̂
Yi = -10.96 + 0.93 X2i – 2.09X3i
t = (-3.33) (249.06) (-3.09) R2 = 0.996
where, Y = PCE in billion rupees
X2 = the disposable income in billion rupees
X3 = the prime rate (%) charged by banks
(a) What is the marginal propensity to consume (MPC) the amount of additional
consumption expenditure?
(b) Is the MPC, statistically different from 1? Show the appropriate testing
procedure.
(c) What is the rational for inclusion of prime rate variable in the model? A priori,
would you expect a negative sign for this variable?
(d) Is b3 statistically different from zero?
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

(e) Test the hypothesis that R2 = zero?

(f) Compute the standard error for each coefficient.

16. The monthly salary (wage, n hundred s of rupees), age (AGE, in years), number of
years of experience (EXP, in years), number of years of education (EDU) were
obtained for 49 persons in a certain office. The estimator regression of wage on the
characteristics of a person were obtained as follows (with a statistic in parenthesis):
Wage = 632.244 + 142.510EDU + 43.225 EXP - 1.913 AGE
(1.493) (4.008) (3.022) (- 0.22)
(i) The value of adjusted R , = 0.277. Using this information, test the model for
2

overall significance.
(ii) Test the coefficient of EDU and EXP for statistical significance at 1% level and
coefficients for age at 10% level.

17. Using quarterly data for 10 years (n= 40) for the U.S. economy, the following model
of demand for new cars were estimated:
NUMCARSi = B1 +B2 PRICEi + B3 INCOMEi + B4 INTRATEi +ui
Where
NUMCARS: Number of new car sales per thousand people
PRICE: New car price index
INCOME: Per capita real disposal income (in dollars)
The table below gives estimates of the coefficients and their standard errors:

ESTIMATES OF COFF STD ERROR

CONSTANT -7.4534 13.5782

PRICE -.0714 .0032

INCOME .0032 .0017

INTRATE -.1537 .0491

(i) A priori, what are the expected signs of the partial slope coefficients? Are
the results in accordance with these expectations?
(ii) Interpret the various slope coefficients and test whether they are
individually statistically different from zero. Use 10% level of significance.
(iii) The adjusted R squared reported for this model is 0.758. Test the Model
for overall goodness of fit at 5% level of significance.
18. A multiple regression analysis between yearly income (Y in $1.000s), college grade
point average (X1) age of the individuals (X2), and the gender of the individual (X3,
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

zero representing male) was performed on a sample of 10 people, and the following
results were obtained.

Coefficients Standard errors

Constant 4.0928 1.4400
X1 10.0230 1.6512
X2 0.1020 0.1225
X3 -4.4811. 1.4400

Analysis of variance
Source of Degrees Sum of Mean
Variation of Freedom Squares Square
Regression 360.59
Error 23.91

(i) Write the regression equation for the above.

(ii) Interpret the meaning of the coefficients of X 3.
(iii) Compute the coefficient of determination.
(iv) Is the coefficient of X1 significant? Use α = 0.05
(v) Is the coefficient of X2 significant? Use α = 0.05.
(vi) Is the coefficient of X3 significant? Use α = 0.05.
(vii) Complete the ANOVA table
(viii) Perform an F test and determine whether or not the model is significant.

19. A three variable regression gave the following results:

Source of Variation Sum of squares d.f. Mean sum of squares
Due to regression (ESS) 65,965 - -
Due to residual (RSS) - - -
Total (TSS) 66,042 14
(i) What is the sample size?
(ii) What is the value of RSS?
(iii) What are the d.f. of ESS and RSS?
(iv) What is R2 and adj. R2?
(v) Test the hypothesis that X2 and X3 have zero influence on Y. Which test do you
use and why?
(vi) From the preceding information can you determine the individual
contribution of X2 and X3 toward Y?
(vii) Recast the ANOVA table in terms of R2.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

20. Child Mortality Rate (CMR) for 25 countries was regressed on Female Literacy Rate
(FLR) and per capita GDP (PCG). The following results were obtained:

̂ = 263.64 – 0.0056PCG-2.2316FLRi
𝐶𝑀𝑅

se = (11.59) (0.0019) (0.2099)

R2 = 0.7077, ADJ.R2 = 0.6981

(i) Interpret the regression results.

(ii) Are the coefficients of regression significant independently and jointly?
(iii) If by adding another explanatory variable R2 increases to 0.77. Will this imply
that this inclusion is justifiable?
21. You are given the following regression models, compute adjusted R 2 for each of the
model and hence decide which of these a better fit is:

Model Dependent Intercept Age No. of R2

Variable Term Bidders

A Auction Price 1328.094 - - 0.00

B Auction Price -191.6662 10.4856 - 0.5325

C Auction Price 807.9501 - 54.5724 0.1549

D Auction Price -1336.049 12.7413 85.7640 0.8905

n = 32 for each model. Also compare model B and D using method of restricted least
squares.
22. Based on a sample of 38 countries the following regression was obtained:
̂i = 414.4583 + 0.0523X1i – 50.0476X2i
Y
se = (266.4583) (0.0018) (9.9581)
t = (1.1538) (28.2742) (-5.0257)
R = 0.916,
2 Adj R = 0.9594 F= 439.22
2

Where, Y = expenditure on education (billions of Rupees)

X1 = GDP (billions of Rupees)
X2 = Population (billions of people)
(i) Test whether the partial slope coefficients of GDP and population are
individually statistically significant at 5% level of significance.
(ii) Test whether jointly both GDP and population significantly explain variation
in the dependent variable. Use α = 5%.
(iii) Now if we impose the restriction that slope coefficient on population is Zero.
We obtain the following regression:
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

Yi = 386.482 + 0.0732X1i
se = (268.421) (0.0049)
t = (1.4398) (14.9397)
R2 = 0.8978, ADJR2 = 0.8823 F = 436.81

Test whether this restricted is statistically regression, if dependent variables

do not have the same form, which alternative test do you use?

23. How the regression coefficients , TSS , RSS, ESS , Coefficients of determination
affected in case of change of origin and change of scale .

EXAM STYLE QUESTIONS

1. Let X2 be the hours spent on mathematics coaching during a week . let X 3 be the time
spent on other subjects and Y be the scores obtained in mathematics final exam. The
following summations for 23 students were obtained as belows.
𝑋2= 10. 𝑋3= 5. 𝑌 =12 , n=23
2 2
Σ𝑥2𝑖 =12 , Σ𝑥2𝑖 𝑥3𝑖 =8 Σ𝑥3𝑖 =12. Σ𝑥2𝑖 𝑦𝑖 =10. Σ𝑥3𝑖 𝑦𝑖 =8. Σ𝑦𝑖2 =10.

𝑥2 , 𝑥3 ,and y are variables measured the deviation form.

i) Estimate the following regression coefficient Yi = β1+β2X2i+β3X3i+ ui

ii) Estimate the standard errors of the slope coefficients
iii) Obtain 𝑅2 of the regression.
iv) Interpret the slope coefficients and comment on their statistical significance .
[Eco(h) 2022]

2. Demographic data from 126 countries is obtained for the year 2017. It is hypothesized
that life expectancy (Y) is dependent on number of under five deaths (X2), polio
immunization coverage (D), Per capita Govt. Exp. on Health Care (X3) (in Rs crores),
Per Capita GNI (in Rs crores) (X4) and Average number of years of Schooling (X). Polio
immunization coverage = 1 if yes and 0 otherwise.

Following regressions were estimated:

MODEL 1:

𝑌̂𝑡 = 0.903 − 0.561𝑋2𝑖 + 2.008𝑋3𝑖 + 0.553𝑋4𝑖 + 0.778𝑋5𝑖 + 3.638𝐷

𝑠𝑒 = (1.280)(0.405)(0.765)(0.712)(0.491)

𝑅2 = 0.787 𝑅𝑆𝑆 = 1339.8

MODEL 2:
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

𝑌̂𝑡 = 1.379 + 0.594𝑋3𝑖 + 2.139𝐷

𝑠𝑒 = (0.406)(0.465)

𝑅2 = 0.677 𝑅𝑆𝑆 = 1567.28

i. Is it a time series or a cross sectional data

ii. Show model 2 is a restricted version of model 1 and what is the restriction?
iii. Test for the statistical significance of the restriction at 5% level.
iv. Construct a 95% confidence interval for true per capita government health
expenditure in model Il and check whether it is statistically
significant.[Eco(h)2021]

3. The estimated equation for sales of TV is given as below :

𝑆𝑎𝑙𝑒𝑠 = 118.91 − 7.908 𝑃𝑟𝑖𝑐𝑒 + 1.863 𝐴𝑑𝑣𝑒𝑟𝑡

(𝑠𝑒) (6.35) (1.096) (0.953) 𝑅2 = 0.448, 𝑛 = 30

Where Price of TV measured in Rs.

Sales is sale revenue and Advert is advertising expenditure. Both Sales and Advert are
measured in terms of thousands of rupees.

i. Is the slope coefficient of price statistically different from 1? Test at 𝛼 = 2%.

ii. Calculate the elasticity of sales revenue with respect to price if average sales
revenue is 300 and average price is 100?
iii. How would you test that an increase in advertising expenditure will bring an
increase in sales revenue that is sufficient to cover the increased advertising
expenditure? Clearly state the Null and alternative hypothesis. Test at 𝛼 =5%.
iv. Estimate the sales revenue for a price of Rs. 6 and an advertising expenditure of
Rs. 1200. [Eco(h)2022]

4. Consider the following data on hourly wage rates (Y), Labour productivity (𝑋1 ) and
literacy rate (𝑋2 ) in a country ABV:

𝑌 90 72 54 42 30 12

𝑋1 3 5 6 8 12 14

𝑋2 16 10 7 4 3 2

i. Calculate the estimators of the regression 𝑌𝑖 = 𝛽1 + 𝛽2 𝑋2𝑖 + 𝛽3 𝑋3𝑖 + 𝑢𝑖

ii. Test the hypothesis 𝛽2 = 0 against the alternative 𝛽2 > 0 at 5% level of significance.
iii. Calculate R2 and 𝑅̅2 and comment on them.
iv. Construct an ANOVA table and check for the significance of the regression at 5%
level of significance.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

v. Do you think that Cov (u, x) will be non-zero in the model which has low R2?
Explain. [Eco(h)2021]

5. Using time series data for 1979 to 2009 for a certain economy, the following model
of demand for money was estimated.

𝑀𝐷𝑖 = 𝐵1 + 𝐵2 𝑌𝑖 + 𝐵3 𝐼𝑁𝑇𝑅𝐴𝑇𝐸𝑖 + 𝑢𝑖

Where

MD = Quantity of money demanded, measured in billions of rupees.

Y = National income, measured in billions of rupees

INTRATE = Interest rate in percent on 3 month treasury bills

The table below has estimates of the coefficients and their standard errors

Variable Estimates of coefficients Standard errors

CONSTANT 0.003 0.009

Y 0.530 0.112

INTRATE -0.0261 0.101

a) Interpret the slope coefficients.

b) Test the overall significance of the model. at 5% level of significance, if coefficient
of determination reported for the model is 0.519. [Eco(h) 2016]
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

CHAPTER-3

FUNCTIONAL FORM OF REGRESSION MODEL

Multiple Choice Questions

Choose the Best alternative for each question
1. For a regression through the origin, the intercept is equal to
a. 1
b. 2
c. 0
d. -1
2. If in Yi =  1+  2Xi + ui, both Y and X are standardized variables. The intercept
term will be be
a. Positve
b. Negative
c. Between -1 and +1
d. Equal to zero
3. In double log regression model, the regression slope gives
a. The relative change in Y for an absolute change in X
b. The percentage change in Y for a given percentage change in X
c. The absolute change in Y for a percent change in Y
d. By how many units Y changes for a unit change in X
4. In Log-Lin regression model, the slope coefficient gives
a. The relative change in Y for an absolute change in X
b. The percentage change in Y for a given percentage change in X
c. The absolute change in Y for a percent change in Y
d. By how many units Y changes for a unit change in X
5. In Lin-Log regression model, the slope coefficient gives
a. The relative change in Y for an absolute change in X
b. The percentage change in Y for a given percentage change in X
c. The absolute change in Y for a percent change in X
d. By how many units Y changes for a unit change in X
6. In double log model, elasticity of Y with respect to X is given by
a. 𝛽 2
b. 𝛽 2(X/Y)
c. 𝛽 2X
d. 𝛽 2(1/Y)
7. In Log-Lin model, elasticity of Y with respect to X is given by
a. 𝛽 2
b. 𝛽 2(X/Y)
c. 𝛽 2X
d. 𝛽 2(1/Y)
8. In Lin-Log model, elasticity of Y with respect to X is given by
a. 𝛽 2
b. 𝛽 2(X/Y)
c. 𝛽 2X
d. 𝛽 2(1/Y)
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

9. In linear model, elasticity of Y with respect to X is given by

a. 𝛽 2
b. 𝛽 2(X/Y)
c. 𝛽 2X
d. 𝛽 2(1/Y)
10. When comparing r2 of two regression models, the models should have the same
a. X variables
b. Y variables
c. Error term
d. Beta coefficients
TRUE/FALSE

1. In regression through origin models, the conventionally computed R2 may not be

meaningful?
2. In a double log model In Y i = A + B In Xi + vi the slope coefficients are different from
elasticity coefficients?
3. In log-liner regression models, the magnitude of the estimated slope coefficients is
invariant to the units in which the explanatory variables are measured, unlike linear
models?
4. Log-Log Model is also called growth rate model?
5. Cob-Douglas production function in Log-lin Model?
6. In Double log model a, elasticity of a function is constant while in Linear regression
model slope of function is constant ?
7. Log-lin models and Lin- log models are comparable on the basis of 𝑅2 ?
8. In Double log model , regression line is passess through the mean of X and mean of Y?
2
̂0)=𝑋0 2 𝜎 2 ?
9. Under regression through origin model V(Y Σ𝑋 𝑖

Practical Questions

Double Log Models

1. The OLS Regression based on the log-linear data gave the following results:
̂ = 4.8877 + 0.1258 InXt
InYt
se = (0.1573) (0.0148).
t = (31.0740) (8.5095)
p = (1.25 x 10-9) (2.79 x 10-5) r2 = 0.9005, n =10

Where Y= Math’s Score

X = Family Income
(i) Interpret the intercept and slope term.
(ii) Interpret the coefficient of determination.
(iii) Test the significance of regression coefficients.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

2. Based on 11 annual observations the following results were obtained:

Model A:
̂
Yt = 2.6911-0.4795 Xt
Se = (0.1216) (0.1140) r2 = 0.6628

Model B :-
̂ = 0.7774 – 0.2530 InX t
InYt
se = (0.0152) (0.0494) r2 = 0.7448
Where Y= cups of coffee consumed per person per day
X= the price of coffee in rupees per cup.

(a) Interpret the slope coefficient in two models.

(b) You are told that Y̅ =2.43 and X̅= 1.11. At these mean values, estimate the price
elasticity for the model A.
(c) What is the price elasticity for the model B?
(d) From the estimated elasticities, can you say that the demand for coffee is price
inelastic?
(e) How would you interpret the intercept in the model B?
(f) Since r2 of Model B is larger than that of model A, Model B is preferable to
Model A. Comment on this statement.
3. Using 21 annual observations, the following equation for demand for a good was
estimated using OLS:
̂ t = 1.71 – 0.35 InX1t + 0.47 InX2t
InY R2 = 0.876, Ṝ2 = 0.843
se = (0.059) (0.083) (0.083)

Where,
Y = No of units demanded
X1 = Price of goods ( Rs. Per unit)
X2 = Consumer’s income
(i) Test at α =5% whether the good has unit income elasticity against the
alternative that the demand for the good is income inelastic.
(ii) Test the overall significance of the regression.
4. For the data for 46 states in USA for 1992following regression result was obtained:
̂ = 4.30 + 1.34 InP + 0.17 InY
InC
se = (0.91) (0.32) (0.20) Ṝ2 = 0.27

Where C = cigarette consumption packs per year

P= Real price per pack
Y= Real disposable income per capita
(i) What is the elasticity of demand of cigarettes with respect to price and
income? Are they statistically significant if not then why?
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

(ii) How would we obtain R2 from Ṝ2 given above? Then test for overall
significance of regression.
5. You are given the following Cobb Douglas Production function:
̂ i= -1.65 + 0.34 In Li + 0.85 In K
InY
t = (-2.73) (1.83) (9.06) R2 = 0.995 n = 22

(i) Interpret the partial regression coefficients

(ii) Find the returns to scale.
(iii) Test for the significance of partial regression coefficients. Will you use a one
tail or to tail test?
(iv) What can you about the overall significance of the regression model.
6. From the following regression function:
̂ i= 1.5195 + 0.9972 In X2i – 0.3315 In X3t
InY
se = (0.903) (0.0191) (0.0243)R 2 = 0.994 n = 23

Where Y = final demand

X2 = Real GDP
X3= Real energy price
(i) Interpret the partial regression coefficient
(ii) Test for the significance of partial regression coefficient. Will you use a one
tail or two tail test?
(iii) What can you about the overall significance of the regression model.
(iv) Compute the value of adj. R2 for the above model.

7. Consider the Cobb-Douglas production function in its logarithmic form as follows:

̂ i = B1 + B2 In Li + B3In Ki + Ui
InY
where, Y = Output
L = Labor input
K = Capital input
Suppose the following production function is estimated
𝑌 𝐾
In (𝐿 ) = B1 + B3 In ( 𝐿 ) + vi

(i) What restriction has been imposed on the Cobb-Douglas production function
to obtain this estimated production function?
(ii) How will you test the validity of this restriction?

Semi Log Models

1. From the data based on population of USA (millions of people) for the years 1975 to
2007 the following regression model was obtained:
̂ i = 5.3593 + 0.0107 t
InY
t = (3321.13) (129.779) R2= 0.9982

Where Y = population of USA (millions of people)

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

t = time period (in year)

(i) Interpret the Intercept term
(ii) Interpret the slope term.
(iii) Find the instantaneous growth rate as well as the compound growth rate.
(iv) Test the significance of regression coefficients.
(v) Interpret r2.
2. Consider the following equation:
̂ i = -5.10 + 0.100EDUi + 0.T10EXPi
InSal
se = (0.025) (0.050)
R = 0.48,
2 n = 28
Where In (Sal)i = log of salary of ith worker
EDi = Years of education of ith worker
EXPi = Years of experience of ith worker

(i) Interpret the equation. Make appropriate hypothesis for signs of coefficient
and test your hypothesis.
(ii) What are the elasticity of salary with respect to education and experience?
(iii) If we run a linear regression instead of log-linear regression then how would
the interpretation change?
3. To determine how expenditure on service (Y) behaves if total personal expenditure
(X) rises by a certain percentage, the following regression model was obtained:
̂t = -12564.8 + 1844.22 In Xt
Y
se = (916.351) (114.32) r2 = 0.881 n=20

(i) Interpret the intercept term

(ii) Intercept the slope term
(iii) Test the significance of regression coefficient.
(iv) Interpret r2.

4. Consider the following regression for cross sectional data for 55 rural households in
India. The regress and in this equation is expenditure on food and the regress or is
total expenditure (a proxy for income)
̂ t = 1283.912 + 257.27 In (TEXP)
𝐹𝐸𝑋𝐵
t = (-4.3848)* (5.6625)* r2 = 0.3769
Note: *denotes an extremely small p-value.

(i) What is the interpretation of coefficient of In (TEXP)?

(ii) Would you say that Engel’s Law is validated for this sample? Explain.
5. Consider the following population regression function
In (Div)t = β1 + β2 In (PRFT)t +β3 Time + ui
Here, Div. = Corporate Dividends Paid
PRFT = Corporate Profits
In = Natural Logarithms
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

The estimated sample regression results for an economy for 244 quarterly
observation are presented below:

Coeff. Standard Errors t-statistic Prob-value

Intercept 0.4357 0.1921 2.2674 0.0243
In (PRFT) 0.4245 0.0777 5.4614 0.0000
Time 0.0126 0.0014 8.93 0.0000
R = 0.9914,
2 adj. R = 0.9913
2

Sum of Squared Residuals = 4.2657, F – Statistic = 13930.73

SE of Regression = 0.133 Prob (F-statistic) = 0.0000
Durbin – Watson Statistic = 0.0201

(i) What are the economic interpretations of β2^and β3^?

(ii) On what counts would a researcher be satisfied with these result s at a first
glance? Verify your conjectures using formal tests. For tables take the
closest value of n.

RECIPROCAL MODEL
1. Based on annual percentage change in wage rates, Y and the unemployment rate, X
for kingdom for the period 1950-1966 the following results were obtained:
̂i = -1.4282 + 8.02743 1
Y 𝑋𝑖
Se = (2.0675) (2.8478) r2 = 0.3849,
(i) What is the interpretation of 8.02743?
(ii) Test the hypothesis that the estimated slope coefficient is not different from
zero. Which test will you use?
(iii) How would you use the F test to test the preceding hypothesis.
(iv) Given that Y = 4.8 percent and X = 10.5 percent, what is the rate of change of
Y at these mean values?
(v) What is the elasticity of Y with respect to X at these mean values.
(vi) How would you test the hypothesis, is that true r 2 =0?

2. The percentage change in the index of hourly earnings (Y) and the civilian
unemployment rate (X) for the United States for the year 1958 to 1969 gives the
following regression model:
1
̂
Yi = -0.2594 + 20.5880 𝑋𝑖
t = (-0.2572) (4.3996) r2 = 0.6594
(i) What is the wage floor?
(ii) Interpret the slope term.
(iii) Test the significance of regression coefficients.
(iv) Interpret r2.
(v) The linear model for the same data is
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

̂i = 8.0147 – 0.7883 Xt
Y
t = (6.4625) (-3.2605) r2 = 0.5153
(a) Is positive slope in the reciprocal model analogous to negative slope in
the reciprocal model.
(b) Compare the slope terms of two models.
(c) Compare r2 for two models.

Polynomial Regression
1. The following regression considers the relationship between lung cancer and
smoking for 43 states in India:
Yi = β1 + β2Xi + β3X2i + ui
Where, Y = number of deaths from lung cancer.
X = number of cigarettes smoked.
Results are as follows:
Predictor Coeff. Std. error t p
Constant -6.910 6.193 -1.12 0.271
X 1.5765 0.4560 3.46 0.001
X 2 -0.019 0.008 -2.35 0.024
R = 0.564,
2 ADJ. R = 0.543
2

F P
Residual sum of squares 311.69 26.56 0.00
Sum of squares regression 403.89
(i) Interpret the above regression
(ii) Test the individual significance of regression coefficients. Which test do
you and why? (Use α = 5%)
(iii) Construct an ANOVA table for the problem and test for the overall
significance of the model. (Use α =5%)
2.The OLS regression results based on the Cost (Y) and Output (X) are as follows:
̂
Yi = 141.7667 + 63.4776Xi – 12.9615X2i + 0.9396X3i
se = (6.3753) (4.7786) (0.9857) (0.0591)
R = 0.9983,
2 n = 10
(i) Does this model represent the cost function; explain by testing the
coefficient in the model.
(ii) Test the significance of the regression coefficient.
(iii) Construct an ANOVA table for the problem and test for the overall
significance of the model. (Use α =5%)
(iv) Find the average and marginal cost curves.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

Regression Through Origin

1. Based on monthly data from January 1978 to December 1987, the following
regression results were obtained:
Model 1 : ̂
Yt = 0.00681 + 0.758Xt r2 = 0.4406
t = (0.262) (2.80)
p = (0.798) (0.0186)
Model 2 : ̂
Yt = 0.762Xt r2 = 0.4368
t = (2.954)
p = (0.0131)
Where, Y = monthly rate of return on Texaco common stick in %.
X = monthly market rate of return in %
(i) What is the difference between two regression models?
(ii) Would you retain the intercept term in model 1? Why or why not?
(iii) How would you interpret the slope term in the two models?

2. The following two models are based on the returns on a future fund (Y) and the
term on the market portfolio(X) for the period 1971-1980:
Model A:
̂
Yi = 1.2797 + 1.0691 Xi
se = (7.6886) (0.2383) r2 = 0.7115
Model B:
̂
Yi = 1.0899Xt
se = (0.1916) raw r2 = 0.7825

(i) Test the significance of incept term in the model A. Does this justify the
model B.
(ii) If the intercept term is absent then the slope term can be estimated by far
greater precision. Explain with the help of above models.
(iii) Can we compare the r2 of two models?

3. Consider the regression through the origin model:

Yi = B2Xi + ui
a) Write the normal equation and use it to derive the ordinary least square
estimator b2 of B2?
b) Show that b2 is a linear and unbiased estimator of B2.
c) Explain why the sum of the estimated residual, Σei need not be zero in this
regression model.
4. Consider the following regression:
Yi = β2Xi +ei
(i) How would you go about estimating the unknowns?
(ii) Will ∑ei = 0 for this model? Why or why not?
(iii) Will ∑eiXi = 0 for this model? Explain.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

(iv) Will ? Explain.

EXAM STYLE QUESTIONS

1. The relationship between infant mortality rate (IMR) and the expenditure on
immunization programmes for children (IMMUN) in lakhs of rupees for 63 districts
of India is postulated by the following two alternate models :

Model A: IMRt = 𝛼1 + 𝛼2 IMMUNi + ui

Model B: IMRt = 𝛽1 + 𝛽2 IMMUNi + 𝛽3 IMMUNi2 + vi

The R2 for Model A and Model B are obtained as 0.6152 and 0.8254 respectively. Use a
suitable test at 5% significance level to decide which model would you prefer-restricted
or unrestricted. State the null and alternate hypothesis clearly.

2. Consider the following Cobb Douglas production function estimated for Taiwan for
the period: 1965-1974.

̂ t =1.6624 + 0.3397In Lt + 0.8460 ln Kt

InGDP

t= (-2.725) (1.8296) (9.0626)

R2 = 0.9951

RSSUR = 0.0136

where GDPt = GDP at time t,

Lt = labour at time t,

Kt = capital at time t

In = natural logarithms:

i. Interpret the coefficients of the regression and comment on their individual

significance.
ii. Comment on the returns to scale experienced by the Taiwanese economy.
iii. By imposing the restriction of constant returns to scale, the following regression
was obtained:
𝐺𝐷𝑃 𝐾
In ( ) = −0.4947 + 1.0153 In ( 𝐿 )
𝐿 𝑡 𝑡
t = (-4.0612). (28.1056)

𝑅2 = 0.977 𝑅𝑆𝑆𝑅 = 0.0166

Interpret the above regression.

Use a test statistic to see whether the economy is characterized by constant
return to scale. [Eco(h) 2015 ]
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

3. Consider the following model of monthly rents paid on rental units in industrial hub
cities of an economy:

in (rent) = B0 + B1 In (pop) + B2 In (avinc) + B3 (socind) + u

where

rent = average monthly rent paid in rupees

pop city population

avinc = average city income in rupees

socind = index of social infrastructure

i. How will you test the hypothesis that city population and social infrastructure
have no significant joint effects on monthly rents? Explain the steps involved in
the test with reference to the above model.
ii. Suppose b1 is estimated.as. 0.066. What is wrong with the statement: "A 10%
increase in population is associated with a 6.6% increase in monthly rent".
[Eco(H) 2014]
4. find the slope and elasticity of Y with respect to X for the following functional formal:
a) In Y = B1 – B2 (1/X)
b) Y = B1 + B2 In X. [Eco(h) 2013]

5. Consider the following models:

Model 1- In 𝑌𝑖∗ = 𝛼1 + 𝛼2 𝐼𝑛𝑋𝑖∗ + 𝑢𝑖∗

Model 2- In 𝑌𝑖 = 𝛽1 + 𝛽2 𝐼𝑛𝑋𝑖∗ + 𝑢𝑖∗

Where 𝑌𝑖∗ = 𝑤1 𝑌𝑖 and 𝑋𝑖∗ = 𝑤2 𝑋𝑖 , the w’s being constants.

i. Establish the relationships between the two sets of regression coeffcients and
their standard errors.
ii. Is the R2 different between the two models? [Eco(h) 2019]

6. Suppose the CLRM applies to 𝑌𝑖 = 𝛽2 𝑋𝑖 + 𝜀𝑖 .

i) Find the slope coefficient in the regression of Y and X.
ii) Suppose now we have a regression of X on Y, 𝑋𝑖 = 𝛾2 𝑌𝑖 + v𝑖 . In slope coefficient
of regression on X on Y an inverse of slope of regression of Y on X. [Eco(h) 2019]

7. Two models for Engel expenditure function are estimated.

Model 1 : 𝑌𝑡 = 1087.930 + 0.077𝑋𝑡
t = (25.58) (21.64) R2 = 0.350 F = 468.645
Model 2 : 𝑌𝑡 = 4005.077 + 0.3381/𝑋𝑡
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

t = (19.259) (-20.816) R2 = 0.333 F = 433.310

where Yt = expenditure on food in rupees = total expenditure in rupees
i. Interpret all coefficient value of the two models.
ii. Are the sign of the coefficients in the two models contradictory?
iii. Can we compare the results of the two models?
iv. Diagrammatically show the sample regression function in the above model.
[Eco(h) 2019]

8. Data is available on per unit cost (Y in Rs) of a manufacturing firm over a 20-year
period, and index of its output (X). Following results were obtained:

𝑌̂𝑡 = 10.522 - 0.175 𝑋𝑡 + 0.000895 𝑋𝑡2

t = (14.3) (-9.7) (7.8)

R2 = 0.978 TSS= 5700

i. Interpret the signs of the two slope coefficients in the above regression.
ii. At what level of output will the average cost function be minimum?
iii. Compute adjusted R' Is adjusted R' always less than R?? Justify your answer.
iv. Test that the variance of per unit cost (ox) over this 20 year period=20 against
not equal to 20. Use 5% level of significance.
v. Would your answer remain the same if a 95% confidence interval is constructed
to test the same hypothesis? Construct the interval and justify your answer.

[Eco(h) 2023]
9. Consider the model

𝑌𝑖 = 𝛽1 + 𝛽2 𝑋2𝑖 + 𝛽3 𝑋3𝑖 + 𝑢𝑖

Where,

𝑌𝑖 is the long term consumption measured in Rs thousands

𝑋2𝑖 is the income measured in Rs thousands

𝑋3𝑖 is the age measured in years

a) How will the estimated intercept and slope coefficients change if the unit of
measurement of income is changed to Rs lakhs.
b) Suppose the researcher thinks that usually consumption increases with income
but at a decreasing rate and consumption increases with age. How would he
modify the model to see whether the data supports his hypothesis?
c) Suppose the researcher wants to assess the relative importance of age and
income on long term consumption, what model should he estimate? Explain.
[Eco(h) 2021]

10. Following is the demand schedule for commodity 𝑥

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

𝐷𝑥 = 𝑓(𝑃𝑥 , 𝑃𝑦 , 𝑌)
Where 𝐷𝑥 is the demand for commodity 𝑥, 𝑃𝑥 is its price, 𝑃𝑦 is the price of related
commodity y andY is the income of the consumer. How do you measure the elasticity
of demand with respect to own price and price of related commodity Y if you use (i)
double log model, (i) linear model. [Eco(h) 2017]

11. Consider the Cobb-Douglas production function: [Eco. (H) III Sem. 2017(ER)]
𝛽 𝛾
𝑄𝑡 = 𝑒 𝛼 𝐾1 𝐿𝑡 𝑒 𝑢
Where, 𝑄 denotes output, K denotes capital input and L denotes labour input and e =
2.71828.
(a) Formulate a model that can be used to estimate the parameters a, 𝛽 and 𝛾 using
ordinary least squares.
(b) Show that this model implies a constant partial elasticity of output with respect to
labour but a variable marginal effect of labour on output. [Eco(h) 2020]

12. The following regression model was estimated using annual time-series data for the
period 1990-2012 for a certain country:

̂ 𝑡 = 𝑏1 + 𝑏2 𝐼𝑛𝑋2𝑡 + 𝑏3 𝐼𝑛𝑋3𝑡
𝐼𝑛𝑌

Where 𝑌1 = demand for cheese (in kg.)

𝑋2 = disposable income (in Rs. '000)

𝑋3 = price of cheese (in Rs. per kg.)

The results are summarized in the following table:

Coefficient Standard error

Intercept 2.03 0.116

𝑋2 0.45 0.025

𝑋3 -0.377 0.063

i) Interpret the partial slope coefficients. [Eco(h) 2017]

ii) If the calculated F statistic for the estimated model is 492.513, what is its R 2?

13. Consider the following regression model:

𝐼𝑛𝑌 = 𝐵0 + 𝐵1 𝐼𝑛(𝑋1 ) + 𝐵2 𝐼𝑛 (𝑋2 ) + 𝐵3 𝐼𝑛 (𝑋3 ) + 𝐵4 𝐼𝑛 (𝑋4 ) + 𝑢𝑖

Where
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

𝑌 = per capita consumption of potatoes in kg.

𝑋1 = per capita income in Rs. '000.

𝑋2 = price of potatoes in Rs. per kg.

𝑋3 = price of cauliflower in Rs. per kg.

𝑋4 = price of cabbage in Rs. per kg.

(i) How will you test the joint hypothesis that potato consumption is not affected
by the prices of cabbage and cauliflower ? Explain the steps involved in the test
with reference to the above model.
(ii) If the estimated value of b, is 200, it means "a 1% increase in income is
associated with a 200% increase in per capita consumption of potatoes;
everything else kept constant." Is the above interpretation correct ? Explain.

14. Based on the data on GNP and money supply for the period 1965-2006 for India. Ma
the following regression results were obtained by regressing GNP (in billions of
Rupees) on money supply (in billions of Rupees) for alternate models :

Model Intercept Slope Coefficient 𝑅2

Coefficient

Log-linear 0.8726 0.7839 0.927

(11.40) (108.93)

Log-lin 6.2392 0.0002 0.852

(75.85) (12.07)

Lin-log 14299 2383.4 0.879

(14.45) (16.84)

Linear 603.28 0.3718 0.921

(7.04) (55.58)

Where the figures in parentheses are t-ratios

(i) For each model. Interpret the slope coefficient.

(ii) For each model, estimate the elasticity of GNP with respect to money supply
(sample means of the GNP and money supply are 5113.65 and 9347.53
respectively.
(iii) Are all 𝑅2 values comparable? If not, which ones are? [Eco(h) 2015]
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

15. Using annual time-series data for the company 'Pure Juice' for the period 2000 - 2016,
the following equation was obtained :

̂ 𝑡 = 1.2028 + 0.0214𝑡
𝐼𝑛𝑌

𝑆𝑒 = (0.0233) (0.0025)

Where 𝑌𝑡 = revenue of the company in crores at time 𝑡 and 𝐼𝑛 indicates natural log.

(i) Interpret the estimated coefficients.

(ii) Explain how the annual compound growth rate in revenues of the company
during the period can be obtained?
(iii) Using the estimated model, how can the forecast revenue for the year 2017 be
obtained? [Eco(h) 2018]
16. The sales manager of a company believes that the district sales (𝑆𝑡 ) of motor vehicles
has been growing according to the model 𝑆𝑡 = 𝑆0 (1 + 𝑔)𝑡 , where 𝑡 is the time. Average
sales is 50 units and average time is 4 years. He obtains the following OLS regression
results:

̂ 𝑡 = 3.6889 + 0.583𝑡
𝐼𝑛𝑆

(i) What is the estimate of the instantaneous and compound growth rate?
(ii) What is the estimate of 𝑆0 ?
(iii) What will be the elasticity of sales with respect to time?
(iv) Suppose the researcher modifies the above equation and estimates the
following regression: 𝑆̂𝑡 = 5.6731 + 2.7530𝑡 Interpret the model.
(v) Compute elasticity of sales with respect to time for the model in part iv. Compare
your results with the answer obtained in part iii. [Eco(h) 2021]
17. Consider the following functional form :

1
𝑌 = 𝐵1 + 𝐵2 𝑋 + 𝐵3 ( )
𝑋

(i) Derive the expression for the marginal effect of Y with respect to X.
(ii) Derive the expression for elasticity of Y with respect to X and express it in terms
of X only.
(iii) Assume without loss of generality. 𝐵1 = 0 and 𝐵2 > 0, 𝐵3 > 0. For what
value of X will this function attain a minima? Draw a rough sketch for the function
[Eco(h) 2017]
18. In order to test whether the developing economies are catching up with the advanced
economies or not, a researcher regressed the growth rate of GDP of a country on its
relative per capita GDP for 119 developing countries. The relative per capita GDP of a
country is measured as a ratio of the country's per capita GDP to the GDP per capita
of USA. The regression results were obtained as under (standard errors are reported
in parentheses):

𝐺̂ = 0.013 + 0.062 𝑃𝑖 - 0.061𝑃𝑖2

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

s.e. = (0.04) (0.02) (0.033)

R2 = 0.053, adjusted R2 = 0.036

Where, G is the growth rate of GDP (in %)

And, P is the relative per capita GDP (in %)

(i) Interpret the above regression results.

(ii) Find the marginal effect of P on G. [Eco(h) 2013]

19. In each of the following cases suggest a suitable functional form to explain the
relationship between dependent variable and the explanatory variable. Also justify
your choice and interpret the coefficients in each case.
(i) Cobb Douglas production function
(ii) Rate of growth of population in an economy
(iii) Total cost function of a firm
(iv) Engel Expenditure Function
(v) Phillips Curve
(vi) Average salary earned by the employee conditional upon the gender of the
employee. [Eco(h) 2020]

CHAPTER-4
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

DUMMY VARIABLE
Multiple Choice Questions
Choose the Best alternative for each question
1. Dummy variables classify the data into
a. Inclusive categories
b. Mutually exclusive categories
c. Qualitative categories
d. Quantitative categories
2. If a quantitative variable has ‘m’ categories, we can introduce
a. Only ‘m-1’dummy variables
b. Only ‘m’dummy variables
c. Only ‘m+1’dummy variables
d. Only ‘m*2’dummy variables
3. We are trying to estimate the differentials in average annual salary of professors
for three categories in India—those employed at a fully government aided college
(D1i), those employed at partially government aided colleges(D2i) and those
employed at private college(D3i) Which of the following is NOT a correct functional
form?
a. Yi=𝛽 0+𝛽 1D1i +𝛽 2D2i+Ui
b. Yi=𝛽 1D1i +𝛽 2D2i+𝛽 3D3i +Ui
c. Yi=𝛽 0+𝛽 1D1i +𝛽 2D2i+𝛽 3D3i +Ui
d. LnYi=𝛽 0+𝛽 1D1i +𝛽 2D2i+Ui
4. For question (3) above, given Yi=β1+β2D2i +β3D3i+ui, β1 represents the mean
annual salary of professors working in
a. Fully government aided colleges
b. Partially government aided colleges
c. Private colleges
d. All three colleges
5. For question (3) above, mean annual salary of professors working in fully
government aided colleges is given by
a. 𝛽 1
b. 𝛽 1 + 𝛽 2
c. 𝛽 1 + 𝛽 3
d. 𝛽 2 + 𝛽 3
6. In trying to test that females earn less than their male counterparts was estimates
the following model: Yi=𝛽 1 +𝛽 2Di, where Y = average earnings per day in Rs. D =
1 for females and 0 otherwise. 𝛽 2 here refers to the
a. Average earnings of male
b. Average earnings of female
c. Differential intercept coefficient for male earnings
d. Differential intercept coefficient for female earnings
7. ANCOVA models include regressors that are
a. Only quantitative variables
b. Only qualitative variables
c. Only categorical variables
d. Both qualitative and quantitative variables
8. ANOVA models is include
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

a. Only quantitative variables

b. Only qualitative variables
c. Only categorical variables
d. Both qualitative and quantitative variables

9. The Process of removing the seasonal component from a time series sample date
is known as
a. Seasonalization
b. Seasonality
c. Deseasonalizstion
d. Seasonal trend testing

TRUE/FALSE

a. Dummy variable are also called stochastic variable?

b. Dummy variable trap situation arises, when there is high multicollinearity
between the explanatory dummy variable ?
c. ANCOVA model is the extension of ANOVA model?
d. Slope coefficient and differential intercept coefficient are same?

Practical Questions

ANOVA models With One Qualitative Variable Having Two Categories

1. Regressing food expenditure on the gender dummy variable, we obtain the
following results:

̂
Yi = 3176.833 -503.1667Di
se = (233.0446) (329.5749)
r2 = 0.1890, n = 12

Where
Yi = Food expenditure (in Rs.)
Di = 1 for female
0 for male
(i) Find the average food expenditure of males and females.
(ii) Is there a significant difference in the average food expenditure of males and
females.
(iii) What is the benchmark category.
2. Consider the following model:
Yt = β1 + β2Dt + ui
Where Dt = 0 for first 20 observations and 1 for next 30 observations
Var (ui) = 300
(a) How would you interpret β1 and β2?
(b) What are the mean values of 2 groups?
(c) Find the Cov(𝛽̂ 1 , 𝛽̂ 2)
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

ANOVA Models With One Qualitative Variable Having more than Two Categories.
1. The data on average salary (in dollars) of public school teachers in 50 states and the
District Of Columbia for the year 1985 was available. These 51 areas are classified
into three geographical regions: (1) Northeast and North Central (21 states in all)
(2) South (17 states in all), and (3) West (13 states in all). The following regressions
model was obtained from the given data:
̂
Yi = 26,158.62 - 1734.473D2i - 3264.615D3i
Se = (1128.523). (1435.953) (1499.615)
t = (23.1759) - (-1.2078) (-2.1776)
(0.0000)* (0.2330)* (0.0349)* R2 = 0.0901

Where, * indicates the p values.

Yi = (average) salary of public school teacher in state i
D2i = 1 if the state is in the Northeast or North Central
= 0 otherwise (i.e., in other regions of the country)
D3i = 1 if the state is in South.
= 0 otherwise (i.e., in other regions of the country)
Find:
(i) Mean salary of public school teachers in the northeast and North Central.
(ii) Mean salary of public school teachers in the South.
(iii) Mean salary of public school teachers in the West.
(iv) The benchmark category.
(v) Is the mean salary of teachers statistically different from each other?

ANOVA Models with Two Qualitative Variables

1. From a sample of 528 persons in May 1985, the following regression results were
obtained:
̂i = 8.8148 + 1.0997D2i – 1.6729D3i
Y
se = (0.4015) (0.4642) (0.4854)
t = (21.9528) (2.3688) (-3.4462)
(0.0000)* (0.0182)* (0.0006)*

Where Y = hourly wage ($)

D2 = married status, 1 = married, 0 = otherwise
D3 = region of residence; 1 = South, 0 = otherwise
And * denotes the p values.
Find:
(i) The benchmark category.
(ii) Interpret the regression model.
(iii) Test the significance of the regression coefficients individually.

Regression with a Mixture of Quantitative and Qualitative Regressors: The ANCOVA

Models
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

1. The following regression results were obtained for 22 individuals, (standard error in
parenthesis)
̂
Yi = 1506.244 – 228.9868Di + 0.0589Xi
(188.0096) (107.0582) (0.0061)
R = 0.9284
2

Where,
Y = expenditure on food ($)
Di = Gender dummy variable = 1 for female
= 0 for male
Xi = after tax income ($)
(i) Holding after tax income constant, what is the difference between mean food
expenditure of males and females at the 5% level of significance? Is the
difference statistically significant? How can you say so?
(ii) What is the marginal propensity of food consumption holding gender
difference constant?
(iii) Write and draw the regression equation for males and females separately.

2. The following regression was estimated using data from a sample of 15 houses
(standard errors are given in brackets) :

̂
Yi = 200.091 + 16.186 Xi + 3.853Di

se = (4.354) (2.578) (1.241)

Yi = assessed value of a house (in lakhs)

Xi = size of the house (in hundreds of square feet)

Di = 0 for house i, if it does not face a park = 1 for house i, if it faces a park.

i. Interpret the estimated coefficient of Di.

ii. Test whether the presence of a park in front of the house increases the assessed
value of the house, using the p-value approach and a 5% level of significance.

3. A person holding two or more jobs, one primary and one or more secondary, is
known as moonlighter. Based on a sample of 318 moonlighters, the following
regression is obtained, with standard errors in parenthesis:
Ŵ m = 37.07 + 0.403W – 90.06race + 75.51urban + 47.33hisch + 113.6region +
2.26age
se (0.06) (24.47) (21.6) (23.42) (27.62) (0.94)
Where,
Wm = moonlighting wage
W = primary wage
Age = age in years
Race = 0, if white, 1 if non – white,
Urban = 0 if non urban, 1 if urban
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

Region = 0 if non west, 1 if west

Hisch = 0 if non graduate, 1 if high school graduate
Derive the wage equations for the following type of moonlighters
(i) White, non urban, western resident and high school graduate.
(ii) Non white, urban, non western resident and non high school graduate.
(iii) White, non urban, non western resident, and high school graduate.

4. You are given the following estimated double log model for cigarette consumption in
Turkey.
The results are based on 29 observations, for the period 1960 – 1988. The variables are
described
as follows:
InQ = Logarithm of cigarette consumption per adult (dependent variable)
InY = Logarithm pf per capita GNP in 1968 prices (in Turkish Liras)
InP = Logarithm of real price of cigarettes (in Turkish Liras per kg)
D82 = 1 for 1982 onward 0 before that
D86 = 1 for 1986 onward 0 before that

̂ = -4.997 + 21.793(D82) – 28.29(D86) + 0.732(inY) + 2.602(D82)(InY)

LogQ
+3.928(D86)(InY) – 0.371(InP) + 0.288(D82)(InP)
R2 = 0.921

(i) What is the numerical value of the elasticity of demand for cigarettes with
respect to income for the period 1969 – 81? For the period 1986 – 88?
(ii) What is the numerical value of the elasticity of demand for cigarettes with
respect to price for the period 1982 – 85?

5. Take the following model

Y = 1000 + 25X1 + 10X2 - 30X3 + 15X4

Where,

Y = annual sales dollars generated by an auto parts counter person,

X4 = years of experience.

X1, X2 and X3 are the dummy variables representing the education level. Base case is
primary school. X1 for high school, X2 for higher secondary and X3 for graduate school.

i. If a salesperson has a graduate degree, how much will sales change according to
this model compared to a person with a primary education?
ii. How much in sales will a counter person with 10 years of experience and a high
school educate generate?
iii. Why do we need three dummy variables to use education level" in this regression
equation?

Interaction Dummies
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

1. (i) You are told that monthly wages. W (in rupees) earned by a person depends on his
age
A (in years). Write an appropriate model to study the effect of age on monthly wages.
(ii) Suppose it has been found that wages also depend on
 Area of residence (Urban/ nonurban)
 Level of education (Post graduate/ graduate)
Modify your model in part (i) above to include these qualitative variables.
(iii) Will your answer change if you are told that a person’s area of residence also
determines his level of education? What will be the regression equation for
urban post graduates?
2. Using data for 526 individuals the following model of wage determination was estimated:
LOG (W)I = B0 + B1D1 +B2EDUi + B3(D*EDU)i + ui
Where,
W = Daily wages in rupees
D = Dummy variable for gender, D = 1 for females and 0 for males
EDU = years of education
D*EDU = Interactive dummy

The table below gives estimated regression coefficients and their standard errors:
Estimates of Coefficients Standard errors

CONSTANT 0.3890 0.1190

D -0.2270 0.1680

EDU 0.0820 0.0080

D*EDU -0.0056 0.0131

(a) Write the regression equations relating LOG (W) to EDU for males and females
separately.
(b) The returns to education are measured by the percentage increase in wages due to
an extra year of education, for males and females.
(c) Is the difference between returns to education for males and females statistically
significant at 5% level of significance?

3. To study the rate of growth of population in an economy over the period 1970 – 1992 the
Following models were estimated:
Model I:
̂ t = 4.73 + 0.024t
Inpop
t = (781.25) (54.71)
Model II:
̂ t = 4.77 + 0.015t
Inpop - 0.075Dt + 0.011(Dtt)
t = (2477.92) (34.01) (-17.03) (25.54)
where,
pop = population in millions
t = trend variable
Dt = 1 for 1970 – 1980, 0 otherwise (for 1980 – 1992)
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

(i) In model I, what is the rate of growth of population over the sample period.
Differentiate between instantaneous and compound rate of growth.
(ii) Are the population growth rates statistically different pre and post 1980?
(iii) If they are different, then what are growth rates for 1970 – 79 and 1980 – 92?

4. Consider the following model :

Yi = B0 + B1 Xi + B2 D2i + B3 D3i + ui

Where, Y: annual earnings of MBA graduates

X: Years of service

D2 = 1 if Harvard MBA

0 otherwise

D3 = 1 if Wharton MBA

0 otherwise

i. What are the expected signs of the various coefficients?

ii. How would you interpret B2 and B3?
iii. If B2 > B3, what conclusions would you draw?
Now suppose the following model is used:

Yi = B0 + B1 Xi + B2 D2i + B3 D3i + B4 (D2i Xi) + B4 (D3i Xi) + ui

What is the interpretation of B4 and B5. If both of these are statistically significant then
which model will you use and why?

Chow Test
1. For the data of savings and income for the US economy the following model is being
estimated:
Savings = β1 + β2 (Income)
We have the following regression results:
For the time period 1970 – 85
RSS = 1785.032 df = 10
For the time period 1985 – 95
RSS = 10,005.22 df = 12
For the time period: 1970 – 95
RSS = 23,248.30 df = ? (find out)
Has the saving income relationship changed pre 1985 as compared to post 1985? Use
Chow test to find out (Given critical F value for given dof at 1% level of significance =
7.72)

Dummy Variables as Alternative to Chow Test

1. Consider the regression result
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

UN t = 2.7494 + 1.15 Dt – 1.5294 Vt – 0.8511 (DtVt)

t = (26.896) (3.628) (-12.55) (-1.9819) R2 = 0.9128
where,
UN = unemployment rate (%)
V = job vacancy rate (%)
D = 1 for period beginning in 1966 – IV
= 0 for otherwise
The time period starts 1958 – IV till 1971 – II. Time measured in quarters.
(i) Comment upon the statistical significance of the model.
(ii) Interpret the dummy coefficients interactive dummy term.
(iii) Derive equations for the two periods. (Prior 1966 – IV and Post).

2. Suppose we have the following relationship between savings and income form 1970 –
1995.
̂i
Y = 1.0161 + 152.4786Di + 0.0803Xi +.0655(DiXi)
Se = (20.1648) (33.0824) (0.0144) (0.0159)
R2 = 0.8819
Where, Y = savings; X = Income;
D = 1 for observations in 19825 – 1995
= 0 for otherwise (1970 – 1981)
(i) Interpret the above regression.
(ii) Derive the regression was obtained for the Indian savings – income data for
the period 1970 – 1995:
̂
Yi = 1.0161 + 152.4786Di + 0.0803Xi + 0.0655(DiXi)
Se = (0.0504) (4.6090) (5.5413) (- 4.0963)
R2 = 0.8819
Where, Y = savings; X = Income;
D = 1 for observations in 1982 – 1995
= 0 otherwise (1970 – 1981)
(i) Comment on the statistical significance of the above regression. How would
you interpret the dummy coefficient?
(ii) Derive the regressions for two periods, i.e., 1970 - 1981 and 1982 – 1995.
(iii) What are the advantages of the dummy variable technique over the Chow
Test.

EXAM STYLE QUESTION

1. A researcher wants to find out what are the factors which determine the number of
installs (I) of an application (app) from a famous app store. Size in Mbs (S), Reviews
in 000s (Re), Ratings (0 to 5) (Re), Price in 'Rs (P). She ran the following regressions:

log I = 1.329 + 0.2356S + 0.4320 log(Ra) - 0.2678P + 1.928 log(Re)

Se = (0.63) (0.242) (1.29) (0.001) (0.156)

R2= 0.734
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

df = 156

i. Interpret the regression above.

ii. Test for statistical significance of Price in the model. Depending on the result do
you suggest that price is a significant factor affecting app installation?
iii. Suppose the regression is re-estimated where number of installs (I) varies only
with respect to price (P). Average I in sample is 5 and average P is Rs 8.9. Following
regression was estimated:
1
𝐼̂ = 52.351 + 3.139 𝑃

se = (37.39) (0.0187)

df = 156. R2 = 0.806

How would you interpret this model? Explain the shape of the curve.

iv. What would be the slope and elasticity of number of installs with reference to the
equation given in above?
v. How would the equation in (iii) change if we suggest that number of app
installations varies with respect to the kind of cellular phone used by the
customer, that is android or ios phones? [Eco(h) 2021]

2. A regression equation includes a quantitative dependent variable (Y = wages), a

quantitative independent variable (X = years of experience) and two qualitative
variables; Gender and Education Level with two categories each; Male & Female: and
Graduate & Not a Graduate. Assume that the qualitative variables do not interact with
each other.
i) Using intercept dummy variables, write the wage regression model if the impact
of years of experience, gender and education level is to be analyzed on wages (use
Female Graduate as the reference category). Write the estimated equation for
Male Graduate Category.
ii) How could answer in Part (i) be changed if the Education level has three categories
instead namely; Graduate, Post Graduate and Ph.D.
iii) Base on part (ii), write the wage equation for the two specific categories.
(a)Female with Ph. D. (b) Male Post Graduates.

iv) How would the model in part (ii) be modified if the objective is to examine
whether the marginal effect of experience is gender specific?
v) How would be the regression in part (i) be modified if qualitative variable interact
with each other ? [Eco(h) 2022]

3. The purpose of this empirical exercise was to analyze the impact of takeovers on CEO
compensation. The model of interest was:
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

𝐶𝑜𝑚𝑝𝑖 = 𝐵1 + 𝐵2 𝑆𝑀𝑃𝑖 + 𝐵3 𝐷1

Where:

Comp = CEO's compensation in hundreds of rupees

SMP = index of firm's stock market performance

D = Dummy variable defined as 1 if the firm acquires another firm, O otherwise .

The model was estimated from data on 34 firms. The results are summarized in the
following table:

Coefficient Standard Error

Intercept 964.5202 69.1662

SMP 1868.567 288.0425

D 996.8745 111.9876

D*SMP 5157.474 545.9090

i. Using the regression results, interpret the coefficients of Di and Di*SMPi.

ii. Test the hypothesis that compensation's relation with stock market performance
remains the same irrespective of take-overs made by the firm. [Eco(h) 2017 ]

4. The following model was estimated for United States from 1958 to 1977 :

1 1
𝑌̂𝑡 = 10.078 − 10.337𝐷𝑡 − 17.549 ( ) + 38.173𝐷𝑡 ( )
𝑋𝑡 𝑋𝑡

se = (1.4204) (1.6859) (8.3373) (9.399)

R2= 0.8787

Where, Y = year-to-year percentage change in the index of hourly earnings

X = percent unemployment rate

D = 1 for 1958-1969

= 0 if otherwise

i. Show the Phillips curve for two periods separately.

ii. Are differential intercept and slope coefficients statistically significant? What does
this suggest?
iii. Interpret the regression. [Eco(h) 2019]
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

5. Consider the following regression results:

̂ = 3840.83 - 0.163totwork - 11.71educ - 8.70age + 0.128age2 + 87.75D

𝑆𝑙𝑒𝑒𝑝

Se = (235.11) (0.018) (5.86) (11.21) (0.134) (34.33)

N = 706. R2 = 0.123,

̅𝑅̅̅2̅ = 0.117

where sleep is total minutes per week spent sleeping.

tot work = total weekly minutes spent working.

educ is education measured in years and age is age of the individual in years.

D is gender dummy and D = 1 if male, 0 otherwise.

i. Is there any evidence that men sleep more than women? How strong is the
evidence?
ii. Interpreting the coefficients of the age and age squared variables explain what
does the researcher have in mind about the relation between sleep and age.
iii. Is there a statistically significant trade-off between working and sleeping? How
would the regression model have to be modified if there is reason to believe that
this trade off might be gender specific? [Eco(h) 2020 ]

6. Data was collected on 344 corporate executives to find out the effect of MBA degree
and work experience on their salary. The following model was estimated :

Yi = 2.3501 + 3.6306D1i - 26354D2i + 0.8527Xi + 1.634(D1 *X)i

t = (1.263) (2.1805) (- 3.457) (7.605) (2.98)

R2 = 0.8968

Y: Annual Income in Lakhs of Rupees

D1 and D2 are MBA and gender dummies respectively

X: Work experience in years

D1 = 1 if one has MBA degree

= 0 otherwise

D2 = 1 for a female executive

= 0 for a male executive

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

i. Write the regression equations for female MBA executives and male MBA
executives separately.
ii. Find the mean income level for the reference category and interpret it.
iii. Test the statistical significance of differential intercept coefficient between female
MBA executives and Male MBA executives at 5% level of significance.
iv. Interpret the coefficient of D1 * X1.
v. Now suppose out of this sample of 344 executives, 48 are female MBA executives
and 156 are male MBA executives. To find out the relation between income earned
and work experience, we run three regressions and the results obtained are as
follows:
Regression A: 156 male MBA executives, RSSA = 3.701
Regression B: for 48 female MBA executives, RSSB = 4.803
Pooled Regression: with 204 (156male + 48female) executives, RSS = 9.7602

Using the above data. do the Chow test at 10% level of significance to check whether there
is significant improvement in doing a pooled regression as compared to other two
subsample regressions. [Eco(h) 2021]

7. A real estate Company used housing sales data to estimate the effect that the
pandemic lockdown had on demand for sub-urban real estate

̂ 𝑡 = 1.83 + 0.08𝐷𝑡 + 0.91𝐼𝑛𝑋𝑡 + 0.55(𝐷𝑡 𝐼𝑛𝑋𝑡 )

𝐼𝑛𝑌

Where Y= Share of sub-urban housing deals during a month, X= price per square metre
of sub-urban real estate, t = time,

Dt = 1, if it is a lockdown month = 0, if t is not a lockdown month

All estimates are statistically significant at 5% level of significance.

i. Write the regression functions for lockdown months and non- lockdown months.
ii. How would you test the hypothesis that lockdown had no impact on price-
elasticity for sub-urban housing?
iii. Rewrite the regression result if Dummy assignment is switched as below:
Dt=0, if t is a lockdown month

= 1 , if t is not a lockdown month

iv. Another investigator believes that the relationship between the two variables X
and Y is given by Yt = 𝛽1 + 𝛽2 𝑋𝑡 + 𝜀𝑡 . Given a sample of n observations, the
investigator estimates 𝛽2 by calculating it as the average value of Y divided by the
average value of X. Discuss the properties of this estimator. What difference would
it make if it could be assumed that 𝛽1 = 0?
v. What will be the consequence for the Gauss Markov theorem if there are errors in
measuring Y? [Eco(h) 2023]
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

8. Regression results for Morena savings-income data are presented for the period
1920-1975,

𝑌̂𝑡 = 1.0161 + 152.4786𝐷𝑡 + 0.0803𝑋𝑡 − 0.0655(𝐷𝑡 𝑋𝑡 )

𝑡 = (0.0504) (4.6090) (5.5413) (−4.0963)

𝑅2 = 0.8819

Where

𝑌𝑡 = savings

𝑋𝑡 =income

𝐷𝑡 = 1 for observations in 1982-1995

= 0 otherwise

i. Interpret the regression results and obtain the regressions or the two time
periods, that is, 1970-1981 and 1982-1995
ii. What do you infer by the statistical significance of the differential intercept and
the differential slope coefficients? [Eco(h) 2014]

9. i)In the regression model, in Yi = B1 + B2Di + ui where D is a dummy regressor, prove

that the relative change in Y when the dummy changes from 0 to 1 can be obtained as:

(𝑒 𝑏2 − 1 )

where e is the base of natural logarithm and b, is the ordinary least squares estimator of
the slope coefficient.

(ii) Suppose you have quarterly data on air-conditioner sales. Explain how you can obtain
average sales of air-conditioners for the our quarters separately using the method of
dummy variables. [Eco(h) 2013]

10. Using data for 120 individuals, the following model of wage determination was
estimated:

WAGEi = 𝛽1 + 𝛽2 𝐼𝑄2𝑖 + 𝛽3 + 𝛽3 PGRADI3i + ui :

where

WAGE: Hourly wages, in Rupees

IQ: Intelligent Quotient, measured on a scale of 70-130

PGRAD: Dummy variable = 1, if the individual is a postgraduate

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

= 0, if the individual is a undergraduate

The regression results were reported as follows, (standard errors in parentheses):

WAGEi = 224.8488 + 5.07661Q2i + 498.0493 PGRAD3i

(se) = (66.6424) (0.6624) (20.0768)

R2 = 0.4540

(a) Write the estimated regression equation for postgraduates and undergraduates
separately.

(b) Test the statistical significance of dummy variable at 5% level of significance. What
conclusion can you draw from this test?

(c) It PGRAD was defined to take values (0, 2) instead of (0, 1) will the estimated value of
B3 and its standard error change? What about its statistical significance?[Eco(h) 2016]

11. Suppose that earnings of individuals are dependent on whether they are skilled
workers and their work experience over the years. 6

(i) Define dummy variables to capture whether workers are skilled or not. Take workers
being unskilled as the reference category.

(ii) Develop a model which is linear in parameters that shows earnings of an individual
as a function of work experience and whether they are skilled. Interpret your model.

(iii) Now assume that there is an interaction between skill of the workers and their work
experience. How would the model in (ii) change. Interpret the new model. [Eco(h) 2019]
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

CHAPTER -5

MULTICOLLINEARITY
Objective Based questions

1. One of the assumptions of CLRM is that the number of observations in the sample
must be greater than the number of
a) Regressors
b) Regressands
c) Dependent variable
d) Dependent and independent variables
2. Perfect multicollinearity between variables X 1 , X2 and X3 can be expressed using
constants 𝜆1 , 𝜆2 and 𝜆3 such that
a) 𝜆1 𝑋1 + 𝜆2 𝑋2 + 𝜆3 𝑋3 = 0, where 𝜆1 , 𝜆2 and 𝜆3 are all equal to zero
simultaneously
b) 𝜆1 𝑋1 + 𝜆2 𝑋2 + 𝜆3 𝑋3 + 𝑣 = 0 where 𝑣 is the stochastic term and 𝜆1 , 𝜆2 and
𝜆3 are not all equal to zero simultaneously.
c) 𝜆1 𝑋1 + 𝜆2 𝑋2 + 𝜆3 𝑋3 = 0; where 𝜆1 , 𝜆2 and 𝜆3 are not equal to zero
simultaneously.
d) 𝜆1 𝑋1 + 𝜆2 𝑋2 + 𝜆3 𝑋3 + 𝑣 = 0 where 𝑣 is the stochastic term and 𝜆1 , 𝜆2 and
𝜆3 are all equal to zero simultaneously.
3. In a regression model 𝑌𝑖 = 𝛽1 + 𝛽2 𝑋2𝑖 + 𝛽3 𝑋3𝑖 + 𝑢𝑖 , F-test is seen to statistical
significant at less than 5 percent level of significance but the coefficients 𝛽1 and 𝛽2 ,
are seen to be statistically insignificant. This means that the
a) Two coefficients are highly correlated
b) Two variables are highly correlated
c) Two variables are perfectly correlated
d) Two variables are not correlated
4. If for a set of explanatory variables 𝑋2 , and 𝑋3 , the coefficients of correlation is
equal to 1, this means that between 𝑋2 and 𝑋3 there exists
a) No collinearity
b) Low level of collinearity
c) Perfect collinearity
d) Very high collinearity
5. If there exists high multicollinearity, then the regression coefficients are
a) Determinate
b) Indeterminate
c) Infinite values
d) Small negative value
6. If multicollinearity is perfect in a regression model then the regression coefficients
of the explanatory variables are
a) Determinate
b) Indeterminate
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

c) Infinite values
d) Small negative value
7. If multicollinearity is perfect in a regression model the standard errors of the
regression coefficients are
a) Determinate
b) Indeterminate
c) Infinite values
d) Small negative value
8. The coefficients of explanatory variables in a regression model with less than
perfect multicollinearly cannot be estimated with great precision and accuracy.
This statement is
a) Always true
b) Always false
c) Sometimes true
d) Nonsense statement
9. In a regression model with multicollinarity being very high, the estimators
a) Are unbiased
b) Are consistent
c) Standard errors are correctly estimated
d) All of the above
10. Multicollinearity is essentially a
a) Sample phenomenon
b) Population phenomenon
c) Both a and b
d) Either a or b
11. Which of the following statements is NOT TRUE about a regression model in the
presence of multicollinearity
a) t ratio of coefficients tends to be statistically insignificant
b) R2 is high
c) OLS estimators are not BLUE
d) OLS estimators are sensitive to small changes in the data
12. Which of these is NOT a symptom of multicollinearity in a regression model
a) High R2 with few significant t ratios for coefficients
b) High pair-wise correlations among regressors
c) High R2 and all partial correlation among regressors
d) VIF of a variable is below 10
13. A sure way of removing multicollinearity from the model is to
a) Work with panel data
b) Drop variables that cause multicollinearity in the first place
c) Transform the variables by first differencing them
d) Obtaining additional sample data
14. Assumption of 'No multicollinearity' means the correlation between the regresand
and regressor is
a) High
b) Low
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

c) Zero
d) Any of the above
15. An example of a perfect collinear relationship is a quadratic or cubic function. This
statement is
a) True
b) False
c) Depends on the functional form
d) Depends on economic theory
16. Multicollinearity is limited to
a) Cross-section data
b) Time series data
c) Pooled data
d) All of the above
17. Multicollinearity does not hurt is the objective of the estimation is
a) Forecasting only
b) Prediction only
c) Getting reliable estimation of parameters
d) Prediction or forecasting
18. As a remedy to multicollinearity, doing this may lead to specification bias
a) Transforming the variables
b) Adding new data
c) Dropping one of the collinear variables
d) First differencing the successive values of the variable
19. F test in most cases will reject the hypothesis that the partial slope coefficients are
simultaneously equal to zero. This happens when
a) Multicollinearity is present
b) Multicollinearity is absent
c) Multicollinearity may be present OR may not be present
d) Depends on the F-value

TRUE/FALSE
1. Despite perfect multicollinearity , OLS estimators are BLUE .

2. In case of high multicollinearity it is not possible to assess the individual significance

of one or more partial regression coefficients ?
3. If an auxiliary regression shows that a particular. Ri2 is high , there is a definite
evidence of high collinearity .
4. High pairwise correlation does not imply high multicollinearity .
5. Multicollinearity is harmless if objective of analysis is prediction only .
6. Ceteris paribus , the higher the VIF is , the larger the variance of OLS estimators .
7. In the regression of Y on X2 & X3 , Suppose there is a little variability in the values of
X3. This would increase V(β̂3) and if all X3 are same then V(β̂3) is infinite .
8. A regression model with a high R2 may not be judged to be good if one or more
coefficients have the wrong sign.
9. Consider the model: Yi = B1 + B2 X2i + B3 X2i + ui. Given that X2i =10+5X3i, we can
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

uniquely estimate all the parameters of the model.

10. Consider the model: Yi = B1 + B2 Xi+B3Xi2 + B4Xi3 + ui. Since 𝑋 2 and 𝑋 3 are the
function of X , there is a perfect multicollinearity ?

11. Do you think that Model suffer from multicollinearity

InYi =  1+  2InXi + B3InXi2 + ui. ?

12. Multicollinearity always implies that correlation between explanatory variables is
lying between -1 and +1 .
13. Very high multicollinearity always implies that estimators are not BLUE ?

Practical Questions
1. In the regression model

Yi = A1 + A2 X2i + A3 X3i + Ui

Suppose X3i = 10 + 3X2i

Show that we cannot uniquely estimate the original parameters A1, A2 and A3.

2. Let Y be the output. X2 be unskilled labour and X3 be skilled labour in the following
relationship:

Yi = B1 +B2 X2i +B 3 X3i + B4 X4i +ui

Where X4i = X2i + X3i

Can the parameters of the model be uniquely estimated by ordinary least squares?
Explain.

3. Consider the set of hypothetical data given in the following table:

Y -10 -8 -6 0 2 4

X2 1 2 3 4 5 6

X3 1 3 5 7 9 11

Suppose you want to fit the following model:

Yi = B1 + B2 X2i + B3 X3i + ui

(i) Explain, without solving, why you cannot estimate the three unknown
parameters of the model.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

(ii) Are any linear functions of these parameters estimable? Show the necessary
derivations.

4. Let Y be output. X2 be unskilled labour and X3 be skilled labour in the following

relationship :
2 2
Yi = B1 + B2 X2i + B3 X3i + B4 (X2i + X3i) + B5 X2i + B6 X 3i + ui

What parameters are estimated by ordinary least squares? Explain

5. In a regression of consumption expenditure Ci on dispoasable incomeYi and wealth

Wi the following results were obtained :
Ci = 24.7747 + 9415Yi – 0.0424Wi
t = (3.6690) (1.0442) (-0.5261)
R2 = 0.9635 degrees of freedom = 17
How can you use these results to detect the presence of multicollinearity ? Suggests
any two methods to reduce the severity of the problem?

6. Consider the regression model :

Yi = Bi + B2 X2i + B3 X3i + Ui

In order to check for presence of multicollinearity. the auxiliary regression is run and the
results are as follows :

𝑋̂2i = 2.456 + 0.7952X3i

（se) = （0.56）（1598） R2 = 0.9

i. Compute variance inflation factor (VIF). Do you find evidence of multicollinearity?

ii. Would multicollinearity necessarily result in high standard errors of the OLS
estimators in the above model?

7. The consumption expenditure of families (c) is regressed upon the income of families
(1) and the wealth of families (W). All variables are measured in Rupees. The following
regression results were obtained for a sample of 10 families.
Variable Coefficient t Statistics
Income 0.94 1.14
Wealth - 0.04 -0.52
Constant 24.77 6.75
df = 7, R = 0.96
2

(i) Based on institution, what signs would you expect for the partial slope
coefficients? Do the observed signs agree with your intuition?
(ii) Every t statistic is insignificant but F statistic is significant. Verify this
statement at 10% level of significance. What are the reasons for this
paradoxical statement?
(iii) Do you expect the estimated coefficients to be unbiased and efficient?
8. Consider the following model relating the gain in salary due to an MBA degree to a
number of its determinants.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

SLRYGAIN = Bt + B2 TUITIONt + B3Z1t + B4Z2t + B5Z3t + ut

where,
SLRYGAIN = Post salary MBA minus pre MBA salary, in thousands of dollars.
TUITION = annual tuition costs, in thousands of dollars
Z1 = MBA skills in being analysts, graded by recruiters.
Z2 = MBA skills in being team p[layers, graded by recruiters.
Z3 = Curriculum evaluation by MBA’s.
Using data for top 25 business schools, the coefficients were estimated as follows,
standard errors in parenthesis.
B1 60.899 (2.513)
B2 0.314 (0.750)
B3 -0.3948 (2.756)
B4 - 2.016 (3.773)
B5 -5.325 (3.773)
(i) Carry out individual two tail tests at 10% level of significance for the slope
coefficients.
(ii) Test the model for overall significance at the 10%level if R2 = 0.461 was
obtained for the model.
(iii) Is there a conflict between your conclusions in (i) and (ii)? If yes can you
suggest a possible explanation?
9. Consider the following regression for an imaginary country, say Utopia, for a period
of 15 years. Variables are: IMP = imports, GNP = Gross National Product and CPI =
Consumer Price Index
Regression 1 :
̂ t= -108.20 + 0.045GNP2t + 0.931CPI3t
IMP
t = (3.45) (1.232) (1.844)
R 2 = 0.9894
(i) Test whether, individually, the partial slope coefficients for GNP and CPI are
statistically significant at the 5% level of significance.
(ii) Test whether GNP and CPI jointly have any statistical significance in
explaining variations in exports. Carry out this test at 5% level of significance.
(iii) Comment on the results obtained in (i) and (ii) above. Do you suspect any
problem?
(iv) Do you expect that OLS coefficients have retained their BLUE property? If no,
explain why. If yes explain why you would still be worried about their quality.
(v) Using a transformation of variables, real imports are regressed on real consequences
income.
Regression 2:
̂t
IMP 𝐺𝑁𝑃𝑡 yes
= -1.39 + 0.202 transformation
𝐶𝑃𝐼 𝐶𝑃𝐼
only one explanatory var.
t = (-5.46) (12.22) R2 = 0.9142 therefore no multicoll.
Would you say that the problem identified above has now been somewhat
solved?

10. In a study of the production function of a firm for the period 1991 to 2011, the
following two regression models were obtained :

Model I
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

̂ = -5.04 + 0.887logK + 0.893logH

logQ

se =(1.40) (0.087) (0.137) R2= 0.878

Model II

̂ = -8.57 + 0.0272t + 0.460logK + 1.28510H

logQ

se = (2.99) (0.0204) (0.333) (0.324) R2 = 0.889

where,

Q is the index of production at constant factor cost,

K is the gross capital stock H is the hours worked, and

t is time trend

i) Interpret both the regressions.

ii) In model I, verify that each partial slope coefficient is statistically significant at the
5% level.
iii) In model Il, verify that the coefficients of t and logK are individually insignificant
at the 5% level. high r2 and insig. expal.
iv) What is the probable reason of insignificance of logK in model II? var.due to high s.e.
v) If you were told that the correlation coefficient between t and logK is 0.98, what
conclusion would you draw?
vi) Even if t and logK are individually insignificant in Model II, would you accept or
reject the hypothesis that in Model Il all partial slopes are simultaneously zero?

11.From the annual data for the US about the manufacturing sector, the results would
be following:

̂ 𝑌 = 2.81 − 0.53𝑙𝑜𝑔𝐾 + 0.91𝑙𝑜𝑔𝐿 + 0.047𝑡

𝑙𝑜𝑔
se = (1.38) (0.34) (0.14) (0.021)

𝑅2 = 0.97, 𝐹 = 189.8

Where Y= index of real output, K= index of real capital input, L= index of

real labour input, t= time or trend.

Using the same data, he also obtained the following regression:

̂ (𝑌) = −0.11 + 0.11𝑙𝑜𝑔(𝐾/𝐿) + 0.006𝑡

𝑙𝑜𝑔 𝐿
se = (0.03) (0.15) (0.006)

𝑅2 = 0.65, 𝐹 = 19.5
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

a) Is there multicollinearity in the regression (1)? How do you know?

b) In regression (1), what is the priori sign of log (k)? Do the results conform to
this expectation? Why or why not?

c) How would you justify the functional form of regression (1)?

d) Interpret regression (1)? What is the role of trend variable in this regression?

e) What is the logic behind estimating the regression (2)?

f) If there was multicollinearity in regression (1), has that been reduced by

regression (2)? How do you know?

g) What restriction has been imposed in regression (2)?

h) Are the 𝑅2 values of the two regressions comparable? Why or why not? How
would you make them comparable, if they are not comparable in the existing
form?

EXAM STYLE QUESTION

1. Consider the three-variable model, Yi = B1 +B2 X2i + B3 X3i + B4 X4i + ui. Let b2 the а
OLS estimator of the slope coefficient B 2.
i. Derive variance of b2, i.e., var(b2), terms of Variance Inflation Factor (VIF).
ii. When X2 is regressed on X3 and X4, 𝑅22 . obtained from this auxillary regression is
0.9217. Does it necessarily imply high variance of b2? Explain. [Eco(h) 2018]

2. Consider the following regression results :

̂ = 9840.83 − 0.163𝑡𝑜𝑤𝑜𝑟𝑘 − 11.71𝑒𝑑𝑢𝑐 − 8.70𝑎𝑔𝑒 + 0.128𝑎𝑔𝑒 2 + 87.75𝐷

𝑆𝑙𝑒𝑒𝑝

𝑆𝑒 = (235.11)(0.018)(5.86) (11.21) (0.134) (34.33)

𝑁 = 706, 𝑅2 = 0.123 ̅𝑅̅̅̅2 = 0.117

where sleep is total minutes per week spent sleeping,

totwork = total weekly minutes spent working,

educ is education measured in years and age is age of the individual in years.

D is gender dummy and D = 1 if male. 0 otherwise.

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

i. Is there any evidence that men sleep more than women? How strong is the
evidence?
ii. Interpreting the coefficients of the age and age squared variables explain what
does the researcher have in mind about the relation between sleep and age
iii. Is there a statistically significant trade-off between working and sleeping? How
would the regression model have to be modified if there is reason to believe that
this trade off might be gender specific?
iv. Do you suspect multicollinearity in the model? Explain your answer.[Eco(h) 2020]

3. In the regression of consumption expenditure (Ci) on disposable income (Yi) and

wealth (Wi). The following results were obtained (the paranthesis contain the t-
ratios)

𝐶̂𝑖 = 24.7747 + 0.9415Yi - 0.0424Wi

(t) (3.669) (1.0442) (-0.5261) R2 =0.9635, df=17

a) Is there any evidence of presence of Multicollinearity? Why or Why not?

b) Give any three ways in which the problem of Multicollinearity can be
remedied.[Eco(h) 2017]

4. Consider the following regression results for 45 countries for the year 2011-2012.
(the /-ratios are given in brackets):

̂𝑡 = 21.045 +0.0545 GDP + 1.864 GOV_INDEX,

𝐹𝐷𝐼

t = (1.232) (0.744) (1.005) R2 = 0.9667

where. FDI = Foreign Direct Investment (billion dollars)

GDP = Gross Domestic Product (billion dollars)

GOV_INDEX = Governance Index (a higher value indicates better on governance)

i. Is there evidence of multicollinearity ? Explain your answer.

ii. Discuss any two methods that can be used to deal with the issue of
multicollinearity ? [Eco(h) 2018]

5. Quarterly data on country XYZ was collected for the period 2005-2019 to estimate the
relation between Foreign Direct Investment (FDI), Trade Openness (TO). Gross
Domestic Product (GDP) and Exchange Rate (E). TO is defined as the ratio of export
plus imports to GDP and t = trend. Following regression was estimated:

̂𝑡 = -0.58 + 0.012E, - 0.025TOt + 0.006GDPt + 0.34t

𝐹𝐷𝐼

Se = (0.097) (0.013) (0.004) (0.015) (0.09)

R2 = 0.904, d = 1.45
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

6. Suppose demand for Brazilian coffee in country Rico is a function of the real price of
Brazilian coffee (Pbc), real price of tea (Pt) and real disposable income (Y d) in Rico.
Suppose following results were obtained by running the implied regression:

̂ = 9.1 + 7.8 𝑃𝑏𝑐 + 2.4𝑃𝑡 + 0.0035𝑌𝑑

𝐶𝑜𝑓𝑓𝑒𝑒

𝑡 = (0.5) (2.0) (3.5)

𝑅̅2 = 0.60 𝑁 = 25

i. Interpret the slope coefficients. Are the signs in accordance with economic theory?

ii. Do you think that the equation suffers from some problem? What could be the
nature of the problem?
iii. What are in general the consequences of problem if any detected in part (ii)? (iv)
Suppose the researcher drops Pbc and run the following regression
̂ = 9.3 + 2.6 𝑃𝑡 + 0.0036𝑌𝑑
𝐶𝑜𝑓𝑓𝑒𝑒

𝑡 = (2.6) (4.0)

𝑅̅2 = 0.61 𝑁 = 25

Has the researcher made the correct decision in dropping 𝑃𝑏𝑐 from the equation? Explain.

iv. Do you think that Brazilian coffee in Rico is price inelastic? Why/Why not?
[Eco(h) 2023]

7. The following function.is known as the transcendental production function (TPF), a

generalization of the well-known Cobb-Douglas production function:

𝑌 = 𝐵1 𝐿𝐵2 𝐾𝐵3 𝑒 (𝐵4𝐿+𝐵5 𝐾)

(a) Perform a suitable logarithmic transformation so that the function is estimable

using ordinary least squares.
(b) For the logarithm TPF to reduce the cob-Douglas production function expressed
in logarithm form. What must be restriction on the value of 𝐵2 and 𝐵3 .what must
be the restriction on the values of 𝐵4 and 𝐵5 ? outline the steps for testing the
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

validity of restriction on 𝐵4 and 𝐵5 in choosing between the TPF function and

cob-Douglas models in logarithm form ? [Eco(h) 2013]

8. In order to test whether the developing economies are catching up with the advanced
economies or not. a researcher regressed the growth rate of GDP of a country on its
relative per capita GDP for 119 developing countries. The relative per capita GDP of a
country is measured as a ratio of the country's per capita GDP to the GDP per capita
of USA. The regression results were obtained as under (standard errors are reported
in parentheses) :

𝐺̂ = 0.013 + 0.062𝑃𝑖 − 0.061𝑃𝑖2

𝑠𝑒 = (0.013) (0.02) (0.033)

𝑅2 = 0.053, adjusted 𝑅2 = 0.036

Where, G is the growth rate of GDP (in %)

And, P is the relative per capita GDP (in %)

i. Interpret the above regression results. (ii) Find the marginal effect of P on G.
ii. If a researcher wishes to estimate the above relationship in logarithmic form and
estimates the following relationship :
InGi = B1 + B2 In Pi + B3 In 𝑃𝑖2 + ui
Do you think he will be able to estimate the model? Give reasons for your answer

[Eco(h) 2013]

CHAPTER-6
Heteroscedasticity

Multiple Choice Questions

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

Choose the Best alternative for each question

1. Heteroscedasticity means that
a. All X variables cannot be assumed to be homogeneous
b. The Variance of the error term is not constant
c. The observed units have no relation
d. The X and Y are not correlated

2. Heteroscedasticity is more likely a problem of

a. Cross-section data
b. Time series data
c. Pooled data
d. All of the above

3. The coefficient estimated in the presence of heteroscedasticity are NOT

a. Unbiased estimators
b. Consistent estimators
c. Efficient estimators
d. Linear estimators

4. Estimating the regression model in the presence of heteroscedasticity using this

method leads to BLUE estimators
a. OLS
b. GLS
c. MLE
d. Two-stage regression estimation

5. In the regression model Yi = 𝛽 1 X0i + 𝛽 2X1i +ui, if 𝛽 1 is the intercept coefficient

then the values that X0i can take are
a. All ones
b. All zeros
c. Any value
d. Any positive value

2
6. Under park test in 𝑢̂ = In σ2 + 𝛽 In Xi + vi, is the suggested regression model.
𝑖
Here if we find 𝛽 to be statistically significantly different from zero, this means
that
a. Homoscedasticity assumption is satisfied
b. Homoscedasticity assumption is not satisfied
c. We need further testing
d. Xi has impact on Yi

7. According to Goldfeld and Quandt the problem with Park test is that the
a. Error term is hetroscedastic
b. Expected value of vi is nonzero
c. vi is serially correlated
d. Model is nonlinear in parameter
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

8. The heteroscedasticity test that is sensitive to the normality assumption or error

term is
a. Goldfield-Quandt test
b. Breuseh-Pagan-Godfrey test
c. Whites general heteroscedasticity test
d. Spearman’s rank correlation test

2
9. The following remedial measure for heteroscedasticity is used when σ is known
𝑖
for a regression model
a. Koenker-Bassett method
b. Weighted least square method
c. OLS method
d. White’s procedure

10. Which of the following is NOT considered the assumption about the pattern of
heteroscedasticity

a. The error variance is proportional to Xi

b. The error variance is proportional to Y i
2
c. The error variance is proportional to X
𝑖
d. The error variance is proportional to square of the mean value of Y

11. Even if heteroscedasticity is suspected and detected, it is not easy to correct the
problem. This statement is
a. True
b. False
c. Sometimes true
d. Depends on test statistics used

TRUE AND FALSE

State whether the following statements are true or false. Briefly justify your answer.

a. In the presence of heteroscedasticity OLS estimators are biased as well as

inefficient.
b.If heteroscedasticity is present, the conventional t and F tests are invalid.
c. In the presence of heteroscedasticity the usual OLS method always
overestimates the standard errors of estimators.
d.If residuals estimated from an OLS regression exhibit a systematic pattern, it
means heteroscedasticity is present in the data.
e. There is no general test for heteroscedasticity that is free of any assumption
about which variable the error term is correlated with.
f. If a regression model is mis-specified, the OLS residuals will show a distinct
pattern.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

g. If a regressor that has non constant variance is omitted from a model, the OLS
residuals will be heteroscedastic.
h. If a pattern is observed on plotting residuals against time, it shows presence of
heteroscedasticity.

Theory Questions
1. Suppose Heteroscedasticity is present in a regression model and ordinary least
squares procedure is applied to estimate the parameters of the model? What are
the consequences for the properties of the estimators and the hypothesis testing
procedures?

2. Consider the following model:

yi = β1+ β2X2i + β3X3i + ui
Suppose it is revealed to us that this regression suffers from heteroscedasticity.
How can we transform the model so that there is homoscedasticity if:

(i) Error variances are known,

(ii) Error variances are unknown.

3. For a cross sectional data on 20 countries, consider the following regression

model:
̂ i = b1 + b2GNP2i + b3EDU3i + ui
IMR
Where, IMR = Infant Mortality rate.
GNP = Per Capita Gross National Product
EDU = Enrolment in Primary Education
The data set has 5 lower middle income countries, 5 upper middle, 5 low income
and 5 high income countries (according to the classification used by the world
bank)
(i) For such a diverse sample, do you have any priori reason to suspect that error
variance might violate an important assumption of classical linear regression
model? Explain.
(ii) Suppose you conduct a formal test to verify your conjecture. State the null
hypothesis carefully. You are given the following auxiliary regression result using
the residuals obtained from the above regression:
ûi = -15.76 + 0.3810GNPi – 4.5641EDUi + 0.00000(GNPi)2 + 0.1328(EDUt)2 –
0.005(GNPt) (EDUi) yes it violates homoscedasticity as there is cross sectional data so
R2 = 0.30 ts=chi=nr2 there is heteroscedasticity
What is your conclusion?
ho homo
ha hetero

4. Consider the following model relating profits to sales of a number of firms.

Pt = B1 + B2St= + B3Dt+ ui
Where,
Pt = Annual profits
St = Annual sales
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

Dt = 1, if firm is in manufacturing industry

0, otherwise
(i) State the auxiliary equation so that you can carry out White’s test for
heteroscedasticity.
(ii) State the null and alternative hypothesis, the test statistic, its distribution and
degree of freedom, given n observations. Write down the 5% critical value for
the test and describe the decision rule.
(iii) Explain the consequences on interpretation of regression results based on
ordinary least squares.

5. In a two variable population regression function Y i = B1 + B2Xi + ui suppose the

error variance has the following structure: E(u 2i) = σ2Xi4. How will you transform
the model to achieve homoscedastic error variance?

6. Describe the Park’s test for detecting heteroscedasticity.

Practical Questions

1. Based upon the data on research and development (R&D) expenditure. sales. and
profits for 18 industry groupings in the United States. all figures in millions of dollars,
the following model is fitted. Since the cross sectional data presented in used for this
model are quite heterogeneous, in a regression of R&D on sales (or profits).
heteroscedasticity is likely. The regression results were as follows :

̂ 𝐷 = 192.9931 + 0.0319 𝑆𝑎𝑙𝑒𝑠𝑖

𝑅&

𝑠𝑒 = (533.9317)(0.0083) 𝑟 2 = 0.4183

To see if the above model suffers from heteroscedasticity we obtained the residuals 𝑒𝑖 ,
squared them and fitted the following models to conduct formal tests.

i. |𝑒𝑖2 | = −974,469.1 + 86.2321 𝑆𝑎𝑙𝑒𝑠𝑖

𝑠𝑒 = (48,02,343) (40.3625) 𝑟 2 = 0.2219
Use the Park test to determine, if the model suffers from the problem of
heteroscedasticity.
ii. |𝑒𝑖 | = 578.5710 + 0.0119 𝑆𝑎𝑙𝑒𝑠𝑖
𝑠𝑒 = (678.6950)(0.0057) 𝑟 2 = 0.2140
Use the Glejser test to determine, if the model suffers from the problem of
heteroscedasticity.
iii. 𝑒𝑖2 = 62,19,665 + 229.3508𝑆𝑎𝑙𝑒𝑠𝑖 − 0.000537𝑆𝑎𝑙𝑒𝑠𝑖2
𝑠𝑒 = (64,59,809)(126.2197)(0.0004) 𝑟 2 = 0.2895
Use the White's test to determine, if the model suffers from the problem of
heteroscedasticity.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

2. In a regression of housing expenditure in Rupees (Y i) on annual incomes of families

in Rupees (Xi) for a sample of 27 families and the following results were obtained:
Variable Coefficients Standard error
X 0.121 0.009
Constant 3.803 4.570
n = 27 R = 0.776
2

On plotting the residual against Xi, it was found that the variance of the residuals
increased with Xi
(i) What problem does this indicate? Name any one test for its detection.

(ii) What are the consequences of this problem for OLS estimators?

(iii) Which type of dataset is more likely to be characterized by this problem?

(iv) Explain the estimation process of Weighted Least Squares with known error
variances in this context.

3. A regression of salaries of 222 professors from seven universities in the U.S. on their
years of experience since they completed their Ph.D. was performed.

(a) The graph of squared residuals against the fitted values of the dependent
variable, salary is shown – below. What does the graph show? Is there u 2
versus fitted values(with least squares fit)

(b) The test statistic for White’s test for this regression was reported as
19.7.State the null and alternative hypothesis and the test statistic for carrying
out this test. Is the null hypothesis rejected at 5% level of significance?

4. Consider the regression model that postulates relationship between monthly demand
for burgers (Y) and monthly household income (HH_INC, in rupees).

𝑌𝑖 = 𝐴 + 𝐵 𝐻𝐻_𝐼𝑁𝐶𝑖 + 𝑢𝑖 ,

The regression was run for a cross section of 41 observations. Susp heteroscedasticity,
the White's test for heteroscedasticity was chosen following the results were obtained :
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

𝑒𝑖2 = −6219 + 229.35𝐻𝐻_𝐼𝑁𝐶𝑖 + 0.000544𝐻𝐻_𝐼𝑁𝐶𝑖2

𝑅2 = 0.1148

Test for heteroscedasticity at 5% level of significance. State the null and alternative
hypothesis clearly.

5. In a regression of average wages (W,$) on the number of employees (N) for a

random sample of 30 firms, the following regression results were obtained:
𝑊̂ = 7.5 + 0.009N
t = n.a. (16.10) r2 = 0.90

̂ = (0.008) + 7.8(1/N)
W/N
t = (14.43) (76.58) r2 = 0.99

(a) How do you interpret the two regressions?

(b) What is the reason for transforming Eq (1) into Eq. (2).

(d) Has the author successfully removed the problem which Eq. (1) is suffering
from.

(e) Can you relate the slopes and intercepts of the two models?

(f) Can you compare the R2 values of the two models? Why or Why not?

6. For pedagogic purposes Hanushek and Jackson estimate the following model:

𝐶̂𝑡 = 26.19 + 0.6248 𝐺𝑁𝑃𝑡 − 0.4398𝐷𝑡 𝑅2 = 0.999

(2.73) (0.0060) (0.0736)

̂ 𝑡 = 25.92(1/𝐺𝑁𝑃𝑡 ) + 0.6246 − 0.4315(𝐷𝑡 /𝐺𝑁𝑃𝑡 )

𝐶𝑡 /𝐺𝑁𝑃

(2.22) (0.0068) (0.0597) 𝑅2 = 0.875

a) What assumption is made by the authors about the nature of the

heteroscedasticity? Can you justify it?

b) Compare the results of the two regressions. Has the transformation of the
original model improved the results, that is, reduced the estimated standard
errors? Why or why not?

c) Can you compare the two 𝑅2 values? Why or why not?

7. You are given the following data:

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

𝑅𝑆𝑆1 𝑏𝑎𝑠𝑒𝑑 𝑜𝑛 𝑡ℎ𝑒 1𝑠𝑡 30 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠 = 55, 𝑑𝑓 = 25

𝑅𝑆𝑆2 𝑏𝑎𝑠𝑒𝑑 𝑜𝑛 𝑡ℎ𝑒 𝑙𝑎𝑠𝑡 30 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠 = 140, 𝑑𝑓 = 25

Carry out the Goldfled- Quandt test of heteroscedasticity at the 5% level of

significance.

8. The regression results from the model,

Yi = B1 + B2 Xt + ut

are obtained for a cross-section of 30 households, where Y is consumption expenditures

(in Rs. thousands) and X is income (in Rs. thousands). In order to check for the presence
of heteroscedasticity. The observations are arranged in the increasing order of the
magnitude of X. The regression is run separate!) for first 11 (Group I) and last 11
observations (Group 2). The regression results for these two subgroups are reported as
follows (standard errors are reported in the parentheses) :

Group 1: 𝑌̂𝑖 = 1.0533 + 0.876𝑋𝑖

(se) = (0.616) (0.038)

R2 = 0.9851. RSS1 =0.475 x 105

Group 2: 𝑌̂𝑖 = 3.279 + 0.835𝑋𝑖

(se) = (3.443) (0.096)

R2 = 0.9585 RSS2 = 3.154 x 105

i.Perform Goldfeld-Quandt test at 5% level of significance. state the null and

alternate hypotheses clearly. Do you find evidence of heteroscedasticity.
ii. List the underlying assumptions related to the disturbance term made in the
above test.

9. A researcher estimated the following regression :

𝐼𝑛 (𝑆𝑎𝑙𝑎𝑟𝑦)𝑖 = 53.809 + 0.0438 𝑌𝑒𝑎𝑟𝑠𝑖 − 6.237𝑌𝑒𝑎𝑟𝑠𝑖2 + 𝑒𝑖

𝑡 (92.1) (9.088) (-5.19) 𝑅2 = 0.789, 𝑛 = 219

Where

In : natural logarithm
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

Salary : Salary of an individual (Rs. '000)

Years : Years of experience

When the White's general test for heteroscedasticity at 5% level of significance was
conducted, the R2 obtained from the regression of 𝑒𝑖2 on a constant, Years, Years2 and
Years3 was 0.45. Is the researcher correct in concluding that Years and Years2 are
individually significant variables in the salary regression? Why? Why not?

EXAM STYLE QUESTION

1. Using data on compensation per employee in thousands of dollars (COMP) and

average productivity in thousands of dollars (PROD) for a cross section of 50 firms for
the year 1958, the following regression results were obtained (t ratios in
parentheses) :

̂ = 1992.35 + 0.233PRODi
COMP

t= (2.1275) (2.333)

R2 = 0.5891

Since the cross sectional data included heterogeneous units, heteroscedasticity was likely
to be present. The Park test was performed and the following results of auxiliary
regression were obtained :

̂2 = 35.817 − 2.8099𝑃𝑅𝑂𝐷
𝑙𝑛𝑒1 𝑖

𝑡 = (0.934) (-0.667) 𝑅2 = 0.0595

(i) Use the result of auxiliary regression to check if the model indeed suffers from
heteroscedasticity, perform the test at 5% level of significance.
(ii) What could be the possible remedies of heteroscedasticity?[Eco(H) 2019]

2. The Home ministry of a country wants 10 lest if petty crimes (minor theis) are higher
in states where poverty rates are high. They obtain data on several variables and ran
the following cross section regression for 35 states in the country.

𝐶̂𝑖 = 6.275 + 0.1147𝑃𝑅𝑖 − 0.0712𝐿𝑅𝑖 + 0.0862𝑆𝐷𝑃𝑖

𝑆𝑒 = (3.125)(0.02713)(0.0361)(0.03834)
𝑛 = 35 𝑅2 = 0.6876

where C = Crimes per lakh of population

PR = Poverty Rates

LR = Literacy Rates
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

SDP = State Domestic Product

The regression equation given in part (a) is modified as follows :

𝐶̂𝑖 = 23.83 − 0.0089𝐿𝑅𝑖

This equation was estimated using 50 cross sectional observations on states. t ordinary
least squares (OLS). To check for heteroscedasticity related to L separate regressions
were run for the 17 states with the lowest LR and the 17 states with the highest LR. The
sum of squared residuals for the low LR states was 270. The sum of squared residuals for
the high-LR states was 90.

i. Compute unbiased estimates of the variance of the error term in the two
subsamples.
ii. Conduct the Goldfeld-Quandt test at 5% level of significance.
iii. Regardless of your conclusion for part (ii), suppose you believe that
heteroscedasticity is indeed present and that the variance of the error term is
inversely proportional to state LR : Var (∈𝑖 ) = Y/LRi, where Y = an unknown
constant. Explain how you would transform the data to satisfy the classical
assumptions. [Eco(h) 2022]

3. In the model 𝑌𝑖 = 𝛽2 𝑋𝑖 + 𝜇𝑖 , 𝑉𝑎𝑟(𝜇𝑖 ) = 𝜎 2 𝑋𝑖2 .

𝜎 2 𝑋𝑖2
i. ̂2 ) =
Show that 𝑉𝑎𝑟(𝛽 2 .
(∑ 𝑋𝑖2 )
ii. How would you use the Bresuch-Pagan-Godfrey test to check for the violation of
homoscedasticity?
iii. How would you transform the model to correct for heteroscedasticity? What
assumptions are being made here in the process? [Eco(h) 2021]

4. Let the population regression function be :

𝑌𝑖 = 𝛽1 + 𝛽2 𝑋2𝑖 + 𝛽3 𝑋3𝑖 + 𝑢𝑖

How will you transform the model to obtain homoscedastic errors under each of the
following cases, assuming other CLRM assumptions for 𝑢𝑖 hold:

i. 𝑢𝑖 = 𝜀𝑖 (𝑋2𝑖 )1/2
ii. 𝑢𝑖 = 𝜀𝑖 𝑍𝑖 (where 𝑍𝑖 is a non-stochastic variable which does not belong to this
model)
iii. 𝐸 (𝑢𝑖2 ) = 𝜎 2 /𝑋3𝑖
It is given that 𝜀𝑖 – N (mean = 0. variance = 𝜎 2 ). [Eco(h) 2015]

5. Let the population regression function be:

𝑌𝑖 = 𝛽1 + 𝛽2 𝐷1𝑖 + 𝛽3 𝐷2𝑖 + 𝛽4 𝑋𝑖 + 𝛽5 (𝐷1 ∗ 𝑋)𝑖 + 𝜇𝑖

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

Y : Annual Income in Lakhs of Rupees

Di and D2 are MBA and gender dummies respectively

X : Work experience in years

D1 = 1 if one has MBA degree

= 0 otherwise

D2 = 1 for a female executive

= 0 for a male executive

Suppose that E(𝜇/X, D1, D2) = 0 and V(𝜇/X, D1, D2) = 𝜎 2 𝑋 2 . Transform the original
equation to obtain homoscedastic error term.

6. Based on data on value added in manufacturing, MANU, and gross domestic product
for 28 countries in 2010, all measured in millions of US dollars. The following
regression results were reported (standard errors in parentheses),

̂ 𝑖 = 604 + 0.194𝐺𝐷𝑃𝑖
𝑀𝐴𝑁𝑈

𝑠𝑒 = (533.93) (0.013)

Since the cross sectional data were based on heterogeneous units, heteroscedasticity was
likely to be present. White's test was performed using ordinary least squares residuals,
ei of the above regression and the following results were obtained :

𝑒̂𝑡2 = −62196 + 229.3508𝐺𝐷𝑃𝑖 − 0.000537𝐺𝐷𝑃𝑖2

𝑅2 = 0.5891

i. Use the R2 value reported in the auxiliary regression to test if the model indeed
suffers from heteroscedasticity. Perform the test at 5% level of significance.
ii. In the light of your answer in part (i) what can you say about the regression results
reported above? [Eco(h) 2013]

7. A researcher finds evidence of heteroscedasticity in the regression model.

Yi = A + BXi + ui

How will you modify the original regression in order to deal with the problem of
heteroscedasticity in each of the following cases, if error variance follows the

a) 𝐸 (𝑢𝑖2 ) = 𝜎 2 𝑋𝑖2
b) 𝐸 (𝑢𝑖2 ) = 𝜎 2 𝑋𝑖3
1/3
c) 𝐸 (𝑢𝑖2 ) = 𝜎 2 𝑋𝑖 [Eco(h) 2017]
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

8. A researcher finds evidence of heteroscedasticity in the regression model.

Yi = B1 + B2 Xi + ui

The function is estimated using OLS and the residuals, ei, are found to be heteroscedastic
Transform the above model by applying the weighted least squares (WLS) method to
obtain homoscedastic errors under each of the following. Do the transformed regressions
in each have an intercept term :

i. 𝑢𝑖 =∈𝑖 . 𝑋𝑖 where ∈𝑖 ~𝑁(0, 𝜎 2 )

ii. 𝐸 (𝑢12 ) = 𝜎 2 √𝑋𝑖

9. A researcher postulates that the car density (number of cars per thousand
population), Y, in a city depends on the bus density (number of buses per thousand
population), X. He runs the regression model. Yi = B1 + B2 Xi + ui for a cross-section
of 128 cities in India and finds evidence of heteroscedasticity.
i. How would the model be re-estimated if it is assumed that error variance is
𝜎2
proportional to the reciprocal of Xi , that is 𝐸 (𝑢12 ) = ? Show that the transformed
𝑋
error term is homoscedastic.
ii. Can we compare R2 of the original model and the transformed model? Explain your
answer. [Eco(h) 2018]

10. A researcher obtained the following results for determining the relation between
school dropout rates of a district (% of class V students who drop out of school) in
India and district's per capita income, district's expenditure on education and a
dummy variable D_partyABC =1 if political party ABC was in power, 0 otherwise. 215
districts were included in this study.

Model Intercept Per District’s D_partyABC R2 TSS

# Capita ecxpenditure
(X4)
Income on education
(X2) (X3)

1 1.422 -0.231 -0.379 0.002 0.9452

(.876) (.058) (.14) (.00001)

2 0.442 -0.115 0.8952

(.561) (.045)

P values are reported in the parentheses

i. What are a priori expected sign of the coefficient of district's expenditure on

education and why? What is the p value of this coefficient in model#1?
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

ii. An opposition party XYZ claims that wherever party ABC comes to power, school
dropout rates increase. Is this a valid claim?
iii. Test the hypothesis Ho: B3=0 & B4=0?
iv. Calculate R2 for model #2. Will this be greater than the ̅𝑅̅̅2̅ for model# 1 and why?
To test for heteroscedasticity, the researcher conducted a Glejser test for model
#1 and obtained the p value to be 0.04. What can you conclude about the absence
of heteroscedasticity? [Eco(h) 2023]

11. The amount of loan (Li in lakhs) that is sanctioned by a bank to an applicant is
regressed on Gender Duminy for Male: D_Male=1 if male, 0 otherwise), Credit Score
(Ci higher values indicate good credit history), Income of (Inc; in lakh Rupees) and
education level (Ed; in years) of the applicant for a sample of 45 applicants

In Li = 4.999 - 0.0038D_Malei + 0.043Ci + 1.062 In Inci + 0.998Edi R2=0.6541

i. What are the likely consequences on the results of the Gauss Markov theorem if it
is found that income and education have a high correlation coefficient of 0.88?
ii. Interpret the coefficient of D_Male.
iii. Test for overall goodness of fit of this regression.
iv. The value of the test statistic of the White's General test was found to be 9.69. What
is the distribution of this test statistic? What are the null and alternative
hypotheses of this test? What can you conclude about the presence of
heteroscedasticity based on the above information given squares and cross
products of explanatory variables were included in the auxiliary regression?
v. What could be the possible remedy of the problem if heteroscedasticity is indeed
present? Assume that error variances are unknown. [Eco(h) 2023]

CHAPTER-7
Autocorrelation
Multiple Choice Questions
Choose the Best alternative for each question
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

1. When error terms across time series data are intercorrelated, it is known as
a. Cross correlation
b. Cross autocorrelation
c. Spatial autocorrelation
d. Serial autocorrelation

2. The regression coefficient estimated in the presence of autocorrelation in the

sample date are NOT
a. Unbiased estimators
b. Consistent estimators
c. Efficient estimators
d. Linear estimators

3. Estimating the coefficient of regression model in the presence if autocorrelation

leads to this test being NOT valid
a. t-test
b. F-test
c. Chi-square test
d. All of the above

4. If in our regression model, one of the explanatory variables included is the ;aged
value of the dependent variable, then the model is referred to as
a. Best fit model
b. Dynamic model
c. Autoregressive model
d. First-difference form

5. Regression of Ui on itself lagged one period is referred to as

a. AR(1) model
b. AR(2) model
c. Coefficient of auto-covariance model
d. White noise model

6. In regression model Ut = 𝜌ut-1 + Є1’ -1 <𝜌<+1, 𝜌, 𝜌 is the

a. Coefficient of autocorrelation
b. First-order coefficient of autocorrelation
c. Coefficient of autocorrelation at lag 1
d. All of the above

7. Estimating the regression model in the presence of autocorrelation using this

method leads to BLUE estimators:
a. OLS
b. GLS
c. MLE
d. Two-stage regression estimation

8. The regression model does not include the lagged value(s) of the dependent
variable as one of the explanatory variables. This is an assumption underlying on
eof the following tests of autocorrelation:
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

a. Durbin-Watson d test
b. Runs test
c. Breusch-Godfrey test
d. Graphical method

9. The d-statistics value is limited to

a. 0 to 2
b. 2 to 4
c. 0 to 4
d. 4 ± 2

10. If the durbin-watsond-test statistics is found to be equal to 0, this means that fors-
order autocorrelation is
a. Perfectly positive
b. Perfectly negative
c. Zero
d. Imperfect negative correlation

TRUE AND FALSE

State whether the following statements are true or false. Briefly justify your answer.

(a) When autocorrelation is present, OLS estimators are biased as well as

inefficient.
(b) The Durbin Watson test assumes that the variance of the error term u i is
homoscedastic.
(c) The first differences transformation to eliminate autocorrelation assumes
that the coefficient of autocorrelation 𝜌is -1.
(d) The R2 values of two models, one involving regression in the first difference
form and another in the level form, are not directly comparable
(e) A significant Durbin-Watson d does not necessarily mean there is
autocorrelation of the first order.
(f) In the presence of autocorrelation, the conventionally computed variances
and standard errors of forecast values are inefficient.
(g) The exclusion of an important variable from a regression model may give a
significant d value.

(h) In the regression of the first difference of Y on the first differences of X, if there
is a constant term and a linear trend term, it means in the original model there
is linear as well as a quadratic trend term.
(i) For the two-variable regression model. 𝑌1 + 𝐵1 + 𝐵2𝑋𝑡 + 𝑢𝑡 , if the OLS residuals
(et) are plotted against time (t) and a distinct pattern is observed. then it is an
indication of heteroscedasticity.

THEORY QUESTIONS
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

1. What do you understand by the term autocorrelation? If the coefficient of

autocorrelation, 𝜌, is not known, how can it be estimated from each of the following?
i. Durbin-Watson d-statistic
ii. OLS residuals.

2. If 𝜌 is known to be 0.8. Discuss how the problem of autocorrelation can be remedied

using Generalized Least Squares (GLS) for the following two-variable regression
model,

Yt = B1 + B 2 Xt + ut

Where the disturbance term. u1 follows AR(1) scheme, that is.

ut = 𝜌ut-1 + vt

3. In the two variables regression model, Y t =B1 + B2 Xt + ut, discuss how the problem
of autocorrelation can be remedied using First Difference Method (𝜌 = 1) if the
disturbance term u, follows AR(1) scheme. that is. u t = 𝜌ut-1 + vt.

Practical Questions
1. Given a sample of 50 observations and 4 explanatory variables, what can you say
about autocorrelation if
a) d=1.05, b) d=1.05, c) d=2.50, d) d=3.97

2. Durbin-Watson d-statistic for a regression model is computed as 2.317. There are 5

regressors (excluding the intercept) in this model estimated for 45 observations.
Test for presence of autocorrelation at 5% level.
3. In studying the movement in labour's share in value added in the metal industry for
an economy, based on annual data for 1980-2000, the following linear trend model
was considered

𝑌𝑡 = 𝐵1 + 𝐵2 𝑡 + 𝑢𝑡

Where Y= Labour's share in value added

t = time

The following regression results were obtained, t-ratios in paranthesis :

𝑌̂𝑡 = 0.4529 − 0.0041𝑡

(𝑡)(2.535) (−3.9608)

𝑅2 = 0.5284, Durbin Watson’s d-statistic =0.8252

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

(a) Use the Durbin Watson d-statistic test to check if there is autocorrelation in the
model. Give the null and alternate hypothesis clearly.
(b) Give any three reasons that can cause autocorrelation.

4. Let the population regression function be as follows. where errors follow AR(1)
process:

𝑌𝑡 = 𝛽1 + 𝛽2 𝑋𝑡 + 𝜇𝑡

𝜇𝑡 = 𝜌𝜇𝑡−1 + 𝜀𝑡

OLS is used to estimate the function using time-series data for 10 consecutive time
periods.

(i) If errors follow AR(1) how would it affect the least squares estimation?
(ii) The residuals for the 10 consecutive time periods are as follows
Time 1 2 3 4 5 6 7 8 9 10

Period

Residuals -5 -4 -3 -2 -1 +1 +2 +3 +4 +5

Plot the residuals with respect to time. What conclusion can you draw about the pattern
of the residuals over time?

a. Compute the Durbin-Watson d-statistic and interpret it.

b. What are the underlying assumptions of the d' statistic? What alternative tests can
be used if these assumptions are not met?
c. Now suppose that in the regression given above errors are assumed to follow
higher order autoregressive process. It is also given that the auxiliary regression
of estimated residuals on original X and lagged values of estimated residuals gives
an R of 0.7498. Obtain an appropriate test statistic to test for serial correlation.
Outline the steps of the test clearly.

5. A researcher estimated the demand function for money for an economy for 100
quarters using quarterly data for the period @1: 1985-1986 to Q2: 2010-2011. The
regression results are as follows (standard errors are mentioned in the brackets and
In indicates natural log) :
̂𝑡 = 2.6027 − 0.4024𝐼𝑛𝑅𝑡 + 0.59𝐼𝑛𝑌𝑡
𝐼𝑛𝑀
(𝑠𝑒) = (1.24)(0.36)(0.36)
2
𝑅 = 9.2, 𝐷𝑢𝑟𝑏𝑖𝑛 𝑊𝑎𝑡𝑠𝑜𝑛 𝑑 − 𝑠𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 = 1.755
Where Mt = real cash balances
Rt = long-term interest rate
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

Yt = aggregate real national income

Use Durbin-Watson d test to check for the presence of first order autocorrelation
at 5% level of significance.

6. In a study of the determination of prices of final output at factor cost in the UK, the
following results were obtained on the basis of the data:

PFt^ = 2.033 + 0.273Wt − 0.521Xt + 0.256Mt + 0.028Mt−1 + 0.121PFt−1

se = (0.992) (0.127) (0.099) (0.024) (0.039) (0.119)

𝑅2 = 0.984, d=2.54

Where PF= prices of final output at factor cost, W= wages and salaries per employee,
X= gross domestic product per person employed, M= import prices, Mt−1 = import
prices lagged 1 year, PFt−1 = prices of final output at factor cost lagged 1 year.

“Since for 18 observations and 5 explanatory variables, the 5% lower & upper d values
are 0.71 and 2.06, the estimated d value of 2.54 indicates that there is no positive
autocorrelation. Comment.

7. Suppose that you estimate the following regression:

∆ ln 𝑜𝑢𝑡𝑝𝑢𝑡 = 𝛽1 + 𝛽2 ∆lnLt + 𝛽3 ∆lnK t + ui

Where Y is output and L is labour input, and K is capital input and∆is the first
difference operator. How would you interpret 𝛽1 in this model? Could it be
regarded as a estimate of technological change? Justify your answer.

8. (i) To study the effect of unemployment rate (u) on the index of variances (VAC i)
in U.S.A. for 24 observations, the following results were obtained:
In VACi = 7.3084 – 1.5375Inui
t= (5.8250) (-21.612)
r2 = 0.9550, d = 0.9108
Is there a problem of autocorrelation indicate in the results. Choose α = 5%.

(ii) Outline the method of estimation that will produce BLUE estimators in the
presence of AR(1) autocorrelation.

9. The following production function was estimate by an economist (standard

errors are reported in parentheses)
logQi = 3.39 + 1.45 logL + 0.384 log K
Se (0.23) (0.08) (0.04)
R = 0.9948 d = 0.88,
2 n = 39,

(i) Test for the presence of autocorrelation using Durbin Watson test at 5% level
of significance. State your hypotheses clearly.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

(ii) What are the limitations of Durbin Watson method?

10. (i)From the given data on the indexes of real compensation per hour (Y) and output
per hour (X) in the business sector of the U.S. economy for the period 1959 to 1998,
the base of the indexes being 1992 = 100. We obtain the following regression model.

Yt = 29.5192 + 0.7136Xt
se = (1.9423) (0.0241)
t = (15.1977) (29.6066)
r2 = 0.9584, d = 0.1229
Using Durbin Watson d test, check does the model suffers from autocorrelation.

(ii) By changing the functional form we obtain the following model:

̂ 𝑡 = 1.5239
InY + 0.6716In Xt
se = (0.0762) (0.0175)
t = (19.9945) (38.2892)
r 2 = 0.9747 d = 0.1542
Does by changing the functional form the model becomes free from
autocorrelation. Comment.

(iii) Since the data underlying regression in part(i) is time series data, it is quite
possible that both wages and productivity exhibit trends. If that is the case,
then we need to include the time or trend, t, variable in the model to see the
relationship between wages and productivity net of the trends in the two
variables.
To test this, we include the trend variable in regression given in part(i) and
obtained the following results

̂
Y𝑡 = 1.4752+ 1.3057Xt -0.9032t
se = (13.18) (0.2765) (0.4203)
t = (0.1119). (4.7230) (-2.1490)
R = 0.9631
2 d = 0.2046

Has the problem of autocorrelation resolved. If not, can we say that the model suffers
from pure autocorrelation?

11. For the Phillips curves for United States from 1958 to 1969 the following regression
was obtained:

̂𝑡 = -0.2594 + 20.5880 1
Y 𝑋 𝑡
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

t = (-0.2572) (4.3996)
R2 = 0.6594, d = 0.6394

(i) Interpret the regression. Is there any evidence of first order autocorrelation in
the residuals?

(ii) If there is autocorrelation, estimate the coefficient of first order autocorrelation.

12. Consider the following population regression function

In (Div)t = β1 + β2 In (PRFT)t + β3 Time + ui

Here, DIV = Corporate Dividends Paid

PRFT = Corporate Profit
In = Natural Logarithms
The estimated sample regression results for an economy for 244 quarterly observations
are presented below:
Coeff. Standard t-statistic Prob-value
Errors
Intercept 0.4357 0.1921 2.2674 0.0243
In(PRFT) 0.4245 0.0777 5.4614 0.0000
Time 0.0126 0.0014 8.93 0.0000
R2 = 0.9914, adj.R2 = 0.9913,
Sum of Regression = 0.133 F-statistic = 13930.73
SE of Regression = 0.133 Prob(F-statistic) = 0.0000
Durbin –Watson – statistic = 0.0201

(i)What are the economic interpretation of β2 and β3?

(ii) On what counts would a researcher be satisfied with these results at a first
glance? Verify your conjectures using formal tests. For tables take the closest
value of n.

(iii) Is there anything in these results that the researcher needs to worry about?
Verify using formal test (s).

13. Consider the following demand for energy model for India for 1945 to1995:
̂ 𝑡 = 1.5495 – 0.9972 InX2t – 0.3315 In X3t + 0.5284 In Yt-1
InY
se = (0.0903) (0.0191) (0.0243) (0.024)
R = 0.6594
2 R = 0.994,
2 d = 1.8

Does the model suffer from first order autocorrelation? Describe the test statistic you use
and why?

14. Consider the following regression results on a model of demand for competitive
imports based on U.K. quarterly data covering 1980(Q1) to 1996(Q4).
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

̂ 𝑡 = -5.5443 + 0.81051nGDPt- 0.0113 In Pt + 0.6178 In Mt-1

InM

Se (.024)

R2 = 0.9897, Durbin Watson, d = 1.8125

Where:

Mt = aggregate imports in units of domestic currency at constant prices

GDPt = gross domestic product at constant prices

Pt = an index of relative price of imports expressed in domestic currency. Apply the

Durbin's h-test to detect the presence of first order autocorrelation. Based on your
results, comment on the regression results reported above.

15. Based on 147 quarterly observations, an aggregate consumption function is

estimated wherein aggregate consumption expenditure C1, is regressed on
disposable income YDt, and one period lagged dependent variable.
The estimated least square equation is as follows (standard errors in parentheses):

Ĉ𝑡 = 1.88 + 0.086YDt + 0.911Ct-1

(-4.49) (0.028) (0.0304)
DW = 1.569, R2 = 0.999

Which test should be used to test the presence of AR(1) error process in this model?
Describe the test and perform this test at 5% level of significance.

16. Complete the following table:

Number of Durbin-watson Evidence of
Sample size explanatory d autocorrelation
variables
25 2 0.83 Yes
30 5 1.24 —
50 8 1.98 —
60 6 3.72 —
200 20 1.61 —

17. For the model 𝑌𝑖 = 𝛽1 + 𝛽2 𝑋2𝑡 + 𝛽3 𝑌𝑡−1 + 𝑢𝑡 , an auxiliary regression found:

𝑢̂𝑡 = 2.3 + 1.539𝑋2𝑡 + 1.32𝑢̂𝑡−1 + 0.892𝑢̂𝑡−2

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

𝑠𝑒 = (0.99)(0.089)(0.051)(0.0058)

𝑅2 = 0.567, 𝑛 = 34

Use the Breusch-Godfrey test to check for the presence of AR(1) scheme of
autocorrelation at 1% level of significance.

18. The following model of consumption is estimated for an economy for the years 1947-
2000 :

In Ct = B1 + B2 InPDIt + B3 INTt + ut

where C= real consumption expenditures in billions of dollars

PDI = real disposable personal income in billions of dollars

INT = real interest rate

and In indicates natural log.

The OLS residuals (et) are then regressed on InPDI, INT, and et-1 as follows:

et = A1 + A2 InPDlt + A3 INTt + A4 et-1 + vt

The above regressions reported to have R 2= 0.0983. Perform Breusch-Godfrey test to

check for the presence of autocorrelation at 5% level of significance.

EXAM STYLE QUESTIONS

1. Consider the following model of Indian imports estimated using data for 40 years for
the period 1945-1985. (Standard errors are given in parentheses)

̂ 𝑡 = 1.5495 + 0.9972 𝐼𝑛 𝑋2𝑡 − 0.3315𝐼𝑛 𝑋3𝑡 + 0.5284 𝐼𝑛 𝑌𝑡−1

InY

𝑠𝑒 = (0.0903)(0.0191)(0.0243)(0.024)

R2 = 0.994, d = 1.8

Where,

Y = imports, X2 = GDP, X3 = CPI

i. Does the model suffer from first order autocorrelation? Which test statistic do you
use and why?
ii. Outline the steps of the test used. Compute the test statistic and test the
hypotheses that the preceding regression does not suffer first order
autocorrelation.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

iii.If the general model is given Yi = B1 + B2 X2i + B3 X3i + ui where errors follow
AR(1) scheme, that is 𝑢1 = 𝜌𝑢𝑡−1 + 𝛿𝑡 , where 5, is a white noise error term. Then
how would you transform the model to correct for the problem of autocorrelation.
[ Eco(h) 2014]
2. Consider the following model :

Ct = 𝛽 1+ 𝛽 2 GNPt + 𝛽 3 GNPt-1 + 𝛽 4 (GNPt – GNPt-1) + ut

where GNPt = GNP at time t,

Ct = aggregate private consumption expenditure in year t.

GNPt-1 = Gross National Product at time (t - 1)

(GNPt – GNPt-1 ) = change in the GNP between time t and time (1 - 1).

i. Assuming you have the data to estimate the preceding model, would it be possible
to estimate all the coefficients of this model? If not. what coefficients can be
estimated? Do you suspect a problem in the regression?
ii. Suppose that the GNP, explanatory variable was absent from the model. Would
your answer to (i) be the same?
iii. What is a possible remedy to the problem detected in (i) above?
iv. Now suppose the model is given as Ct = 𝛽 1 + 𝛽 2 GNP1 + 𝛽 3 Ct-1 + ut and the errors
are assumed to be autocorrelated. How would you test for serial correlation in the
model? Discuss the underlying assumptions of the test if any?
v. Suppose the equation given in (iv) above is transformed and estimated as: C t
/GNPt = 𝛽 1 (1/GNPt) + 𝛽2 + 𝛽3(Ct-1 /GNPt) +ut /GNPt. What could be the possible
reason for the transformation? How would you test for such a problem?

3. What do you understand by the term Autocorrelation? Consider the regression model.
Yt = B1 + B2Xt +ut. How can the problem of autocorrelation be remedied if 𝜌 is
assumed to be 1 ( 𝜌 = 1) and it is assumed that the error term follows the AR (1)
scheme. that is.

ut = 𝜌ut-1 + et, −1 ≤ 𝜌 ≤ 1

4. Quarterly data on country XYZ was collected for the period 2005-2019 to estimate
the relation between Foreign Direct Investment (FDI), Trade Openness (TO). Gross
Domestic Product (GDP) and Exchange Rate (E). TO is defined as the ratio of export
plus imports to GDP and t = trend. Following regression was estimated:

̂𝑡 = −0.58 + 0.012𝐸𝑡 − 0.025𝑇𝑂𝑡 + 0.006𝐺𝐷𝑃𝑡 + 0.34𝑡

𝐹𝐷𝐼

𝑠𝑒 = (0.097)(0.013)(0.004)(0.015)(0.09)

𝑅2 = 0.904, 𝑑 = 1.45
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

i. Interpret the estimated slope coefficients. Do you suspect some problem with the
above regression?
ii. What is the nature of the problem? How do you know? Explain its consequences?
𝐹𝐷𝐼𝑡 𝐸𝑡 𝑇𝑂 𝑡
= 𝛽0 + 𝛽1 𝐺𝐷𝑃 + 𝛽2 𝐺𝐷𝑃𝑡 + 𝛽3 𝐺𝐷𝑃 + 𝑢𝑡
𝐺𝐷𝑃𝑡 𝑡 𝑡 𝑡
Will this transformation solve the problem in (ii) above? How? Can you compare
R of this model with the model above?
iii. Suppose now the regression is estimated as given below
̂𝑡 = −0.74 − 0.042𝑇𝑂𝑡 + 0.41𝑡
𝐹𝐷𝐼
𝑠𝑒 = (0.057)(0.019)(0.364)
𝑅2 = 0.896, 𝑑 = 1.34
Test whether the regression specified above suffers from first order
autocorrelation? Which test will you use and why? (Use a = 5%)
iv. If the errors obtained from regression specified in (iii) above follows higher order
autoregressive process then how would you test for serial correlation? Give the
steps of the test in detail.
v. With reference to the regression specified in part (iii). What will be the remedy
for the problem of autocorrelation if it is detected? Explain.[Eco(h) 2022]

5. In studying the movement in the production workers' share in the value added (i..,
labor's share), the following models were considered by Gujarati :

Model A : 𝑌𝑡 = 𝛽0 + 𝛽1𝑡 + 𝑢𝑡
2
Model B : 𝑌𝑡 = 𝛼0 + 𝛼1𝑡 + 𝛼2𝑡 + 𝑢𝑡

where Y = labor's share and t = time. Based on annual data for 1949 - 1964. the

following results were obtained for the primary metal industry :

Model A : 𝑌̂𝑡 = 0.4589 − 0.0041𝑡

(−3.9608)

Model A : 𝑌̂𝑡 = 0.4786 − 0.0127𝑡 + 0.0005𝑡 2

(−3.2724) (2.7777)

𝑅2 = 0.6629 𝑑 = 1.82

where the figures in the parentheses are t ratios.

(a) Is there serial correlation in model A? In model B?

(b) What accounts for the serial correlation?

(c) How would you distinguish between pure autocorrelation and specification bias?
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

6. The following regression was estimated using quarterly data for 10 years

̂ 𝑡 = −7.453 − 0.0714𝑃𝑡 + 0.00315𝑌𝑡 − 0.1537𝑖𝑡

𝑁𝐶

𝑆𝑒 = (13.58)(0.0347)(0.0017)(0.04919)

̅𝑅̅̅2̅ = 0.758 𝐸𝑆𝑆 == 23.5104 𝑅𝑆𝑆 = 14.1867 𝑑 = 2.04

Where NC = new car sales per 1000 population

P = new car price index

Y = per capita real disposable income in Rs.

i = interest rate

i. Interpret the above regression and comment on the expected and estimated signs
of the coefficients. Also comment on the individual significance of the coefficients.
ii. Construct an ANOVA table and comment on the joint significance of the regression.
iii. Suppose you wish to test the restriction 𝛽3 = 𝛽4 for the above regression. Explain
the two methods that you can use to carry out this test.
iv. Do you suspect autocorrelation in the model? If yes, how would you test for it?
[Eco(h) 2020]

7. A researcher estimated the demand function for money for an economy for 101
quarters using quarterly data for the period Qi: 1986-1987 to Qz: 2011-2012. The
regression results are as follows (standard errors are mentioned in the brackets and
in indicates natural log):

̂ 𝑡 = 2.6027 − 0.424𝐼𝑛𝑅𝑡 + 0.59𝐼𝑛𝑌𝑡 + 0.524𝐼𝑛𝑀𝑡−1

𝐼𝑛𝑀

𝑠𝑒 = (1.24)(0.36)(0.34)(0.02)

R2 = 0.9165

Durbin-Watson d-statistic = 0.650

Mt = real cash balances

Rt = long-term interest rate

Yt = aggregate real national income. [Eco(h) 2018]

i. Use Durbin's h-test to check for the presence of first order autocorrelation at 1%
level of significance.
ii. Can we use Durbin-Watson d-statistic test for the above regression ? Give reasons.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

8. An NGO has performed a regression analysis to determine whether divorce rates

affect suicide rates (Si) in a country. The NGO used data for 40 countries for the year
2010 and obtained the following results using OLS

Si = 22.33 - 0.0237HDI + 532.45nGDP per capita + 0.0056 Divorce Rates

(0.0034) (-019) (0.15) (.05)

Where Si is the number of suicides per million population in a country in the year 2019

HDI is the Human development index ranging from 0 to 1.00

GDP per capita is Gross domestic product per capita (in $)

Divorce Rates is number of divorces per million population in a country in the year 2019

i. Why did not the NGO use only divorce rate as an explanatory variable? What
would be the properties of OLS estimator of the coefficient of divorce rate in such
a regression?
ii. Given GDP has an exact relation with HDI where HDI = (GDP per capita*Literacy
Rates*Life Expectancy)3, will perfect multi-collinearity be a problem in the above
regression?
iii. Interpret the coefficients of In GDP per capita and Divorce rates:
iv. Suppose NGO only examines the impact of divorce rates on suicide rates and run
the following regression: Si = 𝛽1 + 𝛽2 Divorce Rates 𝑠𝑖 + 𝜀2 . Show that 2 is an
efficient estimator.
v. The NGO also ran a time series regression for one specific country for a period of
35 years and obtained the following results.
St = 10.433-.047 HDIt † 343.45 In GDP per capitat + 0002 Divorce Ratest Durbin
Watson d=2.03
What can be inferred about the presence of AR(1) from the results?[Eco(h) 2023]

CHAPTER-8
Model Selection Criteria

Theory Questions

1. Suppose the true model is

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

𝑌𝑖 = 𝛽1 + 𝛽2 𝑋𝑖 + 𝛽2 𝑋𝑖2 + 𝛽3 𝑋𝑖2 + 𝑢𝑖
but you estimate
𝑌𝑖 = 𝛼2 𝑋𝑖 + 𝑣𝑖
If you use observations of Y at X = -3, -2, -1, 0, 1, 2, 3, and estimate the "incorrect" model,
what bias will result in these estimates?

2. For a given model, 𝑌𝑖 = 𝛽1 + 𝛽2 𝑋2𝑖 + 𝛽2 𝑋3𝑖 + 𝛽3 𝑋3𝑖 + 𝑢𝑖 , prove that if we omit X3 (a

relevant variable) then OLS estimator of 𝛽2 is not unbiased. What does the direction
of the bias depend upon?

Practical questions
1. Consider the data in following Table:
Y X2 X3
1 1 2
3 2 1
8 3 -3
Based on these data, estimate the following regressions:
Yi = α1 + α2X2i + u1i
Yi = λ1 + λ3X3i + u2i
Yi = β1 + β2X2i + β3X3i + u3i
Note :Estimate only the coefficients and not the standard errors:
(i) Is α2 = β2? Why or why not?
(ii) Is λ3 = β3? Why or why not?
What important conclusion do you draw from this exercise?

2. The correct regression model is given as under:

̂ i= 263.6416 – 0.0056 PGNPi – 2.2316FLRi
𝐶𝑀
se = (11.5932) (0.0019). (.2099)
r2 = 0.7077,
̂ = 157.4244 + 0.0144 PGNPi
𝐶𝑀
Se. (9.8455). (.0032)
r = 0.1662
2

(i) Interpret and compare the slope terms in two models.

(ii) Interpret and compare the intercept terms in two models.
(iii) If FLR is regressed upon the PGNP the following results are obtained:
̂ i = 47.5971 –
𝐹𝐿𝑅 0.00256PGNPi
Se = (3.5553) - (0.0011)
r2 = 0.0721,
Explain the net effect and gross effect of PGNP on CM.

3. Suppose we estimate an equation for demand for food in India for the period 1922 –
41:
QD = demand for food
PD = food prices
Y = income
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

𝑄̂D = 92.05 – 0.142PD + 0.236Y

se (5.84) (0.067) (0.031)
R2 = 0.9832
(i) Comment on the above regression. Now if we omit the income variable we
get the following regression:
Qd = 89.97 + 0.107PD
se (11.85) (0.118)
(ii) Comment on the new regression with omitted variable. Do you suspect any
problem?
(iii) If the answer to (ii) above is yes, then what is the nature of the problem?
(iv) What are the consequences of such a problem?

4. Using quarterly for 10 years (n = 40) for the U.S. economy, the following model of
demand for new cars was estimated:
NUMCARSi = B1 + B2PRICEi + B3INCOMEi + B4 INTRATEi + ui
Where
NUMCARS: Number of new car sales per thousand people
Price: New car price index
INCOME: Per capita real disposable income (in$)
INTRATE: Interest rate (in percent)
The table below gives estimates of the coefficients and their standard errors:

Estimate of Coefficient Standard errors

CONSTANT -7.4534 13.5782

PRICE -0.0714 0.0032

INCOME 0.0032 0.0017

INTRATE -0.1537 0.0491

(i) A priori, what are the expected signs of the partial slope coefficients? Are the
results in accordance with these expectations?
(ii) Interpret the various slope coefficients and test whether they are individually
statistically different from zero. Use 10% level of significance.
(iii) The adjusted R squared reported for this model is 0.758. Test the model for
overall goodness of fit at 5% level of significance.
(iv) Suppose unemployment rate is an important determinant of demand for new
cars but is not included in the above regression model. What are the
consequences of omitting this variable?

5. The monthly salary (Wage, in hundred of rupees), age (AGE in years), number of
years of experience (EXP, in years), number of years of education (EDU) were
obtained for 49 persons in a certain office. The estimated regression of Wage on the
characteristics of a person were obtained as follows (with t statistics in parenthesis)
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

̂ = 632.244 + 142.510 EDU + 43.225EXP – 1.913AGE

𝑊𝐴𝐺𝐸
(1.493) (4.008) (3.022) (-0.22)
(i) The value of adjusted R , R = 0.277. Using this information, test the model
2 2

for overall significance.

(ii) Test the coefficients of EDU and EXP for statistical significance at 1% level
and coefficients for AGE at 10% level.
(iii) Can you rationalize the negative sign for AGE? If someone suggests that AGE
be eliminated, will you follow the suggestion?

6. Suppose the true model is :

Model A : 𝑌̂𝑖 = 𝛽1 + 𝛽2 𝑋2𝑖 + 𝛽3 𝑋3𝑖 + 𝑢𝑖

The regression results for the model for n = 45 are given below. The figures in
parantheses denote the standard errors.

Model A: 𝑌̂𝑖 = 263.6416 − 0.0112𝑋2𝑖 − 4.4632𝑋3𝑖 .

𝑆𝑒 = (9.5932)(0.0027)(0.2099)

𝑅2 = 0.7897

When 𝑋3𝑖 is regressed on 𝑋2𝑖 , the results obtained are as follows :

𝑋̂3𝑖 = 47.5971 + 0.00512𝑋2𝑖

𝑆𝑒 (0.553)(0.0011) 𝑅2 = 0.0721

If a researcher underfits the model by omitting 𝑋3𝑖 and runs Model B :

𝑌̂𝑖 = 𝛼1 + 𝛼2 𝑋2𝑖 + 𝑣𝑖

What shall be value of the coefficient 𝑋2𝑖 in Model B?

EXAM STYLE QUESTIONS

1. The following are the regression results for Cobb-Douglas production function
estimated for Taiwan for the period 1958-1972 :
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

̂ 𝑡 = 7.8439 + 0.7148 In 𝐿𝑡 + 1.1135 In 𝐾𝑡

𝐼𝑛𝑄

𝑡 = (−0.2011)(4.46642)(3.7214)

Where:

𝑄𝑡 = real gross product, in billion of rupees

𝐿𝑡 = labour input

𝐾𝑡 = capital Input

The slope coefficient in the regression of In 𝐾𝑡 on In 𝐿𝑡 is 0.4875

Suppose the researcher estimates the following mis-specified equation in which capital
input is omitted:

In 𝑄𝑡 = 𝐴1 + 𝐴2 In 𝐿𝑡 + 𝑢𝑡

i. Find the numerical value of E(𝑎2 ) using the information given in the equation,
where 𝑎2 is the OLS estimator of 𝐴2 . Is it biased upward or downward?
ii. What will be the other consequences of estimating this mis-specified equation?
[Eco(h) 2013]

2. A researcher wanted to study the relation between demand for a commodity, 𝒬, in

relation to' its price. P, and disposable income. Y, based on 30 observations. The
following regression result is obtained (figures in parentheses are the standard
errors):

Model 1: 𝒬̂𝑡 = 92.05 − 0.142𝑃𝑖 + 0.236𝑌𝑖

(𝑠𝑒) = (5.84)(0.067)(0.031)

Estimate of the error variance. 𝜎̂𝑡 = 1.952

However, if income, a relevant and important variable, is omitted from the above model,
then the following regression result is obtained:

Model 1: 𝒬̂𝑡 = 89.97 + 0.107𝑃𝑖

(𝑠𝑒) = (11.85)(0.118)

Estimate of the error variance. 𝜎̂ 2 = 8.058

a) In the context of a specification error committed in Model 2. Explain the concept

of omitted variable bias.
b) From the given regression results. obtain an estimate of slope coefficient in the
regression of omitted variable Y on the included variable P.
www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

c) Compare the consequences of including an irrelevant variable in the model and

excluding a relevant variable from the model. [Eco(h) 2016]

3. The Home ministry of a country wants to test if petty crimes (minor thefts) are higher
in states where poverty rates are high. They obtain data on several variables and ran
the following cross section regression for 35 states in the country.

𝐶̂𝑖 = 6.275 + 0.1147𝑃𝑅𝑖 − 0.0712𝐿𝑅𝑖 + 0.0862𝑆𝐷𝑃𝑖

𝑠𝑒 = (3.125)(0.02713)(0.0361)(0.03834)

𝑛 = 35 𝑅2 = 0.6876

where C = Crimes per lakh of population

PR = Poverty Rates

LR = Literacy Rates

SDP = State Domestic Product

i. A priori what signs are expected for the explanatory variables? Explain your
answers.
ii. Test for overall goodness of fit of the regression (Use a = 5%)
iii. Another model was used and following results were obtained:
̂ 𝑖 = 2.142 + 0.01186 In 𝑃𝑅𝑖 − 0.548 In 𝐿𝑅𝑖 + 0.0921 In 𝑆𝐷𝑃𝑖
𝐼𝑛𝐶
𝑆𝑒 = (1.102) (0.0673) (0.0259)(0.0921)
2
𝑛 = 35 𝑅 = 0.7923
Interpret the coefficient of In SDP
iv. How will you conduct MacKinnon-White-Davidson (MWD) test to select which
model is better? Write all the steps clearly. [Eco(h) 2022]

4. An individual is hired to determine the best location for the next branch of a famous
family restaurant chain 'Foodies' The individual decides to build a regression model
to explain the gross sales volume at each of the restaurants in the chain as a function
of various descriptions of the location of that branch. He considers the following
regression (original):

𝑌̂𝑡 = 102.192 − 9075𝑁𝑖 + 0.3547𝑃𝑖 + 1.288𝑙𝑖

𝑆𝑒 = (2053)(0,0727)(0.543)

𝑛 = 22, 𝑅2 = 0.579 𝑅𝑆𝑆 = 384.27

Where, Y = gross sales volume, N = the number of competitive restaurants nearby, P =

the population nearby. and I = the average household income nearby.

i. Interpret the slope coefficients of the regression and 𝑅2

www.rsgclasses.com
Rahul Sir( SRCC Graduate, DSE Alumni)

ii. Suppose we add another variable A to the regression above where A = address of
the restaurant. Consider the modified regression below :
𝑌̂𝑡 = 98.125 − 8975𝑁𝑖 + 0.3607𝑃𝑖 + 1.301𝑙𝑖 + 58.07𝐴𝑖
𝑆𝑒 = (2053)(0,0727)(0.543)(95.21)
𝑛 = 22, 𝑅2 = 0.0695
Do you think adding a new variable A has improved the fit of the equation?
Why/why not?
iii. Do you suspect a problem in Part (ii) above? What is the problem and what could
be the consequences of the problem? How will you correct for the problem?
iv. How do you conduct Ramsey RESET test to check for the likelihood of specification
error in the model?
v. Suppose that the average household income (I) is not measured correctly. What
are the consequences of this on the properties of the OLS estimators.
[Eco(h) 2022]

Introductory Econometrics Sem 4
No ratings yet
Introductory Econometrics Sem 4
131 pages
Ef3450 2021B Mid
No ratings yet
Ef3450 2021B Mid
12 pages
Uttam Linear Regression 17march24
No ratings yet
Uttam Linear Regression 17march24
82 pages
Da Unit-Iii
No ratings yet
Da Unit-Iii
14 pages
MCQs Unit 4 Correlation and Regression
89% (9)
MCQs Unit 4 Correlation and Regression
14 pages
Econometrics For Finance Final Exam Draft
0% (1)
Econometrics For Finance Final Exam Draft
5 pages
MCQS
No ratings yet
MCQS
2 pages
FRM 2010 Part 1 Practice Exam
100% (1)
FRM 2010 Part 1 Practice Exam
59 pages
ECON 6001 Assignment1 2023
100% (1)
ECON 6001 Assignment1 2023
9 pages
Ssss PDF
No ratings yet
Ssss PDF
50 pages
Regression Practice Questions
No ratings yet
Regression Practice Questions
19 pages
Mutliple Regression-Mcqs
No ratings yet
Mutliple Regression-Mcqs
10 pages
Econometrics For Finance (2017-I)
No ratings yet
Econometrics For Finance (2017-I)
6 pages
Data Analytics Unit 3 Notes
100% (3)
Data Analytics Unit 3 Notes
28 pages
Simple and Multiple Regression
No ratings yet
Simple and Multiple Regression
56 pages
Sheet 2
No ratings yet
Sheet 2
7 pages
Daunit 3
No ratings yet
Daunit 3
32 pages
13-52statistics CH 13 2024
No ratings yet
13-52statistics CH 13 2024
14 pages
Chapter Three
No ratings yet
Chapter Three
35 pages
Regression 5th Final
No ratings yet
Regression 5th Final
3 pages
Bana7052 Final
No ratings yet
Bana7052 Final
9 pages
Statistics-17 by Keller
100% (1)
Statistics-17 by Keller
76 pages
Chapter 2 Simple Linear Regression
No ratings yet
Chapter 2 Simple Linear Regression
31 pages
Notes 2
No ratings yet
Notes 2
16 pages
Course 10-Part 1
No ratings yet
Course 10-Part 1
32 pages
BADM 299 Exam 4 Chap 12-Review Questions
0% (1)
BADM 299 Exam 4 Chap 12-Review Questions
7 pages
Chapter4 Regression
No ratings yet
Chapter4 Regression
15 pages
Fds Unit FINAL
No ratings yet
Fds Unit FINAL
27 pages
Midterm
No ratings yet
Midterm
9 pages
Economterics Final 2024.
No ratings yet
Economterics Final 2024.
32 pages
Practice Final
No ratings yet
Practice Final
10 pages
Test1 PDF
No ratings yet
Test1 PDF
10 pages
Mid Sample Ans
No ratings yet
Mid Sample Ans
2 pages
Econometric Mod L
No ratings yet
Econometric Mod L
8 pages
Assignment 5
No ratings yet
Assignment 5
6 pages
Econometrics 2
No ratings yet
Econometrics 2
9 pages
Assigniment Econometrics RVU 2024 Summer
No ratings yet
Assigniment Econometrics RVU 2024 Summer
5 pages
Question 1 (1 Point) : Saved
No ratings yet
Question 1 (1 Point) : Saved
6 pages
QM
No ratings yet
QM
4 pages
Econometrics QP Calicut
No ratings yet
Econometrics QP Calicut
17 pages
CHAPTER 12 Analysis of Variance
No ratings yet
CHAPTER 12 Analysis of Variance
49 pages
Sample Final Solutions
No ratings yet
Sample Final Solutions
12 pages
Mone JM Pre-Test Econometrics Exams
No ratings yet
Mone JM Pre-Test Econometrics Exams
9 pages
Name: . ID No: .. BITS-Pilani Dubai Campus Econ F241 Econometric Methods Semester I, 2018test-1 (Closed Book)
No ratings yet
Name: . ID No: .. BITS-Pilani Dubai Campus Econ F241 Econometric Methods Semester I, 2018test-1 (Closed Book)
6 pages
Regression Kann Ur 14
No ratings yet
Regression Kann Ur 14
43 pages
Arihant Physics
0% (1)
Arihant Physics
22 pages
STAT741 Regression Analysis: Quiz #1 9pm, Wednesday, 1/31: y X y X y X y X
No ratings yet
STAT741 Regression Analysis: Quiz #1 9pm, Wednesday, 1/31: y X y X y X y X
3 pages
12
No ratings yet
12
16 pages
Statistics 17 by Keller
No ratings yet
Statistics 17 by Keller
76 pages
Eco Trix
No ratings yet
Eco Trix
16 pages
CHW 4
No ratings yet
CHW 4
7 pages
Statistics 578 Assignment 5 Homework
100% (6)
Statistics 578 Assignment 5 Homework
13 pages
Review Questions and Key Oct 4 11
No ratings yet
Review Questions and Key Oct 4 11
3 pages
ISOM2500 Regression Practice Questions
No ratings yet
ISOM2500 Regression Practice Questions
16 pages
Correlation and Regression Exam
100% (1)
Correlation and Regression Exam
6 pages
Econ MIdterm 2 Practise
No ratings yet
Econ MIdterm 2 Practise
11 pages
Effect of Presure in Ball Bounce Height 2
No ratings yet
Effect of Presure in Ball Bounce Height 2
18 pages
Ganda Ko
43% (7)
Ganda Ko
15 pages
MGMT E-5070 2nd Examination Solution
100% (1)
MGMT E-5070 2nd Examination Solution
8 pages
Chapter - 8.pdf Filename UTF-8''Chapter 8
No ratings yet
Chapter - 8.pdf Filename UTF-8''Chapter 8
36 pages
BA 182 Regression MC Samplex With Answer
No ratings yet
BA 182 Regression MC Samplex With Answer
4 pages
Alemaya Stat
No ratings yet
Alemaya Stat
153 pages
Ids PDF
No ratings yet
Ids PDF
397 pages
Linear Regression Interview Questions
No ratings yet
Linear Regression Interview Questions
4 pages
OM3 CH 11 Forecasting and Demand Planning
50% (2)
OM3 CH 11 Forecasting and Demand Planning
17 pages
AI and DS Final Autonomy Syllabus
No ratings yet
AI and DS Final Autonomy Syllabus
202 pages
Multi Layer Soil Resisitivity
No ratings yet
Multi Layer Soil Resisitivity
9 pages
Statistics For Business and Economics: Anderson Sweeney Williams
No ratings yet
Statistics For Business and Economics: Anderson Sweeney Williams
25 pages
1-27 Propogation of Error
No ratings yet
1-27 Propogation of Error
22 pages
Granger Causality in Excel
No ratings yet
Granger Causality in Excel
6 pages
ECON6001: Applied Econometrics S&W: Chapter 4: Linear Regression With One Regressor, An Introduction Dr. Gedeon Lim
No ratings yet
ECON6001: Applied Econometrics S&W: Chapter 4: Linear Regression With One Regressor, An Introduction Dr. Gedeon Lim
59 pages
GMDD 7 1525 2014
No ratings yet
GMDD 7 1525 2014
10 pages
LMM Theory 2024
No ratings yet
LMM Theory 2024
51 pages
University of Mumbai: Teacher's Reference Manual
No ratings yet
University of Mumbai: Teacher's Reference Manual
66 pages
Chapter 2
No ratings yet
Chapter 2
15 pages
Esa - QP - Ue19-20cs203 - SDS
No ratings yet
Esa - QP - Ue19-20cs203 - SDS
11 pages
Chapter 11
No ratings yet
Chapter 11
50 pages
Assignment 4 (2) - Engineering Statistics PDF
No ratings yet
Assignment 4 (2) - Engineering Statistics PDF
4 pages
Aff700 1000 230109
No ratings yet
Aff700 1000 230109
9 pages
LN8 - Heteroscedasticity and Multicollinearity
No ratings yet
LN8 - Heteroscedasticity and Multicollinearity
24 pages
ImpactofTaxationonEconomicDevelopmentofNigeria2000 2013
No ratings yet
ImpactofTaxationonEconomicDevelopmentofNigeria2000 2013
20 pages
Virtual University of Pakistan: Statistics and Probability
No ratings yet
Virtual University of Pakistan: Statistics and Probability
5 pages
Proceedings A Pms 2011
No ratings yet
Proceedings A Pms 2011
9 pages
The Informational Content of FOMC Meeting Transcripts
No ratings yet
The Informational Content of FOMC Meeting Transcripts
25 pages
Residual Plots SPSS
No ratings yet
Residual Plots SPSS
14 pages
Homework 3 Answers Updated
No ratings yet
Homework 3 Answers Updated
2 pages
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)