0% found this document useful (0 votes)

50 views4 pages

Worksheet 5. Regression and Experiments: Problems With Regression: Omitted Variables

This document provides instructions and questions for a worksheet on regression analysis and experiments. It includes questions about omitted variable bias in regression, randomised control trials, and using regression to estimate treatment effects. Students are asked to consider threats to internal validity, interpret regression coefficients, and discuss how to estimate an income elasticity from experimental data.

Uploaded by

rithu sayeeram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views4 pages

Worksheet 5. Regression and Experiments: Problems With Regression: Omitted Variables

Uploaded by

rithu sayeeram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Worksheet 5.

Regression and experiments

Read: Sections 7–9 of the lecture notes, and the sections of S&W indicated therein. You may
find it helpful to look at the ‘short questions for review’ that accompany this worksheet (in a
separate document), as you progress through this material.
Unless your tutor directs otherwise, attempt these questions: 1, 4, 5, 6, 7.

Problems with regression: omitted variables

1. Consider a regression model with k r.h.s. variables

Y = β0 + β1 X1 + · · · + βk Xk + u

where Eu = 0 and cov(Xl , u) = 0 for l ∈ {1, . . . , k}.

(a) Show that a population linear regression of Y on (X1 , . . . , Xk−1 ) – i.e. omitting Xk –
yields a coefficient on X1 of
γ1 = β1 + βk π1

for π1 the coefficient on X1 in a population linear regression of Xk on (X1 , . . . , Xk−1 ).

[Hint: first use the FWL theorem to write down expressions for γ1 and π1 that
involve the error X̃1 in a population linear regression of X1 on (X2 , . . . , Xk−1 ); see
Section 4.1.1 of the notes.]
(b) Suppose it is known that βk > 0, cov(X1 , Xk ) > 0, and that cov(Xl , Xk ) = 0 for
l ∈ {2, . . . , k − 1}. Explain why γ1 > β1 .

2. Suppose that in the causal model

Y = β0 + β1 X1 + β2 X2 + u (1)

X1 is exogenous, but X2 is not: i.e. Eu = 0 and EX1 u = 0, but EX2 u = δ 6= 0.

(a) Suppose we attempt to estimate (1) by linear regression. Derive an expression for
the large-sample limit of the OLS estimator of β1 (i.e. compute the coefficient on X1
in a population linear regression of Y on X1 and X2 ). [Hint: use the FWL theorem.]
(b) Using your answer to part (a), provide a condition under which the OLS estimator is
consistent for β1 (when δ 6= 0). Interpret this condition.

3. Consider the causal model

Y = β0 + β1 X1 + β2 X2 + β3 X3 + u

where OR holds, i.e. Eu = 0 and cov(Xl , u) = 0 for l ∈ {1, 2, 3}. Suppose that you observe
X1 and X2 . You do not observe X3 , but instead observe a possible proxy variable W .
Provide conditions under which an OLS regression of Y on (X1 , X2 , W ) will consistently
estimate β1 and β2 . [Hint: adapt the argument given in Section 7.1.2 of the notes.]

1
Randomised control trials
4. Consider a study, run in 2005–06 in the US, to evaluate the effect on college student grades
of dorm room internet connections. In a large dorm, half the rooms were randomly wired
for high-speed internet connections (the treatment group), and final course grades collected
for all residents at the end of the academic year (in July 2006). Which of the following
would pose threats to the internal validity of the study, and why?

(a) Midway through the year, all the male athletes moved into a fraternity and dropped
out of the study. (Their final grades were not observed.)
(b) Engineering students assigned to the control group put together a local area network
so that they could share a private wireless Internet connection that they paid for
jointly.
(c) The art majors in the treatment group never learned how to access their internet
accounts.
(d) The economics majors in the treatment group provided access to their internet con-
nection to those in the control group, for a fee.
(e) A major storm in early October 2005 caused damage to the campus network such
that around 20% of all dorm room internet connections failed, and repairs were not
successfully carried out until August 2006.

5. Suppose you have data on an outcome {Yi }ni=1 and a binary treatment dummy {Di }ni=1 .
Let β̂1 denote the estimate of the coefficient on D in an OLS regression of Y on D, n1
the number of treated observations (Di = 1) and n0 = n − n1 the number of untreated
observations (Di = 0). Show that

1 X 1 X
β̂1 = Yi − Yi
n1 n0
{i|Di =1} {i|Di =0}

i.e. that the OLS estimator is equal to the difference in the sample means of the treated
and untreated groups.

6. An economist has run the following experiment to estimate the income elasticity of food
consumption, using income transfers. 10,000 households were randomly sampled from the
population of a large city in a developing country, to participate in the study: for each of
these the economist has information on the household head (their age, years of completed
education, and height) and the household itself (household size, and household income
and expenditure on food in the week prior to the study). After collecting this data, the
economist used a random number generator to assign each of the participating households
to either:

a ‘control’ group, which received no income transfer (25% of households)

‘treatment’ groups of varying intensity, corresponding to an income transfer of 100
up to 1000 (in units of the local currency), in increments of 10 (75% of households,

2
with equal proportions of these receiving each of the possible values of the income
transfer).

The economist then recorded each household’s expenditure on food during the week follow-
ing their receipt of the income transfer. The following table gives OLS regression estimates
obtained using the data collected by the economist (all regressions include a constant term,
the estimate of which is not reported):

OLS regression estimates

Dependent variables: (1), (2): food consumption
(3): income transfer
(1) (2) (3)
Income transfer 0.659 0.642 —
(0.123) (0.079) —
Age 0.121 0.000
(0.034) (0.009)
Education (years) 0.041 0.001
(0.008) (0.008)
Household size 0.101 0.002
(0.026) (0.004)
Height (in cm) 0.051 −0.001
(0.019) (0.005)
Food consumption in 0.876 0.003
week prior to study (0.121) (0.018)
Income in week 0.020
prior to study (0.021)
F 1245 1.20
R2 0.14 0.34 0.02

where F denotes the F statistic for a test of the null that all slope coefficients are zero.

(a) What is the purpose of performing regression (3)? What do you infer from the F
statistic?
(b) In the context of (1) and (2), why do you think the economist has regressed food
consumption on the value of the income transfer, rather than on total household
income (inclusive of the transfer)? (Assume that income in the week that the transfer
was received was also recorded.) Or would you not expect the choice of either to make
a fundamental difference to the estimates?
(c) Do you think the estimated coefficient on the income transfer in (1) can be given a
causal interpretation? Construct a 95% confidence interval for the estimated coeffi-
cient on the income transfer in (1), and interpret it.

3
(d) What is the purpose of including the additional regressors in (2)? Compare the
estimated coefficient on the income transfer, and its standard error, with that in (1),
and give an intuitive explanation for why they differ.
(e) Explain why height might have been included in (2). Is it possible to give its estimated
coefficient a causal interpretation?
(f) Recall that the economist is interested in the income elasticity of food consumption. A
reviewer suggests re-estimating the regression in (1), but this time with the logarithm
of food consumption as the dependent variable, and the the logarithm of the income
transfer on the r.h.s. (in place of the level of the income transfer). Do you think this
is a sensible approach? Could you propose an alternative way to estimate the income
elasticity of food consumption?

7. Suppose we are interested in the effect of kindergarten class sizes on outcomes later in
life, in this case on earnings at age 40. We observe a group of individuals who were
randomly assigned to ‘small’ and ‘regularly’ sized classes during kindergarten as part of
an experimental study. Our dataset records the type of class they were assigned to (D = 1
if a small class, 0 otherwise), their earnings at age 40 (Y ), and their total years spent in
education by age 40 (X).

(a) Consider a regression of Y on D alone: what causal interpretation could be given

to the estimated coefficient on D? Would you be concerned about omitted variable
bias, due e.g. to the lack of data on an individual’s family background, and other
characteristics?
(b) Suppose you were to regress Y on D and X: could the coefficient on D be interpreted
as an estimate of the causal effect of kindergarten class size on earnings at age 40,
holding educational attainment constant?

[Hint: in answering the preceding questions, it might be helpful to consider the following
model for the determination of Y and X

Y = β0 + β1 D + β2 X + u
X = δ0 + δ1 D + v

and think about what might be plausibly assumed about D, X, u and v in this setting.]

Homework2 1
No ratings yet
Homework2 1
3 pages
2
0% (1)
2
36 pages
Exercise 1 Multiple Regression Model
No ratings yet
Exercise 1 Multiple Regression Model
6 pages
Tutorial 5 and 6
No ratings yet
Tutorial 5 and 6
5 pages
Chapter 1-17 Answer Key
100% (1)
Chapter 1-17 Answer Key
52 pages
Introduction to Applied Econometrics Analysis Using Stata
From Everand
Introduction to Applied Econometrics Analysis Using Stata
Justin Doran
5/5 (3)
Reading B 1.1 Test 2 TASK 1: Read The Five Short Texts and Choose The Correct Answer To Each Text
No ratings yet
Reading B 1.1 Test 2 TASK 1: Read The Five Short Texts and Choose The Correct Answer To Each Text
8 pages
GMU Econ535-Applied Econometrics Final Exam Spring 2023 Solutions
No ratings yet
GMU Econ535-Applied Econometrics Final Exam Spring 2023 Solutions
13 pages
Practice Midterm2 Fall2011
No ratings yet
Practice Midterm2 Fall2011
9 pages
A Sample Mid-Term Examination of Econometrics Multiple Choice
No ratings yet
A Sample Mid-Term Examination of Econometrics Multiple Choice
8 pages
Problem Set 3
No ratings yet
Problem Set 3
2 pages
Final 2022june English
No ratings yet
Final 2022june English
5 pages
Assignment
No ratings yet
Assignment
5 pages
Practice Final Exam #1
No ratings yet
Practice Final Exam #1
11 pages
Ecf630-Final Examination - May 2021
No ratings yet
Ecf630-Final Examination - May 2021
12 pages
Worksheet Econometrics
No ratings yet
Worksheet Econometrics
7 pages
IST172 Problem Set II-2
No ratings yet
IST172 Problem Set II-2
7 pages
Ec 2303 PS09
No ratings yet
Ec 2303 PS09
2 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
Metrics Jan 2021
No ratings yet
Metrics Jan 2021
10 pages
ECON3334 Midterm Fall2022 Solution
No ratings yet
ECON3334 Midterm Fall2022 Solution
6 pages
Tutorial 10 - Questions
No ratings yet
Tutorial 10 - Questions
3 pages
Econometrics Assignment
No ratings yet
Econometrics Assignment
5 pages
Econometrics Solutions
No ratings yet
Econometrics Solutions
11 pages
Questionbank 011020035933
No ratings yet
Questionbank 011020035933
9 pages
Sample Question Econometrics
No ratings yet
Sample Question Econometrics
11 pages
Econ 140 Berkeley Section 12 Handout
No ratings yet
Econ 140 Berkeley Section 12 Handout
4 pages
Section13 PDF
No ratings yet
Section13 PDF
7 pages
Ec2303 PS10
No ratings yet
Ec2303 PS10
3 pages
Mock Final Exam - Econometrics 2022-2023
100% (1)
Mock Final Exam - Econometrics 2022-2023
7 pages
Exercise 1
0% (1)
Exercise 1
5 pages
Eco No Metrics Answers Chapt 1 - 17
89% (19)
Eco No Metrics Answers Chapt 1 - 17
52 pages
ECONO
No ratings yet
ECONO
19 pages
Example Econometrics
No ratings yet
Example Econometrics
6 pages
Past Paper 2015
No ratings yet
Past Paper 2015
7 pages
Econometrics
No ratings yet
Econometrics
7 pages
MECO6312 2021F Test1 - AZ
No ratings yet
MECO6312 2021F Test1 - AZ
6 pages
School of Economics and Business Administration University of Navarra Academic Year: 2024/25 Econometrics I Problem Set IV: Ch. 5
No ratings yet
School of Economics and Business Administration University of Navarra Academic Year: 2024/25 Econometrics I Problem Set IV: Ch. 5
3 pages
Ef3450 2122B Mid
No ratings yet
Ef3450 2122B Mid
11 pages
Exercise 1 (Week 37)
No ratings yet
Exercise 1 (Week 37)
4 pages
Final Exam 102 w10 Solutions
No ratings yet
Final Exam 102 w10 Solutions
14 pages
Basic Econometrics 4987
No ratings yet
Basic Econometrics 4987
20 pages
1 Review Regression Interp and Hypo Tests-Corrected
No ratings yet
1 Review Regression Interp and Hypo Tests-Corrected
2 pages
Name: . ID No: .. BITS-Pilani Dubai Campus Econ F241 Econometric Methods Semester I, 2018test-1 (Closed Book)
No ratings yet
Name: . ID No: .. BITS-Pilani Dubai Campus Econ F241 Econometric Methods Semester I, 2018test-1 (Closed Book)
6 pages
518 2023 05 23 Econometrics - 08052023b
No ratings yet
518 2023 05 23 Econometrics - 08052023b
11 pages
Econometrics Home Taken Exam
No ratings yet
Econometrics Home Taken Exam
2 pages
Eco No Metrics
No ratings yet
Eco No Metrics
4 pages
Econometrics Exam
No ratings yet
Econometrics Exam
8 pages
ECON3334 Midterm Fall2023 Question
No ratings yet
ECON3334 Midterm Fall2023 Question
7 pages
Chapter 6: Specification of Regression Variables
No ratings yet
Chapter 6: Specification of Regression Variables
26 pages
Solutions To Sample Final Exam ECO2151
No ratings yet
Solutions To Sample Final Exam ECO2151
7 pages
3334 Exam Cheat Sheet
No ratings yet
3334 Exam Cheat Sheet
26 pages
Homework 2
No ratings yet
Homework 2
3 pages
Answers To Odd-Numbered Exercises: Chapter One An Overview of Regression Analysis
No ratings yet
Answers To Odd-Numbered Exercises: Chapter One An Overview of Regression Analysis
20 pages
Ansprac 2
No ratings yet
Ansprac 2
6 pages
The Summation of Series
From Everand
The Summation of Series
Harold T. Davis
4/5 (1)
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Calculus III Essentials
From Everand
Calculus III Essentials
Editors of REA
1/5 (2)
Elements of Tensor Calculus
From Everand
Elements of Tensor Calculus
A. Lichnerowicz
3.5/5 (2)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Michael M. Calistrat: Safety, Application, and Service Factors As Applied To Shaft Couplings by
No ratings yet
Michael M. Calistrat: Safety, Application, and Service Factors As Applied To Shaft Couplings by
8 pages
Self-Concept Questionnaire (SCQ)
No ratings yet
Self-Concept Questionnaire (SCQ)
12 pages
PowerBuilder and The Cloud
No ratings yet
PowerBuilder and The Cloud
10 pages
VLSI Design Techniques
No ratings yet
VLSI Design Techniques
119 pages
PM Tech Knowledge Scwev
No ratings yet
PM Tech Knowledge Scwev
216 pages
Honors Flex45 Training
No ratings yet
Honors Flex45 Training
73 pages
Mardia (1970)
No ratings yet
Mardia (1970)
12 pages
Curriculum Development
No ratings yet
Curriculum Development
30 pages
Omega: Mahdi Alinaghian, Nadia Shokouhi
No ratings yet
Omega: Mahdi Alinaghian, Nadia Shokouhi
15 pages
Absence Quota Basing On Working Date - SAP Blogs
No ratings yet
Absence Quota Basing On Working Date - SAP Blogs
17 pages
Community Based Rehabilitation
No ratings yet
Community Based Rehabilitation
9 pages
Lorenz Datalogger Software Installation Maual
0% (1)
Lorenz Datalogger Software Installation Maual
19 pages
CBSE Class 10 Syllabus Social Science For 2014-2015 (Term 1 and Term 2)
No ratings yet
CBSE Class 10 Syllabus Social Science For 2014-2015 (Term 1 and Term 2)
9 pages
Linking Words English
No ratings yet
Linking Words English
2 pages
End of Term 3 7em Joint 2 1
No ratings yet
End of Term 3 7em Joint 2 1
2 pages
(Cô Vũ Mai Phương) Đề thi thử tốt nghiệp THPT Quốc Gia 2024 - Sở giáo dục và đào tạo Nam Định (Lần 2)
No ratings yet
(Cô Vũ Mai Phương) Đề thi thử tốt nghiệp THPT Quốc Gia 2024 - Sở giáo dục và đào tạo Nam Định (Lần 2)
6 pages
Grundfos Remote Control System GRM
No ratings yet
Grundfos Remote Control System GRM
3 pages
Educ 404
No ratings yet
Educ 404
18 pages
Preparing A Debate Arguments and Fallacies
No ratings yet
Preparing A Debate Arguments and Fallacies
37 pages
CHAPTER 8 (References)
No ratings yet
CHAPTER 8 (References)
10 pages
Lesson Plan No 1
No ratings yet
Lesson Plan No 1
4 pages
Lloyds British - Scope of Work Training
No ratings yet
Lloyds British - Scope of Work Training
23 pages
DCCN PPT 1
No ratings yet
DCCN PPT 1
10 pages
Ingles Rakin
No ratings yet
Ingles Rakin
21 pages
SHCC Topics 2022-2023
No ratings yet
SHCC Topics 2022-2023
1 page
Perancangan Sistem Pengukuran Kinerja Pada Pdam Lumajang Dengan Balanced Scorecard Manik Ayu Titisari Teknik Industri UNKAR
No ratings yet
Perancangan Sistem Pengukuran Kinerja Pada Pdam Lumajang Dengan Balanced Scorecard Manik Ayu Titisari Teknik Industri UNKAR
14 pages
How To Write A Good Essay in English
No ratings yet
How To Write A Good Essay in English
2 pages
Our Food Future Lit Review
No ratings yet
Our Food Future Lit Review
82 pages

Worksheet 5. Regression and Experiments: Problems With Regression: Omitted Variables

Uploaded by

Worksheet 5. Regression and Experiments: Problems With Regression: Omitted Variables

Uploaded by

Worksheet 5.

Regression and experiments

Problems with regression: omitted variables

where Eu = 0 and cov(Xl , u) = 0 for l ∈ {1, . . . , k}.

for π1 the coefficient on X1 in a population linear regression of Xk on (X1 , . . . , Xk−1 ).

2. Suppose that in the causal model

X1 is exogenous, but X2 is not: i.e. Eu = 0 and EX1 u = 0, but EX2 u = δ 6= 0.

3. Consider the causal model

 a ‘control’ group, which received no income transfer (25% of households)

OLS regression estimates

(a) Consider a regression of Y on D alone: what causal interpretation could be given

You might also like

a ‘control’ group, which received no income transfer (25% of households)