0% found this document useful (0 votes)

12 views

Multiple Regression Model

Uploaded by

cide1217

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Multiple Regression Model

Uploaded by

cide1217

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Multiple Regression Model

The Multiple Regression Model

• The Multiple regression model takes the form

Yi = β0 + β1 Xi1 + β2 Xi2 + . . . + βk Xik + ui

• There are k regressors (explanatory Variables) and a constant.

Hence there will be k+1 parameters to estimate.

• Assumption M.1:
We will keep the basic least squares assumption - We will as-
sume that the error term is mean independent of all regressors
(loosely speaking - all Xs are uncorrelated with the error term,
i.e.
E(ui |X1 , X2 , . . . , Xk ) = E(ui |X) = 0

EEC 2002-2003. III

Interpretation of the coeﬃcients

• Since the error term is mean independent of the Xs, varying

the X’s does not have an impact on the error term.
• Thus under Assumption M.1 the coeﬃcients in the regression
model have the following simple interpretation:
∂Yi
βj =
∂Xij

• Thus each coeﬃcient measures the impact of the corresponding

X on Y keeping all other factors (Xs and u) constant. A ceteris
paribus eﬀect.

EEC 2002-2003. III

Dummy Variables

• Some of the explanatory variables are not necessarily continu-

ous variables. Y may also be determined by qualitative factors
which are not measured in any units:
– sex, nationality or race.
– type of education (vocational, general).
– type of housing (ﬂat, large house or small house).
• These characteristics are coded into dummy variables. These
variables take only two values, 0 or 1:

Di = 0 if individual is male
Di = 1 if individual is female

EEC 2002-2003. III

Dummy Variables: Intercept Speciﬁc Relationship

• The dummy variable can be used to build a model with an

intercept that vary across groups coded by the dummy variable:

Yi = β0 + β1 Xi + β2 Di + ui

Y Yi = β0 + β1 Xi + β2
6

Yi = β0 + β1 Xi
β0 + β2

β0

• Interpretation: The observations for which Di = 1 have on

average a Yi which is β2 units higher.
• Example: WTP, income and sex

Variable Coeﬃcient st. err

log income 0.22 0.06
sex (1=Male) 0.01 0.09
constant 0.42 0.47

EEC 2002-2003. III

Dummy Variables: Slope Speciﬁc Relationship

• The dummy variable can also be interacted with a continuous

variable, to get a slope speciﬁc to each group:

Yi = β0 + β1 Xi + β2 Xi Di + ui

Y
6 Y = β0 + (β1 + β2 )X

Y = β0 + β1 X

β0

-
X

• Interpretation: For observations with Di = 0, a one unit in-

crease in Xi leads to an increase of β1 units in Yi . For those
with Di = 1, Yi increases by β1 + β2 units.
• Example: WTP, income and sex

Variable Coeﬃcient st. err

log income 0.23 0.06
sex (1=Male)*log income 0.003 0.01
constant 0.42 0.47

EEC 2002-2003. III

Least Squares in the Multiple Regression Model

• We maintain the same set of assumptions as in the one variable

regression model.
• We modify assumption 1 to assumption M1 to take into ac-
count the existence of many regressors.
• The OLS estimator is chosen to minimise the residual sum of
squares exactly as before.
• Thus β0 , β1 , . . . , βk are chosen to minimise

N
N
S= u2i = (Yi − β0 − β1 Xi1 − . . . − βk Xik )2
i=1 i=1

• Diﬀerentiating S with respect to each coeﬃcient in turn we

obtain a set of k + 1 equations constituting the ﬁrst order con-
ditions for minimising the residual sum of squares S. These
equations are called the Normal Equations.

EEC 2002-2003. III

A solution for two regressors

• With two regressors this represents a two equation system with

two unknowns, i.e. β1 and β2 .
• The solution for β1 is
N
N
N
N

(Xi2 − X̄2 )Xi2 (Xi2 − X̄2 )Xi1 − (Yi − Ȳ )Xi2 (Yi − Ȳ )Xi1
i=1 i=1 i=1 i=1
β̂1 = N N N N

(Xi2 − X̄2 Xi2 ) (Xi1 − X̄1 )Xi1 − (Xi2 − X̄2 )Xi1 (Xi1 − X̄1 )Xi2
i=1 i=1 i=1 i=1

• This formula can also be written as

cov(Y, X1 )V ar(X2 ) − cov(X1 , X2 )cov(Y, X2 )
β̂1 =
V ar(X1 )V ar(X2 ) − cov(X1 , X2 )2
Similarly we can derive the formula for the other coefficient
(β2 )
• Note that the formula for βˆ1 is now different from the formula
we had in the two variable regression model. This now takes
into account the presence of the other regressor(s).
• The extent to which the two formulae differ depends on the
covariance of X1 and X2 .
• When this covariance is zero we are back to the formula for the
one variable regression model.

EEC 2002-2003. III

The Gauss Markov Theorem

• The Gauss Markov Theorem is valid for the multiple regression

model. We need however to modify assumption A.4.
• Deﬁne the covariance matrix of the regressors X to be
 
V ar(X1 ) cov(X1 , X2 ) . . . cov(X1 , Xk )
 cov(X , X ) V ar(X ) . . . cov(X , X ) 
 1 2 2 2 k 
cov(X) =  .. .. . . 
 . . . . .
. 
cov(X1 , Xk ) cov(X2 , Xk ) . . . V ar(Xk )

• Assumption M.4: We assume that cov(X) positive deﬁnite

and hence can be inverted.
• Theorem: Under Assumptions M.1 A.2 and A3 and M.4 the
Ordinary Least Squares Estimator (OLS) is Best in the class
of Linear Unbiased estimators (BLUE).
• As before this means that OLS provides estimates that are least
sensitive to changes in the data - given the stated assumptions.

EEC 2002-2003. III

An Example

• We investigate the determinants of log willingness to pay.

• We include as explanatory variables:
– log income,
– education coded as low, medium and high,
– age of the head of household, in years.
– household size.

Variable Coef. Std Err. t-stat

log income 0.14 0.07 2.2
medium education 0.47 0.16 2.9
high education 0.58 0.18 3.1
age 0.0012 0.004 0.3
household size 0.008 0.02 0.4
constant 0.53 0.55 0.96
number of observations 352
2
R 0.0697
2
adjusted R 0.0562

interpretation:
• When income goes up by 1%, WTP goes up by 0.14%.
• low education is the reference group (we have omitted this
dummy variable). Medium educated individuals have a WTP
47% higher than the low educated ones and high educated 58%
more.
•

EEC 2002-2003. III

Omitted Variable Bias

• Suppose the true regression relationship has the form

Yi = β0 + β1 Xi1 + β2 Xi2 + ui

• Instead we decide to estimate:

Yi = β0 + β1 Xi1 + νi

• We will show that in general this omission will lead to a biased

estimate of X1 .

• Suppose we use OLS on the second equation. As we know we

will obtain:
N
(Xi1 − X̄1 )νi
i=1
β̂1 = β1 +
N
(Xi1 − X̄1 )2
i=1

• The question is : What is the expected value of the last ex-

pression on the right hand side. For an unbiased estimator this
will be zero. Here we will show that it is not zero.

EEC 2002-2003. III

Omitted Variable Bias

• First note that according to the true model we have that

νi = β2 Xi2 + ui

• We can substitute this into the expression for the OLS estima-
tor to obtain

N
N
(Xi1 − X̄1 )β2 Xi2 + (Xi1 − X̄1 )ui
i=1 i=1
β̂1 = β1 +

N
(Xi1 − X̄1 )2
i=1

• Now we can take expectations of this expression.

N
N
E[(Xi1 − X̄1 )β2 Xi2 |X] + E[(Xi1 − X̄1 )ui |X]
E[β̂1 |X] = β1 + i=1 i=1

N
(Xi1 − X̄1 )2
i=1

The last expression is zero under the assumption that u is mean

independent of X [Assumption M.1].
• This expression can be written more compactly as:
cov(X1 , X2 )
E[β̂1 |X] = β1 + β2
V ar(X1 )

EEC 2002-2003. III

Omitted Variable Bias

cov(X1 , X2 )
E[β̂1 |X] = β1 + β2
V ar(X1 )
• The bias will be zero in two cases:
– When the coeﬃcient β2 is zero. In this case the regressor
X2 obviously does not belong to the regression.
– When the covariance between the two regressors X1 and
X2 is zero.
• Thus in general omitting regressors which have an impact on
Y (β2 non-zero) will bias the OLS estimator of the coeﬃcients
on the included regressors unless the omitted regressors are
uncorrelated with the included ones.

EEC 2002-2003. III

Summary of Results

• Omitting a regressor which has an impact on the dependent

variable and is correlated with the included regressors leads to
”omitted variable bias”

• Including a regressor which has no impact on the dependent

variable and is correlated with the included regressors leads
to a reduction in the eﬃciency of estimation of the variables
included in the regression.

EEC 2002-2003. III

Measurement Error

• Data is often measured with error.

– reporting errors.
– coding errors.
• The measurement error can affect either the dependent vari-
able or the explanatory variables. The effect is dramatically
different.

EEC 2002-2003. III

Measurement Error on Dependent Variable

• Yi is measured with error. We assume that the measurement

error is additive and not correlated with Xi .
• We observe Y̌i = Yi + νi . We regress Y̌i on Xi :

Y̌i = β0 + β1 Xi + ui
Yi = β0 + β1 Xi + ui − νi
= β0 + β1 Xi + wi

• The assumptions we have made for OLS to be unbiased and

BLUE are not violated. OLS estimator is unbiased.
• The variance of the slope coeﬃcient is:
1 V ar(wi )
V ar(β̂1 ) =
N V ar(Xi )
1 V ar(ui − νi )
=
N V ar(Xi )
1 V ar(ui ) + V ar(νi )
=
N V ar(Xi )
1 V ar(ui )
≥
N V ar(Xi )

• The variance of the estimator is larger with measurement error

on Yi .

EEC 2002-2003. III

Measurement Error on Explanatory Variables

• Xi is measured with errors. We assume that the error is addi-

tive and not correlated with Xi .
• We observe X̌i = Xi + νi instead. The regression we perform
is Yi on X̌i . The estimator of β1 is expressed as:

N
¯ )(Y − Ȳ )
(X̌i − X̌ i
i=1
β̂1 =

N
¯ )2
(X̌i − X̌
i=1

N
(Xi + νi − X̄)(β0 + β1 Xi + ui − Ȳ )
i=1
=

N
(Xi + νi − X̄)2
i=1

N
β1 (Xi − X̄)2
i=1
=

N
(Xi − X̄)2 + νi2 − 2νi (Xi − X̄)
i=1

β1 V ar(Xi )
E(βˆ1 ) = ≤ β1
V ar(Xi ) + V ar(νi )
• Measurement error on Xi leads to a biased OLS estimate,
biased towards zero. This is also called attenuation bias.

EEC 2002-2003. III

統計摘要
No ratings yet
統計摘要
12 pages
Basic Regression Analysis
No ratings yet
Basic Regression Analysis
5 pages
Assignments Ashoka University
No ratings yet
Assignments Ashoka University
32 pages
Ordinary Least Squares
No ratings yet
Ordinary Least Squares
21 pages
Lec Topic3
No ratings yet
Lec Topic3
51 pages
Emet2007 Notes
No ratings yet
Emet2007 Notes
6 pages
2 - Model Linear Jamak Dan OLS
No ratings yet
2 - Model Linear Jamak Dan OLS
11 pages
Multiple Regression Analysis: I 0 1 I1 K Ik I
100% (1)
Multiple Regression Analysis: I 0 1 I1 K Ik I
30 pages
Chapter 3 Multiple regression
No ratings yet
Chapter 3 Multiple regression
49 pages
Top2 Estimation Handout
No ratings yet
Top2 Estimation Handout
39 pages
Chapter 3 Econometrics
No ratings yet
Chapter 3 Econometrics
67 pages
Multiple regression
No ratings yet
Multiple regression
14 pages
Chapter Two: Bivariate Regression Mode
100% (1)
Chapter Two: Bivariate Regression Mode
54 pages
Econ 2220 Lecture 5
No ratings yet
Econ 2220 Lecture 5
26 pages
Lecture 3 Multiple Regression Model-Estimation
No ratings yet
Lecture 3 Multiple Regression Model-Estimation
40 pages
Week 2, OLS
No ratings yet
Week 2, OLS
83 pages
qrm2 Session1 2
No ratings yet
qrm2 Session1 2
89 pages
Lecture 3 - Econometria I
No ratings yet
Lecture 3 - Econometria I
46 pages
Chapter # 6: Multiple Regression Analysis: The Problem of Estimation
No ratings yet
Chapter # 6: Multiple Regression Analysis: The Problem of Estimation
43 pages
WEEK2 Simple Regression
No ratings yet
WEEK2 Simple Regression
133 pages
Chapter 1 Article
No ratings yet
Chapter 1 Article
9 pages
Short - Notes - Econometric Methods
No ratings yet
Short - Notes - Econometric Methods
22 pages
Simple Regression
No ratings yet
Simple Regression
45 pages
Chapter3
No ratings yet
Chapter3
52 pages
2 Regression With Multiple Regressors 1
No ratings yet
2 Regression With Multiple Regressors 1
22 pages
CHAPTER 2
No ratings yet
CHAPTER 2
17 pages
Econometrics_2
No ratings yet
Econometrics_2
8 pages
Chapter 2 Econometrics
No ratings yet
Chapter 2 Econometrics
9 pages
lecture_8
No ratings yet
lecture_8
29 pages
Lecture3 221109 035214
No ratings yet
Lecture3 221109 035214
87 pages
Lecture-4
No ratings yet
Lecture-4
11 pages
CH-3
No ratings yet
CH-3
123 pages
Linear Regression Model: Man - PN@VNP - Edu.vn
No ratings yet
Linear Regression Model: Man - PN@VNP - Edu.vn
77 pages
Lecture 2. Simple Linear Regression
No ratings yet
Lecture 2. Simple Linear Regression
49 pages
Econometrics Unit 3 Tedy Best
No ratings yet
Econometrics Unit 3 Tedy Best
147 pages
Multiple Linear Regression Model
No ratings yet
Multiple Linear Regression Model
99 pages
Regression and Multiple Regression Analysis
100% (1)
Regression and Multiple Regression Analysis
21 pages
02 Simple Regression
No ratings yet
02 Simple Regression
29 pages
Multiple Regression
No ratings yet
Multiple Regression
22 pages
Linear Regression Models
No ratings yet
Linear Regression Models
41 pages
MGT-Three
No ratings yet
MGT-Three
86 pages
Introduction To Mathematical Modeling: Simple Linear Regression
No ratings yet
Introduction To Mathematical Modeling: Simple Linear Regression
21 pages
Econometrics - Review Sheet ' (Main Concepts)
No ratings yet
Econometrics - Review Sheet ' (Main Concepts)
5 pages
CHAPTER THREE - Multiple Linear Regression Analysis
No ratings yet
CHAPTER THREE - Multiple Linear Regression Analysis
77 pages
Lect 3
No ratings yet
Lect 3
53 pages
Chapter 1
No ratings yet
Chapter 1
17 pages
Eco 3
No ratings yet
Eco 3
68 pages
BST 32202 LINEAR REGRESSION 6 SLR ASSUMPTIONS LSE
No ratings yet
BST 32202 LINEAR REGRESSION 6 SLR ASSUMPTIONS LSE
20 pages
1-Chap II Econometrics ABC DR Mitiku
No ratings yet
1-Chap II Econometrics ABC DR Mitiku
80 pages
Lec2 ASE
No ratings yet
Lec2 ASE
86 pages
Econ20222 MJAbackgr
No ratings yet
Econ20222 MJAbackgr
164 pages
Chapter Three: Estimation of Multiple Linear Regression Model
No ratings yet
Chapter Three: Estimation of Multiple Linear Regression Model
18 pages
topic-2
No ratings yet
topic-2
23 pages
Regression With One Regressor
No ratings yet
Regression With One Regressor
25 pages
Topic 2
No ratings yet
Topic 2
23 pages
Bus 173 - Lecture 5
No ratings yet
Bus 173 - Lecture 5
38 pages
2024 1 Metrics 6 Multipleols 4
No ratings yet
2024 1 Metrics 6 Multipleols 4
18 pages
L1 The SLR model
No ratings yet
L1 The SLR model
11 pages
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Utility Indifference Curves For Risk
No ratings yet
Utility Indifference Curves For Risk
3 pages
Correlation and Regression Analyses
No ratings yet
Correlation and Regression Analyses
8 pages
did DML
No ratings yet
did DML
54 pages
SRS Bulletin 2007 Vol 43 No 1
No ratings yet
SRS Bulletin 2007 Vol 43 No 1
6 pages
MId - Term 2
No ratings yet
MId - Term 2
9 pages
BSTA 320 Comprehensive Exam Formula Sheet
No ratings yet
BSTA 320 Comprehensive Exam Formula Sheet
5 pages
Chapter 4
No ratings yet
Chapter 4
11 pages
U02Lecture06 Regression
No ratings yet
U02Lecture06 Regression
25 pages
EKRP211-1stOpp-2023
No ratings yet
EKRP211-1stOpp-2023
4 pages
Yaikob Second Assesiment Final
No ratings yet
Yaikob Second Assesiment Final
33 pages
Econ-331-Econometrics-1
No ratings yet
Econ-331-Econometrics-1
3 pages
Predicting Risk in Criminal Procedure: Actuarial Tools, Algorithms, AI and Judicial Decision-Making
No ratings yet
Predicting Risk in Criminal Procedure: Actuarial Tools, Algorithms, AI and Judicial Decision-Making
19 pages
Srs Bulletin: Sample Registration System
No ratings yet
Srs Bulletin: Sample Registration System
9 pages
Mock Midterm
No ratings yet
Mock Midterm
5 pages
Guidelines For MDP
No ratings yet
Guidelines For MDP
18 pages
BMS 6103 ECONOMETRICS_Set B
No ratings yet
BMS 6103 ECONOMETRICS_Set B
2 pages
Introduction 1
No ratings yet
Introduction 1
3 pages
An Actuarial Audit: Contingencies
No ratings yet
An Actuarial Audit: Contingencies
5 pages
Present Value of A $1
No ratings yet
Present Value of A $1
3 pages
SR No1
No ratings yet
SR No1
6 pages
Linear Regression 2
No ratings yet
Linear Regression 2
22 pages
Econometrics II CH 1
No ratings yet
Econometrics II CH 1
48 pages
Insurance Resume Examples
100% (2)
Insurance Resume Examples
8 pages
Ila LFM Canada
No ratings yet
Ila LFM Canada
8 pages
Python For Actuary
No ratings yet
Python For Actuary
106 pages
male
No ratings yet
male
4 pages
On Epidemologic Measurments
No ratings yet
On Epidemologic Measurments
53 pages
One Pager KAPP - ROM
No ratings yet
One Pager KAPP - ROM
1 page
VI Semester
No ratings yet
VI Semester
20 pages
04 Employee Compensation - Post-Employment and Share-Based
No ratings yet
04 Employee Compensation - Post-Employment and Share-Based
24 pages

Multiple Regression Model

Uploaded by

Multiple Regression Model

Uploaded by

Multiple Regression Model

The Multiple Regression Model

• The Multiple regression model takes the form

Yi = β0 + β1 Xi1 + β2 Xi2 + . . . + βk Xik + ui

• There are k regressors (explanatory Variables) and a constant.

EEC 2002-2003. III

• Since the error term is mean independent of the Xs, varying

• Thus each coeﬃcient measures the impact of the corresponding

EEC 2002-2003. III

• Some of the explanatory variables are not necessarily continu-

EEC 2002-2003. III

• The dummy variable can be used to build a model with an

• Interpretation: The observations for which Di = 1 have on

Variable Coeﬃcient st. err

EEC 2002-2003. III

• The dummy variable can also be interacted with a continuous

• Interpretation: For observations with Di = 0, a one unit in-

Variable Coeﬃcient st. err

EEC 2002-2003. III

• We maintain the same set of assumptions as in the one variable

• Diﬀerentiating S with respect to each coeﬃcient in turn we

EEC 2002-2003. III

• With two regressors this represents a two equation system with

• This formula can also be written as

EEC 2002-2003. III

• The Gauss Markov Theorem is valid for the multiple regression

• Assumption M.4: We assume that cov(X) positive deﬁnite

EEC 2002-2003. III

• We investigate the determinants of log willingness to pay.

Variable Coef. Std Err. t-stat

EEC 2002-2003. III

• Suppose the true regression relationship has the form

• Instead we decide to estimate:

• We will show that in general this omission will lead to a biased

• Suppose we use OLS on the second equation. As we know we

• The question is : What is the expected value of the last ex-

EEC 2002-2003. III

• First note that according to the true model we have that

• Now we can take expectations of this expression.

The last expression is zero under the assumption that u is mean

EEC 2002-2003. III

EEC 2002-2003. III

• Omitting a regressor which has an impact on the dependent

• Including a regressor which has no impact on the dependent

EEC 2002-2003. III

• Data is often measured with error.

EEC 2002-2003. III

• Yi is measured with error. We assume that the measurement

• The assumptions we have made for OLS to be unbiased and

• The variance of the estimator is larger with measurement error

EEC 2002-2003. III

• Xi is measured with errors. We assume that the error is addi-

EEC 2002-2003. III

You might also like