0% found this document useful (0 votes)

22 views8 pages

Ecc321 Chapter 3

The document provides an overview of multiple linear regression analysis, explaining its definition, motivation, and interpretation. It discusses various examples, properties of ordinary least squares (OLS) estimation, and standard assumptions for the multiple regression model. Additionally, it covers issues like omitted variable bias, multicollinearity, and the efficiency of OLS estimators under specific assumptions.

Uploaded by

qhamabeta05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views8 pages

Ecc321 Chapter 3

Uploaded by

qhamabeta05

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Created by Turbolearn AI

Multiple Regression Analysis: Estimation

Definition of Multiple Linear Regression
The multiple linear regression model explains a variable y in terms of variables
x ,x ,...,x .
1 2 k

Motivation for Multiple Regression

Incorporate more explanatory factors into the model.
Explicitly hold fixed other factors that otherwise would be in the error term.
Allow for more flexible functional forms.
Example: Wage equation.

Example: Average Test Scores and Per Student

Spending
Per student spending is likely to be correlated with average family income at a given
high school due to school financing. Omitting average family income in regression
would lead to a biased estimate of the effect of spending on average test scores. In a
simple regression model, the effect of per student spending would partly include the
effect of family income on test scores.

Example: Family Income and Family

Consumption
The model has two explanatory variables: income and income squared.
Consumption is explained as a quadratic function of income. One has to be very
careful when interpreting the coefficients.

Example: CEO Salary, Sales, and CEO Tenure

Page 1
Created by Turbolearn AI

Model assumes a constant elasticity relationship between CEO salary and the
sales of his or her firm.
Model assumes a quadratic relationship between CEO salary and his or her
tenure with the firm.
The model has to be linear in the parameters (not in the variables).

OLS Estimation of the Multiple Regression

Model
Random sample
Regression residuals
Minimize sum of squared residuals

Interpretation of the Multiple Regression

Model
The multiple linear regression model manages to hold the values of other
explanatory variables fixed even if, in reality, they are correlated with the explanatory
variable under consideration.

Ceteris paribus-interpretation: It has still to be assumed that unobserved

factors do not change if the explanatory variables are changed.

Example: Determinants of College GPA

Interpretation: Holding ACT fixed, another point on high school grade point
average is associated with another .453 points college grade point average.
If we compare two students with the same ACT, but the high school GPA of
student A is one point higher, we predict student A to have a college GPA that
is .453 higher than that of student B.
Holding high school grade point average fixed, another 10 points on ACT are
associated with less than one point on college GPA.

Properties of OLS on Any Sample of Data

Fitted values and residuals
Algebraic properties of OLS regression

Page 2
Created by Turbolearn AI

Partialling Out Interpretation of Multiple

Regression
One can show that the estimated coefficient of an explanatory variable in a multiple
regression can be obtained in two steps:

1. Regress the explanatory variable on all other explanatory variables.

2. Regress y on the residuals from this regression.

The residuals from the first regression are the part of the explanatory variable that is
uncorrelated with the other explanatory variables. The slope coefficient of the second
regression therefore represents the isolated effect of the explanatory variable on the
dependent variable.

Goodness-of-Fit
Decomposition of total variation
2
R

Alternative expression for R 2

Example: Explaining Arrest Records

If the proportion prior arrests increases by 0.5, the predicted fall in arrests is 7.5
arrests per 100 men.
If the months in prison increase from 0 to 12, the predicted fall in arrests is
0.408 arrests for a particular man.
If the quarters employed increase by 1, the predicted fall in arrests is 10.4
arrests per 100 men.
When an additional explanatory variable is added (average prior sentence), the
additional explanatory power is limited as R increases by little.
2

Even if R is small, regression may still provide good estimates of ceteris

paribus effects.

Standard Assumptions for the Multiple

Regression Model

Page 3
Created by Turbolearn AI

Assumption Description

MLR.1 (Linear in parameters)

MLR.2 (Random sampling)
(No perfect collinearity) In the sample (and therefore in the population),
MLR.3 none of the independent variables is constant and there are no exact
linear relationships among the independent variables.
(Zero conditional mean) In a multiple regression model, the zero
MLR.4 conditional mean assumption is much more likely to hold because
fewer things end up in the error.
MLR.5 (Homoskedasticity)

Remarks on MLR.3
The assumption only rules out perfect collinearity/correlation between
explanatory variables; imperfect correlation is allowed.
If an explanatory variable is a perfect linear combination of other explanatory
variables, it is superfluous and may be eliminated.
Constant variables are also ruled out (collinear with intercept).

Example for Perfect Collinearity

Small sample and relationships between regressors.

Example: Average Test Scores

In a multiple regression model, the zero conditional mean assumption is much more
likely to hold because fewer things end up in the error.

Omitted Variable Bias

Conclusion
All estimated coefficients will be biased.

Page 4
Created by Turbolearn AI

заработной плате Example: Omitting Ability in a Wage

Equation
When is there no omitted variable bias? If the omitted variable is irrelevant or
uncorrelated.

More General Cases

No general statements possible about direction of bias. Analysis as in simple case if
one regressor uncorrelated with others. Example: Omitting ability in a wage equation.

Homoskedasticity
Example: Wage equation

Shorthand notation: V ar(u|x 1, x2 , . . . , xk ) = σ

Sampling Variances of the OLS Slope

Estimators
Under assumptions MLR.1 - MLR.5:
2
^ ) =
V ar(β
σ
j 2
SST j (1−R )
j

Components of OLS Variances

Page 5
Created by Turbolearn AI

1. The error variance (σ ):

A high error variance increases the sampling variance because there is

more noise in the equation.
A large error variance doesn't necessarily make estimates imprecise.
The error variance does not decrease with sample size.
2. The total sample variation in the explanatory variable (SST ): j

More sample variation leads to more precise estimates.

Total sample variation automatically increases with the sample size.
Increasing the sample size is thus a way to get more precise estimates.
3. Linear relationships among the independent variables (R ): 2
j

Regress x on all other independent variables (including constant).

The R of this regression will be higher when x can be better explained

2
j

by the other independent variables.

The sampling variance of the slope estimator for x will be higher when
j

x can be better explained by the other independent variables.

Under perfect multicollinearity, the variance of the slope estimator will

approach infinity.

Multicollinearity

Example
The different expenditure categories will be strongly correlated because if a school
has a lot of resources it will spend a lot on everything. It will be hard to estimate the
differential effects of different expenditure categories because all expenditures are
either high or low. For precise estimates of the differential effects, one would need
information about situations where expenditure categories change differentially. As a
consequence, sampling variance of the estimated effects will be large.

Discussion

Page 6
Created by Turbolearn AI

In the above example, it would probably be better to lump all expenditure

categories together because effects cannot be disentangled.
In other cases, dropping some independent variables may reduce
multicollinearity (but this may lead to omitted variable bias).
Only the sampling variance of the variables involved in multicollinearity will be
inflated; the estimates of other effects may be very precise.
Multicollinearity is not a violation of MLR.3 in the strict sense.
Multicollinearity may be detected through variance inflation factors.

Variances in Misspecified Models

The choice of whether to include a particular variable in a regression can be made by
analyzing the tradeoff between bias and variance. It might be the case that the likely
omitted variable bias in the misspecified model is overcompensated by a smaller
variance.

Estimating the Error Variance

An unbiased estimate of the error variance can be obtained by subtracting the
number of estimated regression coefficients from the number of observations. The
number of observations minus the number of estimated parameters is also called the
degrees of freedom.

The n estimated squared residuals in the sum are not completely independent but
related through the k + 1 equations that define the first-order conditions of the
minimization problem.

Theorem 3.3 (Unbiased estimator of the error variance)

2 SSR
σ
^ =
n−k−1

Estimation of the Sampling Variances of the

OLS Estimators
Note that these formulas are only valid under assumptions MLR.1-MLR.5 (in
particular, there has to be homoskedasticity).

Efficiency of OLS: The Gauss-Markov Theorem

Page 7
Created by Turbolearn AI

Under assumptions MLR.1 - MLR.5, OLS is unbiased. However, under these

assumptions there may be many other estimators that are unbiased. Which one is the
unbiased estimator with the smallest variance? In order to answer this question one
usually limits oneself to linear estimators, i.e., estimators linear in the dependent
variable.

Theorem 3.4 (Gauss-Markov Theorem)

Under assumptions MLR.1 - MLR.5, the OLS estimators are the best linear unbiased
estimators (BLUEs) of the regression coefficients, i.e., OLS is only the best estimator if
MLR.1 - MLR.5 hold; if there is heteroskedasticity, for example, there are better
estimators.

Several Scenarios for Applying Multiple

Regression
Prediction: The best prediction of y will be its conditional expectation.

Efficient markets: Efficient markets theory states that a single variable acts as a
sufficient statistic for predicting y. Once we know this sufficient statistic, then
additional information is not useful in predicting y.

Measuring the tradeoff between two variables: Consider regressing salary on

pension compensation and other controls.

Testing for ceteris paribus group differences: Differences in outcomes

between groups can be evaluated with dummy variables.

Potential outcomes, treatment effects, and policy analysis: With multiple

regression, we can get closer to random assignment by conditioning on
observables. Inclusion of the x variables allows us to control for any reasons
why there may not be random assignment.

For example, if y is earnings and w is participation in a job training program,

then the variables in x would include all of those variables that are likely to be
related to both earnings and participation in job training.

Page 8

CH 03 Wooldridge 5e PPT PDF
100% (3)
CH 03 Wooldridge 5e PPT PDF
35 pages
CH 03 Wooldridge 6e PPT Updated
No ratings yet
CH 03 Wooldridge 6e PPT Updated
36 pages
National Engineering Handbook Section 4
0% (1)
National Engineering Handbook Section 4
30 pages
Snubbing Practice Irp15 - Final - 2007
100% (4)
Snubbing Practice Irp15 - Final - 2007
149 pages
Lecture 3
No ratings yet
Lecture 3
27 pages
2 - Model Linear Jamak Dan OLS
No ratings yet
2 - Model Linear Jamak Dan OLS
11 pages
Wooldridge Notes
No ratings yet
Wooldridge Notes
15 pages
Econometrics Lecture4 MultipleRegression
No ratings yet
Econometrics Lecture4 MultipleRegression
40 pages
Lecture 3 Multiple Regression Model-Estimation
No ratings yet
Lecture 3 Multiple Regression Model-Estimation
40 pages
Econometrics I Lecture 4 Wooldridge
No ratings yet
Econometrics I Lecture 4 Wooldridge
33 pages
Econometrics II: Revision Class: Introduction To Econometrics
No ratings yet
Econometrics II: Revision Class: Introduction To Econometrics
55 pages
Lecture 3 - Econometria I
No ratings yet
Lecture 3 - Econometria I
46 pages
CH 03
No ratings yet
CH 03
17 pages
Assignments Ashoka University
No ratings yet
Assignments Ashoka University
32 pages
Ordinary Least Squares
No ratings yet
Ordinary Least Squares
17 pages
Lecture 2
No ratings yet
Lecture 2
39 pages
The Multiple Linear Regression Model: Version: 30-10-2023, 16:07
No ratings yet
The Multiple Linear Regression Model: Version: 30-10-2023, 16:07
17 pages
Multiple Linear Regression Model
No ratings yet
Multiple Linear Regression Model
99 pages
統計摘要
No ratings yet
統計摘要
12 pages
Mult Hetero Notes Agd
No ratings yet
Mult Hetero Notes Agd
29 pages
Lec Topic3
No ratings yet
Lec Topic3
51 pages
Econometrics Lecture 3 Multiple Regression Estimation
No ratings yet
Econometrics Lecture 3 Multiple Regression Estimation
40 pages
CH-15 - IInd Sem 23-24
No ratings yet
CH-15 - IInd Sem 23-24
99 pages
Multiple Linear Regression Notes
No ratings yet
Multiple Linear Regression Notes
9 pages
Lecture 2 - MRA and Inference
No ratings yet
Lecture 2 - MRA and Inference
57 pages
CHAPTER THREE - Multiple Linear Regression Analysis
No ratings yet
CHAPTER THREE - Multiple Linear Regression Analysis
77 pages
Lecture 7. Multiple Regression
No ratings yet
Lecture 7. Multiple Regression
11 pages
ECO 401 Econometrics: SI 2021 Week 2, 14 September
100% (1)
ECO 401 Econometrics: SI 2021 Week 2, 14 September
47 pages
4-Econometrics-Linear Regression
No ratings yet
4-Econometrics-Linear Regression
12 pages
Basic Regression Analysis
No ratings yet
Basic Regression Analysis
5 pages
Regression Model
No ratings yet
Regression Model
16 pages
Lecture 5
No ratings yet
Lecture 5
50 pages
Chapter 2 Econometrics
No ratings yet
Chapter 2 Econometrics
9 pages
Chapter 3 Econometric
No ratings yet
Chapter 3 Econometric
23 pages
Econometrics
No ratings yet
Econometrics
13 pages
CH 03 Wooldridge 5e PPT
No ratings yet
CH 03 Wooldridge 5e PPT
35 pages
Linear Regression Assumptions and Limitations
No ratings yet
Linear Regression Assumptions and Limitations
10 pages
CH 03 Wooldridge 5e PPT
No ratings yet
CH 03 Wooldridge 5e PPT
35 pages
Multiple Regression Analysis: y + X + X + - . - X + U
No ratings yet
Multiple Regression Analysis: y + X + X + - . - X + U
26 pages
MLR Note
No ratings yet
MLR Note
3 pages
EC501 Lecture 02
No ratings yet
EC501 Lecture 02
27 pages
Multiple Regression Analysis: Võ Đ C Hoàng Vũ
No ratings yet
Multiple Regression Analysis: Võ Đ C Hoàng Vũ
20 pages
TCH442E Quantitative Methods For Finance: Last Lecture: Next
No ratings yet
TCH442E Quantitative Methods For Finance: Last Lecture: Next
13 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
17 pages
Lecture 2: MRA and Inference: Dr. Yundan Gong
No ratings yet
Lecture 2: MRA and Inference: Dr. Yundan Gong
52 pages
Lecture 2 - Regression - Multiple - Regressors
No ratings yet
Lecture 2 - Regression - Multiple - Regressors
30 pages
3 Multiple Regression Analysis Estimation
No ratings yet
3 Multiple Regression Analysis Estimation
37 pages
Econometrics 8
No ratings yet
Econometrics 8
35 pages
Chap3 - Multiple Regression
No ratings yet
Chap3 - Multiple Regression
56 pages
EE1 - 3 - Multiple Linear Regression
No ratings yet
EE1 - 3 - Multiple Linear Regression
30 pages
Econometrics: Two Variable Regression: The Problem of Estimation
No ratings yet
Econometrics: Two Variable Regression: The Problem of Estimation
28 pages
CH - 03 - Multiple Regression Analysis Estimation
No ratings yet
CH - 03 - Multiple Regression Analysis Estimation
36 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
52 pages
(Reformatted) Module 5 (Students)
No ratings yet
(Reformatted) Module 5 (Students)
32 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
37 pages
3a - Relaxing The Ols Assumptions
No ratings yet
3a - Relaxing The Ols Assumptions
37 pages
FRM Part 1: Regression With Multiple Explanatory Variables
No ratings yet
FRM Part 1: Regression With Multiple Explanatory Variables
29 pages
CH - 03 - Multiple Regression Analysis Estimation (Autosaved)
No ratings yet
CH - 03 - Multiple Regression Analysis Estimation (Autosaved)
36 pages
Gujarati D, Porter D, 2008: Basic Econometrics 5Th Edition Summary of Chapter 3-5
No ratings yet
Gujarati D, Porter D, 2008: Basic Econometrics 5Th Edition Summary of Chapter 3-5
64 pages
Lecture 4
No ratings yet
Lecture 4
35 pages
2 Regression With Multiple Regressors 1
No ratings yet
2 Regression With Multiple Regressors 1
22 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
A.) Excavation Works: Project Cost Estimates
No ratings yet
A.) Excavation Works: Project Cost Estimates
34 pages
14-The Story of The Old Man Who Made Withered Trees To Flower
100% (1)
14-The Story of The Old Man Who Made Withered Trees To Flower
14 pages
1 s2.0 S2405844019366228 Main
No ratings yet
1 s2.0 S2405844019366228 Main
12 pages
Technical Specification Gold
No ratings yet
Technical Specification Gold
66 pages
Flyer Marine Product Guide 2018 Twin Disc Esco
No ratings yet
Flyer Marine Product Guide 2018 Twin Disc Esco
39 pages
PE 2 (Prelim Exam - Attempt 1 Review.) PDF
No ratings yet
PE 2 (Prelim Exam - Attempt 1 Review.) PDF
8 pages
Health and Safety Inspection Checklist
No ratings yet
Health and Safety Inspection Checklist
10 pages
Repair Job Card Dec18
No ratings yet
Repair Job Card Dec18
426 pages
Electrical Noise and Transients: Application Note
No ratings yet
Electrical Noise and Transients: Application Note
5 pages
Brochure ZWCAD Mechanical 2019
No ratings yet
Brochure ZWCAD Mechanical 2019
2 pages
3-2 Storage Data Protection Technologies and Applications
No ratings yet
3-2 Storage Data Protection Technologies and Applications
48 pages
E90-DTU (433C30) UserManual EN v1.2
No ratings yet
E90-DTU (433C30) UserManual EN v1.2
18 pages
Circle-Theorems Test
No ratings yet
Circle-Theorems Test
12 pages
Introduction To The 2018 Edition of IEEE 1584
No ratings yet
Introduction To The 2018 Edition of IEEE 1584
27 pages
PDF JNC 8 Guidelines - Compress
100% (1)
PDF JNC 8 Guidelines - Compress
2 pages
Science Year 8 EOY Exam Revision.190138291
No ratings yet
Science Year 8 EOY Exam Revision.190138291
13 pages
Pico Explorer Base: Features
No ratings yet
Pico Explorer Base: Features
2 pages
OnePlus Nord 3 - Full Phone Specifications
No ratings yet
OnePlus Nord 3 - Full Phone Specifications
2 pages
Free Fall Demo
No ratings yet
Free Fall Demo
38 pages
Slide Bearings Type M: in Mechanical Engineering
No ratings yet
Slide Bearings Type M: in Mechanical Engineering
10 pages
Mensah Millicent
No ratings yet
Mensah Millicent
5 pages
Spotlight On Literature A - Tests PDF
No ratings yet
Spotlight On Literature A - Tests PDF
17 pages
Kenwood KIL60W23 Refrigerator
No ratings yet
Kenwood KIL60W23 Refrigerator
20 pages
Project Management - Chapter One..
100% (1)
Project Management - Chapter One..
46 pages
380 Dia Clutch - Oyster
No ratings yet
380 Dia Clutch - Oyster
29 pages
Woodpanel - Sector Report - Challenging Time, Carving Growth
No ratings yet
Woodpanel - Sector Report - Challenging Time, Carving Growth
64 pages
Marine Environment and Their Divisions
No ratings yet
Marine Environment and Their Divisions
11 pages
Team Building Amazing Race
No ratings yet
Team Building Amazing Race
3 pages

Ecc321 Chapter 3

Uploaded by

Ecc321 Chapter 3

Uploaded by

Created by Turbolearn AI

Multiple Regression Analysis: Estimation

Motivation for Multiple Regression

Example: Average Test Scores and Per Student

Example: Family Income and Family

Example: CEO Salary, Sales, and CEO Tenure

OLS Estimation of the Multiple Regression

Interpretation of the Multiple Regression

Ceteris paribus-interpretation: It has still to be assumed that unobserved

Example: Determinants of College GPA

Properties of OLS on Any Sample of Data

Partialling Out Interpretation of Multiple

1. Regress the explanatory variable on all other explanatory variables.

Alternative expression for R 2

Example: Explaining Arrest Records

Even if R is small, regression may still provide good estimates of ceteris

Standard Assumptions for the Multiple

MLR.1 (Linear in parameters)

Example for Perfect Collinearity

Example: Average Test Scores

Omitted Variable Bias

заработной плате Example: Omitting Ability in a Wage

More General Cases

Shorthand notation: V ar(u|x 1, x2 , . . . , xk ) = σ

Sampling Variances of the OLS Slope

Components of OLS Variances

1. The error variance (σ ):

A high error variance increases the sampling variance because there is

More sample variation leads to more precise estimates.

Regress x on all other independent variables (including constant).

The R of this regression will be higher when x can be better explained

by the other independent variables.

x can be better explained by the other independent variables.

Under perfect multicollinearity, the variance of the slope estimator will

In the above example, it would probably be better to lump all expenditure

Variances in Misspecified Models

Estimating the Error Variance

Theorem 3.3 (Unbiased estimator of the error variance)

Estimation of the Sampling Variances of the

Efficiency of OLS: The Gauss-Markov Theorem

Under assumptions MLR.1 - MLR.5, OLS is unbiased. However, under these

Theorem 3.4 (Gauss-Markov Theorem)

Several Scenarios for Applying Multiple

Measuring the tradeoff between two variables: Consider regressing salary on

Testing for ceteris paribus group differences: Differences in outcomes

Potential outcomes, treatment effects, and policy analysis: With multiple

For example, if y is earnings and w is participation in a job training program,

You might also like