0% found this document useful (0 votes)

15 views8 pages

Theme 3 Multivariante Regression Model

The document discusses multivariate linear regression, focusing on assumptions, omitted variable bias, and practical applications. It explains the importance of drawing statistical inferences, the structure of multiple linear regression models, and the significance of goodness of fit measures like R-squared. Additionally, it provides practical examples, including the analysis of testing costs and the application of the Fama-French benchmark factors in assessing excess returns in different industry portfolios.

Uploaded by

mismail10001000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views8 pages

Theme 3 Multivariante Regression Model

Uploaded by

mismail10001000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Theme 3: Multivariate linear regression Model

[Multivariate Model - Matrix form - assumptions OLS- Omitted Variable Bias - Redundant
Variables- Goodness of Fit- Adjusted R^2 - The coefficient of Correlation- Practical
Applications]

3.1 Drawing Statistical Inferences from the population:

Ø In regression analysis our objective is not only to obtain the OLS estimators but also
to draw inferences about their true values in population, we would like to know how
close the estimators are to their counterparts in the population.

3.2 Multiple Linear Regression:

Ø If we have the following regression model

Ø Yi= β 1+ β 2 X 2i + β 3 X 3 i+ui
Ø β 1 is still the intercept
( β 2 ¿∨¿ ( β 3 ) measure (on the average) the change in y with respect to x2 (x3),
holding other factors constant.
Ø u is still the error term (or disturbance).
Ø The regression model still linear in parameters.
Ø Still need to make a zero conditional mean assumption, so now assume that: E(u|
x2,x3) = 0
3.2 Multiple Linear Regression Assumptions:
Ø No autocorrelation: cov(ui, uj) = 0 i ≠ j
Ø Homoscedasticity: constant variance which does not fluctuate
Zero covariance between ui and X’s:
Ø No specification bias (model correctly specified).
Ø Number of observations (n)> number of parameters to be estimated (k).
Ø There must a linear consistent relation (non- stochastic in the values of X in each
sample.
Ø No perfect linear relationship between the explanatory variables:
Ø for example: aX2 + b X3= 0 (perfect linear relationship).
3.2.2Practical Application: -
Ø Supposedly that student’s exam performance is determined through not only one
explanatory variable which is funding and expenditure but through as well other
factors such as average family income, and students educational background and so
forth….
Avgscore= β o+ β 1expend+ β 2 avginc +……. β k x k +ui

Ø In this example the coefficient of interest will be β 1and the ceteris paribus effect of
expenditure (expend) on average score (Av score), but the model will have omitted
variables that could be explaining why students get higher scores such as average
family income (Avginc). In the normal univariate OLS family income will be
included in u error term. Even we can later include other variables such as teacher
quality and school size.

3.3 Omitted Variable Bias & Redundant Variables: -

Corr (x1, x2) > 0 Corr (x1, x2) < 0

Over specified Under specified
b2 > 0 Positive bias Negative bias
b2 < 0 Negative bias Positive bias

Ø If correlation between x2, x1 and x2, y is the same direction, bias will be positive
(wealth) over specified.
Ø If correlation between x2, x1 and x2, y is the opposite direction, bias will be negative
(poverty rate) Under specified.

The problem takes place when a signficant variable that belongs to the true population
model is omitted and this results in the model’s under specification. Example 3.2.2 :-
Suppose that at the elementary school level, the average score for students on the
standardized exam is determined by:-

avgscore =β +β expend. - β povrate+ui

0 1 2
In the case: -
expend: denotes for school expenditure per student povrate: is poverty rate of the children in
the school.

Ø Let’s assume that we can only get information on the percentage rate of passing grade
for students and students’ expenditure, and we don’t have information on poverty rate.

Ø There is already ample evidence that children living in poverty rate might get lower
scores which signifies a negative correlation (y, x2) and as well there might exist a
negative correlation corr (x1, x2) <0 between average expenditure per student and
poverty rate.

3.3.1 Matrix Notation:

Yi = β 1 + β 2 X 2i + β 3 X 3 i+ . . . β k X ki + ui i= 1, 2, 3, n

Where:
Ø Y = an n x 1 vector of observation on the explained variable.
Ø X = an n x (k) matrix of observation on the explanatory variables.
Ø b = a (k) x 1 vector of parameters to be estimated
Ø u = an n x 1 vector of errors
3.4 Goodness of Fit:

Ø It represents the R^2, we shall find out how “well” the sample regression line fits the
data, by estimating the value of the coefficient of determination R2.
Ø TSS= ESS+RSS dividing by TSS
Ø 1= ESS/TSS+ RSS/TSS
R2 = ESS/TSS = 1 – RSS/TSS
Where:
Ø ESS stands for explained sum of squares.
Ø RSS: stands for residuals sum of squares.
Ø TSS: Total sum of squares

Ø R2= ESS/TSS=1- RSS/TSS

known by coefficient of determination

3.4.1 Adjusted R-Squared

Ø Recall that the R2 will always increase as more variables are added to the model. The
adjusted R2 considers the number of variables in a model and may decrease.
Ø It’s easy to see that the adjusted R2 is just (1 – R2) (n – 1) / (n – k – 1), but most
packages will give you both R2 and adj-R2.

SSR
[ ]
n−k −1
R¿
¿
=1- [
SST
]
n−1

3.5 The coefficient of Correlation:

Ø The correlation coefficient is a statistical measure that quantifies the strength and
direction of the linear relationship between two continuous variables. It is typically
denoted by the symbol "r." The correlation coefficient can take on values between -1
and 1, where:

Ø r = 1 indicates a perfect positive correlation, meaning that as one variable increases,

the other also increases in a linear fashion.
Ø r = -1 indicates a perfect negative correlation, meaning that as one variable increases,
the other decreases in a linear fashion.
Ø r = 0 indicates no linear correlation, meaning that there is no systematic linear
relationship between the variables.

Ø The absolute value of the correlation coefficient (|r|) indicates the strength of the
relationship between the variables. The closer |r| is to 1, the stronger the linear
relationship. If |r| is close to 0, there is a weaker or no linear relationship. The null
hypothesis (H0) typically assumes no correlation (r = 0), while the alternative
hypothesis (Ha) suggests a correlation exists (r ≠ 0). You can use a t-test to assess the
significance of the correlation coefficient. The test statistic follows a t-distribution,
and you can calculate the p-value associated with the test.

Theme 3: Practical Applications: -

Ø A laboratory collected data about the cost of material used for testing necessary
products over a one-year period. They want to know if the cost of materials A, B and
C have a significant value on the overall cost of testing. Observe the following tables
and answer to the questions below: P value: 0.043

a) Specify the MLR equation.

Answer: Y^= 2921.79-5.64X1+4.037X2-20.597X3

B) Determine and interpret the determination coefficient.

Answer: The R2 which is the coefficient of determination is 0.86, it means all three
independent variables X1, X2 and X3 explain 85% of the variability in the Y

C) Using a significance level of 10%, analyse the global significance of the model.
Answer: If the p value is less than 10% significance, then we reject Ho and hence there is a
relation between costs of material A, B and C and testing costs. It's important to note that the
significance level of 10% is relatively high and may increase the chance of a Type I error
(incorrectly rejecting a true null hypothesis).
D) Which of the three coefficients can be considered as the most efficient? Why?
The most efficient coefficient is the one showing the lowest standard error possible and most
significant which is cost component C as it has a p value of 0.03.

E) Which regressor(s) should we keep in our equation? Why? The ones that are mostly
significant as the lack of significance could mean that some of the regressors might have
higher standard error or encounter great variability of its variance and misinterpret the
significance of the variables used it could disrupt the characteristics of efficiency and
unbiasedness.

2- Practical Example 2 CAPM Fama-French benchmark factors

Ø We use monthly data on the excess return of two industry portfolios (consumer goods
and hi-tech) compiled by French11. We regress the excess returns of the two
industries on the excess market return based on a value-weighted average of all
NYSE, AMEX, and NASDAQ firms (all returns are measured in percentage terms).
Using data from January 2000 to December 2004 (n=60) we obtain the following
estimates for the consumer goods portfolio (p-values in parenthesis)

Ø We briefly investigate one version of multi-factor models using the so-called Fama-
French benchmark factors SMB (small minus big) and HML (high minus low) to test
whether excess returns depend on other factors than the market return. The factor
SMB measures the difference in returns of portfolios of small and large stocks and is
intended to measure the so-called size effect. HML measures the difference between
value stocks (having a high book value relative to their market value) and growth
stocks (with a low book-market ratio.
Ø Consumer goods portfolio

Ø High Tech. portfolio

The coefficients 0.624 and 1.74 indicate that a change in the (excess) market return by one
percentage point implies a change in the expected excess return by 0.624 percentage points
and 1.74 percentage points, respectively. In other words, the hi-tech portfolio has much
higher market risk than the consumer goods portfolio. The beta-factor remains significant in
both industries and changes only slightly compared to the market model estimates. However,
the results indicate a significant return premium for holding value stocks in the consumer
goods industry. For the hi-tech portfolio we find support for a size-effect. Overall, the results
can be viewed as supporting multi-factor models.

Econometrics Revision Work
100% (6)
Econometrics Revision Work
6 pages
Linear Regression
100% (2)
Linear Regression
28 pages
62r-11 Risk Assessment Identification and Qualitative Analysis
100% (2)
62r-11 Risk Assessment Identification and Qualitative Analysis
19 pages
Stages of Consulting Management
50% (2)
Stages of Consulting Management
12 pages
Concept of Quantitative Revolution in Geography
50% (2)
Concept of Quantitative Revolution in Geography
3 pages
Econometrics Notes
No ratings yet
Econometrics Notes
95 pages
Simple Regression and Correlation
No ratings yet
Simple Regression and Correlation
30 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Lecture 4
No ratings yet
Lecture 4
60 pages
Report On Financial Management in Schools
No ratings yet
Report On Financial Management in Schools
60 pages
Guidance Syllabus
100% (1)
Guidance Syllabus
6 pages
M2L2 CLRM & Simple Linear Regression Analysis
No ratings yet
M2L2 CLRM & Simple Linear Regression Analysis
13 pages
RiP Final Study
No ratings yet
RiP Final Study
35 pages
Levy - Psychology and Foreign Policy Decision-Making
No ratings yet
Levy - Psychology and Foreign Policy Decision-Making
33 pages
Regression Models - Follow
No ratings yet
Regression Models - Follow
7 pages
Homework 3
No ratings yet
Homework 3
10 pages
REGRESSION ANALYSIS 1 and 2 Notes
No ratings yet
REGRESSION ANALYSIS 1 and 2 Notes
9 pages
Topic - Chapter 12 - Regression Models
No ratings yet
Topic - Chapter 12 - Regression Models
1 page
Drone Market Report 2025 2030 Sample
No ratings yet
Drone Market Report 2025 2030 Sample
54 pages
Ecotrix Ecotrix: B.A. Economics (Hons.) (University of Delhi) B.A. Economics (Hons.) (University of Delhi)
No ratings yet
Ecotrix Ecotrix: B.A. Economics (Hons.) (University of Delhi) B.A. Economics (Hons.) (University of Delhi)
18 pages
05 16 Simple Regression 2
No ratings yet
05 16 Simple Regression 2
84 pages
4.1 Multiple Regression Models
No ratings yet
4.1 Multiple Regression Models
6 pages
Econometrics Chapter 3
No ratings yet
Econometrics Chapter 3
24 pages
Note 13 - Linear Regression
No ratings yet
Note 13 - Linear Regression
25 pages
Econometrics
No ratings yet
Econometrics
13 pages
IAR Lecture 3
No ratings yet
IAR Lecture 3
6 pages
15multiple Linear Regression
No ratings yet
15multiple Linear Regression
168 pages
BAB 7 Multiple Regression and Other Extensions of The Simple
No ratings yet
BAB 7 Multiple Regression and Other Extensions of The Simple
17 pages
FRM Part 1: Regression With Multiple Explanatory Variables
No ratings yet
FRM Part 1: Regression With Multiple Explanatory Variables
29 pages
Unit 4 Multiple Regression Model: 4.0 Objectives
No ratings yet
Unit 4 Multiple Regression Model: 4.0 Objectives
23 pages
Lecture 8 Correlation and Linear Regression
No ratings yet
Lecture 8 Correlation and Linear Regression
66 pages
Session 19&20
No ratings yet
Session 19&20
54 pages
Multiple Linear Regression & Nonlinear Regression Models
No ratings yet
Multiple Linear Regression & Nonlinear Regression Models
51 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Notes Book
No ratings yet
Notes Book
39 pages
Lecture 3 - LRM
No ratings yet
Lecture 3 - LRM
40 pages
Chapter 8
No ratings yet
Chapter 8
45 pages
STAT22209 - Chapter 02-Regression Analyisis - 2022
No ratings yet
STAT22209 - Chapter 02-Regression Analyisis - 2022
41 pages
Lecture 12
No ratings yet
Lecture 12
47 pages
Oversikt ECN402
No ratings yet
Oversikt ECN402
40 pages
Chap3 - Multiple Regression
No ratings yet
Chap3 - Multiple Regression
56 pages
Chapter 3
No ratings yet
Chapter 3
31 pages
Homework 3
No ratings yet
Homework 3
10 pages
Econometrics For MGT ppt-2
No ratings yet
Econometrics For MGT ppt-2
58 pages
Chapter 3 - Classical Simple Linear Regression
No ratings yet
Chapter 3 - Classical Simple Linear Regression
52 pages
Assignment 1 - Answer
No ratings yet
Assignment 1 - Answer
11 pages
Part 2 Exploring Relationships Among Variables
No ratings yet
Part 2 Exploring Relationships Among Variables
8 pages
Regression
No ratings yet
Regression
24 pages
PROBLEMS ch05
No ratings yet
PROBLEMS ch05
117 pages
6 Continuous Data Analysis
No ratings yet
6 Continuous Data Analysis
49 pages
Ra Web
No ratings yet
Ra Web
70 pages
Chapter 4 Multiple Regression Model
No ratings yet
Chapter 4 Multiple Regression Model
31 pages
01 SLR Final
No ratings yet
01 SLR Final
37 pages
Chapter Three Multiple
No ratings yet
Chapter Three Multiple
15 pages
Cheat Sheet Statistics
No ratings yet
Cheat Sheet Statistics
3 pages
Chapter 3 MLR
No ratings yet
Chapter 3 MLR
40 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
Session 5 Marked B PDF
No ratings yet
Session 5 Marked B PDF
36 pages
Correlation Simple Regression
No ratings yet
Correlation Simple Regression
26 pages
Multiple Regression (Compatibility Mode)
No ratings yet
Multiple Regression (Compatibility Mode)
24 pages
MGT Three
No ratings yet
MGT Three
86 pages
Corelation and Regression
No ratings yet
Corelation and Regression
137 pages
Niir Indore Madhya Pradesh India Business Industrial Directory Database List Companies Small Medium Enterprises Sme Industries XLSX Excel Format 7th Edition
No ratings yet
Niir Indore Madhya Pradesh India Business Industrial Directory Database List Companies Small Medium Enterprises Sme Industries XLSX Excel Format 7th Edition
2 pages
Multiple Linear Regression Session 4
No ratings yet
Multiple Linear Regression Session 4
32 pages
Multiple Regression
No ratings yet
Multiple Regression
49 pages
Thesis Writing Handbook
100% (3)
Thesis Writing Handbook
6 pages
1163
No ratings yet
1163
7 pages
U G Boards of Studies Panel
No ratings yet
U G Boards of Studies Panel
34 pages
Meaning, Measurement, and Assessment of Vocational Interests For Career Intervention
No ratings yet
Meaning, Measurement, and Assessment of Vocational Interests For Career Intervention
21 pages
Sankhya Data Science Course
No ratings yet
Sankhya Data Science Course
22 pages
ECOM2001 Quantitative Techniques For Business Trimester 3 2024 Dubai Intern'l Academic City INT
No ratings yet
ECOM2001 Quantitative Techniques For Business Trimester 3 2024 Dubai Intern'l Academic City INT
12 pages
Patrick Phillips Industry-June12th
No ratings yet
Patrick Phillips Industry-June12th
15 pages
Intensive and Extensive Reading
No ratings yet
Intensive and Extensive Reading
4 pages
MTRN3020 Modelling and Control of Mechatronic Systems
No ratings yet
MTRN3020 Modelling and Control of Mechatronic Systems
11 pages
Lecture 6 Updated
No ratings yet
Lecture 6 Updated
42 pages
نظري اخر التيرم
No ratings yet
نظري اخر التيرم
2 pages
Vision - Mission Review
No ratings yet
Vision - Mission Review
21 pages
Uottawa Procedure Hazard Identification Risk Assessment
No ratings yet
Uottawa Procedure Hazard Identification Risk Assessment
12 pages
Socio Political Factors
No ratings yet
Socio Political Factors
3 pages
Syllabus (Semester Pattern) Session 2021-22: Shaheed Mahendra Karma Vishwavidyalaya, Bastar Jagdalpur, Chhattisgarh
No ratings yet
Syllabus (Semester Pattern) Session 2021-22: Shaheed Mahendra Karma Vishwavidyalaya, Bastar Jagdalpur, Chhattisgarh
37 pages
Interviews With The Early Explorers
No ratings yet
Interviews With The Early Explorers
2 pages
Sodh Darpan
No ratings yet
Sodh Darpan
9 pages
Abdulhadi S M J Al Ajmi Resume
No ratings yet
Abdulhadi S M J Al Ajmi Resume
2 pages
Chapter 1
No ratings yet
Chapter 1
16 pages
Problem Solving Skills
No ratings yet
Problem Solving Skills
36 pages
NMSCST - Mary Analyn Lim - Assignment#2 - Setember - 9 - 2024
No ratings yet
NMSCST - Mary Analyn Lim - Assignment#2 - Setember - 9 - 2024
12 pages
ASM3 Outline
No ratings yet
ASM3 Outline
25 pages
Socio Critical GELS
No ratings yet
Socio Critical GELS
13 pages
Pattern of Information Technology Use: The Impact On Buyer-Suppler Coordination and Performance
No ratings yet
Pattern of Information Technology Use: The Impact On Buyer-Suppler Coordination and Performance
19 pages
The Competent Prescriber 12 Core Competencies For
No ratings yet
The Competent Prescriber 12 Core Competencies For
5 pages
Pages de CELEX - 32006R1907R (01) - en - TXT
No ratings yet
Pages de CELEX - 32006R1907R (01) - en - TXT
4 pages
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)

Theme 3 Multivariante Regression Model

Uploaded by

Theme 3 Multivariante Regression Model

Uploaded by

Theme 3: Multivariate linear regression Model

3.1 Drawing Statistical Inferences from the population:

3.2 Multiple Linear Regression:

Ø If we have the following regression model

3.3 Omitted Variable Bias & Redundant Variables: -

Corr (x1, x2) > 0 Corr (x1, x2) < 0

avgscore =β +β expend. - β povrate+ui

3.3.1 Matrix Notation:

Ø R2= ESS/TSS=1- RSS/TSS

3.4.1 Adjusted R-Squared

3.5 The coefficient of Correlation:

Ø r = 1 indicates a perfect positive correlation, meaning that as one variable increases,

Theme 3: Practical Applications: -

a) Specify the MLR equation.

B) Determine and interpret the determination coefficient.

2- Practical Example 2 CAPM Fama-French benchmark factors

Ø High Tech. portfolio

You might also like