Multicollinearity Nature of Multicollinearity

Multicollinearity refers to an exact or near linear relationship between explanatory variables in a regression model. It can occur due to constraints in the population or model specification. Near multicollinearity results in high variance for coefficient estimates, while perfect multicollinearity makes coefficients indeterminate. It can be detected using variance inflation factors, eigenvalues, or correlation between variables. Potential remedies include increasing the sample size, dropping or combining variables, or modifying the model specification. Multicollinearity is problematic as it inflates standard errors and impacts coefficient significance tests.

Uploaded by

Sufian Himel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (3 votes)

5K views7 pages

Multicollinearity Nature of Multicollinearity

Uploaded by

Sufian Himel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Multicollinearity

Nature of Multicollinearity: The term multicollinearity is introduced in Economic

analysis by Economist “Ranger Frisch”. Multicollinearity refers to the existence of a
perfect or exact linear relationship among some or all explanatory variables of a
regression model.
For the k-variables regression involving explanatory variable X1,X2,X3,…,XK having the
following linear relationship
α 1X1+ α2X2+…+αkKk=0
Example: In demand function of a commodity suppose the quantity demanded of
commodity „A‟ depends upon the price of commodity „B‟ .If there are two prices are
correlated to each other it will be difficult to separate the influence of two prices on the
demand of the commodity. Such a problem is known as multicollinearity problem.

Types of Multicollinearity:
There are two types of multicollinearity. They are-
1.Exact/Perfect Multicollinearity.
2.Near or less than perfect Multicollinearity.
Exact Multicollinearity: If exist perfect linear relationship among the explanatory
variables then it is treated as exact multicollinearity. In case of exact multicollinearity the
design matrix as data matrix „X‟ is not of full rank & consequently (X′X)-1 does not exist.
In this case ‫׀‬X′X‫= ׀‬0.
Example; For the K-variables regression model involving explanatory variabie
X1,X2,….,Xk (where X1=1 for all observations to allow for the intercept term) an exact
linear relationship is said to exist if the following condition is satisfied.
λ1X1+λ2X2+λkXk=0
Where λ1,λ2,…,λk are constants such that not all of them are zero simultaneously.
Assume that λ1≠0 then the equation can be written as
X1=-(λ2/λ1)X2-(λ3/λ1)X3-….-(λk/λ1)Xk
Which show that X1is linearly related with other explanatory variables(X‟s).
Near multicollinearity: If the explanatory variables (x‟s) are strongly as highly
correlated but not perfectly then it s called near multicollinearity. In this case (X'X)-1 is
exist but with related large diagonal elements i.e.‫׀‬X′X‫≠׀‬0.
Example: When the explanatory variables (X′s) are inter correlated but not perfectly then
a linear relationship is said to be exist if
λ1X1+λ2X2+….+λkXk+vi=0
where Vi is a stochastic error term & λ′ s are constant such that not all of them are zero
simultaneously.
Assume that λ1≠0, then the equation can be written as
X1=-(λ2/λ1)X2-(λ3/λ1)X3-….-(λk/λ1)Xk-vi/λ1
Which shows that X1 is not exactly linearly related with other explanatory variables.
Sources of multicollinearity: There are several sources of multicollinearity.
1. The data collection method employed, for example, sampling over a limited range
of the values taken by the regressors in the population.
2. Constraints on the model or in the population being sampled. For example, in the
regression of electricity consumption on income (X2) & house size (X3) there is a
physical constraint in the population in the families with higher income generally
have larger homes than families with lower incomes.
3. Model specifications, for example, adding polynomial terms to a regression
model, especially when the range of the X variable is small.
4. An over determined model. This happens when the model has more explanatory
variables than the number of observations. This could happen in medical research
where there may be a small number of patients about whom information is
collected on a large number of variables.

Consequences of multicollinearity: In case of near or high multicollinearity one is

likely to encounter the following consequences.
1. Although BLUE, the OLS estimators have the large variance & covariance‟s
making precise estimation difficult.
2. Because of consequence 1, the confidence intervals tend to be much wider,
leading to the acceptance of the “zero null hypothesis‟ more readily.
3. Also because of consequence 1, the t ratio of one or more coefficients tends to be
statistically insignificant.
4. Although the t ratio of one or more coefficients statistically insignificant, R2,the
overall measure of goodness of fit , can be very high.
5. The OLS estimators & their standard error can be sensitive to small changes in the
data.

If multicollinearity is perfect among the explanatory variables then the regression

coefficient of the X variables are indeterminate & their standard errors are infinite. If
multicollinearity is less than perfect, then the regression coefficients although
determinate, possesses large standard errors (in relation to the coefficients) which mean
that the coefficient cannot be estimated with great precision or accuracy. But when there
is no multicollinearity among the X′s variables then we can easily estimate the regression
coefficient.
For this reason CLRM assume that there is no multicollinearity among the X′s .

Detection of multicollinearity:
The indicators for detecting multicollinearity are as follows :-
1. Eigen Values & Conditional Index: Here we discuss the method of Eigen value &
conditional index to detect the multicollinearity. At first we have to calculate the data
matrix. Then using ‫׀‬X'X-λI‫= ׀‬0 we get the values of λ which is eigen value. Now we
have

maximumeigen value
Conditional Index (CI)=
minimumeigen value
After calculating CI, if CI lies between 10 to 30 then there is moderate
multicollinearity. And if CI exceeds 30 then there have severe multicollinearity.
2. High R2 but few significant t-ratios: This is classic symptom of multicollinearity . If
R2 is high , say excess of 0.8, the F-test in most cases reject the H0 that the partial
slope coefficients are simultaneously equal to zero, but the individual t-test will show
that none or very few of the partial coefficients are statistically different from zero. In
short we can write when R2 is very high but none of the regression coefficients is
statistically significant.
3. High pair wise correlation among Regression: Another suggested rule of thumb is that
if the pair wise as zero order correlation coefficients between two regressors is high,
say excess of 0.1 then multicollinearity is a serious problem.
4. Examination of partial correlation:
If R2 is high but the partial correlation are comparatively low may suggest that the
explanatory variables are highly correlated.
6. Tolerance and Variance Inflation Factor:
The speed with which variance and covariance increase can be seen with the VIF, which
is defined as
1
VIF=
1  r232
VIF shows how the variance of an estimator is inflated by the presence of
multicollinearity.
7. Low value of ‫׀‬X′X‫ ׀‬in case of exact multicollinearity i.e ‫׀‬X′X‫=׀‬0.

Remedial Measures:
If multicollinearity has serious effects on coefficients estimates of important factor, one
should adopt one of the following solutions –

1. Increase The Sample Size:

The easiest way to overcome the problem of multicollinearity is to increase the sample
size. Investigators are advised to collect more data to reduce the intensity of collinearity.

2. Using Extraneaous Estimate:

To eliminate the effects of multicollinearity the other commonly adopted method is the
uses of extraneous information is estimating parameters. Suppose our model is
Y=α0+α1X1+α2X2+u
Where X1 & X2 are correlated. If we know that α2=0.5α1, then the model will be
Y=α0+α1X1+0.5α1X2+u
=α0+α1(X1+0.5X2)+u
=α0+α1X′+u
Now we can estimate α1by OLS and hence α2=0.5α1.
3. Dropping Variables:
When we faced with severe multicollinearity one of the simplest things to do is to drop
one of the collinear variables.

4. Combining Cross-Sectional And Time Series Data:

Generally time series data is affected by multicollinearity problem. So if we combine the
cross-sectional data in time series data then the multicollinearity problem should be
reduced.

5. Model Specification:
Multicollinearity may be overcome if we specify our model; this can be done in the
following way.
a) One approach is to redefine regressors.
b) Re-specification of lagged variable or other explanatory variable in a
distributed lagged variables.

Is multicollinearity necessarily bad?

It has been said that if the purpose of regression analysis is prediction or
forecasting, then multicollinearity is not a serious problem because the higher the
R2 ,the better the prediction.
Moreover , if the objective of the analysis is not only prediction but also reliable
estimation of the parameters, serious multicollinearity will be a problem because
we have seen that it tends to large standard error of the estimators.
In one situation however, multicollinearity may not impose a serious problem.
This is the case when R2 is high and the regression coefficient are individually
significant as revealed by the higher t-values.

Gujarati Basic Econometrics Solutions
87% (39)
Gujarati Basic Econometrics Solutions
189 pages
Mcqs Econometric
75% (20)
Mcqs Econometric
25 pages
Basic Econometrics Solutions Manual
83% (6)
Basic Econometrics Solutions Manual
189 pages
Econometrics Questions
80% (5)
Econometrics Questions
7 pages
I.3 Methodology of Econometrics: Gujarati: Basic Econometrics, Fourth Edition Front Matter
100% (1)
I.3 Methodology of Econometrics: Gujarati: Basic Econometrics, Fourth Edition Front Matter
8 pages
Econometrics Test Bank
100% (1)
Econometrics Test Bank
134 pages
Econometrics Modulei-3
88% (17)
Econometrics Modulei-3
87 pages
Chapter 4 Violations of The Assumptions of Classical Linear Regression Models
100% (10)
Chapter 4 Violations of The Assumptions of Classical Linear Regression Models
10 pages
CHAPTER 4 - Violations of Assumptions
No ratings yet
CHAPTER 4 - Violations of Assumptions
96 pages
Chap 11 Heterscedasticity
100% (1)
Chap 11 Heterscedasticity
45 pages
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
100% (5)
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
83 pages
Lesson 3
100% (5)
Lesson 3
5 pages
Multicollinearity: Abhijeet Kumar Kumar Anshuman Manish Kumar Umashankar Singh
100% (1)
Multicollinearity: Abhijeet Kumar Kumar Anshuman Manish Kumar Umashankar Singh
22 pages
Chapter One: Introduction: 1.1 Definition and Scope of Econometrics
No ratings yet
Chapter One: Introduction: 1.1 Definition and Scope of Econometrics
8 pages
Basic Econometrics Chapter 3 Solutions
75% (8)
Basic Econometrics Chapter 3 Solutions
13 pages
Multicollinearity
No ratings yet
Multicollinearity
7 pages
Multicollinearity
No ratings yet
Multicollinearity
35 pages
MULTICOLLINEARITY
No ratings yet
MULTICOLLINEARITY
8 pages
Multicollinearity Assignment April 5
100% (1)
Multicollinearity Assignment April 5
15 pages
Harris-Todaro Migration Model
100% (2)
Harris-Todaro Migration Model
2 pages
Basic Econometrics Questions and Answers
60% (5)
Basic Econometrics Questions and Answers
3 pages
Econometrics Module 2
100% (1)
Econometrics Module 2
185 pages
Aggregate Demand I: Building The IS-LM Model: Questions For Review
No ratings yet
Aggregate Demand I: Building The IS-LM Model: Questions For Review
10 pages
Econometrics II
100% (1)
Econometrics II
101 pages
6 Multicolinearity
No ratings yet
6 Multicolinearity
6 pages
ch11 Heteroscedasticity
No ratings yet
ch11 Heteroscedasticity
31 pages
Econometric Modeling:: Model Specification and Diagnostic Testing
100% (1)
Econometric Modeling:: Model Specification and Diagnostic Testing
57 pages
15 Multiple Choice
100% (2)
15 Multiple Choice
3 pages
Econometrics Model Exam
100% (3)
Econometrics Model Exam
10 pages
Qualitative Response Regression Model - Probabilistic Models
No ratings yet
Qualitative Response Regression Model - Probabilistic Models
34 pages
Multicollinearity: What Happens If Explanatory Variables Are Correlated.
No ratings yet
Multicollinearity: What Happens If Explanatory Variables Are Correlated.
20 pages
A FINALS Econometrics - II MCQs
100% (2)
A FINALS Econometrics - II MCQs
6 pages
Econometrics 2 Exam Answers
67% (3)
Econometrics 2 Exam Answers
6 pages
FGGHHDH GDD HH
100% (3)
FGGHHDH GDD HH
433 pages
Assignment - Professional Commiunications and Negotiation Skills-1
33% (3)
Assignment - Professional Commiunications and Negotiation Skills-1
5 pages
Cardinal Utility Analysis
No ratings yet
Cardinal Utility Analysis
27 pages
Questions On Harrod & Domar Models
100% (2)
Questions On Harrod & Domar Models
2 pages
Chapter 5
No ratings yet
Chapter 5
116 pages
Answers Are Highlighted in Yellow Color: MCQ's Subject:Introductory Econometrics
100% (1)
Answers Are Highlighted in Yellow Color: MCQ's Subject:Introductory Econometrics
74 pages
Chapter 7
50% (4)
Chapter 7
38 pages
Econometrics II CH 1
No ratings yet
Econometrics II CH 1
48 pages
Econometrics 2
No ratings yet
Econometrics 2
135 pages
International Economics MCQ PDF
90% (10)
International Economics MCQ PDF
119 pages
Practice Questions Econometrics II
100% (1)
Practice Questions Econometrics II
5 pages
Multicollinearity, Heteroscedasticity and Autocorrelation
100% (3)
Multicollinearity, Heteroscedasticity and Autocorrelation
23 pages
Basic Econometrics Old Exam Questions Wi
100% (2)
Basic Econometrics Old Exam Questions Wi
9 pages
Econometrics MTU
No ratings yet
Econometrics MTU
31 pages
Development of Visualization
100% (1)
Development of Visualization
11 pages
CH 10 MULTICOLLINEARITY WHAT HAPPENS IF THE EGRESSORS ARE CORRELATED
No ratings yet
CH 10 MULTICOLLINEARITY WHAT HAPPENS IF THE EGRESSORS ARE CORRELATED
36 pages
International Economics II - Chapter 3
100% (1)
International Economics II - Chapter 3
75 pages
Chapter 1 and 2 Mcqs Econometrics
No ratings yet
Chapter 1 and 2 Mcqs Econometrics
10 pages
Examples of Calculating GDP: Using The Expenditures Approach
No ratings yet
Examples of Calculating GDP: Using The Expenditures Approach
2 pages
ch19 The Identification Problem
100% (1)
ch19 The Identification Problem
30 pages
Chap 3 Two Variable Regression Model The Problem of Estimation
No ratings yet
Chap 3 Two Variable Regression Model The Problem of Estimation
35 pages
Econometric Modeling: Model Specification and Diagnostic Testing
No ratings yet
Econometric Modeling: Model Specification and Diagnostic Testing
52 pages
Lesson 4 - Contructivist Theory in Teaching Science
No ratings yet
Lesson 4 - Contructivist Theory in Teaching Science
2 pages
Employee Performance Review - Quarterly - Final
No ratings yet
Employee Performance Review - Quarterly - Final
5 pages
Chapter 5 and 6 (MCQ)
100% (6)
Chapter 5 and 6 (MCQ)
4 pages
MoE S Model Exit Exam Solution (Economics) July 04, 2023
No ratings yet
MoE S Model Exit Exam Solution (Economics) July 04, 2023
118 pages
CH 1 Econometrics
No ratings yet
CH 1 Econometrics
49 pages
Ricardian or Classical Theory of Income Distribution
100% (1)
Ricardian or Classical Theory of Income Distribution
15 pages
Fractionated Coconut Oil: Material Safety Data Sheet
No ratings yet
Fractionated Coconut Oil: Material Safety Data Sheet
3 pages
Econometrics II Handout For Students
No ratings yet
Econometrics II Handout For Students
29 pages
Definition: The Total Stock of Money Circulating in An Economy Is The Money Supply. The
No ratings yet
Definition: The Total Stock of Money Circulating in An Economy Is The Money Supply. The
7 pages
Moneysupply
No ratings yet
Moneysupply
27 pages
Distributed Computing Question Bank
No ratings yet
Distributed Computing Question Bank
6 pages
Econometrics Exam.
100% (1)
Econometrics Exam.
4 pages
Cmos Fabrication: N - Well Process
No ratings yet
Cmos Fabrication: N - Well Process
42 pages
5 Functional Forms of Regression Models: Questions 5.1. (A)
No ratings yet
5 Functional Forms of Regression Models: Questions 5.1. (A)
23 pages
MCQ Week07ans
No ratings yet
MCQ Week07ans
6 pages
Fazal Mahmood - Resume
No ratings yet
Fazal Mahmood - Resume
1 page
The Role of The Media in Peace Building Conflict Management and Prevention
No ratings yet
The Role of The Media in Peace Building Conflict Management and Prevention
3 pages
Funai PDF
No ratings yet
Funai PDF
101 pages
Funai PDF
No ratings yet
Funai PDF
101 pages
Interview Questions - For LinkedIn
No ratings yet
Interview Questions - For LinkedIn
4 pages
Echoes of The Tambaran Masculinity History and The Subject in The Work of Donald F Tuzin David Lipset Instant Download
No ratings yet
Echoes of The Tambaran Masculinity History and The Subject in The Work of Donald F Tuzin David Lipset Instant Download
85 pages
Dummy Variable Regression Models 9.1:, Gujarati and Porter
No ratings yet
Dummy Variable Regression Models 9.1:, Gujarati and Porter
18 pages
Asm Note
No ratings yet
Asm Note
1 page
Current El WS 14-12-24
No ratings yet
Current El WS 14-12-24
31 pages
Narrative Report (Linguistics)
No ratings yet
Narrative Report (Linguistics)
5 pages
Child Dissociation The Descriptive Psychopathology Analysis of A Case
No ratings yet
Child Dissociation The Descriptive Psychopathology Analysis of A Case
14 pages
Job Vacancies Beatrice (Mine)
No ratings yet
Job Vacancies Beatrice (Mine)
3 pages
Controller
No ratings yet
Controller
2 pages
Uc Colorado Springs
No ratings yet
Uc Colorado Springs
17 pages
Abyip 2024 1
No ratings yet
Abyip 2024 1
11 pages
CN - UNESCO Global Youth Grant Scheme - Powering Up MIL Responses To Discrimination
No ratings yet
CN - UNESCO Global Youth Grant Scheme - Powering Up MIL Responses To Discrimination
6 pages
02 - Introduction To Probabilities
No ratings yet
02 - Introduction To Probabilities
38 pages
Account STMT
No ratings yet
Account STMT
2 pages
Cambridge IGCSE: PHYSICS 0625/41
No ratings yet
Cambridge IGCSE: PHYSICS 0625/41
16 pages
Confirmation and Itinerar1
No ratings yet
Confirmation and Itinerar1
6 pages
Garage Door Control W/keyfob DSC-007: Application Note
No ratings yet
Garage Door Control W/keyfob DSC-007: Application Note
2 pages
Amendment in Regional Transmission Grid Plan of Gwadar Area - Complete (1) - Pages-70-74
No ratings yet
Amendment in Regional Transmission Grid Plan of Gwadar Area - Complete (1) - Pages-70-74
5 pages
HTML Cheat Sheet
No ratings yet
HTML Cheat Sheet
5 pages
Notice Regarding PTM For Students
No ratings yet
Notice Regarding PTM For Students
1 page
M D A I C: Measure Define Improve Control
No ratings yet
M D A I C: Measure Define Improve Control
1 page

Multicollinearity Nature of Multicollinearity

Uploaded by

Multicollinearity Nature of Multicollinearity

Uploaded by

Multicollinearity

Nature of Multicollinearity: The term multicollinearity is introduced in Economic

Consequences of multicollinearity: In case of near or high multicollinearity one is

If multicollinearity is perfect among the explanatory variables then the regression

1. Increase The Sample Size:

2. Using Extraneaous Estimate:

4. Combining Cross-Sectional And Time Series Data:

Is multicollinearity necessarily bad?

You might also like