0% found this document useful (0 votes)

43 views11 pages

Lecture 5 Dummy Variable

Uploaded by

Jakey Brown

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views11 pages

Lecture 5 Dummy Variable

Uploaded by

Jakey Brown

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

ECON3049: ECONOMETRICS I

DUMMY VARIABLES
Marlon Tracey
Summer 2012
Outline
2

 Nature and use of dummy variables

 Creation of dummy variables
 Dummy as explanatory variable

 Dummy variable trap

 Intercept dummy

 Slope dummy

 Dummy as dependent variable

 Linear probability model

Reading: Gujurati Chapt. 9 & 15; Wooldridge Chapt. 7

Nature and Use of Dummy Variables
3

 If a variable is qualitative or categorical, standard regression can still

be used to analyze the effect of such variables. This is done by using
dummy variables.
 Dummy variables are variables that take on only two values 0 and 1,
where 0 means the absence of a quality and 1 means the presence of
the quality.
 A dummy variable can be used as an explanatory variable:
 to capture the effects of qualitative characteristics of individuals (e.g. race, religion,
gender, level of education), firms (e.g. private /public, large/medium/small/micro),
countries (e.g. OECD or otherwise, developed, developing, undeveloped,
underdeveloped) etc.;
 to examine before and after situations (pre-liberalization/post-liberalization; pre-
recovery/post-recovery) or

 To test whether a regression function is different for one group versus another group.

 A dummy variable can also be used as a dependent variable to predict

“choices”, “options” or qualitative outcomes.
Creation of dummy variables
4

 Convert a variable with two categories (such as “distance from home” or

gender) to a dummy variable (D) as per the following examples:
1 if live near 1 if male
Dn   Dm  
0 if live far 0 if female

0 if live near 0 if male

Df   Df  
1 if live far 1 if female
 Convert a variable with three or more categories (such as quality of
degree) to dummy variables by creating for each category a separate
dummy variable as follows:
1 First Class 1 Upper Second Class
D1   D2  
0 Otherwise 0 Otherwise

1 Lower Second Class 1 Pass

D3   D4  
0 Otherwise 0 Otherwise
Dummy as explanatory variable
5

 When dummy variables are used as explanatory variables

in a regression, one must be careful to avoid the dummy
variable trap.
 A dummy explanatory variable can either be an intercept
dummy and/or a slope dummy:
 An intercept dummy is used when the researcher theorizes the
dummy variable to have a level effect on the dependent variable,
that is, the level of the dependent variable is higher for one
category versus another.
 A slope dummy is used when the researcher theorizes the dummy
variable to change the marginal effect of an explanatory variable
on the dependent variable. It changes the magnitude and/or
direction of the relationship between an explanatory variable and
a dependent variable.
Dummy variable trap
6

 In regression analysis, all possible dummies for a qualitative

explanatory variable should not be included in the equation.
 Doing so creates a dummy variable trap, which is due to perfect
multicollinearity.
 To avoid the dummy variable trap:
 If the variable has M categories, only M-1 dummies should be entered
in the regression equation.
 Or we can drop the intercept from the model and include all the
dummies.

 The final category not represented in the regression is referred to as

the reference category.
 The coefficients of the other dummy variables are interpreted
relative to the reference category.
Intercept dummy
7

 Consider a regression model with one non-categorical (edu) and one

dummy (Df = 1 if female) as follows:
wage = 0 + 1edu + 0Df + u
 This can be interpreted as an intercept shift
 If Df = 0, then wage = 0 + 1edu + u
 If Df = 1, then wage = (0 + 0) + 1edu + u

 In this case, the group that is left out is “Males” and is represented by Df =
0. This group is the reference group.

 Note that the dummy Df when equal to 1, shifts the intercept from 0 to
(0 + 0). It is therefore called an intercept dummy.

 The effect of the intercept dummy is interpreted as follows: the difference

between wage if Df =1 and wage if Df =0 is 0 , ceteris paribus. More
intuitively, the gender wage differential is 0 .
Slope dummy
8

 Consider a regression model with one non-categorical (edu) and one

dummy (Df = 1 if female) as follows:
wage = 0 + 1edu+ 1Df *edu+ u
 This can be interpreted as an intercept shift
 If Df = 0, then wage = 0 + 1edu + u
 If Df = 1, then wage = 0 + (1 +1) edu + u

 In this case, the group that is left out is “Males” and is represented by Df =
0. This group is the reference group.

 Note that the dummy Df when equal to 1, shifts the slope from 1 to (1 +
1). It is therefore called an slope dummy.
 The effect of the slope dummy is interpreted as follows: the difference
between the effect of education on wage if Df = 1 and the effect if Df = 0
is 1 , ceteris paribus. More intuitively, the difference between the marginal
return to education for females and males is 1 .
Dummy as dependent variable
9

 In many cases, a researcher may be interested in the set of

variables that predict certain binary “choices” or “options”.
For example, the researcher might be interested in the factors
that determine the following:

 Decision to work or go to school

 Voting (vote or not vote)
 Marital Status (married or not)
 Decision to Pass or fail a course.

 Can OLS regression analysis still be conducted when the

dependent variable is binary?
Linear Probability Model I
10

 Consider the simple linear regression model as follows:

Y   0  1 X  u
 Where the population regression function (PRF) is
E (Y | X )   0   1 X  E (u | X )  0

 Since Y is a dummy (or binary) variable at each value of X,

the E (Y | X )  p , where p is the probability that y = 1.
Therefore, in this case, the PRF is referred to as a linear
probability model.
 When the PRF is estimated by OLS on a particular sample,
the estimated linear probability equation is given by:
pˆ  ˆ 0  ˆ1 X
Linear Probability Model II
11

 However, the linear probability model violates several

assumptions of our standard regression model:
 Linearity assumption is no longer realistic.
 Since X has a linear effect β1 on y, it is possible that pˆ  0 or pˆ  1 but
probability must lie in the interval [0,1].

 Y and therefore, u is non-normal. However, in large sample this

is not so much of a problem.

 u is heteroskedastic, since V ( u )  p (1  p ) , where p depends

on X. However, this problem can be corrected using Weighted
Least Squares.

Lesso n1: Illustrating T-Distribution
80% (5)
Lesso n1: Illustrating T-Distribution
35 pages
Ten Big Statistical Ideas in Research
100% (1)
Ten Big Statistical Ideas in Research
32 pages
Chapter 1 Qualitative Variables Final
No ratings yet
Chapter 1 Qualitative Variables Final
74 pages
Business Econometrics: Session VII-VIII DR Tutan Ahmed IIT Kharagpur February 2020
No ratings yet
Business Econometrics: Session VII-VIII DR Tutan Ahmed IIT Kharagpur February 2020
21 pages
Regression With Qualitative Information
No ratings yet
Regression With Qualitative Information
25 pages
Econometrics II Chapter One
No ratings yet
Econometrics II Chapter One
87 pages
Econometrics II All Chapters
No ratings yet
Econometrics II All Chapters
240 pages
Econometrics I - Lecture 7 (Wooldridge)
No ratings yet
Econometrics I - Lecture 7 (Wooldridge)
34 pages
Econometrics 2
No ratings yet
Econometrics 2
84 pages
Chapter 7, Dummy Variable
No ratings yet
Chapter 7, Dummy Variable
13 pages
Econometrics 2
No ratings yet
Econometrics 2
135 pages
CH 4 Eco
No ratings yet
CH 4 Eco
42 pages
Presentation G1
No ratings yet
Presentation G1
21 pages
Lecture 10
No ratings yet
Lecture 10
37 pages
Econometrics II Chapter One
No ratings yet
Econometrics II Chapter One
71 pages
Chapter 7
No ratings yet
Chapter 7
31 pages
Econometrics II (N)
No ratings yet
Econometrics II (N)
30 pages
Econometrics Il
No ratings yet
Econometrics Il
14 pages
Ees 401 Econometrics II Module
No ratings yet
Ees 401 Econometrics II Module
77 pages
Multiple Regression Analysis With Qualitative Information
No ratings yet
Multiple Regression Analysis With Qualitative Information
4 pages
Econometrics II Slides-1
No ratings yet
Econometrics II Slides-1
61 pages
Chapter 4 (Compatibility Mode)
No ratings yet
Chapter 4 (Compatibility Mode)
66 pages
2022 Econometrics I Chapter Four
No ratings yet
2022 Econometrics I Chapter Four
83 pages
Part 2 - Regression With Indicator Variables
No ratings yet
Part 2 - Regression With Indicator Variables
26 pages
Dummy
No ratings yet
Dummy
7 pages
Econoch 7
No ratings yet
Econoch 7
32 pages
Ch07 - Dummy Variables - Ver1
No ratings yet
Ch07 - Dummy Variables - Ver1
29 pages
Econometrics Categorical Variables
No ratings yet
Econometrics Categorical Variables
12 pages
Chapter 1 Dummy Variable Regression
No ratings yet
Chapter 1 Dummy Variable Regression
45 pages
Binary
No ratings yet
Binary
47 pages
Chap 7
No ratings yet
Chap 7
7 pages
Chapter 2
No ratings yet
Chapter 2
97 pages
CHapter 5 Acct
No ratings yet
CHapter 5 Acct
8 pages
Econometrics II-1
No ratings yet
Econometrics II-1
56 pages
Binary
No ratings yet
Binary
40 pages
Lecture 08 Dummy Variables
No ratings yet
Lecture 08 Dummy Variables
6 pages
Econometrics II Notes
No ratings yet
Econometrics II Notes
72 pages
Chapter 1 Econometrics
No ratings yet
Chapter 1 Econometrics
21 pages
Microeconometrics S1 AZ
No ratings yet
Microeconometrics S1 AZ
71 pages
Lec 5 V 11
No ratings yet
Lec 5 V 11
44 pages
Anova
No ratings yet
Anova
14 pages
Chapter10 Econometrics DummyVariableModel
No ratings yet
Chapter10 Econometrics DummyVariableModel
8 pages
Chapter Three QM
No ratings yet
Chapter Three QM
77 pages
The Linear Regression Model
No ratings yet
The Linear Regression Model
36 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
43 pages
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
100% (5)
Introduction To Econometrics Ii (Econ-3062) : Mohammed Adem (PHD)
83 pages
Dummy Variable Regression Models
No ratings yet
Dummy Variable Regression Models
9 pages
L1090 Lecture8 2024
No ratings yet
L1090 Lecture8 2024
36 pages
1-6 Dummy Variable
No ratings yet
1-6 Dummy Variable
16 pages
Econometrics II Chapter Two
No ratings yet
Econometrics II Chapter Two
96 pages
Chapter 1
No ratings yet
Chapter 1
47 pages
Yusuf Notes
No ratings yet
Yusuf Notes
4 pages
Chapter 7
No ratings yet
Chapter 7
50 pages
Dummy Variable Regression
No ratings yet
Dummy Variable Regression
8 pages
Lecture 7
No ratings yet
Lecture 7
35 pages
EBE Dummy Variables
No ratings yet
EBE Dummy Variables
9 pages
Choosing A Functional Form
No ratings yet
Choosing A Functional Form
8 pages
Dummies
No ratings yet
Dummies
5 pages
Econometrics For Finance Chapter 5
No ratings yet
Econometrics For Finance Chapter 5
12 pages
Dummy Variable
No ratings yet
Dummy Variable
21 pages
Calculus Refresher
From Everand
Calculus Refresher
A. A. Klaf
3/5 (8)
A Treatise on the Calculus of Finite Differences
From Everand
A Treatise on the Calculus of Finite Differences
George Boole
4/5 (1)
Health Statistics Revision Questions
100% (2)
Health Statistics Revision Questions
8 pages
Mathematics For Management - Statistics Section
No ratings yet
Mathematics For Management - Statistics Section
19 pages
Data Analysis Activity 2
No ratings yet
Data Analysis Activity 2
4 pages
Bowman - Monotone Regresion PDF
No ratings yet
Bowman - Monotone Regresion PDF
12 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
10 pages
Graham, S., & Sandmel, K. (2011) The Process Writing Approach - A Meta-Analysis. The Journal of Educational Research, 104 (6), 396-407.
No ratings yet
Graham, S., & Sandmel, K. (2011) The Process Writing Approach - A Meta-Analysis. The Journal of Educational Research, 104 (6), 396-407.
14 pages
Biostatistics A Sample of Questions For The Final Exam
No ratings yet
Biostatistics A Sample of Questions For The Final Exam
3 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
3 pages
X Chart Control Charts: Steven S Prevette
No ratings yet
X Chart Control Charts: Steven S Prevette
12 pages
Pca PDF
No ratings yet
Pca PDF
33 pages
Lesson 8 Random Sampling Activity 12
No ratings yet
Lesson 8 Random Sampling Activity 12
6 pages
XL Miner User Guide
No ratings yet
XL Miner User Guide
420 pages
Primary Metric Process Map Project Plan Project Charter: Purpose Key Outputs Key Tools
No ratings yet
Primary Metric Process Map Project Plan Project Charter: Purpose Key Outputs Key Tools
1 page
Higgins 2002
No ratings yet
Higgins 2002
20 pages
Stats Assignment 1 and 2
No ratings yet
Stats Assignment 1 and 2
19 pages
(Ebook PDF) Principles of Econometrics, 5th Editioninstant Download
100% (4)
(Ebook PDF) Principles of Econometrics, 5th Editioninstant Download
49 pages
SP WS1
No ratings yet
SP WS1
3 pages
Data Analytics On Vechicle Insurance Data
No ratings yet
Data Analytics On Vechicle Insurance Data
22 pages
Npar Tests: Npar Tests /K-W Hasil by Perlakuan (1 2) /missing Analysis
No ratings yet
Npar Tests: Npar Tests /K-W Hasil by Perlakuan (1 2) /missing Analysis
59 pages
đề CLC số 1
No ratings yet
đề CLC số 1
2 pages
How To Prepare Statistics For SSC CGL Tier II Study Notes in PDF
No ratings yet
How To Prepare Statistics For SSC CGL Tier II Study Notes in PDF
10 pages
Project Cardio Good Fitness
No ratings yet
Project Cardio Good Fitness
29 pages
Tugas 2: Bayessian Method (Atribut Kontinyu) : Outlook Temperature Humidity Windy Play-Class
No ratings yet
Tugas 2: Bayessian Method (Atribut Kontinyu) : Outlook Temperature Humidity Windy Play-Class
3 pages
Analyze Phase
No ratings yet
Analyze Phase
33 pages
Stock Watson 3U ExerciseSolutions Chapter6 Students
No ratings yet
Stock Watson 3U ExerciseSolutions Chapter6 Students
8 pages
Student Database STD ID STD Name Marks Percentage Grade Remark Grade Description Marks
No ratings yet
Student Database STD ID STD Name Marks Percentage Grade Remark Grade Description Marks
18 pages
Pengaruh Beban Kerja, Kompensasi, Dan Kepuasan Terhadap Prestasi Kerja Karyawan Dengan Disiplin Kerja Sebagai Variabel Intervening Pada Pt. Indosawit Kecamatan Ukui Kabupaten Pelalawan Provinsi Riau
No ratings yet
Pengaruh Beban Kerja, Kompensasi, Dan Kepuasan Terhadap Prestasi Kerja Karyawan Dengan Disiplin Kerja Sebagai Variabel Intervening Pada Pt. Indosawit Kecamatan Ukui Kabupaten Pelalawan Provinsi Riau
11 pages
Question Paper of Math
No ratings yet
Question Paper of Math
2 pages

Lecture 5 Dummy Variable

Uploaded by

Lecture 5 Dummy Variable

Uploaded by

ECON3049: ECONOMETRICS I

 Nature and use of dummy variables

 Dummy variable trap

 Dummy as dependent variable

Reading: Gujurati Chapt. 9 & 15; Wooldridge Chapt. 7

 If a variable is qualitative or categorical, standard regression can still

 A dummy variable can also be used as a dependent variable to predict

 Convert a variable with two categories (such as “distance from home” or

0 if live near 0 if male

1 Lower Second Class 1 Pass

 When dummy variables are used as explanatory variables

 In regression analysis, all possible dummies for a qualitative

 The final category not represented in the regression is referred to as

 Consider a regression model with one non-categorical (edu) and one

 The effect of the intercept dummy is interpreted as follows: the difference

 Consider a regression model with one non-categorical (edu) and one

 In many cases, a researcher may be interested in the set of

 Decision to work or go to school

 Can OLS regression analysis still be conducted when the

 Consider the simple linear regression model as follows:

 Since Y is a dummy (or binary) variable at each value of X,

 However, the linear probability model violates several

 Y and therefore, u is non-normal. However, in large sample this

 u is heteroskedastic, since V ( u )  p (1  p ) , where p depends

You might also like