Chapter 15.1

Uploaded by

lieschenlouw2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views22 pages

Chapter 15.1

Uploaded by

lieschenlouw2002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Chapter 15

Qualitative Response Regression Models

Part 1
Introduction
• Models considered in Chapter 1 to 13 – dependent variable was quantitative
whereas explanatory variables were quantitative, qualitative or a mixture
• In this chapter the dependent variable (or the response variable) is qualitative
• Used especially in social sciences and medical research
Nature of qualitative response models
• E.g. labour force participation (LFP) of adult males
– In the labour force (=1)
– Not in the labour force (=0)
– Dependent variable = binary / dummy variable

• Qualitative dependent variable can also have multiple-category response variables as

dependent variable
• Keep in mind:
– When Y is quantitative, objective is to estimate its expected / mean value given the value of the
regressors
– When Y is qualitative, objective is to find the probability of something happening e.g. of being in the
labour force

• Therefore qualitative response models are known as probability models.

Objectives
• At the end of Chapter 15 you should know:
– How qualitative response models should be estimated;
– What special inference problems are involved;
– How we measure goodness of fit in such models; and
– How do we handle models with more categories, ordinal categories, nominal categories
and count data / rare event data
Probability models for binary response
variables:
• Linear probability model (LPM)
• Logit model
• Probit model
• Tobit model
Linear probability model (LPM)

• Where Y=1 if a family owns a house and 0 otherwise and X indicates family
income
• LPM because Y = binary variable
• Conditional expectation of Y given X, gives the probability of a family
owning a house and whose income is the given amount X.
Justification of LPM
• ……………….1
• As with OLS, assume (to obtain unbiased estimators), therefore
• ………………2
• Also,
– = probability that (the event occurs)
– = probability that (event does not occur)
Y Prob
– has the following distribution: 0 1–P
1 P
Total 1
Justification of LPM
• follows the Bernoulli probability distribution. By definition of mathematical expectation, we
obtain:
• …………..3
• If we equate equation 2 and 3 we get:
• ………………..4
• Thus the conditional expectation of model 1 can be interpreted as the conditional probability of
• If there are n independent trials, each with a probability p of success and probability (1 – p) of
failure and X of these trials represent the number of successes, then X follow the binomial
distribution with:
– mean = np
– Variance = np(1 –p)

• Binomial random variable with parameter n=1 is equivalent to Bernoulli random variable
Justification of LPM
• must lie between 0 and 1, therefore the restriction:

• LPM poses several problems (can therefore not simply extend OLS to binary
dependent variable regression models)
– Non-normality of the disturbance
– Heteroscedastistic variances of disturbances
– Nonfulfillment of
– Questionable value of R2 as measure of goodness of fit
Non-normality of the disturbance
• Although OLS does not require to be normally distributed, assumed for the
purpose of statistical inference
• Normality of not tenable for LPMs because only 2 values, therefore the
Bernoulli distribution
• Nonfulfillment of normality not critical because OLS point estimates remain
unbiased. Also due to central limit theorem – as sample size increases
indefinitely, OLS estimators tend to be normally distributed generally.
Therefore in large sample, statistical inference of LPM will follow usual OLS
procedure under normality assumption
Heteroscedastistic variances of disturbances
• For Bernoulli distribution theoretical mean = p and theoretical variance = p(1 – p) where p =
probability of success (event occurring) showing that the variance is a function of the mean –
therefore the error variance is heteroscedastic
• In presence of heteroscedasticity, OLS estimators although unbiased, are not efficient – they
don’t have minimum variance
• Solution: Weighted least squares (WLS) – but adapted slightly for LPM:
1. Run OLS despite heteroscedasticity problem and obtain . Then obtain estimate of
– )
2. Use estimated to transform data:

– Estimate transformed equation by OLS (WLS)

• May also use White’s heteroscedastistic-corrected standard errors if the sample is reasonably
large
Nonfulfillment of

• There is no guarantee that will necessarily fulfil this restriction and this is the
real problem with OLS estimation of LPM. OLS doesn’t take this restriction
into account
• Ways of finding out if lies between 0 and 1:
– Estimate LPM by usual OLS and find out if it lies between 0 and 1. Values < 0 – assumed
to be zero. Values > 1 – assumed to be 1
– Devise estimating technique that will guarantee that estimated conditional probabilities
will lie between 0 and 1. Logit and probit models guarantee this
Questionable value of R2 as measure of
goodness of fit
• R2 has a limited value in binary response models
• Because Y is either 0 or 1, corresponding to a given X – all Y values will
either lie along the X-axis (where Y=0) or along the line corresponding to 1
• Generally no LPM is expected to fit such a scatter well (whether constrained /
unconstrained) – R2 is likely to be much lower than 1 for such models (in
general it lies between 0.2 and 0.6)
• Only in excess of 0.8 if actual scatter is very closely clustered around points A
and B (fig c). Predicted Y will be very close to either 0 or 1
• Rather avoid using R2 in models with qualitative dependent variables
Example 15.1
Example 15.1

• Probability that a family with R0 income will own a house (remember probability
can’t be negative, therefore this value is treated as zero (which makes sense here)
• For a unit change in income, on the average the probability of owning a house
increases by 0.1021 (approximately 10%)
• We can also estimate the actual probability of owning a house given a particular
level of income – e.g:

– Probability that a family with an income of $12000 will own a house is approximately 28%

• Obtain estimated values of Y

Example 15.1
• Can be seen from estimated values that there are negative values as well as values over 1
• LPM is not the recommended model when the dependent variable is binary
• Model also suffers from heteroscedasticity and therefore, the standard errors can’t be
trusted – can use WLS to obtain more efficient standard error estimates
• To get w:
– Drop all and
– )
– Obtain
• Estimate regression

• Standard errors are now smaller and t ratios are larger. Keep in mind that we dropped 12
observations
Yfa/sqw c(1)/sqw x/sqw
• y/sqw c(1)/sqw x/sqw
Alternatives to LPM
• LPM plagued by several problems as discussed but fundamental problem:
– Logically not a very attractive model because it assumes probability increases linearly
with X – thus the marginal effect of X remains constant throughout which doesn’t
always makes sense
– E.g. homeownership example as X increases by a unit the probability of owning a house
increases by the same constant amount of 0.1. In reality it is expected that P is
nonlinearly related to X
– Very low income – family will not own a house
– Sufficiently high level of income, e.g. X* - most likely will own a house
– Any increase in income beyond X* will have little effect on probability of owning a
house. At both ends of income distribution, probability of owning a house will be
virtually unaffected by a small increase in X
Alternatives to LPM
• We need a probability model that has 2 features:
1. As increases, increases but never steps outside the 0 – 1 interval
2. Relationship between and is nonlinear – “one which approaches zero at slower and
slower rates as gets small and approaches one at slower and slower rates as gets very
large”
Alternatives to LPM
• Geometric form of model – probability lies between 0 and 1 and varies
nonlinearly with X
• S-shaped curve resembles cumulative distribution function of random variable
• For each random variable there is a unique CDF
• CDFs commonly chosen to represent the 0-1 response models are:
– Logistic (Logit) Model
– Normal (Probit/Normit) Model
Tutorials this week
• Please go through the practical worksheet, memo and instructional video to
make sure that you can apply the work practically
• The tutors will cover the tutorial during the tutorial sessions this week. Make
sure to attend one of the sessions. The lecturing schedule contains all the
information of contact sessions for the module.

MST-004 - Statistical Inference PDF
No ratings yet
MST-004 - Statistical Inference PDF
415 pages
CH - 12 - Serial Correlation and Heteroskedasticity in Time Series Regressions
No ratings yet
CH - 12 - Serial Correlation and Heteroskedasticity in Time Series Regressions
19 pages
Lecture 8 - Limited Dependent Var PDF
No ratings yet
Lecture 8 - Limited Dependent Var PDF
78 pages
Dummy Dependent Variable
100% (1)
Dummy Dependent Variable
58 pages
In All The Regression Models That We Have Considered So
100% (1)
In All The Regression Models That We Have Considered So
52 pages
Regression With Stata
75% (4)
Regression With Stata
108 pages
Regression Analysis MCQ
No ratings yet
Regression Analysis MCQ
15 pages
MicroEconometrics Lecture10
No ratings yet
MicroEconometrics Lecture10
27 pages
K Kiran Kumar IIM Indore
100% (1)
K Kiran Kumar IIM Indore
115 pages
Qualitative Response Regression Model - Probabilistic Models
No ratings yet
Qualitative Response Regression Model - Probabilistic Models
34 pages
Limited Dependent Variables
No ratings yet
Limited Dependent Variables
17 pages
Topic 3: Qualitative Response Regression Models
No ratings yet
Topic 3: Qualitative Response Regression Models
29 pages
SPS 2320 Theory of Estimation Year 3 Semester II
100% (1)
SPS 2320 Theory of Estimation Year 3 Semester II
2 pages
Discrete Choice Models 230919 191735
No ratings yet
Discrete Choice Models 230919 191735
132 pages
CH 5. Discrete Choice Model
No ratings yet
CH 5. Discrete Choice Model
38 pages
CH - 7 - Econometrics UG
No ratings yet
CH - 7 - Econometrics UG
18 pages
Logistic Regression
No ratings yet
Logistic Regression
54 pages
D071171011 - Tugas 02
100% (1)
D071171011 - Tugas 02
7 pages
Lecture 6 LPM
No ratings yet
Lecture 6 LPM
14 pages
Problem-Set - 1 Practise Problems From Textbook
No ratings yet
Problem-Set - 1 Practise Problems From Textbook
2 pages
Factor Analysis: KMO and Bartlett's Test
No ratings yet
Factor Analysis: KMO and Bartlett's Test
31 pages
Part III - Analysis With NonLinear Models
No ratings yet
Part III - Analysis With NonLinear Models
68 pages
Limited Dependent Variables - Binary Dependent Variables
No ratings yet
Limited Dependent Variables - Binary Dependent Variables
24 pages
Variable 1 Variable 2: T-Test: Paired Two Sample For Means
No ratings yet
Variable 1 Variable 2: T-Test: Paired Two Sample For Means
33 pages
4a. LPM-Logit-Probit-Tobit Model - IInd Sem 23-24
No ratings yet
4a. LPM-Logit-Probit-Tobit Model - IInd Sem 23-24
130 pages
Course Title: Quantitative Techniques For Economics Course Code: ECON6002 Topic: The Linear Probability Model (LPM)
No ratings yet
Course Title: Quantitative Techniques For Economics Course Code: ECON6002 Topic: The Linear Probability Model (LPM)
12 pages
Chapter3 Julie Riise Kolstad
No ratings yet
Chapter3 Julie Riise Kolstad
28 pages
Pns Assignment
No ratings yet
Pns Assignment
28 pages
09 Discrete Choice 1 Notes
No ratings yet
09 Discrete Choice 1 Notes
17 pages
Week 12 LPN Logit 0
No ratings yet
Week 12 LPN Logit 0
35 pages
Econometrics II Slides-1
No ratings yet
Econometrics II Slides-1
61 pages
Probability and Statistics Formula Sheet
No ratings yet
Probability and Statistics Formula Sheet
2 pages
Lecture15 Binary Dependent Variables
No ratings yet
Lecture15 Binary Dependent Variables
38 pages
Chapter 15 Qualitative Response Regression Models Part 2
No ratings yet
Chapter 15 Qualitative Response Regression Models Part 2
31 pages
STAT3301 - Term Exam 2 - CH11 Study Package
No ratings yet
STAT3301 - Term Exam 2 - CH11 Study Package
6 pages
Econometrics Eviews 6
No ratings yet
Econometrics Eviews 6
12 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
9 pages
Kontribusi Pembelajaran Di Perguruan Tinggi Dan Literasi Keuangan Terhadap Perilaku Keuangan Mahasiswa
No ratings yet
Kontribusi Pembelajaran Di Perguruan Tinggi Dan Literasi Keuangan Terhadap Perilaku Keuangan Mahasiswa
11 pages
Seminar Econometrie
No ratings yet
Seminar Econometrie
15 pages
Package RDD': R Topics Documented
No ratings yet
Package RDD': R Topics Documented
11 pages
Econometrics - Qualitative Response Models
No ratings yet
Econometrics - Qualitative Response Models
17 pages
Becker Ichino Pscore SJ 2002
No ratings yet
Becker Ichino Pscore SJ 2002
20 pages
Artikel Jurnal Skripsi
No ratings yet
Artikel Jurnal Skripsi
17 pages
Qualitative Response Regression Models 1
No ratings yet
Qualitative Response Regression Models 1
29 pages
Assignment I Questions Econ. For Acct & Fin. 2023
No ratings yet
Assignment I Questions Econ. For Acct & Fin. 2023
3 pages
Applied Econometrics With R
No ratings yet
Applied Econometrics With R
5 pages
Hausman
No ratings yet
Hausman
3 pages
Qualitative Response Regression Questions
No ratings yet
Qualitative Response Regression Questions
10 pages
Sobel
No ratings yet
Sobel
4 pages
Chapter 4
No ratings yet
Chapter 4
11 pages
Fergusson College, Pune - 4. Department of Computer Science
No ratings yet
Fergusson College, Pune - 4. Department of Computer Science
2 pages
Gsbiju MA202 3 2
No ratings yet
Gsbiju MA202 3 2
4 pages
Lecture Notes 5
No ratings yet
Lecture Notes 5
19 pages
Panel Data Models: V X U Y V U X Y
No ratings yet
Panel Data Models: V X U Y V U X Y
2 pages
Lesson 6 - Sampling Distribution
No ratings yet
Lesson 6 - Sampling Distribution
5 pages
How To Build A Quantamental System For Investment Management - Macrosynergy Research
No ratings yet
How To Build A Quantamental System For Investment Management - Macrosynergy Research
5 pages
Msfe Week9
No ratings yet
Msfe Week9
5 pages
ST104b ZA Exam Paper - May 2023
No ratings yet
ST104b ZA Exam Paper - May 2023
30 pages
Sociology: Intermediate Quantitative Research Method
No ratings yet
Sociology: Intermediate Quantitative Research Method
35 pages
Econometrics II: Limited Dependent Variables
No ratings yet
Econometrics II: Limited Dependent Variables
77 pages
Chapter - Five - Limited Dependent Variable Models
No ratings yet
Chapter - Five - Limited Dependent Variable Models
75 pages
Ecntr Assmm
No ratings yet
Ecntr Assmm
23 pages
Econ ML12
No ratings yet
Econ ML12
5 pages
Qualitative Response Regression Models
No ratings yet
Qualitative Response Regression Models
6 pages
Week 6 Notes
No ratings yet
Week 6 Notes
107 pages
Binary
No ratings yet
Binary
40 pages
411 Note LDV
No ratings yet
411 Note LDV
12 pages
Discrete Choice Model Soderbom
No ratings yet
Discrete Choice Model Soderbom
43 pages
Unitb - II - Linear Probability, Logit and Probit
No ratings yet
Unitb - II - Linear Probability, Logit and Probit
34 pages
Chapter 5 MGT
No ratings yet
Chapter 5 MGT
60 pages
Limited Dependent Variables
No ratings yet
Limited Dependent Variables
34 pages
2-11 Limited Dependent Variables - Linear Probability Model
No ratings yet
2-11 Limited Dependent Variables - Linear Probability Model
13 pages
2017 - End Spring Semester 2016-17 - Mathematics - MA60056 - Regression and Time Series Models
No ratings yet
2017 - End Spring Semester 2016-17 - Mathematics - MA60056 - Regression and Time Series Models
2 pages
Probit Logit Models
No ratings yet
Probit Logit Models
26 pages
Econometrics CH 4
No ratings yet
Econometrics CH 4
14 pages
Logistic - Regression Class 3
No ratings yet
Logistic - Regression Class 3
88 pages
Chapter 5-LDVM-2024
No ratings yet
Chapter 5-LDVM-2024
27 pages
Binary
No ratings yet
Binary
47 pages
Assignment 3
No ratings yet
Assignment 3
8 pages
LPM Stata Baum
No ratings yet
LPM Stata Baum
73 pages
3.handouts Binary Dependent Variables
No ratings yet
3.handouts Binary Dependent Variables
8 pages
CH - 10 - Basic Regression Analysis With Time Series Data
No ratings yet
CH - 10 - Basic Regression Analysis With Time Series Data
27 pages
Cap1 Slides
No ratings yet
Cap1 Slides
30 pages
Econometrics Chapter Four and Five
No ratings yet
Econometrics Chapter Four and Five
22 pages
Ecmetrics II Ch1
No ratings yet
Ecmetrics II Ch1
56 pages
Metrikaq
No ratings yet
Metrikaq
11 pages
Qualitative Response Models
No ratings yet
Qualitative Response Models
35 pages
Logit Probit
No ratings yet
Logit Probit
20 pages
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)

Chapter 15.1

Uploaded by

Chapter 15.1

Uploaded by

Chapter 15

Qualitative Response Regression Models

• Qualitative dependent variable can also have multiple-category response variables as

• Therefore qualitative response models are known as probability models.

– Estimate transformed equation by OLS (WLS)

• Obtain estimated values of Y

You might also like