0% found this document useful (0 votes)

27 views

Conditional Logit Applicability

This document provides an overview of fixed effects logit models for panel data analysis. It discusses how fixed effects models control for stable but unmeasured characteristics by using individuals as their own controls across multiple time points. Examples are given of how this approach can be used to estimate the effects of time-varying variables like marriage or video game use on outcomes like recidivism or school performance. Key assumptions and limitations of the fixed effects approach are outlined. An example analysis using fixed effects logistic regression on panel data from a study of teenage girls is also presented.

Uploaded by

Sujit Chauhan

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

Conditional Logit Applicability

Uploaded by

Sujit Chauhan

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Panel Data 3: Conditional Logit/ Fixed Effects Logit Models

Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/

Last revised March 20, 2018

These notes borrow very heavily, sometimes verbatim, from Paul Allison’s book, Fixed Effects Regression Models
for Categorical Data. The Stata XT manual is also a good reference. This handout tends to make lots of assertions;
Allison’s book does a much better job of explaining why those assertions are true and what the technical details
behind the models are.

Overview. In experimental research, unmeasured differences between subjects are often

controlled for via random assignment to treatment and control groups. Hence, even if a variable
like Socio-Economic Status is not explicitly measured, because of random assignment, we can be
reasonably confident that the effects of SES are approximately equal for all groups. Of course,
random assignment is usually not possible with most survey research. If we want to control for
the effect of a variable, we must explicitly measure it. If we don’t measure it, we can’t control
for it. In practice, there will almost certainly be some variables we have failed to measure (or
have measured poorly), so our models will likely suffer from some degree of omitted variable
bias.

Allison notes, however, that when we have panel data (the same subjects measured at two or
more points in time) another alternative presents itself: we can use the subjects as their own
controls. With binary dependent variables, this can be done via the use of conditional logit/fixed
effects logit models. With panel data we can control for stable characteristics (i.e. characteristics
that do not change across time) whether they are measured or not. These include such things as
sex, race, and ethnicity, as well as more difficult to measure variables such as intelligence,
parents’ child-rearing practices, and genetic makeup. This does not control for time-varying
variables, but such variables can be explicitly included in the model, e.g. employment status,
income.

Examples (from Allison): Suppose you want to know whether marriage reduced recidivism
among chronic offenders. We could compare an individual’s arrest rate when he is married with
his arrest rate when he is not. The difference in arrest rates between the two periods is an
estimate of the marriage effect for that individual. Or, you might see how a child’s performance
in school differs depending on how much time s/he spends playing video games. So, you could
compare how the child does when not spending much time on video games versus when s/he
does.

Allison notes there are two conditions for using fixed effects methods.

• The dependent variable must be measured on at least two occasions for each individual.
• The independent variables must change across time for some substantial portion of the
individuals. Fixed effects models are not much good for looking at the effects of
variables that do not change across time, like race and sex.

There are several other points to be aware of with fixed effects logit models.

Panel Data 3: Conditional Logit/ Fixed Effects Logit Models Page 1

• The good thing is that the effects of stable characteristics, such as race and gender, are
controlled for, whether they are measured or not. The bad thing is that the effects of these
variables are not estimated. Again, it is similar to an experiment with random assignment.
The effects of variables not explicitly measured are controlled for (because random
assignment makes the groups more or less similar on these characteristics) but their
effects are not estimated.
• Other methods (e.g. random effects) can be used when we want to estimate the effects of
variables like sex and race, but then the method is no longer controlling for omitted
variables.
• Fixed effects estimates use only within-individual differences, essentially discarding any
information about differences between individuals. If predictor variables vary greatly
across individuals but have little variation over time for each individual, then fixed
effects estimates will be imprecise and have large standard errors.
o Why tolerate the higher errors? Allison says there is a trade-off between bias and
efficiency. Other methods, e.g. random effects, will suffer from omitted variable
bias; fixed effects methods help to control for omitted variable bias by having
individuals serve as their own controls.
o Keep in mind, however, that fixed effects doesn’t control for unobserved
variables that change over time. So, for example, a failure to include income in
the model could still cause fixed effects coefficients to be biased.
o Allison likes fixed effects models because they are less vulnerable to omitted
variable bias. But he cautions that “in applications where the within-person
variation is small relative to the between-person variation, the standard errors of
the fixed effects coefficients may be too large to tolerate.”
• Conditional logit/fixed effects models can be used for things besides Panel Studies. For
example, Long & Freese show how conditional logit models can be used for alternative-
specific data. If you read both Allison’s and Long & Freese’s discussion of the clogit
command, you may find it hard to believe they are talking about the same command!

Example. Here is an example from Allison’s 2009 book Fixed Effects Regression Models. Data
are from the National Longitudinal Study of Youth (NLSY). The data set has 1151 teenage girls
who were interviewed annually for 5 years beginning in 1979. The data have already been
reshaped and xtset so they can be used for panel data analysis. That is, each of the 1151 cases has
5 different records, one for each year of the study. The variables are

• id is the subject id number and is the same across each wave of the survey
• year is the year the data were collected in. 1 = 1979, 2 = 1980, etc.
• pov is coded 1 if the subject was in poverty during that time period, 0 otherwise.
• age is the age at the first interview.
• black is coded 1 if the respondent is black, 0 otherwise.
• mother is coded 1 if the respondent currently has at least 1 child, 0 otherwise.
• spouse is coded 1 if the respondent is currently living with a spouse, 0 otherwise.
• school is coded 1 if the respondent is currently in school, 0 otherwise.
• hours is the hours worked during the week of the survey.

Panel Data 3: Conditional Logit/ Fixed Effects Logit Models Page 2

We can use either Stata’s clogit command or the xtlogit, fe command to do a fixed
effects logit analysis. Both give the same results. (In fact, I believe xtlogit, fe actually
calls clogit.) First we will use xtlogit with the fe option.

. use https://www3.nd.edu/~rwilliam/statafiles/teenpovxt, clear

. xtlogit pov i.mother i.spouse i.school hours i.year, fe nolog
note: multiple positive outcomes within groups encountered.
note: 324 groups (1,620 obs) dropped because of all positive or
all negative outcomes.

Conditional fixed-effects logistic regression Number of obs = 4,135

Group variable: id Number of groups = 827

Obs per group:

min = 5
avg = 5.0
max = 5

LR chi2(8) = 97.28
Log likelihood = -1520.1139 Prob > chi2 = 0.0000

------------------------------------------------------------------------------
pov | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
1.mother | .5824322 .1595831 3.65 0.000 .269655 .8952094
1.spouse | -.7477585 .1753466 -4.26 0.000 -1.091431 -.4040854
1.school | .2718653 .1127331 2.41 0.016 .0509125 .4928181
hours | -.0196461 .0031504 -6.24 0.000 -.0258208 -.0134714
|
year |
2 | .3317803 .1015628 3.27 0.001 .132721 .5308397
3 | .3349777 .1082496 3.09 0.002 .1228124 .547143
4 | .4327654 .1165144 3.71 0.000 .2044013 .6611295
5 | .4025012 .1275277 3.16 0.002 .1525514 .652451
------------------------------------------------------------------------------

Here is how we interpret the results. The note “multiple positive outcomes within groups
encountered” is a warning that you may need to check your data, because with some analyses
there should be no more than one positive outcome. In the present case, that is not a problem, i.e.
there is no reason that respondents cannot be in poverty at multiple points in time.

The note “324 groups (1620 obs) dropped because of all positive or all negative outcomes”
means that 324 subjects were either in poverty during all 5 time periods or were not in poverty
during all 5 time periods. Fixed-effects models are looking at the determinants of within-subject
variability. If there is no variability within a subject, there is nothing to examine. Put another
way, in the 827 groups that remained, sometime during the 5 year period the subject went from
being in poverty to being out of poverty; or else switched from being out of poverty to being in
poverty. If poverty status were something that hardly ever changed across time, or if very few
people were ever in poverty, there would not be many cases left for a fixed effects analysis. Even
as it is, more than a fourth of the sample has been dropped from the analysis. (Other techniques,
like xtreg, fe, won’t cost you so many cases.)

In terms of interpreting the coefficients, it may also be helpful to have the odds ratios.

Panel Data 3: Conditional Logit/ Fixed Effects Logit Models Page 3

. xtlogit, or

Conditional fixed-effects logistic regression Number of obs = 4,135

Group variable: id Number of groups = 827

Obs per group:

min = 5
avg = 5.0
max = 5

LR chi2(8) = 97.28
Log likelihood = -1520.1139 Prob > chi2 = 0.0000

------------------------------------------------------------------------------
pov | OR Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
1.mother | 1.790388 .2857157 3.65 0.000 1.309513 2.447848
1.spouse | .4734266 .0830137 -4.26 0.000 .3357355 .6675871
1.school | 1.31241 .1479521 2.41 0.016 1.052231 1.636923
hours | .9805456 .0030891 -6.24 0.000 .9745098 .9866189
|
year |
2 | 1.393447 .1415223 3.27 0.001 1.141931 1.700359
3 | 1.397909 .1513231 3.09 0.002 1.130672 1.728308
4 | 1.541515 .1796087 3.71 0.000 1.22679 1.936979
5 | 1.495561 .1907255 3.16 0.002 1.164802 1.920242
------------------------------------------------------------------------------

The OR for mother is 1.79. This means that, if a girl switches from not having children to having
children, her odds of being in poverty are multiplied by 1.79. Remember, these are teenagers at
the start of the study, so having a baby while you are still very young is not good in terms of
avoiding poverty. Conversely, if a girl switches from being unmarried to married, her odds of
being in poverty get multiplied by .47, i.e. getting married helps you to stay out of poverty.
Being in school multiplies the odds of poverty by 31 percent, while each additional hour you
work reduces the odds of poverty by 2 percent. The year coefficients are all comparisons with
year 1 and are all positive and significant; on an all other things equal basis, teens are more likely
to be in poverty in the later years.

Notice that we did NOT include the time-invariant variables for age and black. Let’s see what
happens when we do.

. xtlogit pov i.mother i.spouse i.school hours i.year age i.black, fe nolog
note: multiple positive outcomes within groups encountered.
note: 324 groups (1,620 obs) dropped because of all positive or
all negative outcomes.
note: age omitted because of no within-group variance.
note: 1.black omitted because of no within-group variance. [Rest of output deleted]

The two variables get dropped because their values do not vary within each group. Something
that is a constant cannot explain variability in a dependent variable. (Allison, however,
demonstrates that interactions between time-varying and time-constant variables can be included
in the model.)

To do the same thing with clogit,

Panel Data 3: Conditional Logit/ Fixed Effects Logit Models Page 4

. use https://www3.nd.edu/~rwilliam/statafiles/teenpovxt, clear
. xtset, clear
. clogit pov i.mother i.spouse i.school hours i.year, group(id) nolog
note: multiple positive outcomes within groups encountered.
note: 324 groups (1,620 obs) dropped because of all positive or
all negative outcomes.

Conditional (fixed-effects) logistic regression

Number of obs = 4,135

LR chi2(8) = 97.28
Prob > chi2 = 0.0000
Log likelihood = -1520.1139 Pseudo R2 = 0.0310

I did not need to clear the xtsettings; but I did so to illustrate that with clogit, it isn’t necessary
to xtset the data. Instead, the panelvar is specified by using the group option. Further, with
neither method was the timevar actually needed. Instead of years, these could have been children
within schools. The xt labeling of commands can be deceptive in that you do not necessarily
need to have longitudinal data to use some of the commands.

WARNING!!! Marginal effects and predicted values after xtlogit, fe and clogit can be
problematic. By default, margins is giving you “the probability of a positive outcome assuming
that the fixed effect is zero.” This may be an unreasonable assumption. For a discussion of the
problem and possible solutions, see Steve Samuels’ comments at

http://www.statalist.org/forums/forum/general-stata-discussion/general/1304704-cannot-
estimate-marginal-effect-after-xtlogit

Panel Data 3: Conditional Logit/ Fixed Effects Logit Models Page 5

Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Fixed Effects, Random Effects Model Cheat Sheet
100% (1)
Fixed Effects, Random Effects Model Cheat Sheet
4 pages
Multivariate Analysis – The Simplest Guide in the Universe: Bite-Size Stats, #6
From Everand
Multivariate Analysis – The Simplest Guide in the Universe: Bite-Size Stats, #6
Lee Baker
No ratings yet
Properties of Sums: Problem Set 1 - Due July 16th ECON 139/239 2010 Summer Term II
No ratings yet
Properties of Sums: Problem Set 1 - Due July 16th ECON 139/239 2010 Summer Term II
17 pages
Panel Data 4: Fixed Effects Vs Random Effects Models
No ratings yet
Panel Data 4: Fixed Effects Vs Random Effects Models
8 pages
Panel S9-In FEM, Gender Is Controlled For But Not Estimated
No ratings yet
Panel S9-In FEM, Gender Is Controlled For But Not Estimated
16 pages
Regresi Data Panel
No ratings yet
Regresi Data Panel
10 pages
1709.08980v2
No ratings yet
1709.08980v2
40 pages
Panel Data Assign
No ratings yet
Panel Data Assign
19 pages
CH - 14 - Advanced Panel Data Methods
No ratings yet
CH - 14 - Advanced Panel Data Methods
12 pages
Experimental and Panel Data: Slides by Niels-Hugo Blunch Washington and Lee University
No ratings yet
Experimental and Panel Data: Slides by Niels-Hugo Blunch Washington and Lee University
18 pages
Panel Data Models
No ratings yet
Panel Data Models
25 pages
Panel Data Analysis
No ratings yet
Panel Data Analysis
8 pages
(Ebook) Linear Regression Models For Panel Data Using SAS, Stata, Limdep and SPSS
No ratings yet
(Ebook) Linear Regression Models For Panel Data Using SAS, Stata, Limdep and SPSS
67 pages
Linear Regression Models For Panel Data Using SAS, STATA, LIMDEP and SPSS
100% (2)
Linear Regression Models For Panel Data Using SAS, STATA, LIMDEP and SPSS
67 pages
Introduction To Econometrics, 5 Edition
No ratings yet
Introduction To Econometrics, 5 Edition
33 pages
Panel Data
100% (2)
Panel Data
5 pages
Chapter 14: Introduction To Panel Data
No ratings yet
Chapter 14: Introduction To Panel Data
14 pages
Fixed vs. Random Effects Panel Data Models: Revisiting The Omitted Latent Variables and Individual Heterogeneity Arguments
No ratings yet
Fixed vs. Random Effects Panel Data Models: Revisiting The Omitted Latent Variables and Individual Heterogeneity Arguments
20 pages
Chapter 2 Slides Handout
No ratings yet
Chapter 2 Slides Handout
48 pages
Microeconometrie Chapitre4 BinaryChoicePanelDataModels
No ratings yet
Microeconometrie Chapitre4 BinaryChoicePanelDataModels
14 pages
Hazard Models The Hazard Model
No ratings yet
Hazard Models The Hazard Model
5 pages
Chapter 14 Advanced Panel Data Methods: T T Derrorterm Complicate X y
No ratings yet
Chapter 14 Advanced Panel Data Methods: T T Derrorterm Complicate X y
13 pages
Hill_Davis_Roos_French_2019
No ratings yet
Hill_Davis_Roos_French_2019
26 pages
PD2004_1
No ratings yet
PD2004_1
24 pages
Assignment 2 Microeconometrics
No ratings yet
Assignment 2 Microeconometrics
37 pages
Dougherty Chap14
No ratings yet
Dougherty Chap14
16 pages
8) Lesson_11_Panel_FE
No ratings yet
8) Lesson_11_Panel_FE
18 pages
Introduction To Panel Data Analysis
No ratings yet
Introduction To Panel Data Analysis
18 pages
Econometric Analysis of Panel Data
No ratings yet
Econometric Analysis of Panel Data
14 pages
Chapter 5
No ratings yet
Chapter 5
25 pages
AE 2023 Lecture10
No ratings yet
AE 2023 Lecture10
40 pages
Estimating Econometric Models With Fixed Effects
No ratings yet
Estimating Econometric Models With Fixed Effects
14 pages
Homework 2
No ratings yet
Homework 2
3 pages
Document
No ratings yet
Document
44 pages
Linear Mixed Effects Modeling Using R
No ratings yet
Linear Mixed Effects Modeling Using R
13 pages
Panel Event
No ratings yet
Panel Event
30 pages
Panel Data: Fixed and Random Effects: I1 0 I1 0 I I1
No ratings yet
Panel Data: Fixed and Random Effects: I1 0 I1 0 I I1
8 pages
Intro Panel Data by Kurt-Univ Basel
No ratings yet
Intro Panel Data by Kurt-Univ Basel
8 pages
Block 3
No ratings yet
Block 3
105 pages
Chapter 2_Panel Data Regression
No ratings yet
Chapter 2_Panel Data Regression
30 pages
Topic 9: Panel Data Models
No ratings yet
Topic 9: Panel Data Models
46 pages
Panel Data
No ratings yet
Panel Data
105 pages
Fere
No ratings yet
Fere
46 pages
Slides On Panel Data Analysis
No ratings yet
Slides On Panel Data Analysis
44 pages
slides2part1-mrbm2324
No ratings yet
slides2part1-mrbm2324
18 pages
Dougherty5e C14G01 2016 05 27
No ratings yet
Dougherty5e C14G01 2016 05 27
34 pages
Chapter14 Panel Data Models
No ratings yet
Chapter14 Panel Data Models
140 pages
Panel Data
No ratings yet
Panel Data
9 pages
Femlogit Implementation of The Multinomi
No ratings yet
Femlogit Implementation of The Multinomi
16 pages
sim.8732
No ratings yet
sim.8732
18 pages
Panel Data Problem Set 6
No ratings yet
Panel Data Problem Set 6
4 pages
Unbalanced Panel Data PDF
No ratings yet
Unbalanced Panel Data PDF
51 pages
Panel Data Method-Baltagi
100% (1)
Panel Data Method-Baltagi
51 pages
Week 9 - Random Effects Model
No ratings yet
Week 9 - Random Effects Model
3 pages
Why Panel Data - Hsiao
No ratings yet
Why Panel Data - Hsiao
19 pages
Estimating Group Fixed Effects in Panel Data With A Binary Dependent Variable How The LPM Outperforms Logistic Regression in Rare Events Data
No ratings yet
Estimating Group Fixed Effects in Panel Data With A Binary Dependent Variable How The LPM Outperforms Logistic Regression in Rare Events Data
12 pages
This Web Page: - Sort Panelvar Datevar - Tsset Panelvar Datevar
No ratings yet
This Web Page: - Sort Panelvar Datevar - Tsset Panelvar Datevar
4 pages
Ecotrics (PR) Panel Data 2
No ratings yet
Ecotrics (PR) Panel Data 2
16 pages
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
Understanding Statistics: An Introduction
From Everand
Understanding Statistics: An Introduction
Antony Davies
No ratings yet
Z01 Gree5381 07 Se App F-G
No ratings yet
Z01 Gree5381 07 Se App F-G
12 pages
Estimation of Value of Travel Time Savings Using Conditional Logit Model
No ratings yet
Estimation of Value of Travel Time Savings Using Conditional Logit Model
4 pages
Deriving Welfare Measures in Discrete Choice Experiments: A Comment To Lancsar and Savage
No ratings yet
Deriving Welfare Measures in Discrete Choice Experiments: A Comment To Lancsar and Savage
4 pages
Online Lecture-D N Reddy
No ratings yet
Online Lecture-D N Reddy
1 page
Advertisement: Ref: RC/Vacancy/Advt./2020/27 Date: 12th August, 2020
No ratings yet
Advertisement: Ref: RC/Vacancy/Advt./2020/27 Date: 12th August, 2020
2 pages
Skittles Project 5
No ratings yet
Skittles Project 5
3 pages
Psyc417 Final World Happiness
No ratings yet
Psyc417 Final World Happiness
11 pages
Instant download The Art of Data Science Roger D. Peng pdf all chapter
100% (2)
Instant download The Art of Data Science Roger D. Peng pdf all chapter
50 pages
PPT-Hackathon Tiny Coders (1) (1)
No ratings yet
PPT-Hackathon Tiny Coders (1) (1)
21 pages
New TB Stat CH 9
No ratings yet
New TB Stat CH 9
59 pages
Kubsa Guyo Advance Biostatistic
No ratings yet
Kubsa Guyo Advance Biostatistic
30 pages
Regression and Multiple Regression Analysis
100% (1)
Regression and Multiple Regression Analysis
21 pages
Pengaruh Human Relation (Hubungan Antar Manusia), Lingkungan Kerja Terhadap Etos Kerja Karyawan (Studi Kasus Pada PT - Pelindo Teluk Bayur Padang)
No ratings yet
Pengaruh Human Relation (Hubungan Antar Manusia), Lingkungan Kerja Terhadap Etos Kerja Karyawan (Studi Kasus Pada PT - Pelindo Teluk Bayur Padang)
14 pages
Mathematics and Statistics Undergraduate Handbook
No ratings yet
Mathematics and Statistics Undergraduate Handbook
12 pages
Kuliah 3-Taburan PersempelanM4 TABURAN PERSAMPELAN
No ratings yet
Kuliah 3-Taburan PersempelanM4 TABURAN PERSAMPELAN
42 pages
Full Download PDF of (Ebook PDF) Business Statistics 9th by Kent D. Smith All Chapter
100% (18)
Full Download PDF of (Ebook PDF) Business Statistics 9th by Kent D. Smith All Chapter
43 pages
Business Statistics: A Decision-Making Approach: Analysis of Variance
No ratings yet
Business Statistics: A Decision-Making Approach: Analysis of Variance
14 pages
Chi Square Test: Case Processing Summary
No ratings yet
Chi Square Test: Case Processing Summary
4 pages
Linear Regression: 1 1 N N I I I D I I
No ratings yet
Linear Regression: 1 1 N N I I I D I I
20 pages
Penurunan Kadar Sianida Limbah Cair Industri Tapioka Dengan Larutan Kapur Tohor (Ca (Oh) ) Di Desa Ngemplak Kidul, Margoyoso, Pati
No ratings yet
Penurunan Kadar Sianida Limbah Cair Industri Tapioka Dengan Larutan Kapur Tohor (Ca (Oh) ) Di Desa Ngemplak Kidul, Margoyoso, Pati
10 pages
Stat - Inf Part 2
No ratings yet
Stat - Inf Part 2
191 pages
Deco504 Statistical Methods in Economics Hindi
No ratings yet
Deco504 Statistical Methods in Economics Hindi
409 pages
2021 03 26 Sample-Data-Sets-For-Linear-Regression1
No ratings yet
2021 03 26 Sample-Data-Sets-For-Linear-Regression1
26 pages
Lodha Case
No ratings yet
Lodha Case
5 pages
Influential Observation
No ratings yet
Influential Observation
4 pages
SPSS-RAK Faktorial
No ratings yet
SPSS-RAK Faktorial
61 pages
Kolmogorov-Smirnov One Sample Test
No ratings yet
Kolmogorov-Smirnov One Sample Test
2 pages
RCBD Revised Notes
No ratings yet
RCBD Revised Notes
30 pages
Pengaruh Motivasi Kerja Terhadap Kinerja Guru-Guru Di SMK Negeri 7 Medan
No ratings yet
Pengaruh Motivasi Kerja Terhadap Kinerja Guru-Guru Di SMK Negeri 7 Medan
13 pages
Chapter 13
No ratings yet
Chapter 13
18 pages
Test Exer 6
No ratings yet
Test Exer 6
3 pages
Worked Examples of Non-Parametric Tests
No ratings yet
Worked Examples of Non-Parametric Tests
22 pages
JHJHKK
No ratings yet
JHJHKK
27 pages
Immediate download (Ebook) Probability and Statistics with R by Maria Dolores Ugarte, Ana F. Militino, Alan T. Arnholt ISBN 9781584888918, 1584888911 ebooks 2024
100% (3)
Immediate download (Ebook) Probability and Statistics with R by Maria Dolores Ugarte, Ana F. Militino, Alan T. Arnholt ISBN 9781584888918, 1584888911 ebooks 2024
71 pages