Discrete Choice Analysis
Discrete Choice Analysis
Moshe Ben-Akiva
Fall 2008
2
Outline – This Lecture
3
Specification of Systematic Components
● Types of Variables
– Attributes of alternatives: Zin, e.g., travel time, travel cost
– Characteristics of decision-makers: Sn, e.g., age, gender, income,
occupation
– Therefore: Xin = h(Zin, Sn)
● Examples:
– Xin1 = Zin1 = travel cost
– Xin2 = log(Zin2) = log (travel time)
– Xin3 = Zin1/Sn1 = travel cost / income
● Functional Form: Linear in the Parameters
Vin = β1Xin1 + β2Xin2 + ... + βkXinK
Vjn = β1Xjn1 + β2Xjn2 + ... + βkXjnK
4
Data Collection
5
Model Specification Example
β0 β1 β2
Auto 1 TTauto ln(Income)
Transit 0 TTtransit 0
6
Probabilities of Observed Choices
●Individual 1:
Vauto = β0 + β1 15.4 + β2 ln(35)
Vtransit = β1 58.2
e β 0 +15.4 β1 +ln(35) β 2
●Individual 2:
Vauto = β0 + β1 14.2 + β2 ln(45)
Vtransit = β1 31.0
e31.0 β1
P(Transit) =
e β 0 +14.2 β1 +ln(45) β 2 + e 31.0 β1
7
Maximum Likelihood Estimation
● Find the values of β that are most likely to result in the choices observed
in the sample:
– max L*(β) = P1(Auto)P2(Transit)…P6(Transit)
● If yin =
0, if person n chose alternative j
n =1
8
Sources of Data on User Behavior
9
Stated Preferences / Conjoint Experiments
10
Aggregation and Forecasting
characteristics, Xn
11
The Aggregate Forecasting Problem
● The fraction of population T choosing alt. i is:
1 N T
= ∑
N T n=1
P (i| X n ) , NT is the # in the population of interest
12
Sample Enumeration
● Use a sample to represent the entire population
● For a random sample:
1 Ns
Ŵ (i) = ∑ P̂(i | xn ) where Ns is the # of obs. in sample
N s n=1
13
Disaggregate Prediction
14
Generating Disaggregate Populations
Household Exogenous
surveys forecasts
Census
Counts
data
Data fusion
(e.g., IPF, HH evolution)
Representative
Population
15
Review
● Empirical issues
– Model specification and estimation
– Aggregate forecasting
16
Summary of Basic Discrete Choice Models
● Binary Probit:
Vn
1 − 21 ε 2
Pn (i| Cn ) = Φ(Vn ) = ∫ e dε
−∞
2π
● Binary Logit:
1 eVin
Pn (i| Cn ) = −Vn
= V
e in + e jn
V
1+ e
● Multinomial Logit:
eVin
Pn (i| Cn ) =
∑
V jn
e
j∈Cn
17
Independence from
Irrelevant Alternatives (IIA)
● Property of the Multinomial Logit Model
– εjn independent identically distributed (i.i.d.)
– εjn ~ ExtremeValue(0,µ) ∀ j
e µVin
– Pn (i| Cn ) =
∑ e
µV jn
j ∈Cn
P(i|C1 ) P(i|C2 )
so = ∀ i, j, C1, C2
P( j|C1 ) P( j|C2 )
18
Examples of IIA
T-δ
Path 2 b
a
δ
O D
Path 1
T
e µT 1
P(1|{1,2a,2b}) = P(2a|{1,2a,2b}) = P(2b|{1,2a ,2b}) = =
∑ e µT 3
j ∈{1, 2 a ,2b}
19
Red Bus / Blue Bus Paradox
● Consider that initially auto and bus have the same utility
– Cn = {auto, bus} and Vauto = Vbus = V
– P(auto) = P(bus) = 1/2
● Suppose that a new bus service is introduced that is identical
to the existing bus service, except the buses are painted
differently (red vs. blue)
– Cn = {auto, red bus, blue bus}; Vred bus = Vblue bus = V
– Logit now predicts
– We’d expect
20
IIA and Aggregation
● Auto and red bus share ratios remain constant for each
group after introducing blue bus:
Population Auto Share Red Bus Share Blue Bus Share
Auto people 81.8% 9.1% 9.1%
Transit people 5.2% 47.4% 47.4%
Total 43.5% 28.25% 28.25%
21
Motivation for Nested Logit
22
Tree Representation of Nested Logit
motorized non-motorized
23
Tree Representation of Nested Logit
● Example: Route and Departure Time Choice (Multidimensional Choice)
24
Nested Model Estimation
25
Nested Model – Example
Non- Motorized
motorized (M)
(NM)
e µ NM Vi
P(i | NM ) = µ NM VWalk i = Walk , Bike
e + e µ NM VBike
1
I NM = ln(e µ NM VWalk + e µ NM VBike )
µ NM
26
Nested Model – Example
Non-
Motorized
motorized
(M)
(NM)
e µ M Vi
P(i | M ) = µ M VCar
i = Car ,Taxi, Bus
e + e µ M VTaxi + e µ M VBus
1 µ M VCar µ M VTaxi µ M VBus
IM = ln(e +e +e )
µM
27
Nested Model – Example
Non-
Motorized
motorized
(M)
(NM)
e µI NM
P(NM ) = µI NM
e + e µI M
e µI M
P(M ) = µI NM
e + e µI M
28
Nested Model – Example
=
µM
VCar µM
VTaxi µM
VBus ⋅
µINM
µI
M
e +e +e
e
+
e
µ
µMVBus
µM
ln(eµMVCar +eµMVTaxi+eµMVBus )
e
e
=
µMVCar µMVTaxi µMVBus ⋅
µ µ V µ V
ln(e
µMVCar +
eµMVTaxi+
eµMVBus )
µ
e
+e +e
µNM ln(
e NM Walk +
e NM Bike)
e
+e µM
29
Extensions to Discrete Choice Modeling
30
Summary
● Introduction to Discrete Choice Analysis
● A simple example
● The Random Utility Model
31
Additional Readings
● Ben-Akiva, M. and Bierlaire, M. (2003), ‘Discrete Choice Models With Applications to
Departure Time and Route Choice,’ The Handbook of Transportation Science, 2nd ed.,
(eds.) R.W. Hall, Kluwer, pp. 7 – 38.
● Ben-Akiva, M. and Lerman, S. (1985), Discrete Choice Analysis, MIT Press, Cambridge,
Massachusetts.
● Train, K. (2003), Discrete Choice Methods with Simulation, Cambridge University Press,
United Kingdom.
32
Appendix
Cross-Nested Logit
= ~ +
1
∑e
~
µ mV jn
V V Cmn Cmn µm
ln
j∈Cmn
∑
µV C
e ln ∑e
j∈Cmn
l
34
Continuous Logit Mixture
Example:
● Combining Probit and Logit
● Error decomposed into two parts
– Probit-type portion for flexibility
– i.i.d. Extreme Value for tractability
● An intuitive, practical, and powerful method
– Correlations across alternatives
– Taste heterogeneity
– Correlations across space and time
● Requires simulation-based estimation
35
Cont. Logit Mixture: Error Component
Illustration
● Utility:
e β X auto +ξauto
Λ(auto|X,ξ ) = βX +ξ
e β X auto +ξauto + e β X bus +ξbus + e subway subway
ξ unknown •
P(auto|X) = ∫ Λ(auto | X , ξ ) f (ξ )d ξ
ξ
36
Continuous Logit Mixture
Random Taste Variation
37
Discrete Logit Mixture
Latent Classes
Main Postulate:
• Unobserved heterogeneity is “generated” by discrete or
categorical constructs such as
�Different decision protocols adopted
�Choice sets considered may vary
�Segments of the population with varying tastes
• Above constructs characterized as latent classes
38
Latent Class Choice Model
P (i ) =
S
∑ Λ(i | s)Q(s
)
s=1
Class-specific Class
Choice Model Membership
Model
(probability of (probability of
choosing i belonging to
conditional on class s)
belonging to
class s)
39
Summary of Discrete Choice Models
40
6. Revealed and Stated Preferences
– Travel Diaries
– Field Tests
• Stated Preferences Data
– Surveys
– Simulators
41
Stated Preferences / Conjoint
Experiments
42
Framework for Combining Data
Attributes of Alternatives
& Characteristics of
Decision-Maker
Revealed Preferences
43
MIT OpenCourseWare
http://ocw.mit.edu
For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.