ITS2094-On Ordinal Utility Uploadable
ITS2094-On Ordinal Utility Uploadable
Article:
Batley, R. (2007) On ordinal utility, cardinal utility, and random utility. Theory and Decision,
Online. ISSN 1573-7187
https://doi.org/10.1007/s11238-007-9046-2
Reuse
See Attached
Takedown
If you consider content in White Rose Research Online to be in breach of UK law, please notify us by
emailing [email protected] including the URL of the record and the reason for the withdrawal request.
[email protected]
https://eprints.whiterose.ac.uk/
White Rose Research Online
http://eprints.whiterose.ac.uk/
Published paper
Batley, Richard (2007) On ordinal utility, cardinal utility, and random utility.
Theory and Decision, Online First, May 2007
Richard Batley
ABSTRACT
Though the Random Utility Model (RUM) was conceived entirely in terms of
that operation remain faithful to ordinal utility. The paper considers whether
surplus change may depart from ordinal utility, and exploit the cardinality
Keywords: ordinal utility, cardinal utility, Random Utility Model, log sum,
rule of a half
1
JEL Classification: B41 Economic Methodology; D01 Microeconomic
1. INTRODUCTION
The Random Utility Model (RUM) was conceived by Marschak (1960) and
from the same finite choice set. Subsequent to its theoretical conception,
Marschak et al. (1963) introduced much of the apparatus by which RUM could
models of judgement and choice, in particular Fechner’s (1859) model and the
2
markets than individuals, and data to match. McFadden’s adjustment to the
economic methodology.
Since the unwary might, quite understandably, draw confidence from such
theory and psychophysical models. More specifically, the metric of the latter
Classical theory. Several previous authors have alluded to this property (e.g.
Goodwin and Hensher, 1978; Daly, 1978), and many have considered its
empirical consequences usually under the label of the ‘scale factor problem’
(e.g. Swait and Louviere, 1993). But one might ask: ‘why are RUM
3
The paper begins by rehearsing the derivation of RUM from the fundamental
This is not in itself a problem; cardinal utility can be used quite defensibly as a
does not actually depart from the ordinality on which it is founded. The
Whilst the general theory will be well known to many readers, the following
attends to a particular interest in finite choice sets, and this may be less
basis for section 3, which seeks to illuminate the theoretical origins of RUM in
terms of ordinal utility. The informed reader may however wish to omit this
4
Before proceeding, and following RUM convention, deference is made to
space. Again following usual practice, define within this space a finite set T
Marschak (1960), although the reader is referred to Luce and Suppes (1965)
5
The theory rests fundamentally on two axioms1, as follows:
Axiom of Completeness:
xm f xn .
Axiom of Transitivity:
rn ≤ rm if x n f x m
Now define the vector r = (r1 ,..., rN ) , which is an integer valued function
1
Noting that relaxation of the assumption of a finite choice set necessitates an additional
axiom imposing continuity of preference. Continuity is discussed further in section 7.1, albeit
in a different regard.
6
function on r will induce a real valued function U on T that preserves the
U n ≥ U m iff x n f x m (1)
assigned to the function simply on the basis that the U value of a preferable
(i.e. the class of strictly monotonic transformations) is, in these terms, entirely
Uˆ n ≥ Uˆ m iff x n f x m (2)
It is important to be clear that the above theory provides no basis for drawing
any ratio thereof) across alternatives. The ordinal utility function simply
7
3. ORDINAL UTILITY AND PROBABILISTIC CHOICE
(or both) of two phenomena; first, violations of the Axiom of Transitivity; and
second, the observation that an individual, when faced with a repeated choice
task, may not always make the same decisions. Having introduced the theory
8
There is a random vector U = (U 1 ,...,U N ) , unique up to an increasing monotone
n
n =1
Note for example Marschak’s (1960) assertion: ‘Unless specified to the contrary,
all concepts in this paper are associated with a single given person, the ‘subject’ of
and Marschak (1960) make clear this proposition by re couching the analysis
9
Remembering from section 2 that the first ranked alternative can be
( )
Pn (r ) = Pr U nr ≥ U mr ≥ ... ≥ U Nr ≥ 0 (5)
Then summing across all orderings where x n is ranked first, the probability
P (x n T ) = ∑r∈R Pn (r ) (6)
n
Since the above interpretation of RUM may be unfamiliar to some readers, let
repeated occasions, such that these five observations constitute the entire
sample space. With reference to (4), the possible rankings under which each
10
Rn = {U n ≥ U m ≥ U l ;U n ≥ U l ≥ U m }
Rm = {U m ≥ U n ≥ U l ;U m ≥ U l ≥ U n }
Rl = {U l ≥ U n ≥ U m ;U l ≥ U m ≥ U n }
Let us now introduce some data, by assuming that the utilities and choices
remembering of course that these utilities are defined arbitrarily, save for
their ordering).
Repetition Un Um Ul Choice
1 1 2 3 xl
2 4 2 3 xn
3 1 2 3 xl
4 4 2 3 xn
5 1 2 3 xl
therefore never chosen. Second, whilst the utilities of both x m and x l are
fluctuating preferences for x n and x l . Now applying the data of Table 1, let
11
us calculate, for each alternative, the probability of being first ranked by each
possible ordering (5) and the probability of being chosen (6), thus:
Pr (U n ≥ U m ≥ U l ) = 0 , Pr (U n ≥ U l ≥ U m ) = 2 , hence P(x n ) = 2
5 5
Pr (U m ≥ U n ≥ U l ) = 0 , Pr (U m ≥ U l ≥ U n ) = 0 , hence P(x m ) = 0
Pr (U l ≥ U n ≥ U m ) = 0 , Pr (U l ≥ U m ≥ U n ) = 3 , hence P(x l ) = 3
5 5
The worked example serves to reiterate that RUM is defined entirely in terms
of ordinal utility2. Indeed, this can be shown by applying the data of Table 1
2
Should the sample space be extended from five repetitions to ‘arbitrarily many’ repetitions,
then the above interpretations of ordinal utility and choice probability would apply in exactly
the same manner.
12
Table 2: Worked example of ordinal RUM, after transformation
Repetition Û n Û m Û l Choice
1 1 4 9 xl
2 16 4 9 xn
3 1 4 9 xl
4 16 4 9 xn
5 1 4 9 xl
More generally, and analogous to (2), the probability of being first ranked
( ( ) ( ) ( ))
Pn (r ) = Pr f U nr ≥ f U mr ≥ ... ≥ f U Nr ≥ 0
13
If min[P(x n x n , x m ), P(x m x m , x l )] ≥ 1 , then the binary choice probabilities
2
Discussion of other properties will be avoided here, since these are dealt with
relies entirely on ordinal utility, such metrics are not particularly amenable to
14
RUM to inform public policy3. It is rarely acknowledged, however, that a
Marschak et al. considered the relationship between the binary RUM and
P(x n S ) = φ (vn − v m )
15
Marschak et al. prove that every Fechner model such that φ (q ) is the
and v1 ,..., v N are the constants appearing in the Fechner model. Then for any
S = {x n , x m } ⊆ T :
= Pr{v n − v m ≥ ε m − ε n } = φ (v n − v m )
A few comments are appropriate. First, the above analysis relates specifically
stimulus b?’. This is not unduly restrictive, however, since the model can be
joint distribution pertaining to each and every pair in the choice set. Second,
16
ε 1 ,..., ε N must be defined independently of v1 ,..., v N ; subsequent presentations
of RUM such as Daly and Zachary (1978) are more explicit about this. Third,
the IID assumption is not essential to the definition since any distribution
function φ (q ) will yield binary RUM; the IID assumption is however relevant
section expands upon the latter point, whilst the penultimate point is
= Pr{v n − v m ≥ ε m − ε n }
Second, the utility of each and every alternative may similarly be multiplied
17
P(x n S ) = Pr{λU n ≥ λU m } = Pr{λvn + λε n ≥ λv m + λε m } (10)
= Pr{v n − v m ≥ ε m − ε n }
where λ > 0
transformations (9) and (10), where the latter constitute the class of increasing
linear transformations. Contrast this with the probability statement (6), which
then the above analysis shows that the probability statement now exhibits
Let us summarise the story thus far, and consider the implications that follow.
The theory of deterministic choice under certainty is explicit about the ordinal
basis of utility (section 2). This basis is preserved in the translation from
18
deterministic to probabilistic choice under certainty, and indeed to RUM
analogy between the RUM and Fechner models (section 4). An implication of
properties of cardinal utility. The latter does not in itself constitute a problem;
Let us now digress slightly by reconciling the above discussion with the
19
understand the nub of his approach, however, a brief excerpt from McFadden
(1976) suffices:
‘… the assumption that a single subject will draw independent utility functions
from a population with differing, but fixed, utility functions, and offers each a
single choice; the latter model is consistent with the classical postulates of
population. The probability statement (11) is thus distinct from (3) in that
20
repeated choices by an individual he might well have asked just how useful
would seem to imply that, within McFadden’s RUM, ε ni could be a catch all
least latent).
4
Whilst the distinction between inter and intra individual variation is a pertinent one, it
might be noted that the popular data collection technique of Stated Preference (SP) offers a
facility for pursuing both interests. That is to say, SP data is typically collected from a sample
of individuals, and each individual typically responds to several repetitions of a given choice
task.
5
Manski (1977) develops this proposition, postulating several specific sources of randomness.
21
The Fechnerian properties of RUM outlined in section 4 enter McFadden’s
analysis via the elegant vehicle of his Generalised Extreme Value (GEV)
RUM carries the same implication as Marschak et al.’s for the cardinality of
utility.
UTILITY
The practice of RUM has evolved through the adoption of the Fechner model
that utility be interpreted only in ordinal terms. We can develop this interest
whether this results in any substantive change to the inferences deriving from
22
the Fechner model. In particular, let us consider three measurements that are
consider the choices of a single individual, though it might be noted that the
transitivity alone, hence the need for an additional axiom, that of continuity
23
Axiom of Continuity
′ ′
z (x m ) exists such that for any bundle x l ∈ z (x m ) , x n f x l
attributes x n that embody the alternative, with the money budget entering
additively6. This yields the so called Additive Income RUM (or ‘AIRUM’)
U n = α ( y − p n ) + g (x n ) + ε n
Then explicating the redundant degrees of freedom (9) and (10), we arrive at
the following:
U n = λ [α ( y − p n ) + g (x n ) + ε n + K ] (13)
6 It might be noted that recent contributors have considered the possibility of non linearity,
for example McFadden (1999), Karlström (1998, 2001), Dagsvik (2001), Dagsvik and Karlström
(2005).
24
Equation (13) reveals K to be confounded with y , such that choice
our interest in the marginal utility of an attribute, let us differentiate (13) with
∂U n ∂U n ∂g ∂g
= =λ for all x n ∈ T , x kn ∈ x n (14)
∂x kn ∂g ∂x kn ∂x kn
Equation (14) demonstrates that marginal utility derived from the Fechner
Uˆ n = f {λ [α ( y − pn ) + g (x n ) + ε n + K ]}
25
Comparison of (14) and (15) reveals that the former is subject to the cardinal
scale λ , whereas the latter is subject to both the same cardinal scale and the
ordinal scale f .
ordinal utility, and thus establishes the basis for admitting order preserving
requirement, adopting for working purposes the marginal utility (14), which
managing the λ scale, usually under the banner of the ‘scale factor problem’
(e.g. Swait and Louviere, 1993). With reference to (10), the scale factor
problem acknowledges the relation between the λ scale and the variance of
var[λ (ε m − ε n )] = λ2 var (ε m − ε n )
Rearranging:
s.d.[λ (ε m − ε n )] = λ s.d.(ε m − ε n ) ,
s.d.[ ]
λ= (16)
s.d.( )
26
Whilst IID within any dataset, the random variables may not be identically
distributed across datasets, and (16) therefore provokes the ‘problem’ that the
different λ scale. RUM practice has shown considerable support for this
Stated Preference (SP) data, where the latter invariably carry the lower
(e.g. Morikawa, 1989, 1994; Bradley and Daly, 1997) have proposed methods
that combine one or more datasets within the same RUM, explicitly
amount to setting the cardinal scale of one dataset as the base, and the scales
difference in scale between RP and SP, and one would expect a priori that
θ > 1 ). Whilst this serves to establish a unique λ scale across the various
transformation.
27
It is well established in microeconomic theory that the ratio between the
marginal utility of income net of price then the marginal rate of substitution
marginal utilities are of the form (14) or (15), the marginal valuation of an
∂U n ∂U n ∂g
Vo( x kn ) = = α for all x n ∈ T , x kn ∈ x n
∂x kn ∂( y − p n ) ∂x kn
cardinal, such that the distinction between ordinal utility and cardinal utility
surplus arising from a change in price. Note that such a relation exists only if
28
this with the ordinal basis of RUM, a change in attributes may impact upon
change, then a new set of choice probabilities could feasibly result. Two
The rule of a half method (Lane, Powell and Prestwood Smith, 1971;
all else constant. Now introducing the subscripts 0 and 1 to represent the
states before and after the price increase, define Q0 and Q1 to be the number
Having equipped ourselves with the necessary notation, we can now write a
7
Or in other words, total units consumed in each state; i.e. in the example of Table 1, Q = 5 .
8
( ) ( )
Note in passing that if P1 x m T ≠ P0 x m T , such that the price increase ∆p m results in a
change in the probability of choosing x m ∈ T , then the requirement from (3) that all
probabilities sum to one implies the following: ∑l∈T
P1 (x l T ) ≠ ∑l∈T P0 (x l T ), l ≠ m . That
(
is to say, any change in P x m T ) must be compensated by changes in one or more of the
other probabilities.
29
consumer surplus ∆CS m (in this instance a loss) arising from the price
increase ∆p m , thus:
reference to (6), they can be interpreted in ordinal terms. Accepting that, and
orderings that give rise to these probabilities, it becomes clear that (17) can be
inevitably, the cardinal properties of the Fechner model. That is to say, the
redundant degrees of freedom (9) and (10) become relevant, and the forecast
Moreover, the rule of a half remains faithful to ordinal utility only if the
probabilities before and after the price change are observed, and therefore
30
Now consider the second method of measuring the change in consumer
surplus the ‘log sum’ method. Although the origins of this method are
evident in Williams (1977), Small and Rosen (1981) were first to offer a full
specifically to AIRUM. The log sum method captures, in the context of RUM,
individual in order leave him or her on the same indifference curve as before
and Karlström (2005), the log sum method derives from an equality
established between the maximal utilities that arise before and after a price
change, wherein the utilities are of the form (12). To illustrate, let us consider
the same price increase as before, that is an increase in the price of alternative
max [v n (x n ; y + c − ( p n + δ mn ∆p m )) + ε n ] = max [v n (x n ; y − p n ) + ε n ]
n =1,..., N n =1,..., N
31
where c represents the compensating variation, p n is price in the
⎧0 m ≠ n
before state, and δ mn = ⎨
⎩1 m = n
(18)
For the price increase ∆p m , any compensating variation must be given to the
alternative may not be utility maximising in both states. Indeed, let us denote
where these alternatives may or may not be one and the same. Furthermore
let the prices of these alternatives in the before state be denoted p n 0 and p n1
λ [α ( y + c − ( p n + δ mn ∆p m )) + g (x n ) + ε n + K ]
1 1 1 1
[
= λ α ( y − p n o ) + g (x n 0 ) + ε n 0 + K ]
⎧0 m ≠ n1
where δ mn = ⎨
⎩1 m = n
1
1
(19)
discussion of section 7.1, this has the effect of imposing a common cardinal
32
scale across the two states. This assumption of identical distributions is rather
of the consumer surplus loss ∆CS m associated with the price increase ∆p m ,
thus:
∆CS m = c =
[g (x ) − αp ] − [g (x ) − α ( p
n0 n0 n1 n1
]
+ δ mn1 ∆p m )
for all x m ∈ T
α
(20)
this does not mean that cardinality is absent from (20). Rather the cardinal
scale has been standardised across the two states. We can demonstrate this
the maximal utilities within (20), noting that h is a member of the class of
33
preserving. By contrast, the difference between the maximal utilities in those
states is not preserved under the same transformation. Hence with reference
∆CS m = c ≠
[ ] [ ]
h g (x n0 ) − αp n 0 − h g (x n1 ) − α ( p n1 + δ mn1 ∆p m )
α
ordinal terms (i.e. to say whether maximal utility in the after state is greater
than or less than the maximal utility in the before state), it has been necessary
Finally, note that where the random variables ε 1 ,..., ε N of (8) are specifically
IID Gumbel, the Fechner model adopts the logit form. Then adhering to the
analysis outlined above, but taking expectations of the maximal utilities over
9
It might in passing be remarked that the act of taking expectations would in itself seem to
carry a strong inference of cardinality.
34
⎧N ⎫ ⎧N ⎫
ln ⎨∑ [g (x n ) − αp n ]⎬ − ln ⎨∑ [g (x n ) − α ( p n + δ mn ∆p m )]⎬
∆E (CS m ) = E (c ) = ⎩ n =1 ⎭ ⎩ n =1 ⎭
α
⎧0 m ≠ n
where p n is price in the before state, and δ mn = ⎨
⎩1 m = n
from the Fechner model (as with any marginal utility, it might be added)
inferring marginal valuation, and the latter is free of either cardinal or ordinal
scale. The cardinality of utility within the Fechner model does however
probability forecasts to the rule of a half method. Hence, to the extent that
35
8. SUMMARY AND CONCLUSION
the original presentation by Marschak (1960) and Block and Marschak (1960)
from the popular McFadden (e.g. 1968, 1975) presentation. Consistent with
RUM yields a model that carries the properties of cardinal utility. Though
this is not in itself a problem cardinal utility can be used quite defensibly as
from the Fechner model, namely the marginal utility of an attribute, the
36
As would seem inherent in the very notion of marginal utility, marginal
utility derived from the Fechner model carries a cardinal scale. RUM
establish a unique cardinal scale across data pooled from different sources. If
however the ratio of two marginal utilities is taken, thereby yielding the
marginal rate of substitution (or in the particular case of the price attribute,
marginal valuation), then the cardinal scale is removed and the resultant
method for measuring consumer surplus change, then this would seem to be
the latter itself exploits cardinality. The log sum method for measuring
and may therefore stand accused of operating outside of the bounds for
realigning theory and practice; this is to apply the rule of a half method using
37
1960). The original presentation is defined entirely in terms of ordinal utility,
ACKNOWLEDGEMENTS
Institute for Transport Studies (ITS), University of Leeds, and by the Platform
Daly, Tony Fowkes, Nicolás Ibáñez, Jeremy Toner, Chris Nash, David Watling
Any errors and misunderstandings that remain are of course the sole
REFERENCES
38
Block, H.D. and Marschak, J. (1960) Random orderings and stochastic theories
Bradley, M.A. and Daly, A.J. (1997) Estimation of logit choice models using
Hensher, D.A. and Dalvi, Q. (eds) Determinants of Travel Choice. Saxon House,
Farnborough.
39
Daly, A.J. and Zachary, S. (1978) Improved multiple choice models. In
Hensher, D.A. and Dalvi, Q. (eds) Determinants of Travel Choice. Saxon House,
Farnborough.
function. In Thrall, R.M., Coombs, C.H. and Davis, R.L. (eds) Decision
Fechner, G.T. (1859) Elemente der Psychophysik. Breitkopf and Härtel, Leipzig.
Goodwin, P.B. and Hensher, D.A. (1978) The transport determinants of travel
40
Lancaster, K.J. (1966) A new approach to consumer theory. Journal of Political
Lane, R., Powell, T.J. and Prestwood Smith, P. (1971) Analytical Transport
Luce, R.D. and Suppes, P. (1965) Preference, utility and subjective probability.
unpublished.
New York.
41
McFadden, D. (1975) The revealed preferences of a government bureaucracy:
theory. The Bell Journal of Economics and Management Science, 6 (2), pp401 416.
Karlqvist, A., Lundqvist, L., Snickars, F. and Weibull, J. (eds) Spatial Interaction
Models. In Moore, J., Riezman, R. and Melvin, J. (eds.) Trade, Theory and
42
Manski, C.F. (1977) The structure of random utility models. Theory and
Marschak, J., Becker, G.M. and DeGroot, M.H. (1963) Stochastic models of
Neuburger, H. (1971) User benefit in the evaluation of transport and land use
43
Small, K.A. and Rosen, H.S. (1981) Applied welfare economics with discrete
Swait, J. and Louviere, J. (1993) The role of the scale parameter in the
44