The Index Number Problem
The Index Number Problem
This manuscript has been reproduced from the microfilm master. UMI
films the text directly from the original or copy submitted. Thus, some
thesis and dissertation copies are in typewriter free, while others may be
from any type o f computer printer.
In the unlikely event that the author did not send UMI a complete
manuscript and there are missing pages, these will be noted. Also, if
unauthorized copyright material had to be removed, a note will indicate
the deletion.
UMI
A Bell & Howell Information Company
300 North Zed) Road, Ann Arbor MI 48106-1346 USA
313/761-4700 800/521-0600
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
H A R V A R D U N IV E R S IT Y
THE GRADUATE SCHOOL OF ARTS AND SCIENCES
Division
Department Economics
Committee
presented by PiaMalaney
Signature
Typed name
Eric Maskin. Chair
Signature ...
Typed name Peter Timmer
Signature
Typed name Richard Goldman
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
T h e Index N um ber Problem : A
D ifferential G eom etric Approach.
A thesis presented
by
to
Economics
Harvard University
Cambridge, Massachusetts
December 1996
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
UMI Number: 9721685
Copyright 1996 by
Malaney, Pia Nandini
All rights reserved.
UMI
300 North Zeeb Road
Ann Arbor, MI 48103
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Copyright (c) 1996 by Pia Nandini Malaney
All rights reserved.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
A b stract
The first part of this thesis looks at issues in index number theory. By
using techniques developed in differential geometry, it is shown that the so-
called index number problem can be resolved by the development of a special
economic derivative operator constructed for this purpose. This derivative
is shown to give rise to a unique differential geometric index number which
is then demonstrated to equal the Divisia index. It is then shown in the
second chapter (co-authored with Eric Weinstein) that when placed under
similar assumptions, this data based index equals the Konus index, previously
asserted to be incalculable in the absence of a knowledge of preference maps.
The third chapter deals with rural to urban migration. Based on the risk
neutral Todaro model, it has been suggested that raising rural incomes will
have the effect of controlling migration to urban labour markets. However, it
is shown in this chapter that it in the case of decreasing absolute risk aversion
it is possible for increasing income to have the perverse effect of increasing
the desirability of migration.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
C ontents
I n tro d u ctio n 3
A ck n o w led g m en ts 6
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
2 W elfare Im p lica tio n s o f D ivisia Indices 1 50
2.1 Introduction............................................................................................50
2.2 Welfare and Path D ependence........................................................... 53
2.3 Implicit Assumptions Underlying a Cost-of-Living Index. . . . 54
2.3.1 C h a n g in g Preferences and P sych ological N e u
tr a lity ......................................................................................... 57
2.4 The Konus Index under Changing Preferences..............................65
2.5 C on clu sio n s............................................................................................71
department.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Introduction
The theory of index numbers has been of interest to economists for over a cen
tury. This interest has been rejuvenated in the 1990s as the U.S. government
grapples with the question of how to construct a true cost of living adjustment
(C.O.L.A.), or how to correct for the problems of the current consumer price
index (CPI). The inconsistencies between the hundreds of available algebraic
index numbers, and their uncertain effects on welfare (when implemented as
C.O.L.A) have led to extensive debate, and some pessimism about the ability
of economic theory to answer the fundamental questions lying at the core of
this issue.
The first chapter of this thesis takes a new approach to understanding
the question of what the “correct” index number is. We first observe that
all index numbers can be viewed as a combination of two components: an
algebraic formula and a notion of constancy. A general implicit choice has
been made using the ordinary calculus derivative as the notion of constancy.
By using techniques in differential geometry, we construct here an economic
derivative precisely to perform the separation of income and substitution
effects necessary for an economic index. It is shown that when the choice
of derivative is made explicit and the ordinary non-economic derivative is
replaced by the differential geometric ‘economic co-variant derivative’, the
discrepancy between all algebraic formulas for index numbers disappears,
thereby resolving the index number problem. The unique index resulting
from the use of the economic derivative in any of the previously developed
index number formulas is then shown to equal the Divisia index.
The second chapter looks at the welfare implications of the Divisia in
dex based on the new understanding of its mathematical underpinnings. We
compare the welfare effects of the Divisia index with those of the theoretical
Konus index. While the Konus is referred to as the ‘true’ economic index, it is
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
asserted to be incalculable in the absence preference maps. The Divisia index
is, however, entirely data based, and therefore computable. In comparing the
two, however, it is found that there is a major difference in the assumptions
upon which these indexes have been developed. The Konus index makes the
implicit assumption of unchanging preference. The Divisia makes no such
assumption. It is essential therefore, when comparing the two, to uniformise
the assumptions under which they are studied. It is shown in the first part
of this chapter that if an individual is compensated using the Divisia price
index as a cost of living adjustment, under constant preferences the Divisia
price index is in fact equal to the Konus price index. The Divisia therefore
provides us with a revealed preference method for finding the Konus index,
previously thought to be unknowable in the absence of knowledge on pref
erence maps. This is in fact an indication that the path dependent nature
of the Divisia C.O.L.A. axises entirely from the failure of the assumption of
unchanging preferences in the real world. In the second part of the chapter
this unrealistic assumption is relaxed, and both indexes are analysed under
the assumption of changing preferences. In a situation of unchanging pref
erences, two distinct Konus indexes are defined, the Laspeyres Konus or the
Paasche Konus, depending on whether the indifference curve picked is that
of the base time or the current time. Once preferences are allowed to change,
one must select not only between base and current time indifference curves,
but also base and current time preferences. We therefore define four Konus
price indexes, with the various combinations of base and current indifference
curves, and base and current preferences. It is proved that chaining these
indexes also gives equivalence with the Divisia.
It is therefore shown that when viewed in its natural mathematical set
ting, the Divisia index not only resolves the consistency issues that have
heretofore plagued index number theory, but also answers the welfare con
cerns that have spurred the deep and longlasting economic interest in the
issue.
The third chapter deals with issues of migration from rural to urban areas.
Since the seminal work of Todaro in the 1960s, there has been much interest
in the decision making process that leads to the migration of labour into
urban areas which often suffer from high unemployment. The conclusion of
the Todaro model was that the discrepancy in incomes between the low wage
rural labour market and the high wage urban market was sufficient that it
was possible for the present discounted value of the expected urban income to
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
outweigh that of rural income, even with a high probability o f unemployment
in the urban market. The policy suggested by this conclusion has been to
reduce the discrepancy in incomes either by raising the level o f rural wages or
by lowering the level of urban wages. In this chapter it is shown that these
two approaches may in fact not be equivalent. The Todaro model makes
two implicit assumptions which are unlikely to hold in reality. The first is
that an individual maximises expected income as opposed to expected utility.
This is equivalent to making the assumption of risk neutrality. It has been
shown that in general individuals are risk averse, especially at the low levels
of income being discussed here. The second assumption is that the decision
making unit is the individual. Here too much evidence has been accumulated
showing that migration decisions are often made at the level of the family
unit and not solely by the individual. In this chapter we relax these two
unrealistic assumptions. It is proved that when households facing migration
decisions are characterised by decreasing absolute risk aversion, it is in fact
possible for increasing rural wages to lead to increased migration to urban
areas. This ‘perverse’ migration arises because with the increased security of
higher wages for those family workers remaining in the rural labour market,
it is possible for the family to ‘gamble’ more of its members on the high wage-
low probability urban market. It is therefore essential that before using this
policy as tool for stemming rural to urban migration, one ensures that wages
are not within the range of perversity defined herein.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Acknowledgm ents
I would like to begin by thanking my advisor E ric M askin for guidance and
assistance in the writing of this thesis. This dissertation has benefited from
his extraordinary breadth, precision and intellectual flexibility in guiding me
through such non-standard approaches as the application of geometry to
welfare considerations.
I would like also to thank P e te r T im m er for his generous advice and
encouragement throughout this process. In particular, I took continuous
inspiration from his ability to focus my analysis on the core economic points,
even amidst the most seemingly arcane mathematics.
The author benefited from a research assistantship working with Prof.
A m artya Sen during the 1994-1995 academic year. Additionally, I have
found Sen’s discourse on index theory contained in the article “Welfare Basis
of Real Income Comparisons” (cited here as Sen [32]) to be invaluable as
a reference laying bare the otherwise implicit assumptions underlying the
theory of index numbers.
I am grateful to W . E rw in D iew ert for his thoughtful input and for
freely sharing his comprehensive knowledge of index theory. Specifically,
I received invaluable information on the state of the literature concerning,
changing preferences, chaining arguments, and welfare discussions of the Di
visia index, from a thorough letter which is cited here as Diewert [10]. Per
haps the most helpful resource in understanding the background and history
of Index Number theory has proved to be his compilation of articles Essays
in Index Number Theory Vol. /(c ite d here as Diewert and Nakamura [11]).
Many of the ideas contained in this dissertation are the result of an ongo
ing discussion, debate, and intellectual interchange with my co-author E ric
W einstein.
Throughout the course of writing this thesis I received a great deal of
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
valuable input. In particular, I would like to thank S c o tt A x elro d , Dick
G old m an , T h o m a s H ellm ann, S tep han K lasen , T h o m a s Sjostrom
and Isadore Singer, for helpful conversations. It is probable that this list
contains omissions; they are entirely unintentional.
To my parents E sth er and H ira M alaney: You always expected the
best from me, and it is only in striving to give it to you that I have made
it this far. I am grateful to have had the opportunity to fulfill some of the
dreams which when placed out of your reach, you put within mine. Thank
you for all the love and support that have enabled me to achieve this.
I would like to dedicate this thesis to my intellectual inspiration and best
friend, Eric: I have learned so much more than differential geometry from
you along the way, and I know the journey continues to hold ever more.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Chapter 1
The Index Num ber Problem : A
Differential G eom etric
Approach
1.1 Introduction
“A man with one watch knows what time it is; a man with
two is never sure.” -Proverb
The primary diagnostic measures of an economy are price and quantity in
dex numbers. The plethora of existing indexes can therefore leave economists
with an inability to evaluate accurately the most basic of our economic con
cepts. The most widely used index numbers, the Paasche and Laspeyres, can
disagree not only in terms of the magnitude of change but also the direction.
These contradictions in qualitative results (with regards to growth versus
shrinkage or inflation versus deflation) caused by our index number problem,
puts us as economists in the embarrassing position of being doctors unable to
take a temperature without counseling the patient to get a second opinion.
In the words of W.E. Diewert:
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
base Laspeyres index, while the implicit deflator for the consump
tion expenditures component of GNP is a fixed base Paasche In
dex. Given the importance of consumer price indexes in indexing
wages and cost of living supplements, it is important that official
price indexes be consistent with each other.” -Diewert 1978 pp.
268
This points out that in practical terms, the social welfare of those who
depend on cost of living adjustments is affected to a large extent by which of
several gauges is currently in favor. To put it simply, consistency issues are
pre-requisite to welfare analysis.
The fundamental nature of this problem has, over the last century, at
tracted the attention of many economists. Starting in the late 1800s the quest
for the “true” index number led to the development of hundreds of different
formulas, each offering a different result. The inability to evaluate the rela
tive validity of any of these formulae led to the development of what is known
as the test approach to index numbers. Since all index numbers agree in the
case of a single good, an attempt was made to formulate plausible properties
for index numbers in the one good case and test whether a proposed index
satisfies them for many goods. There were systematic attempts made by
Walsh, Fisher and other economists to develop tests to isolate the “true”
index number. However the people developing the tests, not coincidentally,
were often the people developing the indexes. Needless to say this lent an air
of relativism to the entire process. In the 1930’s, however, the test approach
showed up an even bigger problem: Frisch proved an impossibility theorem
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
showing that no bilateral index number could pass three basic tests1. This
led to a pessimism as to the very existence of this ‘‘true” index that had been
quested.
An entirely different approach was developed by Konus in the 1920’s
which has come to be called the ‘economic’ approach to index numbers.
The Konus index, a somewhat theoretical concept, is based upon economic
optimisation under constraints. A Konus cost of living adjustment would
compensate (or penalise) a person by the amount it would cost him to con
sume the least expensive basket of goods on his original indifference curve.
While this in some sense captures what we as economists would like to think
of as the “true” cost of living adjustment, there axe several problems with
this index and its interpretation.
Undoubtedly, the most obvious practical shortcoming of the Konus in
dex is the general unavailability of information on preference maps. While
this practical concern may limit its computation, there are even more chal
lenging theoretical problems. One basic flaw is its unnatural assumption of
unchanging preferences, without which the answers it returns can be more
psychological than economic in nature2. Finally we once again are left with
the decision of whether to use base or current yeax preferences, bringing us
back to the original problem of consistency.
As has been noted by Sen and others, it is of the utmost importance in
work on index theory to clearly identify the scope of inquiry and the implicit
assumptions present. The focus of this paper will be on those price and
quantity indices which axe directly computable from time-dependent vectors
p(t), q(t) of price and quantity data for an individual or group between
two time periods t = 0,1. We will refer to such indices as bilateral temporal
indices. Since the disagreement between these indices can reduce what should
be an objective measurement (viz. of inflation or growth) to a subjective
choice of measurement instrument, the most vexing problem in the field is the
consistency or so-called index number problem. The solution of this problem
may in turn be thought of as the first step in the application of data-based
indexes to the analysis of welfare. It is this problem to which this paper is
addressed. It should be stressed that this is a theoretical paper attempting
1Frisch 1930
2Sen (1979) addresses a broad range of issues underlying the assumption of unchanging
preferences.
10
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
to further develop the logical foundations of index number theory.3
It is possible to look at any bilateral index number as a combination of two
ingredients: an algebraic formula and a notion of constancy. We claim that
given such a decomposition there is no inherent problem vrith the algebraic
formulas of any of the common bilateral index numbers. The problem, when
properly isolated, is seen to arise from a consistent implicit choice of an
incorrect notion of constancy.
As the field of economics shifted in the last century to incorporate devel
opments in calculus, economists have adopted the standard notion of deriva
tives: df = 0, as our notion of constancy. It was only later that mathe
maticians themselves refined their concept of derivatives to allow for them
to be adapted to particular problems. The field of differential geometry was
developed around work with these “adapted derivatives” or “connections”.
By using differential geometry to adapt our notion of constancy to fit the
unique problem of index numbers, we are able to eradicate inconsistencies
between all common bilateral index numbers.
Updating the field of index numbers to incorporate these developments in
mathematics in this way enables us to cast age old issues into an entirely new
framework. Just as differential calculus was the framework within which to
think of optimisation, differential geometry is the correct framework within
which to think of index numbers. As with previous mathematical develop
ments, this framework immediately helps clarify the questions we are asking.
Not only is it able to recover various scattered results from index theory in
a cohesive picture, it enables us to prove new and valuable results.
We give here a plan of this chapter. In section 2 we use a simple toy
example to show explicitly how a seemingly innocuous use of the ordinary
derivative must be altered in order to recover the economics of constant
purchasing power; this guiding analogy is intended to bring out the logical
necessity of the differential geometric approach. In section 3 we will de
fine the differential geometric set-up and show how it produces a preferred
index which we refer to as the geometric index. A simplified 2 -good econ
omy is constructed in section 4 which exhibits the index number problem
for the Paasche and Laspeyres indexes. We show in detail how the some
what abstract geometric set-up is used to carry out an explicit computation
3For previous related results see: Richter 1966, Hulten 1973, Samuelson and Swamy
1974, and Diewert and Nakamura 1993 and the references therein.
11
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
producing agreement between the aforementioned offenders. In section 5 we
show that this framework actually leads to a resolution of the index number
problem for temporal bilateral indices. Section 6 contains a derivation and
discussion of the Divisia index. In section 7 we summarize our results and
discuss the merits of path dependence in the computations of C.O.L.A.s;
we also discuss plans for future extensions. Finally, in appendix A we give
some of the terminology from differential geometry underlying the previous
discussions.
12
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
1.2 A d ap ted D erivatives and Differing N o
tio n s o f Constancy
Each index number can be viewed as a combination of two elements :
1. An algebraic formula
2. A notion of constancy.
8m P(t) _
( 1. 1 )
St = 8t
13
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
St dt
With this ansatz let us solve for g(t):
0 — ^ m ^ ^ + am)P{t))
4. g{t Ptm n( 1-3)
This is our adapted derivative, i.e. The derivative which recognizes a constant
purchasing power salary as constant. We will call this adapted derivative a
covariant d eriv a tiv e or con n ection.
Thus we see that we have a choice between two different derivatives where
we may have believed that only one existed. It is plain to see that failure to
select the correct derivative leads to wrong economics. Fortunately, in the one
good case it is easy to recognise the problem and it can be corrected for in
several ways; this contrasts markedly with the more complicated situations.
Our claim in this paper is that the so-called “index number problem” arises
from a consistent improper choice made from an analogous pair of derivatives.
All bilateral index numbers implicitly choose to use the ordinary derivative as
their notion of constancy; what we propose to show in this paper, is that when
instead of the ordinary derivative, we use the derivative uniquely adapted to
our problem, the discrepancies between the various bilateral index numbers
disappear.
14
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
1.3 T he D ifferential G eom etric Index
1.3.1 Notation and Introduction to the Index
Let V n be a space of n goods and services under consideration. Let V* be
the space of pricing systems o f V n. (V and V* can be thought of as column
and row vectors respectively.) Then if q € Vn and p 6 V* , we have
Assume p(t) • q(t) ^ 0 V£, i.e our basket is never worthless. Any combi
nation of goods and services w 6 V can be uniquely described by a sum
15
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
of chaining indexes, but because of time gaps in chained baskets, the ability
to translate any particular basket is unavailable. For instance, the U.S. CPI
is a prominent representative example of a chained index whose links are
approximately 10 years in length. Rapid changes in fields such as computers
and the automotive industry mean that several generations of changes may
occur between recalibrations of the reference basket. Consider, for example,
the category referred to by the Bureau of Labor Statistics as “Information
Processing Equipment”; there has been only one recalibration (in 1982-84)
between the last slide rules and the modem lap tops of the 1990s. Obviously,
this kind of rough taxonomy is insufficient to deal with rapidly changing
components of the basket.
What the index should require is the ability to allow the composition of
the basket to change, while maintaining constant some notion of value. We
can maintain a “constant value” basket while allowing the composition of
the basket to change by allowing trade-offs in the basket at any time based
on the current market value of the trade-offs.
This can be accomplished mathematically using an adapted or covariant
derivative. The tools we use to develop this are:
1. Projections onto the space of goods [q(f)] and barters /3p^) i.e. t> =
A(<)q(f) + 6 (f)
D efinition 3 We define E to be V x V * —C .
16
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
For most realistic purposes, the baskets of goods under consideration will
have non-zero value. Thus confining ourselves to consideration of only those
baskets and pricing system represented by elements of H C R2n will not
present a serious restriction.
Given this motivation we give the following definition.
n w ‘,l(“) = (? M ) ,( i) (L8)
= w - n Wt)](w) (l.ii)
where v € V and uj € V '. So, for example, it is immediate that
v = % ( * ) ] ( * ) + n ^P(t)(u )- (L 12)
17
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
suggests, composites of debts and assets whose total value is identically zero.
The corresponding subspace (3q can be defined within V*.
y*
Good 2 Mce 2
Good 1 . Price 1
18
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Good 2
v
[q]
Good 1
(1.13)
We can then assemble these real valued functions into a single vector valued
function
given by
n
t=l
19
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
constituents. The real-valued functions /,• which determine the vector-valued
function o are given by the contributions of the various basket components
in their various units of measurement (kilograms, gallons, etc.).
The total differential, or Jacobian, of a is defined to be
W = 5 3 dH q , p)et ( 1 .16)
i=l
= t ( t ^ , + t ^ d Pk)e, (1.17)
i=l j = l UQj k= 1 ° P k
This is simply the Jacobian evaluated in the direction of the tangent field
* = i alonS Q-
Let us now define the derivative uniquely adapted to achieve the ap
propriate index number. The non trivial ‘economic’ covariant derivative of
o’ : E — ►V along a is:
The interested reader may check that V “ satisfies the axioms for a covariant
derivative found in Appendix A. 4
If V%o = 0, we will say that o is covariantly con stan t along a with
respect to the covariant derivative V°. If a is covariantly constant along a
with respectto agiven covariant derivative V and <r(a(s)) = v we will refer
to o(a(d)) asthe parallel translation of v along a from the‘source’ time
4It should be noted that when the vector valued function <r(x) is a multiple of the
quantity vector comprising z 6 E , the second summand is identically zero; this simplifies
matters considerably and is the major case of interest in this paper. For example, if we
are attempting to track the growth of a basket ?(0) from time t = 0 to time t — 1, the
vector <r(0) at the source time t = 0 will lie entirely in the subspace cut out by g(0). The
projection of or onto the barter space, lip <r, will therefore be zero. In fact if V%<r = 0
then it will be the case that along a we will have IIpFt<r = 0 for all t.
20
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
t = s to the ‘destination’ time t = d. We will denote this parallel translate
by T,(d); that is rs(d) is determined as a particular value of a vector valued
function a, covariantly constant along a curve. It should be noted that it is
possible to parallel translate using any covariant derivative. If the parallel
translation is done using the ordinary derivative, it will simply give us the
original basket. This corresponds to the traditional notion of constancy. We
will denote the parallel translate of q(s) (or p(s)) along a with respect to the
ordinary derivative V° by r°; likewise, the parallel translate of q(s) (or p(s))
along a using the adapted covariant derivative V a will be written r°.
At this point one might ask ‘what is the economic interpretation of the
equation V|<r = 0?’ In order to address this question, it will help to recall
our earlier interpretation of the various projection maps. Let us begin by
making the assumption that cr(a(0 )) = q(0 ); while this is certainly not a
generic assumption, all er’s considered in this paper will satisfy similar con
ditions. In order to induce modernisation or antiquation of the a(t) basket
components while disallowing ‘growth’, we would like to require that cr{t)
be changing at any given time strictly by barters. This allows the time de
pendent basket <r(t) to maintain the ‘value’ of the original basket q r ( 0 ) while
changing its composition as the composition of q(t) changes. This goal is
achieved by selecting the covariant derivative V£<r, for by construction, the
parallel translation equation requires that V|<r = n[9]V|(II[g]<7 ) = 0. In
prose, this means that the projection of the “change in <r(<)” (namely V |)
onto ‘basket space’ (namely [q]), must be zero, thereby confining all change
to lie in barter space. This parallel translation of the basket g(0) from time
t = 0 to time t = 1 using V£er will thereby ensure that at time t = 1 , a(t)
will be some multiple of the basket q(t).
We are therefore going to seek a vector valued function a : E — > V such
that for a source time s and a destination time d we have:
1 . <r(a(s)) = q(s)
2 . V%<r = 0
The first condition requires that our vector valued function a agree with
the source-time basket q(s) at the point from which we are parallel trans
lating. The second condition requires that we parallel translate this basket
using the adapted derivative.
21
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
We note that <r(a(t)) is uniquely determined by a theorem in differential
geometry. 5
It is clear that depending on the covariant derivative used, such a solution
will have quite different properties. For the ordinary derivative V°, a solution
will be of the form
*(«(*)) = ( L2 0 )
For the adapted derivative V°, however, we will have,
As discussed above, the parallel translate of the basket from the source time
to the destination time using the adapted derivative is a multiple of the
basket atthe destination time t. This multiple, Aexpresses the “size” of the
source time basketrelative to the destination-time basket q{t).. T herefore
the g eo m etric in d ex is sim p ly th e m u ltip le A.
We should note at this point that there is a complementary version of
this picture where we consider vector valued functions
cr: E — y Vm ( 1 .2 2 )
n : V m— ►V . (1.23)
22
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
us assume that the islanders produce lumber with a saw mill which is grad
ually rusting into disrepair, but that in partial compensation the islanders
become increasingly inventive in their ability to produce coconuts. We fur
ther assume that the consumers on the island are developing a mild preference
for coconuts over lumber. Mild in this context means that the increased de
mand is not strong enough to keep the coconut price from decreasing relative
to lumber. The toy economy given below will be seen to exhibit the index
number problem for the Laspeyres and Paasche quantity indices. While the
example has obviously been abstracted, it does exhibit the full complexity of
the paradox that we are trying to resolve.
Exam ple 1 Consider an economy with two goods gi, <72 examined over a
one year period 0 < t < 1 . Let q : [0,1] — > V = R2 given by q(t) =
(< + l)ei + (—<+ 2)e 2 be the output function of the economy at time t. That is
q(t) gives the production o f goods (<7i, <72 ) at time t. Let p : [0,1] — > V" = R 2
given by p(t) = (—lOt + 100)e* + (91 + 92)ej be the price function of the
economy. Likewise p{t) gives the prices per unit quantity of the goods (<71 , <72 )
at time t.
1. cr(<) = A(t)q(t)
2. A(0) = 1
3. vs(*W) = nw«,i(^Snw<B<r) = o
Inside of
[0,1] x V (1.24)
(referred to as a trivial vector bundle in Appendix A) the section
given by
? (0 = ((^ + l ) ei + (~* + 2 )e2) (1-26)
23
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
cuts out a changing 1-dimensional vector sub-space from V (a non-trivial
sub-bundle with 1-dimensional fiber in the geometric language of Appendix
A). Conversely, the price history
Thus we calculate
vjAMrfi) = (1-29)
dA
= H[9(*)](-^(*)((f + l)ei + (—* + 2)e2) -I- A(i)(ex — e2)) (1-31)
Recalling that
24
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
with initial condition A(0) = 1 (which is an ordinary linear differential equa
tion). We solve it as follows. Since
— (t) = __________ + _8 )___ . .
dt W (—19*2 + 16t + 284) W 1 j
we should expect that A(*) will have the form A(*) = e3^ with
*£.(*) = —(—19* + 8 )
d t { ) (—19*2 + 16* + 284) ’ [ }
We next note that
rf(-19*2 + 16* + 284)
= -38* + 16 = 2 • (-1 9 * + 8 ) (1.39)
dt
so we are looking for a function g{t) with
* = _ 1 (-191*+ 161+ 284)'
dt 2 (-19t» + 1 6 1 + 284) ' ’ 1
which indicates that g{t) is of the form g(t) = —| • /n (—19*2 +16* + 284) + c
where c is a constant to be determined from initial conditions. Thus we have
A(<) = e-f/n(-19^+16<+284)+c = (1.41)
w V - 1 9 t2 +16* + 284 v '
which when when supplemented by our requirement that A(0) = 1 fixes the
value c = /n(\/284) giving our final solution
A(f) = , ^ (1.43)
N/( —19*2 + 16* + 284)
25
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
1.5 R esolution O f th e Index N um ber P rob
lem
As noted previously, the two most commonly used indexes, the Paasche and
Laspeyres, have an inherent discrepancy. Let us start by observing the effects
of updating our notion of constancy for these two indexes. We will later
extend these results to all bilateral indexes.
The Paasche and Laspeyres quantity indexes have the formulas
In both of these indices, three of the four factors which appeax are from a
common time period (the period used to make the comparison)and the odd
factor is imported unchanged acrossthe time separating t — 0 from t = 1 .
It has been held constant in the usual sense of the term. This leads us to
suspect that the problem is arising from the use of a derivative, un-adapted
to our situation. We thus rewrite (1.44) in the form
26
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
thus resolving the so called index number problem.
Let us verify this for our example from the previous section:
In our example we see that the Laspeyres quantity index
K O M I) 292
(1.48)
p(O)KO) 284
indicates positive growth, whereas the Paasche quantity index
p (l)g (l) _ 281
(1.49)
p(l)<z(0) 292
indicates negative growth.
Let us begin with the Paasche quantity index. We first observe that since
the quantity q(0 ) in the expression p(t)q(0) is being evaluated at p(t) prices,
it is really the parallel translate of q( 0 ) from a ( 0 ) to a(t) using the ordinary
derivative. That is
P (l)g (l) P ( lM l) n enx
P(i)<z(0) p ( i K ( i ) 1 ;
where Tq(1) denotes the parallel translation from time t = 0 to time t = 1
using the ordinary derivative.
Likewise for the Laspeyres quantity index we have
1. <r(t) = A(t)q(t)
2. A(0) = 1
3 . V ‘M ‘ » = n h w l( v ? n w i r ) = o
From our calculations in the last section we see that this function is given
27
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
enabling us to calculate the corrected Paasche index:
2. A(0) = 1
3- v s (# (i» = n w^ (v sn win5) = o
which we see from (equation 43) gives us
A ( l) = ^ (1.54)
y ( - 1 9 t 2 + 16(4-284)
As the Laspeyres and Paasche are actually equal for all t, the agreement with
the Fisher index is immediate.
This result is not restricted to the Paasche, Laspeyres and Fisher in
dexes. What follows is a formal proof of the equivalence of all bilateral index
numbers when the correct adapted derivative is used.
28
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
1.5.1 Technical Proof of Equivalence o f Bilateral Index
Numbers
We take the opportunity below to set notation before going into the proof
of the equivalence of all bilateral adapted indices. Readers should consult
appendix A for terminology. We follow the proof of equivalence with explicit
formulas for the major indices and their differential geometric adaptations.
29
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
parallel translates.
When one chooses the ordinary derivative, the parallel translation of the
source-time vector q(s) to the destination-time is merely the original vec
tor q(s). However, when the adapted derivative is selected, we are ensured
that the parallel translate of the vector q(s) will be a multiple of the vector
q(d) at the destination time. Therefore, when this data is input into any
index number with an algebraic formula passing the proportionality test, it
is guaranteed to give us the same answer: namely A.
D efinition 5 We define sub-bundles of'Ey. V x V m by
Eq = E x V x V m (1.58)
S i = {(x, 0, v m) € So s.t. vm6 [pr] C V*} (1.59)
s 2 = { ( x ,o ,u * ) € S 0 s x D ' e f t .c r } ( 1 .60 )
5 3 = {(x ,u ,0 ) € S 0 s.t. v € [?i] C V} (1-61)
5 4 = {(x, v , 0) € S 0 s.t. v e P Px C V } (1.62)
Definition 6
P(E) = { a |a : [0,1] — > E (with a piecewise differentiable)} (1.63)
D e fin itio n 7 Given an element <r(s) in the fiber Ea(3) of a vector bundle E
over E with connection V, we will let (<r(s), d) denote the parallel translate
ofa(s) at a(s) to a(d) along a relative to V.
30
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
D efinition 9 Let us define 0 := ©*=1 V ©*=1 V*and k C 0 by the rule
Definition 11 Let
T : P(E) x A —4 0 (1.68)
be given by
r ( a ,V ) = (1.69)
( p ( 0 ). r j ( p ( l) , 0 ), p(l), r j(p ( 0 ) ,l) ,
9 (0 ), r j ( , ( l ) , 0 ), „( 1 ), r j ( 9 ( 0 ) ,l) )
F : 0 — >R (1.70)
F((A,y-A, (1.71)
y
31
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
P roof o f Equivalence
Theorem 13 Let / i i2 • F(E) x A — > R be a pair o f quantity index formulas
where / lt2 = Fl f 2 o T,
T : P(E) x A — + e (1.72)
is as before and
Fh2 : 0 — > R (1.73)
are two formulae which satisfy the proportionality test. Then if V “ = =
V “, we have /i(*, V a) = / 2 (-,V a) and /{(a(t), V a) = y») (time rever
sal).
Proof: The proof proceeds in two parts. First we we will show that any
adapted connection on So is “reducible” in that it is the sum of 4 connec
tions on the Ej sub-bundles of So. The implication of reducibility is that
if we parallel translate a vector u(a(s)) belonging to a sub-bundle E,- using
a reducible connection, then the parallel translate at the destination time d
will remain in the same sub-bundle as the original vector at source time s. As
both the sub-bundle of pricing systems S i, and the sub-bundle of baskets S 3
have 1 -dimensional fibers, we are guaranteed to have our parallel translates
of p(s) and q(s) expressible as non-zero multiples of the price and quantity
data at the destination time d.
The second part of the proof will be to show that for any such reducible
connection V°, the 8 pieces of data comprising T( a , V°) are guaranteed to
fit the pattern found in the proportionality hypothesis. Since all index num
bers can be represented by formulas satisfying the proportionality hypothesis
above, the conclusion follows.
To begin, we must demonstrate that if V is an adapted connection and
a is a section of one of the four sub-bundles of S,- C So, then V * < 7 is again
in E,* for any X € T3. For concreteness, let us look at <7 C S 3 . Now by
hypothesis we have
proving our claim. The claim that given any connection V on S 0, IIe,-V(IIe,-*)
is a connection on S,- can be verified by checking the axioms for a connection
in Appendix A, and is straightforward.
32
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Because the 4 summands of an adapted connection V* are individually
connections on the sub-bundles we are assured that the parallel transla
tion of a vector in one of the summands £,• will remain in that summand. In
particular, for the one dimensional [9 ] sub-bundle we have
This proves:
(p(0 ), y -p ( 0 ), p(l), ^ - p ( i ) 5
If
9 (0 ), 2 - 9 (0 ), ?(1 ), 7 - 9 ( 1 ) ).
33
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
This means that in the case of an adapted covariant derivative V°, we
are guaranteed that T (q ,V °) € k . Thus by the hypothesis that our index
formulas satisfy the proportionality test we can see that
A) = A) = --- (1.82)
/i ( a ( l - 1 ), A) / 2 (a (l - 1 ), A)
34
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
A dapted Index N um bers
I^et us now see how this adjustment in the notion of constancy corrects
some common bilateral index numbers. In the following price index number
formulas the <r’s refer to the parallel translates of prices:
Laspeyres
Original:
(1.83)
Adapted:
f w p v .iv ) s (1.84)
Paasche
Original:
Adapted:
( 1 ' 8 6 )
Fisher
Original:
n fJO
PF( p , _p1 , q_o, q_u
) _= P1 • 9° P1 ' Q1 „
(1.87)
Adapted:
Pf (p i P \ q »? ) = ( 1.88)
Tomqvist
Original:
N f p l \ (P n 9 n /p ° '9 ° ) / ^ l \ ( P n f l i / p 'V )
^ h p 0,? 1,? 0,? 1) = n \ (1.89)
n=l ' JsJ 'U J
Adapted:
AT
^ 1(0 )V pOn,a/pO°',O) ( pk \ ^ k/pW) (1.90)
l Pn J \^ o ( l ) j
35
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Walsh
Original:
.0 „I „o _ Dfai
* ( 1 JX)
T . U («?«})I/2 P?
Adapted:
Jevons
Original:
p t f j ) = n ( 4 ) ,/w (i.93)
«=i p
Adapted:
36
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
1.6 The D ivisia In d ex
What is thisnew index? In this section we will prove that this index is
the same as the Divisia index. Divisia developed thisindex inthe 1920’s
in a somewhat ad hoc fashion. In fact, Bennet [1920] writing a few years
before Divisia using essentially the same argument, came up with almost the
same index which differs from Divisia’s by its absence of Log functions. The
following is a description of the traditional development of the Divisia index6:
Q :R — P :R —t R (1.95)
such that
p(t) • q(t) = (* )« (0 = P(t)Q(t) Vf. (1.96)
t '= i
m - g (i.97)
with Q(t) arbitrary. Divisia’s idea was to take the derivatives of the loga
rithms
dln(p(t) • q(t)) _ dln(P(t)Q(t))
(1.98)
dt dt
expand using the chain and product rules
37
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
to be equal. We can then see that this does specify unique function P(t), Q(t)
by the formulas
= n w ( v j n w (<r)) = n p( ^ ( i ) . P( 0 + m % M) ( 1. 107)
= ((fW -P W + A W fW ) . , W )
p(t) • q(t)
38
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
hence
(u n >
= ~ * - J * P l M ^ )d r) (U 1 2 )
= « * - / (U I4)
J a i=l P'% Pi
Thus
QED.
39
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
1.7 R esu lts
As we have shown in this paper, much of index number theory can be viewed
as a special topic in differential geometry. When the traditional index number
problem is formulated in the context of differential geometry, a new feature
emerges. Unlike the ordinary calculus, differential geometry points out that
there is not a unique derivative as is often assumed. There is however a
unique choice of derivative adapted from the fam iliar one to the problems
of index theory. Once that choice is made successfully, all index numbers
currently in favor become equivalent and are equal to the Divisia index. The
shift in emphasis from the algebraic formulas of various index numbers, and
towards the choice of derivative thereby enables us to resolve the so called
Index Number Problem. Thus by using these mathematical techniques, we
have a natural answer to the question “What is the correct index number?”
Casting the question in the right framework not only allows us to unify and
recover previous results, it also enables us to understand better the nature
of the question.
One of the primary debates one sees in the current literature on index
number theory centers around whether or not to chain bilateral indexes.
With the sophistication of the differential geometric framework, we can cast
this question in an entirely new light. As we have seen above, all index
numbers make a choice of derivative. The choice has till now been made
implicitly. When we view the issue differential geometrically we are forced
to make the choice explicit. What then becomes clear is that there are two
different derivatives being used. The ordinary derivative, which has no justifi
cation whatsoever economically, and the adapted derivative, whose economic
justification has been presented in this paper. Once one is forced to justify
the choice of derivative, one must realise the arbitrary nature of all bilateral
indexes using the ordinary derivative. The Divisia is the only index which
uses an economically justified derivative and is therefore superior to all bi
lateral indexes using the ordinary derivative. Any further approximation to
the Divisia is therefore an improvement. Chaining is simply one such ap
proximation. The issue should no longer be whether or not to chain bilateral
indexes, but rather to find the best way of approximating the Divisia.
The one characteristic of the Divisia index most troubling to economists
is that it is given by a path dependent line integral. What we would claim is
that even with this path dependence (or perhaps because of it), the Divisia
40
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
is the fair and correct way to calculate cost of living adjustments in our real
world filled with changing preferences. Let us attempt to make an argument
for this in a particular setting in order to show that path dependence can be
more of a virtue than a vice. Let us assume that we have two individuals
starting and finishing by consuming exactly the same basket of goods, and
look at a situation where the path dependent Divisia might give us a different
index for each of these individuals. Let us say that the first, as a connoisseur
of the finer things in life, starts cultivating his taste in red wine, while this
might not be a particularly popular habit. As he is functioning under a
budget constraint he thus shifts consumption away from other goods, for
instance soda pop. If we assume there is then a faddish move toward the
consumption of red wine we should expect that wine prices will start rising.
The second consumer, following the fad, starts shifting his consumption only
after the prices have already risen. The Divisia would require compensating
the first consumer, but not the second, precisely because their paths were
different. The first started consuming wine while prices were low, and then
got hit with the price increase. The second chose to shift his consumption
only after prices had increased. What we would claim is that it is only the
path dependent Divisia index that can give a fair cost of living adjustment.
What we see in the mathematics is that a Divisia index C.O.L.A. is a
guarantee that that the recipient will be able to maintain his previous life
style with the freedom to change to an equivalently priced life-style at any
time. Unlike the Konus C.O.L.A. discussed in Sen [1979], the person will
not be compensated if they become increasingly depressed nor penalized for
greater happiness. Unlike the Paasche and Laspeyres C.O.L.A., her adjust
ment is based on a lifestyle which is allowed to change at any time. But there
is an important feature of Divisia compensation not found amongst the other
indices: the element of risk. Under a Divisia C.O.L.A., your lifestyle is an
investment. If you change your spending patterns, there is no guarantee that
you will be able to get back to your previous level of consumption. Fax from
being exotic, these risks are well known to all those who must decide how
much currency to change when visiting nations with volatile exchange rates.
What the differential geometry emphasizes is that path dependence is prefer
able and that the path dependence of the Divisia is no more counter-intuitive
than the path dependence underlying the theory of investment.
As previously noted, the differential geometric framework allows us to
see that this index is the natural one to emerge from both the mathematics
41
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
and the economics, and that it enables us to resolve consistency issues raised
by the plethora of existing indexes. What it also enables us to do is recover
previous results about the Divisia in a unified framework and to extend these
results in ways that can practically facilitate our use of the index.
For economists concerned with issues of path dependence, an important
result is path independence of the Divisia under unchanging homothetic pref
erences [see Hulten 1973]. What does the mathematics have to say about
this?
Differential geometry allows us to calculate something called a curvature
tensor. Recall that with the ordinary notion of a derivative, mixed partials
commute. This is no longer true of general covariant derivatives. The cur
vature tensor measures the failure of the covariant derivative to commute,
i.e. V x V y ^ V y V x- What differential geometry tells us is that when the
curvature tensor is equal to zero on a subspace, our index will be path in
dependent. Homothetic preferences are a special case of a zero curvature
tensor.
Another important result is Diewert’s 1980 demonstration that when cer
tain approximations are valid, the Paasche, Laspeyres and Tomqvist indexes
can be regarded as discrete approximations to the Divisia. In addition, it has
been noted that chaining usually improves the approximation and leads to
less variation among index numbers. As discussed previously, the differential
geometry shows us that chaining is a crude way of approximating our adapted
derivative. Given that the adapted derivative resolves the consistency issue,
greater agreement under chaining is therefore explained.
Thus we see that not only does our approach recover the above results,
but it also gives a unifying framework in which to understand them.
1.7.1 Extensions
Besides recovering previous results in a consistent framework, differential
geometry enables us to extend our knowledge about the Divisia in practically
applicable ways:
I.) Using the theory of connections, it is possible to do for the space of
barters what has been done for the space of baskets. In this paper we have
shown how to modernize or antiquate the components of the usual ‘fixed
basket’ to reflect changes in production and/or consumption.
42
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Understanding how barters change is essential to understanding the econ
omy as a whole. For example, if our basket q(t) is taken to be the U.S. GDP,
an individual’s basket will consist of a non-zero barter vector B added to
a scaled basket vector A • q(t) (here B is asserted to be non-zero because
individuals never purchase multiples of the GDP). Further, the entire theory
of option pricing rests on the fact that barters today will not in general be so
tomorrow. It can be shown however that the adapted derivative gives us a
natural identification between the spaces of barters at two different instants
of time. This in turn results in an n-1 x n-1 matrix analog for barters of the
usual Divisia index for baskets. The significance of this matrix index is not
yet clear and and is the subject of work in progress with E. Weinstein.
II.) Differential geometry allows us to discuss path independence in the
absence of knowledge of homothetic preferences. Let us assume that we axe
not given access to preferences, but that we are instead given some region
R C V x V* such that when we restrict the curvature tensor F v° to R we get
zero. Then if cci, axe two curves with the same end points and a i can be
deformed to within R while keeping the endpoints fixed, then the Divisia
of a i equals c*2 .
III.) When the curvature tensor F v ° ^ 0 we can use the ‘size’ of the
curvature tensor to estimate how fax we are from path independence. In
other words, we can use the size of the curvature tensor to estimate the local
contribution to the path dependence of the Divisia index. Thus, this allows us
to deviate from pure homothetic preferences and estimate data needs based
on the curvature intensity of a region.
These issues will be further explored in following papers.
43
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
ment. Thus gasoline will be measured in gallons and labor in hours. We will
also assume that all prices are linear and are therefore measured in .
Let us order the goods and services together with their system of units and
denote the collection by {fir,-, Given this initial data we will describe
the evolution of our sector by the collection of data T = (V n, or, V°) given
as a triple. We will postpone the definition of V° until section 1.8.3.
Let V n be the vector space of all quantities of goods and services in the
sector of the economy under consideration. That is we let {e,}JL1 C Vn be
a basis for V indexed by the (ordered) set of n goods and services under
consideration in their natural units of measurement (kilograms, units, etc...).
We will interpret negative multiples of these goods as debts or obligations to
provide.
We will let V * be the dual space of linear functionals <f>: V — > R. This
is a vector space of the same dimension as V. We note that given our basis
{e , } " = 1 C Vn there is a corresponding basis {e * }" = 1 C V* where e* is defined
by specifying that e*(ey) is equal to 1 if * = j and 0 otherwise.
We wish to comment here that the reader may choose to represent the
elements of V as ‘column’ vectors and the elements of V * as ‘row’ vectors.
In this notation the act of evaluating an element p* of V “ on a vector v is
obtained by matrix multiplying the l x n row vector against the n x 1 column
vector to get a real number (i.e. a 1 x 1 matrix).
If we assume that we are interested in an initial state of the economy
at time To contrasted against a later state of the economy at time Ti then
we can define p : [r0, Ty] — > [0 , 1 ] by p(x) = ( ^ ~ ^ ) in order to restrict our
formalism to the interval [0,1]. From now on we will assume that the time
interval of interest will always be [0 , 1 ].
Let E = {(u, <f>) v € V, <£ € V* s.t. <j>(v) ^ 0} be the collection of pairs
of baskets of goods and possible pricing systems.
Then let a = (m, ir) where a : [0,1] — > E and 7r : [0,1] — > V* gives
the prices of the goods and services under consideration. Conversely m :
[0,1] — y V denotes the (possibly) time dependent basket of goods and
services with which we are attempting to calibrate the economy.
Given any non-zero vector to in a vector space W , we will denote by [to]
the linear subspace spanned by to.
With this defined we argue that if we are given any element (o, <f>) € E
we get a decomposition of our vector space V as follows. In the first place
44
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
<f>determines a subspace Py] defined by Py] = {h € V s .t. <f>(h) = 0}. Our
bracket notation is justified by the fact that Py] is independent of whether
one chooses to work with <j>or r • <f> (r ^ 0). Now by our above definition of
E we are guaranteed that
Proof: The argument above goes over word for word in this case with V ’
and V interchanged. QED
The importance of the decompositions is that they determine projection
maps onto the various subspaces. Given vectors y € V, u> € V* and our
point (v, 4>) 6 E we get projection maps
n * i : v — 1►f a (U 1 7 )
nw : v — >M (1.118)
n % i: ^ t1-119)
defined as follows.
n w ( ! ,) = l w - v (1' 120)
8One might ask whether there exists a word corresponding to ‘barter’ for currencies
which view our basket rn(t) as worthless. Unfortunately the word exists; the Oxford English
Dictionary defines Floccinaucinihilipilification as “The action or habit of estimating as
worthless” so we rise to the challenge and suggest to the brave reader that the noun
describing the elements of is probably ‘Floccinaucinihilipilificators.’
45
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
IW y ) = y - n M(y) ( 1.121)
( 1.122)
46
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
ir o er(x) = x. The space of all sections is denoted by E). The space of
smooth (that is infinitely differentiable) sections is denoted by r°°(X, E).
It is important for our paper to know that given a curve a : [0,1] — ►X ,
then every vector bundle over our space X determines a vector bundle over
[0,1] as follows. If E is a vector bundle over X with projection map tt, define
the pull-back bundle c f{E ) to be the set of pairs a*(E) = {(t, u) s.t. t 6
[0,1 ], u € 7r- 1 (a(<))}. Thus if E = X x W n then o f(E ) is simply the trivial
bundle [0, 1 ] x W n. Likewise if we have a sub-bundle of X x W n given by a
map Z : X — ►Pm(W n) then we get a sub-bundle of [0,1] x W n given by the
map Z : [0, 1 ] — > Pm(W n)which sends t to the point Z (a {t)) € Pm(W n).
47
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
is that it is a total differential of ‘vector valued functions’ which are the
sections we have met above.
The simplest connection is just the familiar toted differential of ordinary reed
valued functions.
Vn
v s(z) = (x, £ £ € W ,. (1.126)
t'=i i= i axi
If we have a vector bundle S over a base space R 1 (or any connected subset
thereof) and a (possibly non-trivial) connection V on S, then we define the
48
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
parallel translations T (w p, V) of a vector tvp in the fiber above p to be
given by the section <r : R 1 — >■£ which satisfies
49
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Chapter 2
Welfare Im plications o f Divisia
Indices
2.1 Introduction
In recent months the U.S. Senate has been actively debating the Consumer
Price Index (CPI) and its relation to the “true” cost of living index. The
importance of this question for the U.S. economy is undeniable. According
to recent Senate hearings on the CPI, 30% of federal outlays and 45% of
federal revenues are indexed to the CPI, and a 1 % overstatement in the CPI
costs the U.S. government $280,000,000,000 over 7 years.
The current CPI issued by the U.S. Bureau of Labor Statistics is based
on a Laspeyres index Pl, = for a representative consumer whose
fixed basket is updated once every 10 years or so. It is widely accepted,
however, that this is not a cost of living index. There are several reasons for
this, both technical (eg. the way data on housing is handled by the BLS)
and theoretical. We will focus here on that part of the failure of the CPI to
evaluate the true cost of living adjustment that arises from the theoretical
issues such as substitution effects, the new goods bias, and changes in quality.
The goal of a ‘cost of living adjustment’ P#(t) is that a utility maximising
consumer purchasing a basket q(0 ) with expenditure E ( 0 ) at time 0 , should
1This chapter is co-authored with Eric Ross Weinstein in the MIT mathematics
department.
50
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
receive the same utility from an expenditure of
E (t) = Jfe(t)£(0) (2 . 1 )
at time t ?
The Konus index (often referred to as the ‘true cost of living index’ or
simply as ‘the economic index’), is believed by most economists to be a
complete theoretical (if incalculable) solution to this welfare problem. Poliak
(1983) defines the Konus as a ratio of expenditures in the following way:
p (pa p b . D\ __ E { P isiR) fn n \
PK(P ,P E (p„tS R) (2.2)
51
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
it suffers from none of the preference restrictions under which the Konus
labours.
It is clear that the Laspeyres index, currently being used for the CPI
is an overestimate of the Konus index, Ffr. What is not clear, however, is
by how much. The economists who have been asked by the government for
their estimations of the overstatement have delivered a range of suggested
adjustments to the CPI. This raises two important questions:
52
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
of living are at the heart of index number theory, the importance of this
question was clear. When viewed from a differential geometric standpoint,
the answer to the question also becomes apparent. In this paper we attempt
to translate these insights into a framework familiar to economists using
advanced calculus. The geometric details will appear elsewhere.
53
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
the same.
The point here is that A was hit with a price increase in the basket she
was already consuming, while B increased her consumption into a good whose
price had already undergone much of the increase.
A continuous C.O.L.A. based on the Divisia price index would protect the
rights of both consumers to keep their life styles intact in the face of a price
increase but it would compensate both consumers differently. This led us in
[Malaney 1 ] to make the point that the path dependence of the Divisia index
C.O.L.A represents something directly analogous to the path dependence of
currency exchange or stock portfolio composition; with a Divisia C.O.L.A.
your lifestyle is an investment. You can always retain your way of life but
you cannot necessarily return to it if you change your preferences.
rf pa pb _ m _
i{p , p , s, r ) - k{Pr - - R) '/n
The notation emphasizes that the index depends not only on the
two sets of prices, P a and P b, but also on an initial choice of
an indifference map or preference ordering, R and the choice of
a base indifference curve s from that map. One set of prices is
called “reference prices” and the other, “comparison prices’. If
the comparison prices are twice the reference prices, the index is
2; if they are one-half, the index is one-half.” -The Theory of the
Cost of Living Index Robert Poliak Pg. 94
54
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
It is plain from the above definition that (the cost of living index’ (otherwise
known as the Konus price index P k ) is only defined under the hypothesis
of unchanging ordinal preferences. The assertion commonly made is that if
a consumer with a given cardinal utility function U maintains such a static
indifference map R, then Pk = I (P a,P b,s ,R ) by its definition insures ‘con
stant utility’. We do not take issue with this statement in this paper except
to point out that this assertion appeals to an implicit condition on U which
we wish to make explicit.
Let us assume for the moment that we were interested in a consumer
whose preference map Rt was derived from a dynamic utility function U :
R-rim. x V — y R. It is quite possible that our consumer’s “efficiency as
a [cardinal] pleasure machine” might change even if her indifference map
Rt was constant. For example, consider a static cardinal utility function
Uq : V — y R with an indifference map Ro. Then
and
Ub(t, u) = (^Sin{2‘Kt) + l)U 0(v) (2.6)
are both time dependent utility functions with a common indifference map:
Ro-
Let us then assume that our utility maximising consumer possesses just
such a static indifference map R derived from the cardinal utility function
U(s, v) and at time t expends an adjusted income of I(t) = P # (t)/(0). In this
case, the assumption that her Konus compensation yields constant cardinal
utility uc.
U (t,q(t)) = uc Vf (2.7)
implies that
dU = d U d s , d U d q = Q (0 8)
dt ds dt dq dt
We know however from the definition of the Konus that ^ lies tangent to
the indifference curve while = V U is orthogonal to the tangent planes of
U’s level sets implying
S 5-*
55
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Therefore, for U (t, q(t)) to be constant under Konus compensation, we must
make the single assumption
= 0 whenever t= s. (2 . 1 0 )
os
Relative to the basket q(t), equation (2.10) refers to the constancy of
what what Sen calls the ‘change in man’s efficiency as a pleasure machine’
and what Balk refers to as the ‘cost of living effect of pure preference change’.
56
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
2.3.1 Changing Preferences and Psychological N eutral
ity
The Konus index in its standard form is of no use in evaluating welfare
considerations in the context of variable preference maps. However, if we
restrict our attention to those utility functions which:
then the Konus guarantees constant utility. This raises the question of
whether there is any way to guarantee constant cardinal utility when the
first assumption is lifted.
Happily this question can be answered in the affirmative; we will show
in this paper that the Divisia index Pd can be seen to be an extension of
the Konus index into the realm of changing preferences where only the above
“condition 2” is retained. To this end we make the following precise definition
based on equation (2 . 1 0 ).
dU ( s ,v ) t n ,o l 1 ^
-------|a=t,v=<j(t) = o ( 2 .1 1 )
for all t. Given a dynamic utility function U, we will refer to any U which
shares the same indifference curves as U while satisfying (2.11) as a psycho
logical neutralisation o fU .
57
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
With this in mind, we can introduce the function
58
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
T h eorem 2 0 Consider a representative consumer with budget constraint
/ ( 0 ) at time t = 0 and a differentiable time-dependent convex utility function
U : R x V — ►R . Let us further assume the consumer’s expenditure at time
t is given by an experimental price index PE(t) according to the formula
Let p(t) be a differentiable 1-parameter fam ily of price vectors with q(t) be
the basket o f goods which maximises U{t, •) subject to the income constraint
/(<). Then i f U i s any psychological neutralisation of U and Pk , Pd -> Q d
are the Konus price, Divisia price and Divisia quantity indices we have:
Proof: For part 1 , we first show that if the consumer’s PN utility U (t,q(t))
is constant for all time i then his Divisia quantity index Q d (1) is equal to
unity.
4SamueIson & Swamy 1974, Page 568 and Sen 1979, Page 404
59
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Now define /?* to be the time dependent space of barters
f t = {» 6 V s.t. pt • v = 0 } (2.16)
i.e. all baskets which our pricing system at time t evaluates as representing
a ‘fair trade’.
Then by the assumption that the basket q(t) represents the purchases of
a utility maximiser, we know that any instant of tim e t there exists a strictly
positive constant Xt such that (VC/),(t) = AtPq(t)
... . ^ (| , = „ * M | , = 0 ^ € ^ Q d = 1 (, 17)
To establish the converse, assume that the Divisia quantity index is equal
to unity for all t.
= * Pt( ^ ) = 0 (2.18)
$ • VU = 0 (2.20)
at
dU dq n
Tqir° ( 2 -2 1 )
By PN:
dU ds
= 0 at s = t, v = q(t) (2 .2 2 )
d s dt
d U d s Md U d q _ d U
• • ds dt + dq dt dt 1 ' }
The second point can be deduced from the above argument.
60
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
but by P.N., maximisation and line (2.17) we have
a , ( 2 _2 5 )
At dq dt dt
r* ^ vAs) • ^4^- /* 1 V U is ■) • —
exP( I n™ 7 \ ~ r xds) = exp( I -r—= i rds) (2.26)
■'O i=i 5Zy=i Pj(s ) ' 9i(s ) Hy=i P/(5) ’ ^j(5)
— d~ = |e * p ( j j % W ( V t /( 5, ■) • ds (2.28)
= u { t) ty u ( t, •) • ^ ) (2.30)
but since Pd(0) = Qo(0) = 1 we must have c = —ln( u(0) )yielding the
expression
Pd M dM = (2.33)
61
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
since q(t) is the basket of minimal cost on the indifference curve C by the
maximisation hypothesis.
QED.
62
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
C orollary 21 Given the same assumptions and notation as in Theorem 2:
Q ED .
63
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
first part of Corollary 3 shows that in this respect the Divisia index outper
forms the more brittle Konus. That is, it agrees with the Konus when both
are defined under constant preferences and utility but continues to guaran
tee constant utility in the absence of constant preferences. This suggests the
idea that the definition of the Konus fixed utility price index be extended
by defining it to equal Pd when Q d = 1 Vt. In the next section we back
up this assertion by showing that even more is possible. First we define four
variable-preference Konus indexes and then show through tin advanced cal
culus argument (translated from differential geometry) utilising a traditional
chaining approach, that they are all directly equal to the Divisia even under
changing utility.
64
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
2.4 T he K onus In dex under C hanging P ref
erences
As discussed before, the Konus Price Index
E (P a, s, R)
Pk = (2.39)
E (P b, s, R)
measures the change in the minimal cost of getting the consumer to the level
of utility ur associated with a reference basket. Depending on whether we
take the reference basket to be the base time basket or the current time
basket, therefore, we get different indexes. The Laspeyres Konus (P l k ) uses
the base time basket as the reference basket, whereas the Paasche Konus
( P p k ) uses the current time basket. In general these are not the same,
except in the case of homothetic preferences.
Let us define Q(Ua(q(b)),p(c)) € V to be the basket of minimal cost in
the pricing at time c with utility equivalent to the consumer’s basket at time
6 as measured by the consumer’s utility function at time a. With the above
notation, the traditional Laspeyres Konus and Paasche Konus price indices
are given by the formulas:
65
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
D efin ition 22 The dynamic utility Konus price indexes are defined by the
formulas:
P l k l (P u Po ; U o (q o )) = (2.42)
p (0) Q m q ( 0 ) ) , p( 0 ) )
K i) Q m q ( 0 ) ) , P ( l ) ) (2.43)
P l k p ( p i ,P o ] U i {<1o) ) —
p(0) Q(Vi(q(0)),p(0))
p(l) Q(U0(q(l)),p(l)) (2.44)
P p K l { P u P o \ =
p(0) Q(Uo(q(l)),p(0))
p(l) Q(tfi(<z(i)),p(i)) (2.45)
P p k p (P i ,P o ; U i (<Ii ) ) =
Pl k lgives the change in the cost of achieving the utility associated with
the base time basket under base -time preferences, whereas P l k p gives the
change in the cost of achieving the utility associated with the base time basket
under current preferences. Similarly, P p k l represents the change in the cost
of achieving the utility associated with the current basket under base time
preferences, while P p k p gives the change in the cost of achieving the utility
associated with the current basket under current preferences.
In this section we will prove the equality between the infinitely chained
versions of these indexes and the Divisia.
The infinitely chained Laspeyres-Konus-Laspeyres price Index over the interval [to,
66
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
For the sake of simplicity we shall assume that the intervals under consider
ation have been shifted and scaled to coincide with [0 , 1 ].
Our strategy to demonstrate the equivalence
^ L ( 0 , l ) = f t > ( 0, l )
will be to
“S n
a=0
p£kt = Ug> a=0
n e'n(P&?1) = .Urn c x p C £ f n ( P & f ))
a=0
(2.48)
67
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
where s represents the number of ‘links’ in our chain and a is a non-negative
integer less than s. These functions have been specifically constructed so
that their integrals
limO
J-fO 5
= *0. (2-53)'
V
Setting
r = - (2.55)
s
we get
lim t f KL(x0) = lim l^ P-LK-L^ 7 ^ 31, ~!? 1 + (2.56)
5—loo v ' r—(-0 r
= Um I r t P u c L ^ , ^ + r)) - ^ ) ) „ 57,
r—
f0 r ’
: lim -
r - ♦0
12. 5 8 )
68
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
_ dln(PLKL{to,h)) - (e) _Qv
|fo=ti=io (2.59)
1__________ ! _ ,M h ) . d Q { U M t o ) ) , p h)
j(<o) ■»(*)) *i lio=<i=*o
(2.60)
^ = ? (« .) (2.62)
p(h) • (2.63)
ut l
= xd U M ( U M t o ) ) , P u ) ) ) . a g ^ t e f a ) ) . ! * , ) (2 64)
dQ dti
= ^ U M iU M to )),^ ))) (2 65)
d ti
- A w y fc» ( 2 .6 6 )
t/tl
= 0 (2.67)
by the definition of Q(£/t0(9(*o))iPti) implying
- - 1 ( ^ • .(X .) + P (M . g g g ^ ^ a l ) ^ (2.68)
p(x0) • ?(®o) dti dti
p(xo) • g(sp)
(2.69)
p(xQ) • q(x0)
69
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
= Divisia Integrand (2.70)
Now, by the assumption that q(t) and p(t) are continuous on the closed
unit interval, we know that they must be uniformly continuous. This in
turn guarantees that the <f%KL converge uniformly to the Divisia integrand.
Finally this implies that the limit of the integrals is integral of the limit by
a theorem in advanced calculus. Thus, with the definitions and arguments
above we have:
dt) (2.75)
eXp(Jo p(t) • q(t)
rl JL TV . n, dpi
= exP(L E (2.76)
•'° i=i 2Jj=i Pj ■*7i Pi
QED
70
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
2.5 C onclusions
A comparatively small amount of work seems to have been done on the
analysis of the welfare implications of the “little understood” 5 Divisia index.
This is especially curious as in this paper we have seen that for an idealised
representative consumer with static or dynamic preferences, the Divisia price
index is peerless in its ability to maintain constant utility.
In retrospect this is perhaps not so surprising for two main reasons:
71
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
price index. The relationship between the two is also shown to provide a
means for evaluating estimates of the CPI, which is of particular relevance
given the current Congressional debate on this issue.
72
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Chapter 3
H ousehold M igration D ecisions
Under U ncertainty
3.1 Introduction
The problems involved with the control of rural migration into overtaxed
urban areas presents one of the great challenges in development economics.
Thirty years of intensive study by economists have spawned numerous pol
icy initiatives intended to combat the phenomenon. Nevertheless, despite
scattered success, many LDCs continue to report an increasing demographic
shift towards urban areas.
The pioneering work of Lewis (1954) and Todaro (1969) approached the
question of rural-to-urban labor migration in the face of high urban unem
ployment and a high urban to rural wage ratio as an individual decision
making problem. The basic behavioral equation1 developed by Todaro can
be expressed by
73
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
While this straightforward approach has enabled economists to under
stand some of the decision making involved in migration, it has been pointed
out that there are pervasive phenomena that are unexplained by this model.
One such issue is that of remittances. It has been found that 10%-30% of a
migrant’s income may be transferred in remittances (Lucas and Stark 1988).
There are various theories based on pure self interest or pure altruism to
explain these remittances. Lucas and Stark also discuss migration as a con
tractual risk sharing proposition: as there are basically non-correlated risks
involved in rural production and urban job search, the two parties can co-
insure, thereby diversifying the risk. The extent to which such co-operative
behavior is found however, suggests that it might be useful to think of the
decision making process not at the individual level, but instead at the house
hold level. As Lauby and Stark (1988) contend
“the decision to migrate and the choice of migration patterns are
often not made by the individual alone but rather are determined
by family resources and needs.”2
In fact there were already indications from the authors of the the original
individual migrant models that the ultimate goal should be to move the level
of analysis from laborer to household. According to Harris and Todaro:
“...this notion that migrants retain their ties to the rural sec
tor is quite common and manifested by the phenomenon of the
extended family system and the flow of remittances to rural rela
tives of large proportions of urban earnings. However, the reverse
flow, i.e. rural-urban monetary transfers is also quite common in
cases where the migrant is temporarily unemployed and therefore,
must be supported by rural relatives.”3
In this paper we will model rural-to-urban migration as a household de
cision. Once this shift is made, it is possible to view a family’s migration
pattern as a portfolio allocation problem. The low wage rural market, where
it has traditionally been assumed that employment is full, is a safe, low return
asset. The high wage urban market is associated with uncertain employment,
and represents a risky asset. The wealth to be ‘invested’ in these assets is
taken to be the number of working family members.
3Lauby J. and Stark O. 1988.
3Harris and Todaro 1970 pg. 127.
74
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
P o lic y C onsiderations
In the face of high urban unemployment there has been an ongoing discussion
of the best policies by which to control rural to urban migration. This dis
cussion has focused mainly on whether the migration incentive is best viewed
as a “push” from the low wage rural sector, or a “pull” from the high wage
urban sector. Throughout the development literature it is often stated as
self-evident that
75
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
during the very same period both risk and (especially) risk avoid
ance have assumed major significance in mainstream economics.
Yet the expected income hypothesis, even in its revised formu
lations, is devoid of any explicit decisional risk content (Todaro,
1976, 1980); the hypothesis does not incorporate a random vari
able (multiplicative or other), and the implied utility function is
linear.”6
It is only once one incorporates the effects of risk aversion into the fam
ily’s decision making that one realises that there may be an income effect
associated with the rise in rural incomes that may have adverse effects.
It is shown in this paper that if household utility can be characterised by
Decreasing Absolute Risk Averse functions, then it is possible for increases
in rural wages to in fact lead to ‘perverse migration’; i.e. depending on which
point of their utility function the family is at, an increase in rural wages may
increase the incentive to send workers into the city.
76
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
The following proposition is well known, and the notation and discussion
here is based closely on the treatment in Green, Whinston and Mas-Collel.
Proposition 24 Assume that u(x) is the Bernoulli utility function of an
investor displaying decreasing absolute risk aversion (DARA). Assume fur
ther that the investor is given the opportunity to invest amounts of wealth
w = Wi,W2 in shares of a safe asset returning r dollars for every r dollars
invested and a risky asset returning z dollars with cumulative distribution
function F. Then ifw \ < ty2, the optimal investments in the risky asset will
obey aj < a \ .
Proof:
We will show that if the utility function displays the property of Decreas
ing Absolute Risk Aversion, then at higher levels of wealth the individual
will invest more in the risky asset.
Consider the two levels of initial wealth w\ < iy2. Denote the increments
or decrements to wealth by z. Then the individual evaluates risk at W{ by
the Bernoulli utility functions U{(z) = u(iut- -f z).
Because u is posited to exhibit DARA we are assured that whenever
W\ < W2 , 1*1 (2 ) = u(wi + z) is a concave transformation of 1*2 (2 ) = **(102 + 2 ).
The expected utility maximising investor with wealth tu,- seeks to max
imise the expression
77
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
The reasoning is that integral
78
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
What we see here is that with DARA utility functions, and the fact that the
increase in income is coming precisely from an increase in the return to the
safe asset, income and substitution effects move in opposite directions.
In the case o f consumer goods it is assumed that even when these ef
fects conflict, it is highly unlikely that the income effect will outweigh the
substitution effect. This is generally true because in order for the income
effect of a price rise in any good to be large, the good must constitute a large
proportion of the consumption basket. While this is unlikely to be true for
any good in a consumption situation, it is almost certainly true in the case of
employment. As there are only two assets between which a consumer must
choose, it is very likely that income effects will be substantial. The omission
of any discussion of this when forming policy can therefore lead to counter
productive efforts.
79
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
3.3 A R isk S en sitive H ousehold M igration M odel.
Our aim in this section is to construct household migration models which
incorporate risk sensitivity into labour allocation decisions. While this seems
straightforward enough, a literature search indicates that such models are
either quite obscure or have not been introduced at all.7
There is much debate in the current development literature about the
appropriate methods o f modeling household decision making. The problem
is rich in intricacies which include principal-agent problems, gender specific
labour8, migrant network structures and various game theoretic considera
tions. Given the wide variety of phenomena which are being uncovered, it is
unlikely that a ‘universal’ model will be settled upon any time soon. For the
sake of simplicity we will thus work with a household whose decision making
is well approximated by the maximisation of a household utility function that
views the decisions as being made by a single entity. While such a choice
does not pretend to universality, the situation is known to arise from a variety
of situations (e.g. a benevolent dictatorship, a nuclear family whose inter
ests are similar enough to work well in aggregation or an individual decision
maker) and may lend itself to most easily to varied adaptation.
We define below the category of rural household models used in this paper.
where
80
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
2. The number of workers in the household is given by a positive integer
n € Z+.
isO jsO
Proof:
Let us assume that we have an ordered set of r identical though unrelated
binary events {X ,};-! (e.g. urban job searches) which all carry the probabil
ity ka of outcome A (urban employment) and 1 —tta of outcome B (urban
unemployment). Then the probability ir that only the first s events yield the
outcome A is given by
7T= 71^(1 V A y - . - (3.10)
81
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
If one wishes to instead calculate the probability fr that exactly s of the
possible outcomes result in A (without respect to which events return the
result) the probability is given by the discrete binomial distribution:
*= 1A)r“ = ( I) *s(1 ~ T
aY" <3-u >
m K -m
isO j= 0
=i=0z j=0
f ( ^7 )/ (\ nT
J
)J (7Tr )*(1- 7Tr r - ' K )J(1 -7TU)n ~ m ~J l/(iw r + jw u - (n-m)c)
QED
82
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
3.4 P erverse M igration In cen tives
We will show below that the migration model introduced in the previous
section may behave in a fashion exactly opposite to that expected from the
migration literature. In such situations, attempts to boost the attractiveness
of rural labour will serve to increase the appeal of migration to the urban
labour market.
While such phenomena can be shown to occur for families with any num
ber of workers n exceeding 2, the higher cases do not contribute any quali
tatively new phenomena; we will thus restrict our attentions to a 2 worker
household for ease of illustration.
In order to clarify our exposition, it will be helpful to define a condition
somewhat weaker than perverse migration which refers only to the effect
on incentive structure. The following definition is given for the two worker
household although the concept can be defined more generally:
D efin itio n 27 We will define a rural wage increase to have a perverse effect
on incentives (perverse utility for short) if the marginal increase of 17(1,1)
is greater than 1 7 (2 ,0 ). Thus perverse migration implies perverse utility.
D efin itio n 28 The functions representing CARA and CRRA equal to unity
are taken to be —exp(—x) and ln(x) respectively.
With these definitions set, let us look at the case of a two worker house
hold. In this particular case we have the explicit formulas for the expected
utility of the 3 possible labour allocations:
^ ( 2 , 0 ) = 2u(2wr) (3.17)
dwr
83
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
duWr(1, i) = (l _ Hu)j>(u;r) + +Wu) (3.i8)
dw r
dUWr(0,2)
= 0 (3.19)
dwr
relative to changes in the rural wage.
As can be seen from the formula in line (3.16), the strategy of sending both
workers to the urban labour market carries a significant risk of a complete
loss of income. It is thus important to make sure that whatever functional
form is posited for v{x) above U7r , the value for t'(O) must be low enough
to prevent ‘gambling’ with the (0,2) strategy in all but the most extreme
circumstances as the migrants under consideration are taken to belong to
risk averse subsistence level households.9
If we assume that the family was initially indifferent between having 1 or
2 workers in the rural market
UWr(2 ,0) —UWr(l, 1) = 0 (3.20)
then asking whether rural wage increases will lead to perverse migration is
equivalent to asking whether
d£/«,p(2r,0u) a t ^ r( l r, l u)
— dw--------- dw— = n-fn \ a tt \ •/• \ / tt \ • /
2u^2Wr>~ ~ n«M«v) - (n.Mt* +wu)
, \
(3-21)
is positive or negative.
In order for the increase in rural wages to cause an increase in the utility
of sending workers to the urban labour market,
2i/(2it>r) < (1 —n„)i>(u7r) + (IIuMuv + Wu) (3.22)
On the pages that follow it is shown that (for a two worker household)
any unbounded DARA utility function v which is a concave transformation
of the CRRA function ln (x) is capable of producing perverse migration at
every rural wage level. Intuitively this indicates that functions which are
‘intermediate’ between CARA and CRRA will exhibit perversity for a two
worker household.
9See for example Sen (1984) Pg.260
“In matters of ‘life and death’ such as these decisions involve, affecting one’s
entire economic existence, the assumption of expected utility maximization
is recognized to be quite restrictive.”
84
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
First, we establish that for any concave transformation of the CRRA
function /n(x), there are always urban employment probabilities leading to
perverse incentives (without restriction on the urban wage).
Proof:
By hypothesis we have
1 - n tt > c (3.26)
implying the inequality
(3.27)
xl>'(ln[wr))
1
2tlf>(ln(2wr) ) ~ — < (1 - nu)^'(/n(iur)) (3.29)
zwr Wr
85
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
implying
From this theorem we can see that while the standard CARA function
—exp(—x) is not everywhere a concave transformation of the CRRA ln(x),
it nevertheless leads to perverse incentives.
86
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
as /n (l) = 0. By the fact that if)(x) is an. increasing concave function above
x = ln( 1) = 0, we are assured that for wr large
if)'{ln{wr))
and there will thus exist probabilities 1 > IIU > 0 such that
0 ,(/n(2uv))
Ha r \\ < 1 (3.40)
if)\ln{wr))
While perverse incentives indicate that raises in the rural wage are ineffec
tive for combating rural to urban migration, they do not necessarily indicate
that effort is counterproductive to the point of inducing migration. The fol
lowing theorem shows that if the concave transformation if) is unbounded,
then for every choice of wr, it will always be possible to find an urban wage
u>„ which will lead to perverse migration for a marginal increase in the rural
wage.
T h eo rem 31 Let the notation be as in the above theorem with if) a concave
transformation and IIU a probability leading to perverse utility for wr G [a, 6].
If if) has
J im ^ O r ) = o° (3.41)
then
Vwr € [a, 6] 3u;tt > wr (3.42)
such that the family is indifferent between the pure rural strategy and the
mixed rural/urban strategy
with
dUWr( 2,0) duwr(l,l) (3.44)
dwr dw r
leading to perverse migration.
87
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Proof:
First, we recall that the expected utility for the ‘mixed strategy’ is given
by:
UWr( 1,1) = (1 - n„)i/(u;r) + (n tt)i/(uv + iuu). (3.45)
Now by hypothesis we know that
with ifi(x) unbounded from above. However, since ln (x) is also unbounded,
we see from (3.45) that
(3.47)
However, since the family is assumed to be risk averse we are assured that if
wu were set equal to wr we would have
C orollary 32 Let k > 1. Consider the DARA function v(x) whose coeffi
cient of absolute risk aversion for x > k is given by
v"(x) 1 1
- 4 o = ; < 1 + ]^ j> (3-50>
88
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Then if wr is any value greater than k , the choices
n._i_£feL (3.5i)
ln(2wr) K ’
, > ( / n(2 * r) ) - g g ^ n ( ( n K ) ) „
wu = exp(exp(---------------- ■ )) - wr (3.52)
/n(2wr )
lead to a 2 worker household being indifferent between the pure rural strategy
and a mixed urban/rural strategy with marginal increases in the rural wage
leading to perverse migration.
Proof: First, it can be readily checked that (3.50) is decreasing for x > 1
and can be solved by
i/(x) = ln(ln(x)) + c (3.53)
where for simplicity we take c = 0.
Next we see that if we take the difference in utilities between the two
strategies under consideration we have
89
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Relations to Income Stabilisation
While this paper has focused on raising a certain rural wage, the problem
of increasing effective rural incomes may take a slightly different form. We
discuss below a proposed method of relating the models of the current paper
with models which deal with uncertainty in the agricultural incomes.
The policies proposed for raising the attractiveness of rural labour may
attempt to do so by raising rural wages directly, or by attempting to reduce
the fluctuations associated with agricultural incomes. In the latter case it is
possible to model rural incomes as lotteries in which the certainty of rural
employment is preserved but the possible incomes fluctuate with agricultural
prices. The importance of this change in perspective is that it could be used
to incorporate the effects of agricultural price stabilisation policy into the
rural-to-urban migration decision.
In this situation we would imagine that we would have two cumulative
distribution functions with G i(x) representing the natural variation in niral
incomes and (?2 (x) giving the variation which would result from government
stabilisation efforts. If the government is successful in doing no more that sta
bilising rural incomes we could expect G i(x) to determine a mean preserving
spread of Gz(x). In either case, if we call Zq 2 the certainty equivalents
Z* = v ~ l ( J v ( x ) dGi(x)) (3.59)
of the two lotteries Gri(x), Crj(ar), we know that for a household governed by
a risk averse utility function u(x) that
Z \ < Zq (3.60)
90
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
3.5 Exam ples
Once one is made aware of the possibility of perverse migration, examples of
perverse models appear to be ubiquitous. It is thus not too difficult to find
examples of models where:
2. The values for probabilities, and wages axe not terribly different from
those found in actual labour markets.
While such models are not intended to mirror any particular labour mar
ket, they do indicate that such models exist without recourse to unrealistic
hypotheses.
Let us then look at examples in which we expect perverse migration. We
will illustrate the phenomena by graphing the utility differences of the two
main allocation strategies as a function of the rural wage.
It has been shown that any unbounded DARA utility function i/(x) which
is a concave transformation of the CRRA function ln(x) is capable of pro
ducing perverse migration. Here we look at a specific function and evaluate
ranges of rural and urban wages and urban probabilities that cause perversity.
Let us assume the household’s utility function is equal to
u(x) = l n ( l n (3.61)
91
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
into the realm where the RAF becomes negative, the wage increase is seen
to cause perverse migration as it becomes preferable for the family to send
another worker into the city.
We will look at two specific cases; in the first case we set the urban prob
ability equal to .5 and observe the range of rural wages wr which will cause
perverse migration with an urban wage of Rs. 200 (It will be noted in this
example that a rural wage increase would have to more than double the rural
earnings before it brought the migrant out of the range of perversity). In the
second case we look at a situation with an urban probability of employment
of .4 and urban wage rate set at Rs. 500.
We assume in these examples that the negative utility of a zero wage
is taken to be large enough that the family will consider it prohibitive to
gamble both workers in the city (as this strategy necessarily entails a high
risk of earning nothing).
92
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Case 1
We set:
H u = .5
wu = Rs. 200
Perverse migration is caused whenever rural wages wr rise or fall to any
wage within the range of Rs. 15 to Rs. 44. (See Graph 1)
Graph 1
Probability o f Urban Em ployment c .5
Urban Wage * R s.200
o.
o.
0.
20 100
93
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Case 2
We set:
Hu = .4
wu = its.500
Perverse migration is caused whenever rural wages tvr rise or fall to any
wage within the range of Rs. 20 to Rs. 56. (See Graph 2)
Graph 2
Probability of Urban Em ploym ent • .4
Urban Wage ■ Rs. 500
o.
o.
0.
0.
94
Reproduced with permission o f the copyright owner. Further reproduction prohibited without permission.
3.6 Conclusion
When developing migration control policy, understanding the decision mak
ing processes behind rural to urban migration is essential. It has in general
been assumed in the literature that there is a theoretical equivalence between
raising rural wages to dampen the “push” factors of the low wage rural sec
tor, and putting in place urban wage controls, to deal with the “pull” factor
of a high wage urban sector. It is shown here, however, that once attitudes
to risk are incorporated, it is is possible to find that raising rural wages may
in fact have a counterproductive effect.
The migration literature has, until very recently, been based closely on
the Todaro model. While this model captures the basic incentive structure
for migration, it makes two simplifying assumptions that are generally ac
knowledged to be unrealistic, i.e.
Once these assumptions axe lifted, it is found that one of the basic conclusions
of the Todaro model can no longer be relied upon.
Perverse incentives have been shown in this paper to afflict the category
of Decreasing Absolute Risk Averse utility functions. As this is precisely
the utility structure one expects to find in the general situation we axe left
with the question of how to evaluate the likely migration effects of a generic
rural wage support program. This paper would argue that without specific
information on the particulars of the potential migrants being targeted (e.g.
family sizes, objective functions, wage rates and probabilities) the usual con
clusion drawn in the literature is not assured.
95
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
3.7 A p p en d ix B: The R elationship B etw een
th e R ural H ousehold Labour A llocation
M od el and the Todaro M od el
The original Todaro equation serves something of a dual role in the devel
opment literature; while it is in some sense one of the simplest migration
models, it also functions as the progenitor for the more sophisticated models
which followed it. It may therefore be worthwhile to see how the data spec
ifying a rural labour allocation model (RHLMAM) compares with the data
specifying Todaro’s equation.
In Todaro’s model (3.1), the basic behavioral equation gives a single num
ber V^O) whose sign is the answer to the discrete optimisation problem; if
and only if the number is positive will the rational individual choose to mi
grate. In the case o f a household, the number of options is enlarged so that
there are decisions to evaluate. We thus posit that the natural ana
log of Todaro’s equation for a remitting household is given by the following
‘behavioral matrix’ M H.
D efin ition 33 The (n + 1 ) x (n + 1) skew-symmetric behavioral matrix M H
associated to a RHLM AM is given by the entries { M f j }",-=0 where Mff =
us {i) - v % )
The number of rural workers m which will optimise the expected utility of
the household H is then the number of the row whose entries are all non
negative. The greatest number row with this property will be called 0 ( H) .
If we denote the collection of household models by TL:
D efin ition 34 Let O : % — > TL be the integer valued function which gives
the number of workers who should be allocated to rural labour in order to
maximise the household’s expected utility.
P rop osition 35 Let us assume that the following hypotheses are made:
96
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
1. The household workforce consists of an individual: n = 1.
This shows that in a well defined sense the category of RHLMAMs can be con
sidered as an expansion of the seminal Todaro model generalised to include
risk preferences and household objective functions. The following corollary
is then immediate.
97
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Bibliography
[1] Afriat S., The Price Index, Cambridge: Cambridge University Press
[2] Allen, R.G.D., Index Numbers in Theory and Practice, London: Macmil
lan 1975
[3] Banerjee B. and Kanbur, S.M. “On the Specification and Estimation of
Macro Rural-Urban Migration functions: With an Application to Indian
Data” Oxford Bulletin of Economics and Statistics 43, 7-29, 1981
[5] Besse, A.L. Einstein Manifolds, Ergebnisse der Mathematik und ihrer
Grenzgebiete: 3. Folge, Band 10, Springer-Verlag, Berlin 1987.
[6] Bishop, R.L., and Goldberg, S.I. Tensor Analysis on Manifolds, New
York: Dover Publications 1980
98
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
[12] Dubrovin, B.A., Fomenko, A.T. and Novikov, S.P. M odem Geometry -
Methods and Applications. Part II: The Geometry and Topology of Man
ifolds., Graduate Texts in Mathematics, Vol.124, New York: Springer-
Verlag 1985
[13] Friedman, M.. and Savage, L.J. “The Utility Analysis of Choices Involv
ing Risk” Journal o f Political Economy 56, 279-304,1948
[18] Hulten, Charles R., “Divisia Index Numbers” Econometrica, 41, No. 6,
1973
[20] Jorgenson, D.W., and Z.Griliches, “Divisia Index Numbers and Produc
tivity Measurement,” Review of Income and Wealth, 17, 227-29, 1971
[21] Katz, E. and 0 . Stark, “ Migration and Risk Aversion in Less Developed
Countries” Journal of Labour Economics 4, 134-149, 1986.
99
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
[24] Lucas, R. and Stark, 0 . “Migration, Remittances and the Family” Eco
nomic Development and Cultural Change 93, 465-481, 1988
[28] Poliak, Robert A. “The Theory of the Cost of Living Index” Price Level
Measurement, Diewert, W.E., and Montmarquette, C., Eds., 1983
[33] Sen, A.K. “Employment, Institutions and Technology: Some Policy Is
sues” Reprinted in Resources Values and Development Harvard Univer
sity Press 1984
[35] Todaro, M. “Urban Job expansion, Induced Migration and Rising Un
employment in Less Developed Countries” American Economic Review
59, 138-148, 1969
100
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
[36] Weinstein, E., 1992. Extension of Self-Dual Yang-Mills Equations Across
the 8 ^ Dimension, PhD. Dissertation, Mathematics Department, Har
vard University.
101
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.