Solutions To Problems in Asset Pricing
Solutions To Problems in Asset Pricing
John H. Cochrane∗
Graduate School of Business
University of Chicago
1101 E. 58th St.
Chicago IL 60637
[email protected]
This is a very preliminary draft; it’s incomplete and I’m sure full of typos. Still, I welcome comments
on any problems you Þnd with these notes.
u0 (c1 ) = λ
λ
βu0 (c2 ) = .
R
Differentiating the Þrst order conditions,
dc1 c1 u00 (c1 ) dc1 dλ
γ = 0
=
c1 u (c1 ) c1 λ
dc2 00
c2 u (c2 ) dc2 dλ dR
γ = = −
c2 u0 (c2 ) c2 λ R
2. The expected return of the asset is the same as that of its mimicking portfolio, proj(R|m)
3.
(a) We know there are a, b, such that m = a + bRmv . Determine a,b, by pricing Rmv and the risk
free rate Rf
1
³ ´
1 = aE(Rmv ) + bE Rmv2
1 = aRf + bE(Rmv )Rf
1 E(Rmv ) − Rf
b = −
Rf var(Rmv )
1
a = − bE(Rmv ).
Rf
An easier way to do this is to parameterize the linear function by a mean and shock:
|ρ| = 1 : m = E(m) + a(Rmv − E(Rmv ))
E(m) = 1/Rf : m = 1/Rf + a(Rmv − E(Rmv ))
E(Rmv )
1 = E(mRmv ) : 1 = + aσ2 (Rmv )
Rf
E(Rmv ) − Rf
a = − f 2 mv
R σ (R )
1 E(Rmv ) − Rf mv
m= − (R − E(Rmv ))
Rf Rf σ2 (Rmv )
(b) We had
E(Ri ) = Rf + βi,m λm
We have
cov(Ri , a + bRmv ) = bcov(Ri , Rmv ).
5.
q
£ ¤
σ (ct+1 /ct )−γ = E (e−2γ∆ ln ct+1 ) − E (e−γ∆ ln ct+1 )2
q
2 σ 2 (∆ ln c 2 σ2 (∆ ln c
= e−2γE(∆ ln ct+1 )+2γ t+1 ) − e−2γE(∆ ln ct+1 )+γ t+1 )
q
1 2 2
= e−γE(∆ ln ct+1 )+ 2 γ σ (∆ ln ct+1 ) 2 σ 2 (∆ ln c
eγ t+1 ) −1
h i ³ ´ 1 2 σ 2 (∆ ln c
E (ct+1 /ct )−γ = E e−γ ln ∆ct+1 = e−γE(∆ ln ct+1 )+ 2 γ t+1 )
.
Dividing, we get the Þrst result. For the second result, use the approximation for small x that
ex ≈ 1 + x.
2
6. You wouldn’t put all your money in such an asset, but you might well put some of your money in
such an asset if it provides insurance — if its beta is low. (Graph!)
7.
(a) Rather obviously, use the equation at t and t + 1, i.e. start with
µ ¶
u0 (ct+2 ) 0
2 u (ct+3 )
pt+1 = Et+1 β d t+2 + β dt+3 + ...
u0 (ct+1 ) u0 (ct+1 )
The last term is not automatically zero. For example, if u0 (c) is a constant, then pt = β t or
greater growth will lead to such a term. It also has an interesting economic interpretation. Even
if there are no dividends, if the last term is present, it means the price today is driven entirely
by the expectation that someone else will pay a higher price tomorrow. People think they see
this behavior in “speculative bubbles” and some models of money work this way.
The absence of the last term is a Þrst order condition for optimization of an inÞnitely-lived
P 0
j u (ct+j )
consumer. If pt < (>) Et ∞ j=1 β u0 (ct ) dt+j , he can buy (sell) more of the asset, eat the
dividends as they come, and increase utility. This lowers ct , increases ct+j , until the condition
is Þlled.
If markets are complete — if he can also buy and sell claims to the individual dividends — then he
can do even more. For example, if pt >, then he can sell the asset, buy claims to each dividend,
pay the dividend stream of the asset with the claims, and make a sure, instant proÞt. He does
not have to wait forever. (Advocates of bubbles point out that you have to wait a long time to
eat the dividend stream, but they often forget the opportunities for immediate arbitrage that a
bubble can induce. The plausibility of bubbles relies on incomplete markets.)
Bubble type solutions show up often in models with overlapping generations, no bequest motive,
and incomplete markets. The OG gets rid of the individual Þrst order condition that removes
bubbles, and the incomplete markets gets rid of the arbitrage opportunity. The possibility of
bubbles Þgures in the evaluation of volatility tests.
8.
Λ = e−δt uc (c, l)
· ¸
−δt 1 1
dΛ = −δΛdt + e ucc dc + ucl dl + uccc dc2 + ucll dl2 + uccl dcdl
2 2
· ¸
dΛ ucc ucl 1 uccc 2 1 ucll 2 uccl
= −δdt + dc + dl + dc + dl + dcdl
Λ uc ucc 2 uc 2 uc uc
3
After multiplication by dP/P only the dc and dl terms will have anything left, so
µ ¶ µ ¶
dp D dp dΛ
Et + dt − rtf dt = Et
p p p Λ
µ ¶ µ ¶
ucc dp ucl dp
= Et dc + Et dl
uc p uc p
or,
ucc ucl
Et (Ri ) − Rf ≈covt (Ri , c) + covt (Ri , l)
uc uc
this is your Þrst view of a multifactor model, one with multiple betas or factors on the right hand
side. Of course, there is nothing deep about multiple factors — the same model is expressed with the
single Λ on the right hand side. But there may be more economic intuition in having the c and l
separately rather than combining the two into Λ.
9.
If you increase leverage α in R = (1 − α)Rf + αRm you increase mean and volatility. If R can get
anywhere near zero, ln R goes off to -∞. Thus, increasing α eventually leads to a decrease in E ln R.
For example, if returns are normal, then
1 2 (R)
E (R) = eE(ln R)+ 2 σ
1
ln E(R) = E(ln R) + σ2 (R)
2
1
E(ln R) = ln E(R) − σ2 (R)
2
h i 1
E(ln R) = ln αE(Rm ) + (1 − α)Rf − α2 σ2 (Rm ).
2
As α increases, the second term eventually dominates.
(a)
X µ ¶
ct+j −γ
pt = Et βj
ct+j
ct
pt X µ ct+j ¶1−γ
= Et βj .
ct ct
If γ = 1,
p 1
= β/(1 − β) =
c δ
where β = 1/(1 + δ).
4
(b) If γ < 1, then a rise in ct+j raises pt . If γ > 1, however, a rise in ct+j lowers pt . Any piece of
news has two possible effects: cashßows and discount rates. In this case the discount rate rises
faster than the payoffs, so the price actually declines.
2.
ct − c∗ = Et [Rβ(ct+1 − c∗ )]
Taking expectations,
∞
X ∞
X
kt + β j+1 Et et+j = β j+1 Et ct+j .
j=0 j=0
Intuitively, the present value of future consumption must equal wealth plus the present value of
future endowment (labor income).
The j + 1 comes from the timing, alas standard in the macro literature and national income
accounts . If you adopt the more common Þnance timing convention
kt+1 = (1 + r) (kt + it )
5
Consumption equals the annuity value of wealth (capital) rkt plus the present value of future
labor income (endowment). This is the permanent income hypothesis. It is not a “partial
equilibrium” result — it is a general equilibrium model with linear technology and an endowment
income process.
Now to the random walk in consumption. Just quasi-Þrst difference, and use kt+1 −kt = rkt +it ,
³ ´
ct = rkt + r βet + β 2 Et et+1 + β 3 Et et+2 + ...
³ ´
ct−1 = rkt−1 + r βet−1 + β 2 Et−1 et + β 3 Et−1 et+1 + ...
ct − ct−1 = r(kt − kt−1 ) + ...
ct − ct−1 = r(rkt−1 + et−1 − ct−1 ) + ...
h ³ ´i
ct − ct−1 = r rkt−1 + et−1 − rkt−1 − r βet−1 + β 2 Et−1 et + β 3 Et−1 et+1 + ... + ...
³ ´
ct − ct−1 = ret−1 + r βet + β 2 Et et+1 + β 3 Et et+2 + ...
³ ´³ ´
− r2 + r βet−1 + β 2 Et−1 et + β 3 Et−1 et+1 + ...
³ ´ ³ ´
ct − ct−1 = ret−1 + r βet + β 2 Et et+1 + β 3 Et et+2 + ... − r et−1 + βEt−1 et + β 2 Et−1 et+1 + ...
∞
X
ct = ct−1 + (Et − Et−1 ) rβ β j et+j .
j=0
Consumption is a random walk. Changes in consumption equal the innovation in the present
value of future income.
Bob Hall (1979) noticed the random walk nature of consumption in this model, and suggested
testing it by running regressions of ∆ct on any variable at time t−1. This paper was a watershed.
It is the Þrst “Euler equation” test of a model; note it does not require the full model solution
tying the shocks in ∆ct to fundamental taste and technology shocks — the second term in our
random walk equation. The Hansen-Singleton (1982) Euler equation tests generalize to non-
quadratic utility, random asset returns for which it is impossible to fully solve the model.
Technical details: I have assumed no free disposal - you follow the Þrst order conditions even if
past the bliss point. If you can freely dispose of consumption, then you will always end up at
the bliss point c∗ sooner or later. (Thanks to Ashley Wang for pointing this out. Hansen and
Sargent’s treatments of this problem deal with the bliss point issue.)
£ ¤
By the way, the algebra is much easier if you use lag operators, i.e. write ct = rkt +rβEt (1 − βL−1 )−1 et .
But if you know how to do that, you’ve probably seen this model before.
(b)
∞
X ∞
X rβ
ct = rkt + r β j+1 Et et+j = rkt + rβ β j ρj et = rkt + et .
j=0 j=0
1 − βρ
∞
X ∞
X rβ
ct = ct−1 + (Et − Et−1 ) rβ β j et+j = ct−1 + rβ β j ρj εt = ct−1 + εt .
j=0 j=0
1 − βρ
The top equation does look like a consumption function, but notice that the parameter relating
consumption c to income e depends on the persistence of income e. It is not a “psychological law”
or a constant of nature. If the government changes policy so that income is more unpredictable
(i.e. it gets rid of the predictable part of recessions), then this coefficient declines dramatically.
The income coefficient is not “policy-invariant.” This is the basis of Bob Lucas (1974) dramatic
6
deconstruction of Keynesian models based on consumption functions that were used for policy
experiments.
In both equations, you see that consumption responds to “permanent income” and that as
shocks get more “permanent” — as ρ rises — consumption moves more.
(c) R was the rate of return on technology. Despite the symbol, it is not (yet) the interest rate —
the equilibrium rate of return on one-period claims to consumption. That remains to be proved.
The logic is, Þrst Þnd c, then price things from the equilibrium consumption stream. To be
precise and pedantic, call the risk free rate Rf , and
µ ¶ µ ¶ µ ¶
1 u0 (ct+1 ) ct+1 − c∗ ct − c∗ 1
= Et β = βEt =β =β=
Rft u0 (ct ) ct − c∗ ct − c∗ R
Now, the fun stuff. We can approach the price of the consumption stream by brute force,
∞ ∞ ∞
X X c∗ − ct+j X c∗ ct+j − c2t+j
pt = Et mt,t+j ct+j = Et βj c t+j = Et β j
j=1 j=1
c∗ − ct j=1
c∗ − ct
³ ´
∞
X c∗ ct − Et c2t+j ∞
X c∗ ct − c2t − vart (ct+j )
= βj = βj
j=1
c∗ − ct j=1
c∗ − ct
rβ
ct+1 = ct + εt+1
1 − βρ
rβ
ct+2 = ct + (εt+1 + εt+2 )
1 − βρ
rβ
ct+j = ct + (εt+1 + .. + εt+j )
1 − βρ
³ ´2
rβ
∞
X ct (c∗ − ct ) − j 1−βρ σε2
pt = βj
j=1
c∗ − ct
³ ´2
rβ
∞
X j 1−βρ σε2
= β j ct −
j=1
c∗ − ct
³ ´2
rβ
∞
X ∞
X 1−βρ σε2
= β j ct − jβ j
j=1 j=1
c∗ − ct
∞
X β
jβ j =
j=1 (β − 1)2
7
³ ´2
rβ
β β 1−βρ σε2
pt = ct −
1−β (1 − β)2 c∗ − ct
³ ´2
rβ
1
1+r
1
1+r 1−βρ σε2
= 1 ct −³ ´2
1 − 1+r 1− 1 c∗ − ct
1+r
1 β 1
pt = ct − 2 ∗
σε2
r (1 − βρ) c − ct
Wow. The Þrst term is the risk-neutral price — the value of a perpetuity paying c. (Don’t forget
Et (ct+j ) = ct ) The second term is a risk correction. It lowers the price. If σε2 is high — more risk
—the price is lower. If ρ is high — more persistent consumption — the price is lower.
Now, the hard term — the effect of consumption. At the bliss point, the consumer is as happy
as can be, and marginal utility falls to zero. Hence, the consumer is inÞnitely risk averse.
(u00 (c)/u0 (c) rises to inÞnity). There is no consumption you can give him to compensate for risk,
since he’s at the bliss point. No surprise that the price goes off to −∞ here. As consumption
rises towards the bliss point, the consumer gets more and more risk averse (u00 is constant,
u0 is falling), so the price declines. Above the bliss point, the consumer values consumption
negatively, so the price is higher than the risk-neutral version.
This feature — that risk aversion rises as consumption rises — is obviously not a good one.
Quadratic utility is best used as a local approximation. Find a c∗ that gives a sensible risk
aversion, and then make sure the model doesn’t get too far away!
The question says price as a function of e and k. I’m curious how I ever got that, since it seems
a much more natural function of c. c is a function of e and k, of course, but substituting that
in does not seem very easy.
3. This is not only a historically important model, it introduces a very important method. Evaluating
inÞnite sums as in the last problem is a huge pain. In most models, conditioning information is a
function of only a few state variables, xt . Everything you could want to know about the current state
of the economy, and the conditional distribution of everything you could want to know in the future
is contained in the state variables. Hence, prices (at least properly scaled) have to be a function
of the state variables. Instead of solving for p in terms of a huge inÞnite sum, you can solve the
functional equation p(x) = Et [mt,t+1 (xt , xt+1 ) (p(xt+1 ) + dt+1 )]. Here we go...
pb = πx.
The riskfree rate is of course
Rf = 1/pb .
8
(b) The consumption stream: h i
pt = Et β∆c−γ
t+1 (pt+1 + ct+1 )
· µ ¶¸
pt pt+1
= βEt ∆c1−γ
t+1 +1
ct ct+1
Solve this as a functional equation, as explained above. Find p/c in the h state and in the l
state (functions from two points to the real line are easy to determine— you just Þnd the values
at the two points.)
µ ¶ µ ¶
p p p
(h) = βπh→h h1−γ (h) + 1 + βπh→l l1−γ (l) + 1
c c c
" # " # Ã" # " #!
p/c(h) πh→h h1−γ πh→l l1−γ 1 p/c(h)
=β +
p/c(l) πl→h h1−γ πl→l l1−γ 1 p/c(l)
pc = βπ∗ (1 + pc)
pc = (1 − βπ∗ )−1 βπ∗ 1
We can Þnd returns from pt+1
ct+1 + 1 ct+1
Rt+1 = pt .
ct ct
Note when p/c is constant, R is just a constant times consumption growth. You need a very
small p/c before R is much different from consumption growth.
Conditionally expected returns follow from the probabilities.
(c) Start with the calibration. It’s most natural to take the two points to be equally above and
below the mean, h = 1.01 + x, l = 1.01 − x and equal probabilities. Then, you want
γ=5
bond price 0.943 0.943
Rf 6.01 6.01
p/c 19.96 19.96
R h 7.11 7.11
l 5.01 5.01
9
The major failing is the equity premium. The mean stock return is almost exactly the same
as the riskfree rate. Also, stock returns are perfectly correlated with consumption growth.
The
h standardi deviation of stock returns is about 1%, not about 20%. The Sharpe ratio
f
E(R) − R /σ(R) is way too low.
(d) To get serial correlation in consumption growth, I tried π of the form
" #
1/2 + θ 1/2 − θ
π=
1/2 − θ 1/2 + θ
Now,
E(dct+1 |dct = h) = (1/2 + θ) ∗ (g + x) + (1/2 − θ) ∗ (g − x) = g + 2θx
E(dct+1 |dct = l) = (1/2 − θ) ∗ (g + x) + (1/2 + θ) ∗ (g − x) = g − 2θx
Here are my results for a positive serial correlation.
h l
γ = 5, θ = 0.1
pb 0.934 0.953
Rf 7.07 4.97
.
p/c 19.93 20.3
R h 7.12 5.05
l 6.99 4.92
ρ(∆ct , ∆ct−1 ) 0.21
The main reason I put this in at this stage is to get variation in prices with the initial state.
In the previous case, the world looks the same from any starting date, so there is no variation
in prices (ex-ante). The interest rate and stock return are higher from the high state, because
expected future consumption growth is higher. Higher return means lower price or p/c.
10
(a) R−1 is a discount factor. It is not necessarily in the payoff space, since that space is constructed
of linear combinations of the assets. x∗ is the unique discount factor in the payoff space, but
not the only discount factor. Often, R > 0, i.e. for limited liability securities like stocks. In this
case, R−1 is always positive, but so is R/E(R2 ). Securities do not have to be limited liability,
so in general R−1 can be negative. The biggest trouble with this discount factor is that it can
be inÞnite if R = 0 can happen, in which case the expectation may not be deÞned. (It may be
out of the set of random variables with second moments).
(b) The Þrst order conditions are µ ¶
1
E R = λ.
α0 R
Thus,
1
m=
λα0 R
is a discount factor. In general, you can’t solve the Þrst order conditions for α analytically.
Another more beautiful way to do this. We know that every payoff in X can be priced by a
discount factor m > 0 State the problem as
11
4 Problems for Chapter 5
1. You have to Þnd equations that express the right angles in the picture. Right angles means orthogonal
with second moment norm, so we want to prove that the line from any Re to its projection on Re∗
lies at right angles to Re∗ , E [(Re − proj(Re |Re∗ )) × Re∗ ] = 0. Working on the latter expression,
2. Start with a nonstochastic economy. In this case, x∗ is typically below the set of returns. x∗ is a
discount factor, so typically less than one. Returns are returns, hence typically greater than one.
Precisely, in a nonstochastic economy,
x∗ = 1/Rf
If Rf > 1, then x∗ < 1 < Rf . It’s possible that Rf < 1, if consumption is declining drastically, but
not typical.
Now, let’s do it in a stochastic economy. R∗ is the return parallel to x∗ ,
x∗ R∗
R∗ = ; x∗
= .
E(x∗2 ) E(R∗2 )
so we just have to Þgure out if x∗ is longer or shorter than R∗ . Now, from the deÞnitions,
1 1
|x∗ |2 = E(x∗2 ) = =
∗2
E(R ) |R∗ |2
|x∗ |2 |R∗ |2 = 1
Thus, |x∗ | < |R∗ | if |R∗ | > 1 or if |x∗ | < 1. This is very nice: In a nonstochastic economy x∗ Rf = 1;
in a stochastic economy this generalizes to |x∗ | |R∗ | = 1.
So is the second moment of the return with smallest second moment greater or less than one? As you
can see in the drawing below, this can happen if risk premia (slope of the mean-variance frontier) is
high, and if the riskfree rate is low, not much more than 1.0.
12
Mean
MVF
Rf
R*
Std. Dev.
There are lots of ways to continue from here, to see if typical numbers give one of these conditions.
Easiest, by just looking at the frontier, I am able to show that
Rf 2 1.01
E(R∗2 ) = ¯ ¯ = <1
¯ E(Re ) ¯2 1 + 0.25
¯ σ(Re ) ¯ + 1
(1% is about the average real interest rate and stocks have averaged roughly 9% mean and 16%
standard deviation.) This means, we should actually expect x∗ to lie somewhat above the return
line.
Derivation: h i2
E(R2 ) = E(R)2 + σ 2 (R) = Rf + E(Re ) + σ2 (Re )
where Re = R − Rf ;
The minimum second moment return occurs (minimize over E(Re ), holding Sharpe ratio constant)
occurs at
Rf
E(Re ) = − µ ¯ ¯ ¶
¯ σ(Re ) ¯2
1 + ¯ E(R e) ¯
13
1
E(R∗2 ) = Rf 2 1 − ³ ¯ ¯´
¯ σ(Re ) ¯ 2
1 + ¯ E(Re ) ¯
¯ ¯
¯ σ(Re ) ¯2
¯ E(Re ) ¯ Rf 2
= Rf 2 ¯ ¯ = ¯ ¯
¯ σ(Re ) ¯2 ¯ E(Re ) ¯2
1 + ¯ E(R e) ¯ ¯ σ(Re ) ¯ + 1
A little more formally, or using some of the tools and representations of the course, form x∗
x∗ = p0 E(xx0 )−1 x
To get some more intuition, price the risk free rate with x∗ ,
1
= E(x∗ 1) = E [proj(x∗ |1)1]
Rf
Again, in a nonstochastic economy this reduces to x∗ = 1/Rf and we learn that the projection of
x∗ on 1 should lie below the line of returns. But when there are large risk premia — high Sharpe ratios;
large distortions between actual and risk neutral probabilities; large distortions between contingent
claims prices and probabilities — the stochastic discount factor is very different from a constant, so
x∗ itself must lie above R as shown in the graph below. If you believe the size of the equity premium,
we live in an economy with severe distortions from risk neutrality, enough to get x∗ above the plane
of returns.
3. We already had from (5.20)
Rf = R∗ + Rf Re∗ .
Rearranging,
Re∗ = (Rf − R∗ )/Rf .
This is a useful formula to show that you can construct Re∗ from knowledge of R∗ and Rf .
14
x*
Rf
1
Proj(x*|1)
R
Figure 1:
4.
(a) R∗ and 1 are collinear, so Re∗ collapses to zero. The frontier collapses to a point, R∗ = Rf . In
mean-variance space, all returns have the same mean, so the frontier collapses to a line, with
R∗ = Rf at the leftmost point.
(b) The projection of 1 on Re is still zero, so R∗ is still the mean variance frontier. When the
payoffs are generated from x, Re∗ = E(x)0 E(xx0 )−1 x. But if consumers are risk neutral, p(x) =
E(mx) = E(x) for all assets, so E(x) = 0 for excess returns and Re∗ = 0.
5. No. This is subtle. R∗ = x∗ /p(x∗ ) = proj(m|X)/p [proj(m|X)]. R is a set, but not a space, since
it does not include zero, so you can’t project on it. There is a rather general point in here. For
example, you don’t form factor-mimicking portfolios by projecting on R, you project on X instead.
2. No. The β are different, so the λ must also be different. As a simple example, if the factor is 2 times
a return, then the factor mimicking portfolio is the return, so β is cut in half and λ doubles.
³ ´
3. m is in X, so it is x∗ . R∗ = x∗ /p(x∗ ) = (a − bRm ) / a/Rf − b .
4. They can stay the same. If you span the frontier by R∗ and Rf , as you increase or decrease weight
on R∗ , you change the amount of one particular linear combination of factor mimicking portfolios,
but not the relative weights of the factors.
15
var(R∗ ) − wE(R∗ )E(Re∗ )
y =
E(R∗ )E(Re∗ ) − wvar(Re∗ )
6. In a risk-neutral economy Re∗ = 0, so the whole mean-variance frontier, including the various risk-
free rate proxies, collapses to the minimum-variance point R∗ . R∗ is not a constant — no risk free
rate is traded. In a risk neutral economy, R∗ is the return closest to the unit vector.
E Info set 1
Info set 2
σ2
2. No. This is really the same question. Put another way, the unconditional mean-variance frontier will
not intersect the vertical axis. This happens all the time. The 3 month T bill rate is (nominally)
conditionally risk free. Yet a plot of the unconditional mean-variance frontier will not intersect the
vertical axis since the T bill rate varies over time. If you’re running E(R) = γ +βλ, the question is, is
there is risk free rate to identify γ, or must you use a zero-beta rate. Even if there is a conditionally
risk free rate, if it is not constant, the unconditional representation will need a zero-beta rate.
16
3. Neither, since you didn’t include the managed portfolios. Unless, of course, everything is i.i.d. so
there are no instruments.
1.
¡ ¢
max Eu(c) st. c = α0 R , α0 1 = W
∂ h i
0 i
: E u (c)R =λ
∂αi
u0 (ct+1 ) c∗ − ct+1 c∗ − RW
t+1 Wt
mt+1 = = =
λt λt λt
2. This is a case where Þnite-dimensional intuition can be misleading. If there were a Þnite-dimensional
state space, then
min (or max) p(x) = E(mx) s.t. m ≥ 0, p(f ) = E(mf)
{m}
would have Þnite upper and lower bounds. In an inÞnite-dimensional state space — the typical case
for factor pricing as applied to equities — it does not. Intuitively, there are no arbitrage portfolios of
stocks — portfolios that dominate in every state of nature. Thus, the absence of negative prices for
such portfolios doesn’t help us at all. (Problems like this generate arbitrage bounds in option pricing
problems. In option pricing, though, there are strictly dominating portfolios; a call option is better
in every state of nature than the portfolio that holds the stock and borrows the strike payment.)
3. No, it has to be the risk free rate, or a zero-beta rate. Here’s the potential confusion. With a risk-free
rate, the CAPM is h i
E(Ri ) = Rf + βi,m E(Rm ) − Rf
You can difference the left hand variables,
h i
E(Ri − Rj ) = βi−j,m E(Rm ) − Rf
Covariance is a linear operator, so βi,m − βj,m = βi−j,m . The question is whether we can do this on
the right hand side too. Can we write the CAPM in terms of an excess return on the right hand
side? Can we write
E(Rei ) = βi,Rm −Rj E(Rm − Rj )?
Once posed, you can see that the answer is no. Betas are not linear in the denominator. In discount
factor language,
m = a − bRm
can’t be written
m = a − b(Rm − Rj )!
In fact, the CAPM is frequently tested with the T-bill rate as a “proxy” for the risk free rate. Though
the riskiness of the T bill rate does not matter for the left hand side, it does for the right. In practice,
this proxy is probably not a big deal
17
8 Problems for Chapter 11
so
1
∂φ 1
= 2 2E(x)ρj
∂b σ (x)
−ρj
The S matrix is
∞
X xt xt−j xt−k xt−j−k xt xt−j xt−k xt xt−j x2t−k
Sj = E xt xt−k xt−k−j xt xt−k xt x2t−k
k=−∞ x2t xt−k xt−k−j x2t xt−k x2t x2t−k
The standard error is · ¸ · ¸
1 ∂φ 0 ∂φ
var(corrT ) = S
T ∂µ ∂µ
This is straightforward to calculate, but I can’t simplify it further.
If the series is i.i.d., then only the k = 0 terms survive,
x2t x2t−j x2t xt−j x3t xt−j E(x2t )2 0
¡ 2¢
0
¡ 3¢
2 2 3
Sj = E xt xt−j xt xt = 0 E ¡xt ¢ E ¡xt ¢ ,
x3t xt−j x3t x4t 0 E x3t E x4t
and ρj = 0 in ∂φ/∂µ. Thus,
0
1 E(x2 )2 0
¡ 2¢
0
¡ 3¢
1
2 1 1
σ (ρ̂j ) = 0 0 E x
¡ 3¢
E x
¡ 4¢ 0
T σ4 (x)
0 0 E x E x 0
³ ´2· ¸ £ ¤2
1 1 2 1 σ2 (x) + E(x)2
= E x =
T σ4 (x) T σ4 (x)
18
If, in addition, E(x) = 0, we obtain the
√ classic result: If the series is i.i.d. mean zero, the standard error
of the autocorrelation coefficient is 1/ T . Of course, you can calculate standard errors without mean zero,
and without i.i.d.!
You can do the correlation coefficient from p.207 the same way:
· ¸ · ¸
1 ∂φ 0 ∂φ
var(corrT ) = S
T ∂µ ∂µ
∞
X
S = cov(ut , u0t−j )
j=−∞
I must have seen a pretty way to simplify this when I wrote the problem, but I can’t seem to see one now.
Still, it’s easy enough to compute.
2, 3. The general formula:
X∞
1
var(β̂) = E(xx0 )−1 E(εt xt x0t−j εt−j )E(xt x0t )
T j=−∞
19
If you believe in homoskedasticity, then E(εt xt x0t−j εt−j ) = E(εt εt−j )E(xt x0t−j ) = σε2 ρj E(xt x0t−j ). Then,
X∞
1 2
var(β̂) = σε E(xx0 )−1 ρj E(xt x0t−j )E(xt x0t )
T j=−∞
In retrospect, 2 and 3 seem the same. I think I meant in 2 to correct for heteroskedasticity but not
autocorrelation, in which case the answer is
1
var(β̂) = E(xx0 )−1 E(ε2t xt x0t )E(xt x0t )
T
4. Under the null that the model is true, a good asset pricing model should have
1 = Et (mt+1 Rt+1 ) .
Thus, the errors, ut+1 = mt+1 Rt+1 − 1 should be unpredictable from anything zt at time t, including
zt = ut−1 . Thus, E(ut ut−j ) = 0. Yes, even if returns are predictable. The point of the asset pricing model
is that even if E(zt Rt+1 ) 6= 0, discounted returns should not be forecastable, so E [zt (mt+1 Rt+1 − 1)] = 0,
including zt = ut . It’s still true if the error is formed from an instrument or managed portfolio. If
ft+1 = zt (mt+1 Rt+1 − 1), the model predicts Et (ft+1 ) = 0, so E(ft ft+1 ) = 0.
where f˜ = f − E(f) and K is the number of factors. The estimates aT gT (b, E(f)) = 0 give the OLS
cross-sectional regression for b, and the sample mean for E(f).
£ ¤−1
b̂ = C 0 C C 0 ET (Re ) (1)
E(f) = ET (f).
20
where
C ≡ E(Re f˜0 )
denotes the covariance matrix of returns and factors. (It is best to reserve d for the d matrix, and
the two are no longer equal.)
To Þnd the standard errors, just plug in to the general GMM formulas. The general formula (??) is
à !
b̂ 1
cov = (ad)−1 aSa0 (ad)−10 .
Ê(f ) T
We can simplify S somewhat with the null hypothesis that pricing errors ut should not be forecastable
from t − j information,
P∞ ³ ´
E (ut u0t ) E ut f˜t+j
0
S = P∞ ³ ´ P j=0 ³ ´ .
E f˜t+j u0
j=0
∞
E f˜t f˜0
t j=−∞ t−j
However, the factors need not be unpredictable, and may comove with the pricing errors, so no
further simpliÞcation is possible in general. The other terms are
" #" # " #
−C 0 0 −C E(Re )b0 C 0 C −C 0 E(Re )b0
ad = =
0 −IK 0 −IK 0 IK
" #
−1 (C 0 C)−1 (C 0 C)−1 C 0 E(Re )b0
(ad) =
0 IK
" #
−1 (C 0 C)−1 C 0 (C 0 C)−1 C 0 E(Re )b0
(ad) a=− .
0 IK
Under the null, asymptotically, we will have E(Re ) = C 0 b so we can simplify the formula now with
that substitution. " #
−1 (C 0 C)−1 C 0 bb0
(ad) a = − .
0 IK
We’re interested in the top left and bottom right elements of
" #
1 S11 S12
(ad)−1 a a0 (ad)−10 .
T S21 S22
21
This equation reminds us a great deal of the correction for cross-sectional regressions of average
returns on betas. The Þrst term is the same standard error we derived ignoring sampling variation in
the sample mean, and looks like the usual formula for OLS regressions with standard errors corrected
for covariation. The remaining terms add the effects of the fact that the sample mean must be
estimated, as the extra terms in the Shanken formula correct for the fact that the betas had to be
estimated.
Next, we want to test the pricing errors. Use the genral formula,
h i ³ ´ ³ ´0
T cov gT (b̂) = I − d(ad)−1 a S I − d(ad)−1 a .
Under the null, E(Re ) − Cb = 0, so the top right term vanishes. Since the E(f) moment is zero in
every sample, the last K diagonal elements of cov(gT ) are zero. The top left part, in which we are
interested, gives us
1³ ´ ³ ´
cov(α̂) = I − C(C 0 C)−1 C 0 S11 I − C(C 0 C)−1 C 0
T
Thus, the pricing error test statistic is not affected by the fact that the factor mean E(f) is estimated.
This is natural, since the E(f) moments are set to zero in each sample.
2. If we really want to distinguish factor models based on factor risk premia λ, we can do it by using
single regression betas in the expected return - beta model. However, though the new λ are useful for
testing for the marginal importance of factors, they lose their interpretation as the expected returns
on factor mimicking portfolios.
s as the single regression coefficient of return i on a constant and factor j alone.
To see this, deÞne βij
s cov(r, fj )
βij = .
var(fj )
Picking up our proof that m = f 0 b is equivalent to a beta model, we can multiply and divide by the
0
diagonal of cov(f˜f˜ ) rather than that covariance matrix itself to express the model in terms of single
regression betas, ³ ´ h i
0
E Ri = α 1−cov(Ri , f˜ )b̃
· h i−1 h i ¸
0
=α 1−cov(R , f˜ )diag cov(f,
i ˜ f˜0 ) ˜ f˜0 ) b̃
diag cov(f,
h h 0
i i
˜f) b
=α 1 − βis0 diag cov(f,
22
E(Ri ) = α + β s0 λs .
Now the “factor risk premia” are deÞned as
h 0
i
˜f) b
λs ≡ −α diag cov(f,
Now λsj = 0 if and only if bj = 0, so a test on factor risk premia so deÞned is equivalent to a test
whether factor j is marginally important. Surprisingly, the λs s deÞned from single regression betas
deliver a multiple regression test for the importance of a factor given all the others!
However, single regression based λs do not have the interpretation as the price of the factors in the
interesting case that factors are correlated. A factor that is a return will have a single regression beta
of one on itself, but will also have nonzero single regression betas on the other (correlated) factors.
Therefore, the beta pricing model does not imply that the factor risk premium λs equals the expected
excess return of the factor.
Rei 0 0 i
t = αi + βi λ + β i [ft − E(ft )] + εt .
2. Instead of writing a regression, build up the ML for the CAPM a little more formally. Write the
statistical model as just the assumption that individual returns and the market return are jointly
normal, " # Ã" # " #!
Re E(Re ) Σ cov(Rem , Re0 )
∼N ,
Rem E(Rem ) cov(Rem , Re ) 2
σm
The model’s restriction is
E(Re ) = γcov(Rem , Re ).
Estimate γ and show that this is the same time-series estimator as we derived by presupposing a
regression.
Answer: The likelihood function is
T
Ã" # " #!0 " #−1 Ã" # " #!
1X Re E(Re ) Σ cov(Rem , Re0 ) Re E(Re )
L=− − − .
2 t=1 Rem E(Rem ) em e
cov(R , R ) 2
σm Rem E(Rem )
23
Imposing the model
E(Re ) = γcov(Rem , Re )
E(Rem ) = γσm
2
T
" #0 " #−1 Ã" # " #!
∂L X cov(Rem , Re ) Σ cov(Rem , Re0 ) Re cov(Rem , Re )
= 2 −γ = 0.
∂γ t=1
σm cov(Rem , Re ) 2
σm Rem σm2
To make it easy, use only one test asset (you can use the partitioned matrix inverse formulas to do
the same thing with many test assets)
" #
2
σm −cov(Rem , Re )
" #−1
Σ cov(Rem , Re ) −cov(Rem , Re ) Σ
=
cov(Rem , Re ) σm2 2 − cov(Rem , Re )2
Σσm
Then,
" #0 " #" #
cov(Rem , Re ) 2
σm −cov(Rem , Re ) ET (Re )
σm2 em e
−cov(R , R ) Σ ET (Rem )
γ = " #" #" #
cov(Rem , Re ) 2
σm −cov(Rem , Re ) cov(Rem , Re )
σm2 em e
−cov(R , R ) Σ σm2
ET (Rem ) = γσm
2
E(Re ) = γcov(Rem , Re )
24
12 Problems for Chapter 17
1. You might want to exercise American puts early.
2. Retrace the steps in the integral derivation of the Black-Scholes formula and show that the dw does
not affect the Þnal result.
A:
à !
σ2 √
ln ST = ln S0 + µ − T + σ Tε
2
à µ ¶2 !
1 µ−r 1 2 µ − r√ √
ln ΛT = ln Λ0 − r + + σw T− T ε − σw T δ
2 σ 2 σ
Z ∞ Z Z ∞ Z
ΛT (ε) ΛT (ε)
C0 = ST (ε) df (ε) df(δ) − X df (ε) df (δ).
ST =X Λt ST =X Λt
Z Z ³ ´ √ √
∞ 2 √
− r+ 12 ( µ−r
σ )
2 T − µ−r
+ 12 σw T ε−σw T δ 1 2
C0 = e σ
S0 e(µ− 2 σ )T +σ Tε
f (δ) dδf(ε)dε
ST =X
Z ³ ´ √ √
∞ 2
− r+ 12 ( µ−r
σ )
2 T − µ−r
+ 12 σw σ
T ε−σw T δ
−X e f(δ)dδf (ε) dε
ST =X
Z ∞ ·Z √ ¸ h ³
2
´i √
− 12 σw
2 T −σ Tδ µ−r− 12 σ 2 +( µ−r
σ )
T +(σ− µ−r
σ )
Tε
= S0 e w
f (δ)dδ e f (ε)dε
ST =X
Z ·Z ¸ ³ ´ √
∞ √ 2
− 12 σw
2 T −σ − r+ 12 ( µ−r
σ )
T − µ−r Tε
w Tδ σ
−X e f (δ)dδ e f (ε)
ST =X
25
13 Problems for chapter 18
Then, take E(·|It ) − E(·|It−1 ) of both sides. As long as pt , dt and rt are in It , we can conclude
∞
X
0 = [E(·|It ) − E(·|It−1 )] ρj (∆dt+j − rt+j )
j=0
Hence,
(a) The equation can work for information sets that are coarser than agent’s, including the infor-
mation set of a VAR. However,
(b) The results will depend on the information set you choose. Adding more variables to a VAR
can change the fraction of return variance attributed to the different components.
2. (Fama and Bliss 1987) See the term structure chapter for notation.
The basic idea, is that there is a mechanical connection between the Þrst period holding period return
and the second period yield. The following picture considers what happens if the one year yield is
5% and the two year yield is 10%. As you can see, the expectations hypothesis — expected returns
the same for the Þrst year — mean that the one period yield must rise to 15% the next year. You can
also see that as we change the Þrst year holding period return, we mechanically change the second
year return (one year yield in second year). Move point “A” up and down.
26
Time
0 1 2
-0.05
lnP
-0.10
-0.15 A
-0.20
This holds for expected values as well as ex post values, or values forecast by a variables such as
(1) (1)
the forward - spot spread — the more goes to a change in yield yt+1 − yt the less goes to a holding
(2)
period return hprt+1 .
Now, to express the same idea formally.
(2) (1) (2)
pt = −yt+1 − hprt→t+1
that’s cool in its own right. It says that the bond price is the discounted value of 1 dollar (ln(1) = 0),
discounted by the bond’s returns. This should remind you of the Campbell Shiller identity that you
can discount a stock price by its ex-post returns. You can see that given prices at t, the change in
(1) (2)
yield yt+1 and the holding period return hprt→t+1 are mechanically linked. We want to say something
about forward rates, so let’s get there
(1) (1)
pt = −yt
(1→2) (1) (2) (1) (2)
ft = pt − pt = −yt − pt
(1→2) (1) (1) (2)
ft = −yt + yt+1 + hprt→t+1
³ ´ ³ ´
(1→2) (1) (1) (1) (2) (1)
ft − yt = yt+1 − yt + hprt→t+1 − yt .
Aha! The left hand variable is the forward-spot spread in the Fama Bliss regression. The right hand
term is the change in one year yield, and the holding period return on two year bonds. Now run a
27
regression of both sides on the forward-spot spread. The left hand side is 1 — forward spot on forward
spot. The right hand side gives the coefficient b1 in
³ ´ ³ ´
(1) (1) (1→2) (1)
yt+1 − yt = a + b1 ft − yt + εt+1
Thus,
1 = b1 + b2 .
See Fama and Bliss (1987) for the general case — longer maturities.
∞
X
rt − Et−1 (rt ) = ∆dt − Et−1 (∆dt ) + ρ (Et − Et−1 ) ρj−1 (∆dt+j − rt+j )
j=1
∞
X
j
∆dt − Et−1 (∆dt ) + (Et − Et−1 ) ρ (∆dt+j − rt+j )
j=1
3. Conceptually, what we’re doing is simple. Imagine simulating out a huge number of data points from
the VAR. Then, take only the return data, ignoring data on other variables. Run a regression of
returns on lagged returns. We’re looking for that regression. That’s straightforward to do numerically
for a speciÞc example. The algebra is hard, because we’re analytically deriving the result of this
operation. You just have to go through and do for prices what we did for returns. The case is
simple enough that you can follow the same procedure as in “How to Þnd the univariate return
representation” on p.418. (You don’t have to follow the general procedure, constructing the spectral
density and factoring it.) The equation for prices is, from (20.19)
µ ¶
1
∆pt+1 = (1 − b)(dt − pt ) + εdt+1 − δt+1
1 − ρb
Thus, the system — analogous to the equations in the middle of p.419 — is
(Yes, that’s a typo on p.419 just above “Then, write returns.” It should be pt+1 not pt .) As for
returns, Þnd an expression with just price growth and shocks,
(1 − bL)(dt − pt ) = εdpt
1−b
∆pt+1 = εdpt + (εdt+1 − εdpt+1 )
1 − bL
(1 − bL) ∆pt+1 = (1 − b) εdpt + (1 − bL) (εdt+1 − εdpt+1 )
(1 − bL) ∆pt+1 = (εdt+1 − εdpt+1 ) + (εdpt − bεdt )
28
Again, the form suggests an ARMA(1,1)
1 − γL
∆pt = vt
1 − bL
Let’s try it. As in the book, deÞne yt = (1 − bL)∆pt so
h i
E yt2 = σd2 + σdp
2 2
− 2σd,dp + σdp + b2 σd2 − 2bσd,dp = (1 + b2 )σd2 + 2σdp
2
− 2(1 + b)σd,dp
2
E(yt yt−1 ) = −σdp − bσd2 − (1 + b)σd,dp
E(y 2 ) = (1 + γ 2 )σv2
E(yt yt−1 ) = −γσv2
Thus,
1 + γ2 (1 + b2 )σd2 + 2σdp
2 − 2(1 + b)σ
d,dp
= 2 2 = 2q
γ σdp + bσd + (1 + b)σd,dp
q
γ = q− q2 − 1
(Looks like we have another typo on the top of p.420, the Þnal - in the denominator should be +).
Now, look through the cases on p.,415:
1 + γ2 (1 + b2 )σd2
=
γ bσd2
γ = b
Again, the two roots cancel so ∆pt = vt . Returns are i.i.d, dividend growth is i.i.d., so price
growth is also i.i.d.
(b) Constant dividend growth σd = 0
2
1 + γ2 2σdp
= 2 =2
γ σdp
γ=1
1−L
∆pt = vt
1 − bL
1
pt = vt
1 − bL
If there is no dividend growth shock, then prices become stationary. This isn’t as surprising as
it seems initially. Bond prices are stationary, because the only reason a bond price changes is
that interest rates change. If we turn off dividend growth uncertainty, stocks become like bonds.
You could see this in the VAR too. More subtly, this shows that when we turn off one source
of noise, the univariate price shocks now reveal the underlying expected return shocks.
29
(c) Dividend growth uncorrelated with expected return shocks σd,dp = 0
1 + γ2 (1 + b2 )σd2 + 2σdp
2
1 + b2 bσd2 1+1 2
σdp
= 2 + bσ 2 = 2 + bσ 2 + 2 + bσ 2
γ σdp d b σdp d 1 σdp d
Thus, following the logic on p.415, we see that γ is between b and 1 rather than between b and
ρ. Using the numbers on the top of p.405,
That’s a little higher than the 0.928 that the book reports for returns.
4. Here we go again.
30
µ ¶
1
b− σxr = aσx2
b
Substituting σxr = ρσx σr ,
ab σx
ρ =
(b2
− 1) σr
ab σx
= −
(1 + b)(1 − b) σr
As you see, you cannot get a zero correlation to do it. You need a negative correlation between
expected return and actual return shocks to generate uncorrelated returns.
5. First, the long run variance of a stationary series must be zero. The deÞnition of covariance stationary
is that variances exist and variances and covariances are not functions of time. If a series is stationary,
variances exist, so
1 1
var(xt+k − xt ) = (var(xt+k ) + var(xt ) − 2cov(xt , xt+k ))
k k
The covariance is bounded by the variance, so the whole thing goes to zero as k → ∞. If x has a
unit root, var(x) does not exist (inÞnite) so you can’t do the Þrst step of this.
Now, apply this logic to the stationary series xt − yt . and the same for y.
1
var [xt+k − yt+k − (xt − yt )]
k
1
= var [(xt+k − xt ) − (yt+k − yt ))]
k
1
= (var (xt+k − xt ) + var (yt+k − yt ) + 2cov [(xt+k − xt ) , (yt+k − yt )])
k
Since each of xt and yt have unit roots, we know
1
lim var(xt+k − xt ) = vx > 0.
k→∞ k
Thus, the only way for the above expression to go to zero is if the series are perfectly correlated in
the long run — if vx = vy and cov [(xt+k − xt ) , (yt+k − yt )] = σ (xt+k − xt ) σ (yt+k − yt )
6.
Then
31
and the long horizon regression is
(yt+1 + yt+2 + yt+3 + ...yt+N ) = (A+A2 +..+AN )yt +(εt+N +(I+A)εt+N−1 +...+(I+A+..+AN−1 )εt+1 )
Thus, since
N
X b − bN + 1
bj =
j=1
1−b
b − bN+1
rt→t+N = (1 − ρb) dpt + er,t+k
1−b
As you can see, these rise close to linearly at Þrst, but eventually approach a limit
b
(1 − ρb) .
1−b
To do the R2 we need to evaluate the error covariance matrix, the bottom right element of
I didn’t get anywhere with analytical manipulation of this. To see the R2 rise with horizon, you have
to compute σ2 (xβ)/σ2 (ε) numerically using this formula. (See Hodrick 1992 for lots of calculations
like this.)
1. Suppose habit accumulation is linear, and there is a constant riskfree rate or linear technology equal
to the discount rate, Rf = 1/δ. The consumer’s problem is then
∞ ∞
X (Ct − Xt )1−γ X X X
max δt s.t. δ t Ct = δ t et + W0 ; Xt = θ φj Ct−j
t=0
1−γ t t j=1
where et is a stochastic endowment. In an internal habit speciÞcation, the consumer considers all the
effects that current consumption has on future utility through Xt+j . In an external habit speciÞca-
tion, the consumer ignores such terms. Show that the two speciÞcations give identical asset pricing
32
predictions in this simple model, by showing that internal-habit marginal utility is proportional to
external-habit marginal utility, state by state.
A: The Þrst order conditions are
MUt = Et [MUt+1 ]
where MU denotes marginal utility. In the external case, marginal utility is simply
The sum measures the habit-forming effect of consumption. Now, guess the same solution as for the
external case, h i
(Ct − Xt )−γ = Et (Ct+1 − Xt+1 )−γ . (4)
and plug in to (3). We Þnd that the internal marginal utility is simply proportional to marginal
utility (2) in the external case,
µ ¶
θδφ
MUt = 1 − (Ct − Xt )−γ . (5)
1 − δφ
Since this expression satisÞes the Þrst order condition MUt = Et MUt+1 , we conÞrm the guess (4).
Ratios of marginal utility are the same, so allocations and asset prices are completely unaffected by
internal vs. external habit in this example.
2. Many models predict too much variation in the conditional mean discount factor, or too much
interest rate variation. This problem guides you through a simple example. Introduce a simple form
of external habit formation,
u = (Ct − θCt−1 )1−γ
and suppose consumption growth Ct+1 /Ct is i.i.d. Show that there interest rates still vary despite
i.i.d. consumption growth.
A:
(Ct+1 − θCt )−γ
mt+1 =
(Ct − θCt−1 )−γ
µ ¶
(Ct+1 /Ct − θ)−γ Ct −γ
mt+1 =
(Ct /Ct−1 − θ)−γ Ct−1
µ ¶−γ
£ −γ ¤ Ct
Et (mt+1 ) = E (Ct+1 /Ct − θ)
Ct − θCt−1
"µ ¶−γ # µ ¶γ
Ct+1 Ct−1
= E −θ 1−θ
Ct Ct
The Þrst term is constant, but the second varies as consumption varies.
33
17 Problems for the Appendix
1. Find the diffusion followed by the log price,
y = ln(p).
A: Applying Ito’s lemma,
µ ¶
1 1 dp2 1
dy = dp − 2
= µ − σ 2 dt + σdz.
p 2 p 2
This is not
dp
.
d(ln(p)) =
p
You have to include the second order terms. It matters whether you specify
dp
= µdt + σdz
p
or
d ln p = µdt + σdz.
The two µ terms are not the same; you have to add or subtract 1/2σ 2 to go from one to the other.
2. Find the diffusion followed by xy.
A: Usually, we write
d(xy) = xdy + ydx
But this expression comes from the usual Þrst order expansions. When x and y are diffusions, we have
to keep second order terms. Write f(x, y) = xy. ∂f/∂x = y, ∂f/∂y = x, ∂ 2 f /dy 2 = 0, ∂ 2 f/∂x2 = 0,
∂ 2 f/∂y∂x = 1, so
d(xy) = xdy + ydx + dydx.
We used this fact in expanding d(Λp).
3. Suppose y = f (x, t) Find the diffusion representaiton for y. (Follow the obvious multivariate extension
of Ito’s lemma.)
A: Recognizing ahead of time that terms dt2 and dtdz will drop,
∂f ∂f 1 ∂2f 2
dy = dt + dx + dx
∂t ∂x 2 ∂x2
à !
∂f ∂f 1 ∂ 2f 2 ∂f
dy = + µx + σ dt + σx dz
∂t ∂x 2 ∂x2 x ∂x
4. Suppose y = f(x, w), with both x, w diffusions. Find the diffusion representation for y. Denote the
correlation between dzx and dzw by ρ.
A: First do a second order expansion,
à !
∂f ∂f 1 ∂ 2f 2 ∂2f 2 ∂2f
dy = dx + dw + dx + dw + 2 dxdw
∂x ∂w 2 ∂x2 ∂w2 ∂x∂w
Then, get rid of terms dt2 and dzdt, and organize the result into dt and dz terms
à à !!
∂f ∂f 1 ∂ 2f 2 ∂2f 2 ∂2f
dy = µx + µw + σ + σ + 2 σx σw ρ dt
∂x ∂w 2 ∂x2 x ∂w2 w ∂x∂w
∂f ∂f
+ σx dzx + σw dzw
∂x ∂w
34