Forwards, Swaps, Futures and Options
Forwards, Swaps, Futures and Options
1 Forwards
Definition 1 A forward contract on a security (or commodity) is a contract agreed upon at date t = 0 to
purchase or sell the security at date T for a price, F , that is specified at t = 0.
When the forward contract is established at date t = 0, the forward price, F , is set in such a way that the initial
value of the forward contract, f0 , satisfies f0 = 0. At the maturity date, T , the value of the contract is given2
by fT = ±(ST − F ) where ST is the time T value of the underlying security (or commodity). It is very
important to realize that there are two “prices” or “values” associated with a forward contract at time t: ft
and F . When we use the term “contract value” or “forward value” we will always be referring to ft , whereas
when we use the term “contract price” or “forward price” we will always be referring to F . That said, there
should never be any ambiguity since ft is fixed (equal to zero) at t = 0, and F is fixed for all t > 0 so the
particular quantity in question should be clear from the context. Note that ft need not be (and generally is not)
equal to zero for t > 0.
Examples of forward contracts include:
• A forward contract for delivery (i.e. purchase) of a non-dividend paying stock with maturity 6 months.
• A forward contract for delivery of a 9-month T-Bill with maturity 3 months. (This means that upon
delivery, the T-Bill has 9 months to maturity.)
F = S/d(0, T ) (1)
where S is the current spot price of the security and d(0, T ) is the discount factor applying to the interval [0, T ].
Proof: The proof works by constructing an arbitrage portfolio if F 6= S/d(0, T ).
Case (i): F < S/d(0, T ): Consider the portfolio that at date t = 0 is short one unit of the security, lends S
until date T , and is long one forward contract. The initial cost of this portfolio is 0 and it has a positive payoff,
S/d(0, T ) − F , at date T . Hence it is an arbitrage.
Case (ii): F > S/d(0, T ): In this case, construct the reverse portfolio and again obtain an arbitrage
opportunity.
Forward Price for a Security with Non-Zero Storage Costs: Suppose a security can be stored for
period j at a cost of c(j), payable at the beginning of the period. Assuming that the security may also be sold
short, then the forward price, F , for delivery of that security at date T (assumed to be M periods away) is given
by
M −1
S X c(j)
F = + (2)
d(0, M ) j=0
d(j, M)
where S is the current spot price of the security and d(j, M ) is the discount factor between dates j and M .
Proof: As before, we could prove (2) using an arbitrage argument. An alternative proof is to consider the
strategy of buying one unit of the security on the spot market at t = 0, and simultaneously entering a forward
contract to deliver it at time T . The cash-flow associated with this strategy is
market. Eventually the security is repurchased and returned to the original lender. Note that a profit (loss) is made if the
security price fell (rose) in value between the times it was sold and purchased in the market.
Forwards, Swaps, Futures and Options 3
and its present value must (why?) be equal to zero. Since the cash-flow is deterministic we know how to
compute its present value and we easily obtain (2).
Exercise 1 Convince4 yourself that we can indeed only conclude that (4) is true if short-selling is not
permitted.
4 See Luenberger, Chapter 10, for a discussion of tight markets.
Forwards, Swaps, Futures and Options 4
In such circumstances, we say that the market is tight. An artifice that is often used to restore equality in (4) is
that of the convenience yield. The convenience yield, y, is defined in such a way that the following equation is
satisfied.
M −1
S X c(j) − y
F = + . (5)
d(0, M ) j=0
d(j, M )
The convenience yield may be thought of as a negative holding cost that measures the convenience per period
of having the commodity on hand.
2 Swaps
Another important class of derivative security are swaps, perhaps the most common of which are interest rate
swaps and currency swaps. Other types of swaps include equity and commodity swaps. A plain vanilla swap
usually involves one party swapping a series of fixed level payments for a series of variable payments.
Swaps were introduced primarily for their use in risk-management. For example, it is often the case that a party
faces a stream of obligations that are floating or stochastic, but that it will have to meet these obligations with
a stream of fixed payments. Because of this mismatch between floating and fixed, there is no guarantee that the
party will be able to meet its obligations. However, if the present value of the fixed stream is greater than or
equal to the present value of the floating stream, then it could purchase an appropriate swap and thereby ensure
than it can meet its obligations.
C = P × (0, r0 − rf , ... , rM −1 − rf )
| {z } | {z }
st th
At end of 1 period At end of M period
where rf is the constant fixed rate and ri is the floating rate that prevailed at the beginning of period i. In
general, ri will be stochastic and so the swap’s cash-flow, C, will also be stochastic. As is the case with forward
contracts, the value X (equivalently rf ) is usually chosen in such a way that the initial value of the swap is zero.
Even though the initial value of the swap is zero, we say that party A is “long” the swap and party B is “short”
the swap.
Exercise 2 Make sure you understand how to use the terms “long” and “short” when referring to a swap.
the swap while a German company might receive the dollar payments. Note that the value of the swap to each
party will vary as the USD/Euro exchange rate varies. As a result, the companies are exposed to foreign
exchange risk but if necessary this risk can be hedged by trading in the forward foreign exchange market.
Why might the US and German companies enter such a transaction? A possible explanation might be that the
US company wishes to invest in the Eurozone while the German country wishes to invest in the U.S. Each
company therefore needs foreign currency. However, they may have a comparative advantage borrowing in their
domestic currency at home as opposed to borrowing in a foreign currency abroad. If this is the case, it makes
sense to borrow domestic currency at home and use a swap to convert it into the foreign currency.
C = N × (0, S1 − X, S2 − X, . . . , SM − X) .
Note that this cash-flow is stochastic and so we cannot compute its present value directly by discounting.
However, we can decompose C into a stream of fixed payments (of −N X) that we can easily price, and a
stochastic stream, N (0, S1 , S2 , . . . , SM ). The stochastic stream is easily seen to be equivalent to a stream
of forward contacts on N units of the commodity. We then see that receiving N Si at period i has the same
value of receiving N Fi at period i where Fi is the date 0 forward price for delivery of one unit of the commodity
at date i. As the forward prices, Fi , are deterministic and known at date 0, we can see that the value of the
commodity swap is given by
XM
V = N d(0, i)(Fi − X).
i=1
C = P (0, r0 − rf , r1 − rf , . . . , rM −1 − rf ).
where ri is the short rate for the period beginning at date i. Again this cash flow can be decomposed into a
series of fixed payments that can be easily priced, and a stochastic stream, P (0, r0 , r1 , . . . , rM −1 ). We can
6 As mentioned above, the fixed payment stream of a swap is usually chosen so that the initial swap value is zero. However,
once the swap is established its value will then vary stochastically and will not in general be zero.
7 Later in the course we will develop the theory of martingale pricing. Then we will be able to price swaps directly, without
value the stochastic stream either using an arbitrage argument or by recalling that the price of a floating rate
bond is always par at any reset point. Note that the stochastic stream is exactly the stream of coupon payments
corresponding to a floating rate bond with face value P . Hence the value of the stochastic stream must be
(why?) P (1 − d(0, M )) and so the value of the swap is given by
" M
#
X
V = P 1 − d(0, M ) − rf d(0, i) . (6)
i=1
As before, rf is usually chosen so that the initial value of the swap is zero.
3 Futures
While forwards markets have proved very useful for both hedging and investment purposes, they have a number
of weaknesses. First, forward markets are not organized through an exchange. This means that in order to take
a position in a forward contract, you must first find someone willing to take the opposite position. This is the
double-coincidence-of-wants problem. Second, because forward contracts are not exchange-traded, there can
sometimes be problems with price transparency and liquidity. Finally, in addition to the financial risk of a
forward contract, there is also counter-party risk. This is the risk that one party to the forward contract will
default on it’s obligations. These problems have been eliminated to a large extent through the introduction of
futures markets. That is not to say that forward markets are now redundant; they are not, and they are used, for
example, in the many circumstances when suitable futures markets are not available.
Perhaps the best way to understand the mechanics of a futures market is by example.
Remark 1 You should make sure that you fully understand the mechanics of this futures market as these are
the same mechanics used by other futures markets.
In Example 5 we did not discuss the details of margin requirements which are intended to protect against the
risk of default. A typical margin requirement would be that the futures trader maintain a minimum balance in
her trading account. This minimum balance will often be a function of the contract value (perhaps 5% to 10%)
multiplied by the position, i.e., the number of contracts that the trader is long or short. When the balance drops
below this minimum level a margin call is made after which the trader must deposit enough funds so as to meet
the balance requirement. Failure to satisfy this margin call will result in the futures position being closed out.
• It is easy to take a position using futures markets without having to purchase the underlying asset. Indeed,
it is not even possible to buy the underlying asset in some cases, e.g., interest rates, cricket matches and
presidential elections.
• Futures markets allow you to leverage your position. That is, you can dramatically increase your exposure
to the underlying security by using the futures market instead of the spot market.
• They are well organized and designed to eliminate counter-party risk as well as the
“double-coincidence-of-wants” problem.
• The mechanics of a futures market are generally independent of the underlying ‘security’ so they are easy
to “operate” and easily understood by investors.
Forwards, Swaps, Futures and Options 8
• The fact that they are so useful for leveraging a position also makes them dangerous for unsophisticated
and/or rogue investors.
• Futures prices are (more or less) linear in the price of the underlying security. This limits the types of risks
that can be perfectly hedged using futures markets. Nonetheless, non-linear risks can still be partially
hedged using futures. See, for instance, Example 7 below.
When perfect hedges are not available, we often use the minimum-variance hedge to identify a good hedging
position in the futures markets. To derive the minimum-variance hedge, we let ZT be the cash flow that occurs
at date T that we wish to hedge, and we let Ft be the time t price of the futures contract. At date t = 0 we
adopt a position9 of h in the futures contract and hold this position until time T . Since the initial cost of a
futures position is zero, we can (if we ignore issues related to interest on the margin account) write the terminal
cash-flow, YT , as
YT = ZT + h(FT − F0 ).
Our objective then is to minimize
Var(YT ) = Var(ZT ) + h2 Var(FT ) + 2hCov(ZT , FT )
and we find that the minimizing h and minimum variance are given by
Cov(ZT , FT )
h∗ = −
Var(FT )
Cov(ZT , FT )2
Var(YT∗ ) = Var(ZT ) − .
Var(FT )
Such static hedging strategies are often used in practice, even when dynamic hedging strategies are capable of
achieving a smaller variance. Note also, that unless E[FT ] = F0 , it will not be the case that E[ZT ] = E[YT∗ ]. It
is also worth noting that the mean-variance hedge is not in general the same as the equal-and-opposite hedge.
9 A positive value of h implies that we are long the futures contract while a negative value implies that we are short. More
generally, we could allow h to vary stochastically as a function of time. We might want to do this, for example, if ZT is path
dependent or if it is a non-linear function of the security price underlying the futures contract. When we allow h to vary
stochastically, we say that we are using a dynamic hedging strategy. Such strategies are often used for hedging options and
other derivative securities with non-linear payoffs.
Forwards, Swaps, Futures and Options 10
R = D1 P1 + D2 P2
where Pi represents the price per widget at time ti . We assume that Pi is stochastic and that it will depend in
part on the general state of the economy at date ti . In particular, we assume
Pi = aSi ei + c
where a and c are constants, Si is the time ti value of the market index, and 1 and 2 are independent random
variables that are also independent of Si . Furthermore, they satisfy E[ei ] = 1 for each i. The firm wishes to
hedge the revenue, R, by taking a position h at t = 0 in a futures contract that expires at date t2 and where the
market index is the underlying security. The date t2 payoff, Y , is then given by
Remark 2 A more sophisticated hedge would be to choose a position of size h1 at date t = 0 and then to
update this position to h2 at date t1 where h1 and h2 are constants that are chosen at date t = 0. In this case
the resulting hedging strategy is still a static hedging strategy.
Note, however, that since h2 need not be chosen until date t1 , it makes sense to allow h2 to be a function of
available information at date t1 . In particular, we could allow h2 to depend on P1 and S1 , thereby obtaining a
dynamic hedging strategy, (h1 , h2 (P1 , S1 )). Such a strategy should be able to eliminate most of the uncertainty
in R.
Exercise 4 How would you go about solving for the optimal (h∗1 , h∗2 (P1 , S1 ))? Would you need to make an
assumption regarding F1 ?
Note that the most general class of dynamic hedging strategy would allow you to adjust h stochastically at
every date in [0, t2 ) and not just at dates t0 and t1 .
Exercise 5 What types of risk do you encounter when you roll the hedge forward?
Forwards, Swaps, Futures and Options 11
In order to answer Exercise 5, assume you will have a particular asset available to sell at time T2 . Today, at time
t = 0, you would like to hedge your time T2 cash-flow by selling a single futures contract that expires at time T2
with the given asset as the underlying security. Such a futures contract, however, is not yet available though
there is a futures contract available at t = 0 that expires at time T1 < T2 . Moreover, upon expiration of this
contract the futures contract with expiration T2 will become available. You therefore decide to adopt the
following strategy: at t = 0 you sell one unit of the futures contract that expires at time T1 . At T1 you close out
this contract and then sell one unit of the newly available futures contract that expires at time T2 . What is your
net cash-flow, i.e. after selling the asset and closing out the futures contract, at time T2 ?
Note that we have only discussed the mechanics of futures markets and how they can be used to hedge linear
and non-linear risks. We have not seen how to compute the futures price, Ft , but instead will return to this after
we have studied martingale pricing.
Definition 3 An American call (put) option gives the right, but not the obligation, to buy (sell) 1 unit of the
underlying security at a pre-specified price, K, at any time up to an including a pre-specified time, T .
K and T are called the strike and maturity / expiration of the option, respectively. Let St denote the price of
the underlying security at time t. Then, for example, if ST < K a European call option will expire worthless and
the option will not be exercised. A European put option, however, would be exercised and the payoff would be
K − ST . More generally, the payoff at maturity of a European call option is max{ST − K, 0} and its intrinsic
value at any time t < T is given by max{St − K, 0}. The payoff of a European put option at maturity is
max{K − ST , 0} and its intrinsic value at any time t < T is given by max{K − St , 0}.
Put-Call Parity
A very important result for European options is put-call parity. Suppose the underlying security does not pay
dividends. We then have
pE (t; K, T ) + St = cE (t; K, T ) + Kd(t, T ) (7)
where d(t, T ) is the discount factor used to discount cash-flows from time T back to time t. We can prove (7)
by considering the following trading strategy:
• At time t buy one European call with strike K and expiration T
• At time t sell one European put with strike K and expiration T
• At time t sell short 1 unit of the underlying security and buy it back at time T
• At time t lend K d(t, T ) dollars up to time T
Forwards, Swaps, Futures and Options 12
Regardless of the underlying security price, it is easy to see that the cash-flow at time T corresponding to this
strategy will be zero. No-arbitrage then implies that the value of this strategy at time t must therefore also be
zero. We therefore obtain −cE (t; K, T ) + pE (t; K, T ) + St − Kd(t, T ) = 0 which is (7).
When the underlying security does pay dividends then a similar argument can be used to obtain
Therefore the price of an American call on a non-dividend-paying stock is always strictly greater than the
intrinsic value of the call option when the events {ST > K} and {ST < K} have strictly positive probability.
We have thus shown that it is never optimal to early-exercise an American call on a non-dividend paying stock
and so cA (t; K, T ) = cE (t, K, T ). Unfortunately there is no such result relating American put options to
European put options. Indeed it is sometimes optimal to early exercise an American put option even when the
underlying security does not pay a dividend.
Suppose now that S0 = 100, R = 1.01, u = 1.07 and d = 1/u = .9346. Some interesting questions now arise:
Definition 4 A type A arbitrage is a security or portfolio that produces immediate positive reward at t = 0
and has non-negative value at t = 1. i.e. a security with initial cost, V0 < 0, and time t = 1 value V1 ≥ 0.
Definition 5 A type B arbitrage is a security or portfolio that has a non-positive initial cost, has positive
probability of yielding a positive payoff at t = 1 and zero probability of producing a negative payoff then. i.e. a
security with initial cost, V0 ≤ 0, and V1 ≥ 0 but V1 6= 0.
Theorem 1 There is no arbitrage in the 1-period binomial model if and only if d < R < u.
Forwards, Swaps, Futures and Options 13
Proof: (i) Suppose R < d < u. Then at t = 0 we should borrow S0 and purchase one unit of the stock.
(ii) Suppose d < u < R. Then short-sell one unit of the stock at t = 0 and invest the proceeds in cash-account.
In both cases we have a type B arbitrage and so the result follows.
We will soon see the other direction, i.e. if d < R < u, then there can be no-arbitrage. Let’s return to our
earlier numerical example and consider the following questions:
1. How much is a call option that pays max(S1 − 102, 0) at t = 1 worth?
2. How will the price vary as p varies?
To answer these questions, we will construct a replicating portfolio. Let us buy x shares and invest y in the cash
account at t = 0. At t = 1 this portfolio is worth:
Can we choose x and y so that portfolio equals the option payoff at t = 1? We can indeed by solving
107x + 1.01y = 5
93.46x + 1.01y = 0
and the solution is x = 0.3693 and y = −34.1708. Note that the cost of this portfolio at t = 0 is
The arbitrage-free time t = 0 price of the derivative must (Why?) then be C0 := xS0 + y. Solving (10) and
(11) then yields
1 R−d u−R
C0 = Cu + Cd
R u−d u−d
1
= [qCu + (1 − q)Cd ]
R
1 Q
= E [C1 ] (12)
R 0
where q := (R − d)/(u − d) so that 1 − q = (u − R)/(u − d). Note that if d < R < u then q > 0 and 1 − q > 0
and so by (12) there can be (why?) no-arbitrage. We refer to (12) as risk-neutral pricing and (q, 1 − q) are the
risk-neutral probabilities. So we now know how to price any derivative security in this 1-period binomial model
via a replication argument. Moreover this replication argument is equivalent to pricing using risk-neutral
probabilities.
We also note that the price of the derivative does not depend on p! This at first appears very surprising. To
understand this result further consider the following two stocks, ABC and XYZ:
Forwards, Swaps, Futures and Options 14
Note that the probability of an up-move for ABC is p = .99 whereas the probability of an up-move for XYZ is
p = .01. Consider now the following two questions:
Question: What is the price of a call option on ABC with strike K = $100?
Question: What is the price of a call option on XYZ with strike K = $100?
You should be surprised by your answers. But then if you think a little more carefully you’ll realize that the
answers are actually not surprising given the premise that two stocks like ABC and XYZ actually exist
side-by-side in the market.
and we note the risk-neutral probabilities for ST are displayed at the far right in the binomial lattice above.
Risk-neutral pricing pricing via (13) has the advantage of not needing to calculate the option price at every
intermediate node.
Question: How would you find a replicating strategy for the option?
For example, the value of the option at the lower node at time t = 2 is given by
1
12.66 = max 12.66, (q × 6.54 + (1 − q) × 18.37)
R
where 12.66 = 100 − 87.34 is the intrinsic value of the option at that node. More generally, the value, Vt (S), of
the American put option at any time t node when the underlying price is S can be computed according to
1
Vt (S) = max K − S, [q × Vt+1 (uS) + (1 − q) × Vt+1 (dS)]
R
1 Q
= max K − S, Et [Vt+1 (St+1 )] .
R
We will return to option pricing in much greater generality when we study martingale pricing.
Forwards, Swaps, Futures and Options 16