A Practical Guide To Quantitative Volatility Trading (2016)
A Practical Guide To Quantitative Volatility Trading (2016)
The copyright to this computer software and documentation is the property of Quant Finance Ltd. It may be
used and/or copied only with the written consent of the company or in accordance with the terms and conditions
stipulated in the agreement/contract under which the material has been supplied.
Daniel B LOCH 1
Q UANT F INANCE LTD
Working Paper
Financial time series exhibit multifractal scaling behaviour indicating a complex behaviour with long-range time
correlations manifested on different intrinsic time scales. Such a behaviour typically points to the presence of recurrent
economic cycles, crises, large fluctuations, and other nonlinear phenomena. We review quantitative volatility trading in
classical economics before discussing some necessary modifications needed to account for multifractality in inefficient
markets. We then present an arbitrage-free model of implied volatility surface, which is robust, easy to implement and
computationally fast, enabling for systematic volatility trading. We consider risk management and discuss some
applications on variance swaps and dispersion trading.
0.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Appendices 265
B.4.2
Correlation : the general case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 302
B.4.2.1 Case 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 302
B.4.2.2 Case 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 302
B.4.3 Decorrelating Brownian motions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303
B.5 Numerical tools and others . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303
B.5.1 Approximating the normal functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303
B.5.2 Pseudorandom number generators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 304
B.5.3 About continuous functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 304
0.1 Introduction
In general, uncertainty related to traditional economic variables (interest rates and foreign exchange), as well as new
variables (credit, commodity, weather etc), imposing costs and risks on the society, must be hedged away. Thus, fi-
nancial markets developed, where derivative products were created to transfer risk between different economic agents.
Both parties entering the deal must agree on the price of the contingent claim, even though they have different risks.
The decisions for the appropriate pricing of such claims are made contingent on the price behaviour of the underlying
securities. The uncertainty affecting the underlying is modelled by considering future trajectories of the risky asset
seen as possible scenarios. Thus, it is important to understand the properties of market returns in order to devise their
dynamics.
Assuming stock prices to be the sum of many small terms, stable models have been considered to describe financial
systems. For instance, Mandelbrot [1963] described the financial market as a system with fat tails, stable distributions
and persistence. The argument of large data sets exhibiting heavy tails and skewness combined with the generalised
CLT theorem was put together to justify the use of stables models. However, it was largely ignored by Fama [1965a]
(Fa65) [1970] and Sharpe [1970] who followed Bachelier’s assumption of normally distributed returns (see Bachelier
[1900]) to formalise the weak form efficient market where the price changes are independent and may be a random
walk. The justification for this assumption is provided by the simplicity of the central limit theorem and the fact that
variance and covariance are the only measures of risk. A direct consequence of the efficient market hypothesis (EMH)
is that the most important concepts in theoretical and empirical finance developed around the assumption that asset
returns follow a normal distribution. This includes the classical portfolio theory, the Black-Scholes-Merton option
pricing model and the RiskMetrics variance-covariance approach to Value at Risk (VaR).
Using the concepts of self-similarity, scaling, and fractional processes to assess market returns, various authors
found that long-range dependence (LRD) was subject to debate for raw (signed) returns, but was plainly visible in
absolute returns, square returns, or any other measure of the extent of fluctuations. Thus, financial models developed
to capture the thick tails and long-memory volatility persistence, where returns themselves have no autocorrelation,
but their amplitudes have LRD (FIGARCH, MMAR). However, these statistical approaches are based on the moment
properties of stochastic processes and must be restricted to second-order stationary processes. In addition, fractal
properties are only verified in the infinitesimal limit, making LRD very difficult to be measured on sample data. Fur-
ther, when the time series of asset returns possess the two features of heavy tails and LRD, most statistical tests fail
to work. Yet, stock returns and FX returns are complex systems suffering from systematic effects mainly due to the
periodicity of human activities, and can not be considered as processes with stationary increments. Once methods
for the multifractal characterisation of non-stationary series developed (MFDFA, WTMM, GMWP), academics found
correlations present in systems and highlighted the multifractal nature of financial time series. Further, using the ef-
fective local Holder exponents (ELHE) to detect and localise outliers in financial time series, some academics showed
that these series exhibit stochastic Hurst exponent with characteristics of abrupt changes in the fractal structure. Such
10
behaviour typically points to the presence of recurrent economic cycles, crises, large fluctuations, and other nonlinear
phenomena. As a result, the multifractal nature of financial markets contradict the EMH, and thus the Black-Scholes
pricing model and its associated assumptions. Nonetheless, it seems that the academics who developed the option
pricing theory (OPT) focused on the empirical results that demonstrated thick tails and long-memory volatility persis-
tence exhibited in the financial time series, but did not account for the evidence of long memory in raw returns. Such
models of uncorrelated returns (white spectrum) and semimartingale prices are consistent with economic equilibrium
and the EMH of Samuelson [1965]. For example, Engle [1982] proposed to model volatility as conditional upon its
previous level, that is, high volatility levels are followed by more high volatility, while low volatility is followed by
more low volatility. This led jump-diffusion models to develop in continuous time, such as the Merton model (see
Merton [1976]), the Heston model (see Heston [1993]), or a combination of both.
One consequence of the weak efficient market hypothesis (EMH) (Fa65) is that statistical approaches must be
restricted to second-order stationary processes. In that setting, variance is finite and measures the risk of a single
underlying asset, while the portfolio risk becomes a weighted sum of covariation of all stocks in the portfolio. These
statistics are easy to understand and to estimate. As a result, volatility trading via vanilla options developed to hedge
away uncertainty related to economic variables. In the 1990s volatility became an asset class of its own, and new
products such as volatility swaps and variance swaps were created. When measuring portfolio risk, Value at Risk (VaR)
is the risk expressed in dollar terms showing what amount of money your portfolio could lose during a defined interval
with a given probability. It is commonly understood that VaR is in essence the volatility of a portfolio expressed in
dollar terms. This concept has been applied not only to estimate the risks of a portfolio of assets, but also for trading
the market itself, since the market is considered as one large portfolio of various assets each having a weight in the
global market place. As a result, relative value trading developed as a way to profit from the mean reversion of two
related assets that have diverged. Trades are usually chosen on correlated assets. A pair trade can be carried out via
straddles / strangles or variance swaps. As the volatility of indices tends to trade significantly less than its constituents,
traders tried to profit from this anomaly by either selling correlation swaps, or through dispersion trading. In dispersion
trading, a short index volatility position is traded against a basket of long single stock volatility positions. Such a long
dispersion trade is short implied correlation. Considering a probability formula and assuming identical correlation
between each stock, the index correlation was estimated as the variance of the index divided by the weighted average
single stock variance. Various financial products, developed around the idea of buying or selling correlation, were sold
in the market by investment banks, such as worst-of / best-of, Altiplano and other Himalayas products. The vendors,
being short implied correlation, put buying pressure to lift the implied correlation above fair value. Since dispersion
trading is not a pure correlation trade, correlation swap developed as a swap between the average pairwise correlation
of all members of an index and a fixed amount determined at inception. Basket options, with fixed weighting of the
members over time, have been proposed, leading to covariance risk. Thus, covariance swaps paying out the correlation
multiplied by the volatility of the two assets developed.
Even though the Black-Scholes formula assumes constant volatility, market practice consists in using one BS-
model for every pair (K, T ), leading to the generation of a non-flat implied volatility (IV) surface. Acknowledging
the existence of non-Gaussian market returns and the need to consider higher moments, practitioners had to tweak
the Black-Scholes formula. This approach being a proxy for accounting for stochastic variations of the volatility
surface, traders use the wrong volatility parameter into the wrong formula. They must therefore consider the sensitivity
of option prices to changes in the IV surface. When the volatility σt changes with a certain volatility of its own,
the sensitivity of the option price with respect to the volatility is called Vega. Hence, the vega risk is associated
with unexpected changes in volatility and should be accounted for. Theoretically, the BS-formula does not apply to
stochastic volatility, although the explicit BS-vega calculation is a useful approximation to the actual vega over a small
time horizon. More importantly, the profit and loss (PnL) of financial products on the variance and covariance of asset
returns are fully exposed to moves in volatility of volatility terms, leading to new measures of risk such as the Volga
(second derivative of the price with respect to volatility) and the Vanna (cross derivative of the price with respect to the
underlying asset and the volatility). Ironically, the only possible forms of market risk within the weak form efficient
market, namely the variance and covariance, are themselves stochastic. Going further, we know that standard time
11
series models, including the GARCH(1, 1) process and its squares, the stochastic volatility models and solutions to
stochastic recurrence equations converge in distribution to an infinite variance stable distribution (see Bartkiewicz et
al. [2011]). In the presence of dependence, the conditions for the convergence depend on regular variation of the
stationary sequence with index α ∈ (0, 2) plus some other conditions. This would explain the difficulties encountered
by practitioners to estimate the volatility of volatility parameters of stochastic volatility models. Since the absence of
long range dependence (LRD) in returns is still compatible with its presence in absolute returns, models accounting
for LRD in volatility developed. However, recent studies on asset returns highlighted their non-stationary scaling
properties (stochastic Hurst exponent) with characteristics of abrupt changes in the fractal structure. Thus, financial
time series exhibit multifractal scaling behaviour indicating a complex behaviour with long-range time correlations
manifested on different intrinsic time scales. Such a behaviour typically points to the presence of recurrent economic
cycles, crises, large fluctuations, and other nonlinear phenomena which must be included in the pricing and risk
management of derivative products. These phenomena can easily be observed in the market via the existence of an
implied volatility surface for single stock options and an implied correlation matrix for multi-asset options. Moreover,
since the variance and covariance of financial time series are not necessarily defined, their historical dynamics are
stochastic with erratic jumps. Thus, in order to account for these dynamics when pricing path-dependent options,
practitioners and academics focused on modelling the variance and covariace with jump-diffusion processes. Variance
and covariance should not be considered as measures of risk and the valuation of variance swaps and other correlation
swaps should be revised.
12
13
14
such that the value of the forward contract is zero to both parties, so that neither party pays any money to obtain the
contract. In general the payoff at time T from a long position in a forward contract on one unit of an asset is
ST − K
where K is the delivery price and ST is the spot price of the asset at maturity of the contract. Similarly, the payoff
from a short position is K − ST . The main purpose of forward contracts is to reduce risk since the holder can reduce
future uncertainty. For example, a company facing the need to make a large fixed payment in foreign currency at a
fixed future date can lock in the exchange rate now by buying a forward contract. The seller of such a contract is a
speculator who would make a profit if the exchange rate remain below the value fixed at inception. Note, the forward
contract entails no cost now, and only potential cost in the future.
An option is a security giving the right to buy or sell an asset, subject to certain conditions, within a specified
period of time. Options, or derivatives, on stock prices were first traded on an organised exchange in 1973. Since
then there has been a dramatic growth in terms of volumes of options traded in many exchanges throughout the world.
Factors affecting option prices are the spot price S, the strike price K, the time to maturity T , the volatility of the
underlying σ, and the risk-free interest rate r. The most common underlying assets include stocks, indices, foreign
currencies, debt and commodities. More recently a new type of underlying assets was introduced which is link to the
realised variance or volatility of the underlying such as the variance swap or the volatility swap.
European, or vanilla, options such as a call option give the holder the right to buy the underlying asset by a certain
date called the maturity for a specified price called the exercise price or strike price, while the put option gives him
the right to sell it. An American call or put option is as its European counter part except that it can be exercised at any
time up to the expiration date, while a Bermudan option is exercisable only on certain specified days during its life.
Contrary to the forward contract defined above, the holder of an option has the right to exercise the option but it is not
an obligation. However, whereas it costs nothing to enter into a forward contract, there is a cost to entering into an
option contract. We let K be the strike price and ST be the underlying price at time T , then the payoff from a long
position in a European call option is
max (K − ST , 0) = (K − ST )+
so that the option will be exercised if ST < K and the option can never be worth more than K while the payoff
of a short position in the European put option is −(K − ST )+ . Also, a lower bound for the put option price on a
non-dividend paying stock is Ke−r(T −t) − St . Stoll [1969] first noted the relation between the value of a call option
and that of a put option. The put-call parity is a relationship at a given time t between the price of a European call
option C(t, K) and a European put option P (t, K) given by
The intrinsic value at time t of a call option is the maximum of zero and the value it would have if it was exercised
immediately, that is (St − K)+ . The option is then said to have time value. So, one can identify the total value of
the option at time t as the sum of its intrinsic value and its time value. Options are referred to as in-the-money (ITM),
15
at-the-money (ATM), or, out-the-money (OTM). At a given time t, an in-the-money call option would lead to a positive
cash flow to the holder if it was exercised immediately, that is St > K. Similarly, an at-the-money option would lead
to zero cash flow if it was exercised, that is St = K, while an out-the-money call option would lead to negative cash
flow, that is St < K. OTM options have a high risk of expiring worthless, but they tend to be relatively inexpensive.
As the time value approaches zero at expiration, OTM options have a greater potential for total loss if the underlying
stock moves in an adverse direction. Options that are deep ITM generally trade at or near their actual intrinsic values.
16
v
u n
u 1 X
σ
bn = t (xi − x)2
n − 1 i=1
Si
where xi = log Si−1 is the logarithm return over one business day and x is the arithmetic average of all log-returns.
Sti
In the Black-Scholes formula, the returns Rti = Sti−1 for ti = ih, i = 1, .., n for some time step h > 0 are i.i.d.
2
distributed with N ((µ− σ2 )h, σ 2 h). Hence, the sample standard deviation σbn corresponds to the maximum likelihood
estimator for the parameter σ. One can then plug in the historical volatility to the BS-formula to compute an estimate
of the option price. However, this approach fit very poorly the observed market prices. Since demand and supply
determines the prices of liquid market options, one can use these prices to infer an implied volatility. Contrary to
the historical volatility, the calculation of the market implied volatility requires an option pricing model. For pricing
simple options on a single underlying the financial world uses the Black-Scholes model, which leads to a closed
form solution since the stock price at a fixed time follows a lognormal distribution. In the Black-Scholes model all
parameters are observed except for the volatility parameter σ. Hence, for practical reasons the BS-formula is used to
infer the implied volatility. However, stock prices jump on occasions and do not always move in the smooth manner
predicted by the geometric Brownian motion (GBM) model. Further, stock prices tend to have fatter tails than those
predicted by the GBM. If the BS-model was correct, we should infer a flat implied volatility surface which is not the
case in practice. In addition, the volatility surface varies significantly with time. The volatility skew is a pattern of
the implied volatility among different strike prices, while the term structure of volatility is a pattern of the implied
volatility with varying time to maturity. The slope of the skew is often regarded as a measure of risk aversion. The
implied volatility reflects the market expectation of the stock’s future level of volatility.
We assume that one unit of the underlying is expressed in dollar and that the spot price, or index, is either quoted
in the market or comes from an independent publication representing a consensus between the different parties. The
price of liquid options is determined by supply and demand on the market within a bid-ask spread. There exists various
pricing methods among which some are based on hedging arguments, on the law of large numbers, or, as actuaries
know it, on the standard deviation principle 1 to name a few. In a probabilistic approach, we consider the probability
space (Ω, F, P) where Ft is a right continuous filtration including all P negligible sets in F. Using the concept of
absence of arbitrage opportunities (AAO) (see Harrison and Kreps [1979]), asset returns must reflect the fact that the
riskier the asset the higher the returns, resulting in an instantaneous drift higher than the risk-free rate. Hence, for the
market to be risk neutral all the tradable assets must share the same returns equal to the risk-free rate. As a result, in an
arbitrage-free market, the assumption of linearity of prices leads to the existence of a risk-neutral probability measure
1 P (X) = E[X] + λσ[X] where λσ[X] corresponds to a risk premium.
17
Q equivalent to the historical measure P. Since our underlying is exchanged in a market, its discounted price needs
to be a martingale under the risk-neutral measure Q (see Delbaen et al. [2004]). For an event A ∈ F, while P(A)
1
represents its probability of occurrence, Q(A) represents the value of an option with terminal payoff equal to P (t,T )
if A occurs and zero otherwise. Therefore, the risk-neutral measure represents a market consensus on the values of
derivative instruments rather than an objective description of the evolution of the market (see Cont [2005]).
We let Q be the set of coexistent equivalent measures Q. In a complete market the risk-neutral probability is
unique, but it is not the case in an incomplete market where it must be specified. That is, in an incomplete market,
even if P is known with certainty, we still face uncertainty in the choice of the pricing model Q. In that sense, the
pricing of contingent claims is very sensitive to the assumptions made and the choice of a model for the underlying
process. Financial markets consists of benchmark instruments with payoff (Hi )i∈I , where observed market prices
(Pi∗ )i∈I are in the range Pi∗ ∈ [Pib , Pia ], corresponding respectively to the market bid and ask prices. Thus, we
consider a set of arbitrage free model Q such that the discounted asset price (S t )t∈[0,T ] is a martingale under each
Q ∈ Q with respect to its own history Ft and
In general, in an incomplete market we can choose any equivalent martingale measure as a self-consistent pricing
rule, but option prices no-longer correspond to the cost of a specific hedging strategy. That is, the expected value
of the option is purely mathematical and does not relate to its future price in the sense of replication (see Ayache et
al. [2004]). One solution is to work in a risk-neutral framework by adding a market price of risk to the underlying
dynamics. For simplicity of exposition, given a market price of risk λ, we will assume that there exist an equivalent
martingale measure Qλ denoted by Q. As a result of absence of arbitrage opportunities (AAO), contingent claims can
be valued by taking expectation of their discounted payoffs under the risk-neutral measure. Therefore, the price of a
European contingent claim C(t, x) on ]0, T ]×]0, +∞[ under the risk-neutral measure Q is
RT
C(t, Xt ) = E Q [e− t
rs ds
h(XT )|Ft ] (1.1.1)
where XT is a Q-random variable and h is a sufficiently smooth payoff function. In that setting, the difficulty comes
from the meaning that one gives to the market price of risk λ as it depends on the preference of investors. Therefore,
one need to know where to observe it and how to calibrate it. In the special case where the market price of risk is set
to zero we recover the so-called historical option price.
18
where V0 = E Q [H]. The strategy is self-financing, meaning the cost of the portfolio is a constant
Z t
V (t) − α(s)dX(s) = V0
0
where V0 is a perfect hedge.
where V0∗ = V0 + G0 and α∗ = α + αH . Market incompleteness implies the existence of an equivalent measure in
the set Q, which is not necessarily a martingale and/or a unique measure.
19
No analytical solution exists to this problem, and so we must rely on other methods to compute the prices of American
options in practice.
Proposition 1 Given an American call option with payoff function h : [0, ∞) → R given by h(St ) = (St − K)+ ,
for asset price St at time t and strike price K, the value of this security is the same as the value of the equivalent
European option with payoff at maturity h(ST ). That is,
20
EQ [(1 + r)−ti+1 h(Sti+1 )|Fti ] = (1 + r)−ti EQ [(1 + r)−(ti+1 −ti ) h(Sti+1 )|Fti ]
≥ (1 + r)−ti EQ [h((1 + r)−(ti+1 −ti ) Sti+1 )|Fti ]
≥ (1 + r)−ti h EQ [(1 + r)−(ti+1 −ti ) Sti+1 |Fti ]
= (1 + r)−ti h(Sti )
where we have also used Jensen’s inequality. Hence, we can see that the process {(1 + r)−ti h(Sti )}ni=0 is a sub-
martingale, and so by the Optional Sampling Theorem (Appendix A.5) we have
Since the option holder will always have the possibility of exercising at expiration time T , it must be the case that the
strict inequality in the above can not hold. Hence, we must conclude
In fact, the proof of Proposition (1) will hold for any positive convex function h. However, the result no longer
holds when the underlying asset pays dividends. It can, on occasion, be optimal for the owner of an American call
option to exercise immediately prior to an ex dividend date. For example, as discussed by Hull [2012], let us assume
N ex-dividend dates td1 < ... < tdN during the lifetime of an American call option, paying dividends d1 , ..., dN
respectively. Approximating in the region around the ex-dividend date tdN , if the call option is exercised immediately
prior to the payment the holder will receive S(tdN ) − K, where as the stock price will fall to S(tdN ) − dN if the option
is not exercised, and the value of the option will be S(tdN ) − dN − Ke−r(T −tdN ) . Hence, if
then it can not be optimal to exercise the American call option at prior to time tdN . Conversely, if
21
even without dividends, the holder of an American put option may benefit from exercising their right to sell prior to
maturity. In fact, it is regarded that the put option should always be exercised if at any point it is sufficiently deep in
the money during the lifetime of the option. This is related to the lower bound on the stock price and therefore upper
bound on the put option payoff, that does not exist in the case of a call option. With a stock price very close to zero,
and a strike price such that the put option is deep in the money, the holder of a put option can do little better than to
exercise the option and take the current value. Since the stock price could only rise, the payoff of the put option could
only fall. Moreover, exercising immediately is preferable to waiting, even with a constant stock price, due to the time
value of money and the fact that the strike value you receive, will be worth less in the future. Hull [2012] notes that
both call and put options provide insurance against movements in the value of the underlying asset. A put option held
with stock insures against the risk of the underlying value becoming too low. However, due to the zero bound on stock
prices, such insurance becomes irrelevant when the bound is hit, making it optimal to forgo the insurance and receive
the payoff immediately.
1 2
with initial condition S0 = x. Applying Ito’s lemma to the function f (t, z) = xeµt+σz− 2 σ t , we compute the time
and space derivatives
0 1 0 00
ft (t, z) = f (t, z)(µ − σ 2 ) , fz (t, z) = f (t, z)σ , fzz (t, z) = f (t, z)σ 2
2
and we get the dynamics of the stock price as
dSt
= µdt + σdW
ct
St
Given the initial condition S0 = x, the first two moments of the stock price are given by
2
E[St ] = xeµt , E[St2 ] = x2 e(2µ+σ )t
2
V ar(St ) = x2 e2µt (eσ t − 1)
22
which is independent from the initial value x. Further, given a bounded positive function h, the option price in Equation
(1.1.1) simplifies to
√
Z Z
1 2
E[h(Stx )] = h(y)φ(t, x, y)dy = h(xe(µ− 2 σ )t+σ tu )ψ(u)du
1 2
where ψ(z) = √12π e− 2 z is the Gaussian density. In the special case where the volatility is set to zero (σ = 0) the
rate of return of the asset should equate the risk-free rate r
dSt0
= rdt
St0
In all other cases (σ 6= 0) the market compares the rate of return per unit of time µ to the return a risk-free asset, so
that the market reference is µ − r. In that setting the Sharpe ratio is assumed to be the market price of risk, λ, that the
market assign to the noise Wc as
dt
1
E[ dS
St ] − r
t
µ−r
RS = λ = q =
1
ar( dS σ
dt V St )
t
23
Under these assumptions, the value of the option will only depend on the price of the stock and time, so that it is
possible to create a hedged position, consisting of a long position in the stock and a short position in the option. In a
complete market, every terminal payoff can be replicated by a portfolio, so that the problem of defining the price of
an option is linked with that of finding a replicating portfolio. That is, using the option premium, one can buy and sell
the underlying to hedge risk against the derivative. Following that argument, Black and Scholes defined the price of
a derivative option as the price of its hedge. Knowing the dynamics of the underlying price we need to formalise the
evolution of the self-financing replicating portfolio dynamically managed (without adding or removing capital during
the trading period). Following the approach in Section (1.1.3.1), we put ourself in the Black-Scholes world and we
assume a market made of a risk-free asset St0 and a risky asset with price St . We model the dynamics of the future
stock prices under the historical measure P as
dSt = St µt dt + σt dW
ct
µt = rt + σt λt
We denote the quantity of cash that one can hold as β(t)St0 with dynamics given by
Vt = α(t)St + β(t)St0
and we denote the value invested in stocks as πt = α(t)St . Over a very short period of time the variations of the
portfolio are only due to the variations of the asset price and the interest rate rewarding the cash held at the bank.
Hence, the portfolio dynamics are given by
dVt = rt Vt dt + πt σt (dW
ct + λt dt)
Market completeness is equivalent to assuming that markets are efficient, that is, the price of an asset at a given time
contains all past information as well as the market expectation of its future value. This means that there exists a
strategy leading to a final P/L of null value in all possible future configuration. As a result, we can conclude that
the option price is equal to the value of the replicating portfolio at the initial time. A direct application of the above
assumption is that we can express the Black-Scholes price dynamics as
24
where the pair (α(t), β(t)) allows replication of the contingent claim paying h(ST ) at maturity. Note, a consequence
of the option price being expressed under the risk-neutral measure is that it neither depend on the return µ of the
risky asset nor the market price of risk λ. However, the risk due to the variations of the risky asset is still present and
significantly impact the option price via the volatility parameter σ. Thus, to avoid arbitrage, a market price of risk is
computed such that the discounted spot price is a martingale with respect to the equivalent probability measure. In
the risk-neutral measure, the drift of the spot price is equal to the risk-free rate so that the key parameter becomes the
volatility of the dynamics. Volatility is a key parameter when risk managing option prices as it allows the portfolio
manager, or trader, to define his hedging strategies by computing the necessary Greeks (see details in Section (1.6.2)).
The Vega is a risk measure quantifying exposure to a misspecified volatility.
CBS (t, x, K, T ) = xe−q(T −t) N (d1 (T − t, F (t, T ), K)) − Ke−r(T −t) N (d2 (T − t, F (t, T ), K)) (1.2.6)
where F (t, T ) = xe(r−q)(T −t) is the forward price and N (•) is the normal cumulative distribution function (see
details in Appendix (B.5.1)). Further, we have
1 x 1 √ √
d2 (t, x, y) = √ log − σ t and d1 (t, x, y) = d2 (t, x, y) + σ t
σ t y 2
with the following properties
lim di = ∞ for i = 1, 2
σ→0
and
From the limits of the CDF, limx→−∞ N (x) = 0 and limx→∞ N (x) = 1, we get
25
+
lim CBS (t, x, K, T ) = xe−q(T −t) N (∞) − Ke−r(T −t) N (∞) = xe−q(T −t) − Ke−r(T −t)
σ→0
and
lim CBS (t, x, K, T ) = xe−q(T −t) N (∞) − Ke−r(T −t) N (−∞) = xe−q(T −t)
σ→∞
PBS (t, x, K, T ) = Ke−r(T −t) N (d1 (T − t, K, xe(r−q)(T −t) )) − xe−q(T −t) N (d2 (T − t, K, xe(r−q)(T −t) )) (1.2.7)
and
lim PBS (t, x, K, T ) = Ke−r(T −t) N (∞) − xe−q(T −t) N (−∞) = Ke−r(T −t)
σ→∞
Being long a call option and short a put option, when the volatility tends to infinity, we get
lim CBS (t, x, K, T ) − lim PBS (t, x, K, T ) = xe−q(T −t) − Ke−r(T −t)
σ→∞ σ→∞
and we recover the call-put parity. Hence, we see that for σ → 0 both the call and put price tend to their intrinsic
values. Further, option price is postively related to volatility, no matter if the option is a call or a put. For a call option,
the higher the volatility is, the greater the chance that the underlying value raises above strike price. For a put option,
the higher the volatility is, the greater the chance that the underlying value falls below strike price.
K KP (t, T )
η= =
F (t, T ) xRe(t, T )
to be the forward moneyness of the option. It leads to the limit case
1 √
lim d2 (.) = − σ t
η→1 2
1 √
lim d1 (.) = σ t
η→1 2
It is well known that when the spot rate, repo rate and volatility are time-dependent, we can still use the Black-Scholes
formula (1.2.3) with the model parameters expressed as
Z T Z T Z T
1 1 1
r= r(s)ds , q = q(s)ds , σ 2 = σ 2 (s)ds
T −t t T −t t T −t t
Further, we let the Black-Scholes total variance be given by ω(t) = σ 2 t, and rewrite the BS-formula in terms of the
total variance, denoted CT V (t, x, K, T ), where
1 x 1p p
d2 (t, x, y) = p log − ω(t) and d1 (t, x, y) = d2 (t, x, y) + ω(t) (1.2.8)
ω(t) y 2
26
Expressing the strike in terms of the forward price K = ηF (t, T ), the call price in Equation (1.2.3) becomes
= xe−q(T −t) N (d1 (η, ω(T − t)) − ηN (d2 (η, ω(T − t)))
CT V (t, x, K, T ) K=ηF (t,T )
(1.2.9)
p
where d2 (η, ω(t)) = √ 1 log 1
η − 1
2 ω(t), or equivalently
ω(t)
1 1p p
d2 (η, ω(t)) = − p log η − ω(t) and d1 (η, ω(t)) = d2 (η, ω(t)) + ω(t) (1.2.10)
ω(t) 2
which only depends on the forward moneyness and the total variance. This is the scaled Black-Scholes function
discussed by Durrleman [2003].
1p 1p p
≈ xe−q(T −t) N ( ω(T − t)) = 0.4xe−q(T −t) ω(T − t)
CT V (t, x, K, T ) K=F (t,T )
ω(T − t)) − N (−
2 2
(1.2.12)
which is linear in the spot price and the square root of the total variance. It comes from the fact that
1 1
N (x) − N (−x) ≈ √ (2x − x3 )
2π 3
√
For x = 21 σ T − t, which is small, so is the quantity 13 x3 and we get
1 1 √
N (x) − N (−x) ≈ √ 2x = √ σ T − t
2π 2π
where √12π ≈ 25 . One of the property of the call and put prices in the Black-Scholes model is that they are homoge-
neous functions
27
er∆t − d u − er∆t
p= and 1 − p =
u−d u−d
We let τ = T − t = N ∆t where N is the number of time periods, such that there are 2N possible price paths from
(t, S) to (T, ST ). Further, we let (i, j) denotes the jth node at the ith time step, with price Sij given by
fN j = F (S0 uj di−j )
Using backward induction, the price of an option at node (i, j) is found by taking the expected valus of the option at
time i + 1
where pN,j is the risk-neutral binomial probability to end up in state (N, j), given by
N j
pN,j = p (1 − p)N −j
j
28
with nj = j!(n−j)!
n!
. We need to relate the discrete-time multiplicative binomial model with parameters (u, d, p, r) to
the continuous time BS-model with parameters (σ, r). In the limit of infinitesimally small time steps, N → ∞ and
∆t → 0, the binomial option pricing formula must converge to the BS-formula
Z ∞
−rτ
f =e F (ST )pQ (ST , T |S, t)dST
0
for the two formulations to be consistent.
Proposition 2 The distribution of the terminal stock price in the binomial model with parameters (u, d, p, r) con-
verges to the Black-Scholes lognormal distribution with parameters (σ, r) as ∆t → 0 if and only if
u
p ln + ln d = µ∆t + o(∆t)
d
and
u
p(1 − p)(ln )2 = σ 2 ∆t + o(∆t)
d
Proof can be found in the book by London [2005]. Since there are two equations for three unknowns (u, d, p) relating
them to (σ, r) which must hold only in the first order in ∆t, one can find an infinite number of binomial models
consistent with the BS-model. One way forward is for the mean and variance of the stock price distribution at the end
of the period ∆t to match exactly the mean and variance of the lognormal distribution. That is,
pu + (1 − p)d = er∆t
2
pu2 + (1 − p)d2 = e(2r+σ )∆t
er∆t − d
p=
u−d
2
er∆t (u + d) − du = e(2r+σ )∆t
However, we still need an additional equation to solve for u and d in terms of r and σ.
Assuming u = d1 , Cox et al. [1979] obtained a solution to the above system of equations
p
u = A+ A2 − 1
p
d = A− A2 − 1
2
1
e−r∆t + e(r+σ )∆t
where A = 2 , which can be approximated as
√
u = eσ ∆t
√
−σ ∆t
d = e
In the limit ∆t → 0, both the binomial and BS-model converge to the same limit.
Jarrow et al. [1983] considered equal probabilities for up and down price movements, p = 1 − p = 12 , leading to
√
u = eµ∆t+σ √∆t
d = eµ∆t−σ ∆t
29
with µ = r − 12 σ 2 . Note, the CRR tree is symmetric since ud = 1, but the up and down probabilities are not equal,
while in the JR tree the probabilities are equal, but the tree is skewed since ud = e2µ∆t . We can combine the two
approaches by setting ud = e2ν∆t for some scalar ν. We then get
√
u = eν∆t+σ √∆t
d = eν∆t−σ ∆t
with probabilities
1 1 µ−ν√ 1 1 µ−ν√
p= + ∆t and 1 − p = − ∆t
2 2 σ 2 2 σ
where ν = 0 is the CRR and ν = µ is the JR model.
Trigeorgis [1991] proposed to relax the constraint of small time intervals by considering the natural logarithm of
the asset price X = ln S with drift µ = r − 21 σ 2 going up to ∆Xu = X + ∆X with probability pu , or down to
∆Xd = X − ∆X with probability pd = 1 − pu . This is the additive binomial model, as opposed to the multiplicative
one. Equating the mean and variance of that model to that of BS-formula, we get
1 1p 2
∆Xu = µ∆t + 4σ ∆t − 3µ2 (∆t)2
2 2
3 1p 2
∆Xd = µ∆t − 4σ ∆t − 3µ2 (∆t)2
2 2
Assuming equal jump size, we get the system
which we solve as
p
∆X = µ2 (∆t)2 + σ 2
1 1 µ∆t
pu = +
2 2 ∆X
We can also extend the model to price American-style options on foreign currencies by setting r → r − rf where rf is
the foreign risk-free rate. Further, we can obtain options on stock indexes by setting r → r − q where q is the dividend
yield on the index. When computing the hedge statistics of an option, we can read the delta and gamma off the tree.
The delta can be approximated by finite difference as follow
∂C ∆C C1,1 − C1,0
∆= ≈ =
∂S ∆S S1,1 − S1,0
and the gamma is given by
30
and one would have to build a new tree from every node after each dividend date (see Hull [2012]). Vellekoop et
al. [2006] proposed a way around to get a recombining tree by letting the dividends be stock-price dependent and
using an interpolation technique after each dividend date. This approach, which we briefly describe, was introduced
by Wilmott et al. [1993] in the case of a partial differential equation (PDE). Assuming a single dividend at time tD ,
we define the set of nodes at that date as
An = {S : Qn (Sm(n)∆n = S) > 0}
where the time step m(n) ∈ N satisfies m(n)∆n ≤ tD < (m(n) + 1)∆n . Then the procedure is as follow:
• Build a no-dividend binomial tree with n time steps, compute the payoff at maturity and work backwards until
we reach the dividend date at time step m(n). We then obtain the values fm(n) = F (S) for the option contract
in all points S of the binomial tree, that is, for all S ∈ An . These values F (S) approximate the option values at
time just after the dividend has been paid, given the stock price at that time is S.
• Since the stock price jumps down with the amount D(St− ) when it goes ex-dividend, the option value for a stock
price S just before the dividend date equals the option value for the stock price S − D(S) just after the dividend
date. Hence, we need the option values fm(n) = F (•) in all the points S − D(S). Since we have calculated the
values of fm(n) = F (S) for S ∈ An , we can devise a function approximating fm(n) on the whole of R+ , based
on the values of fm(n) on An . We let BF be such an interpolation function, and we compute BF (S − D(S)) as
an approximation to the values of F (S − D(S)) needed to continue the backward propagation.
31
• We then proceed backward as in the no-dividend binomial tree until we reach time zero.
We can then extend this approach to the case of multiple discrete dividends. Posing some explicit conditions on the
interpolation procedure, the authors prooved convergence of the method.
subject to
n
X
pj = 1 and pn ≥ 0
j=0
n
1 X
S0 = pj S(j)
rn j=1
n
1 X
Cib < Ci < Cia where Ci = n pj F (S(j), Ki )
r j=0
32
We then use a recursive algorithm to go backwards through the tree to build the IBT tree. For each node on the tree
we need to find the path probability Q at that node, the cumulative return R, and the probability of an up movement q.
The sum of the path’s probabilities at a node is the total (nodal) probability that the price will arrive at this node at that
time step in the tree. Both CRR and Rubinstein’s tree assume that each path leading to a node is of equal probability,
so that the nodal probability is the path probability multiplied by the number of paths leading to that node. Hence,
n!
at the nth time step and the jth node there are j!(n−j)! paths through the tree leading to this node, so that the path
probability at this node is the nodal probability divided by the number of path, that is, pj j!(n−j)!
n! . For example, at a
particular node we consider (Q, R, q) where R denotes one plus the cumulative risk-neutral return through the tree
to this node, Q is the path probability of arriving at this node, and q is the probability of an up move from this node.
Then, if the one-step ahead probabilities Q+ and Q− correspond to the ending nodes of the tree, we calculate them
using pj j!(n−j)!
n! . The path probabilities are additive backwards Q = Q+ + Q− , and the path probabilities determine
Q+
the up probabilities q = Q− . At last, the returns cumulate probabilistically as
1
qR+ + (1 − q)R−
R=
r
subject to
pj ≥ 0 for all j (0 ≤ j ≤ n)
Xn
pj = 1
j=0
n
X
−(r−q)T
S0 = e pj S(j)
j=0
where Vimodel is the model price and Vimkt the market price, T is the common maturity of all options, q is the dividend
yield of the underlying asset, and α > 0 is a parameter specifying the penalty for not matching the option price exactly.
In the case of European options, the model prices are linear function of the ending nodal probabilities given by
n
X
Vimodel = e−rT pj F (S(j), Ki )
j=0
where F (S(j), Ki ) is the payoff at maturity. While European options are linear function of the ending nodal probabili-
ties, when considering American options the problem is no-longer a simple quadratic program, but a general nonlinear
optimisation problem due to the early exercise feature.
33
C : [0, ∞) × [0, ∞) → R
(K, T ) → C(K, T )
along with a real number s > 0.
However, market practice is to use the implied volatility when calculating the Greeks of European options. Thus, we
let the implied volatility (IV) be a mapping from time, spot prices, strike prices and expiry days to R+
Σ : (t, St , K, T ) → Σ(t, St ; K, T )
Hence, we define the implied volatility as follow:
Definition 1.4.2 Given the option price C(t, St , K, T ) at time t for a strike K and a maturity T , the market implied
volatility Σ(t, St ; K, T ) satisfies
34
Given (S, K, T ) and the price of an option, there is a unique implied volatility associated to that price, since
(K, T ) → Σ(K, T )
as the implied volatility surface. Note, we will some time denote ΣBS (K, T ) the Black-Scholes implied volatility.
A representation of the volatility surface is given in Figure (1.1). When visualising the IVS, it makes more sense
to consider the two-dimensional map (η, T ) → Σ(η, T ) 2 where η is the forward moneyness (corresponding to the
strike K = ηF (t, T )). We can also use the forward log-moneyness η such that the two-dimensional map becomes
(η, T ) → Σ(η, T ) 3 . Further, we let the total variance be given by
35
As discussed by Figlewski [1989], the IV is a free parameter containing expected volatility and everything else
that affects option demand and supply, but it is not the model, making it very difficult to disentangle the different risk
factors. As a results, all these risks are incorporated into the implied volatility in such a way that the skew can be seen
as the view from the market that options with different strikes and different expiries have different risks and should
be valued accordingly. That is, one expect a normal behaviour of the stock prices near ATM options which can be
reasonably hedged, but when the stock prices exhibit large downward movements, the fear of non-headgeable jumps
dominate. Bollen et al. [2004] showed that changes in the level of an option implied volatility were positively related
to variation in demand for the option, and then argued that demand for out-of-the-money puts to hedge against stock
market declines pushes up implied volatilities on low strike options in the stock index options market. Hence, the out-
of-the-money (OTM) put prices have been viewed as an insurance product against substantial downward movements
of the stock price and have been overpriced relative to OTM calls that will pay off only if the market rises substantially.
For example, in the case of short maturity options where OTM put prices should have a zero market value, they actually
exhibit positive values representing exclusively a market risk premium. As a result, the implicit distribution inferred
from option prices is substantially negatively skewed compared to the lognormal distribution inferred from the Black-
Scholes model. That is, given the Black-Scholes assumptions of lognormally distributed returns, the market assumes
a higher return than the risk-free rate in the tails of the distributions.
36
In this section we are going to introduce formally the no arbitrage conditions. Further, understanding the behaviour
of the implied volatility (IV) surface for far expiries and far strikes is fundamental for extrapolation problems. Thus,
various necessary conditions for an implied volatility surface to be properly defined have been proposed (see Durrle-
man [2003], Lee [2005]). We are going to briefly describe some well known results on the no-arbitrage constraints
imposed on the IV surface and refer the readers to Fengler [2005] for more detailed results. Note, this section assumes
deterministic interest rates and the underlying is not paying discrete dividends.
CA ≤ S0
C ≤ S0
where CA is an American call price and C is a European call price. Similarly, for American put price PA and European
put price, we get the upper bound
PA ≤ K
P ≤ K
P (t; K, T ) ≤ P (t, T )K
where P (t, T ) is a zero-coupon bond price. We get the following lower bound for European call options
+
C(t; K, T ) ≥ St − P (t, T )K
which also applies to American call options as they are more flexible than the European ones and are worth more. In
the case of a European put option on non-dividend paying stock, we get the lower bound
+
P (t; K, T ) ≥ P (t, T )K − St
which also applies to American put options. As European call prices are monotonically decreasing functions of strike
prices and puts are monotonically increasing functions of strike prices, we can get broad no-arbitrage bounds on the
slope of the smile. First, given K1 ≤ K2 for any expiry date T , we have
37
P (t; K1 , T ) P (t; K2 , T )
≤
K1 K2
Moreover, the call price must be a decreasing and convex function with respect to the strike K
Σ : (t, K, T ) → Σ(t; K, T )
K
Further, we define the forward moneyness as η = F (t,T ) , where F (t, T ) is the forward price seen at time t for the
maturity T , and let the total variance be given by ν (η, T ) = Σ2 (η, T )(T − t).
2
Proposition 3 Assuming deterministic rates and dividend yield, if ν 2 (η, Ti ) is a strictly increasing function for i =
1, 2 with T1 < T2 , then there is no calendar arbitrage.
Assuming the explicit dependence of volatility on strikes, we differentiate the market prices with respect to the strike
getting
∂K CBS (t; K, T )
∂K Σ(t; K, T ) ≤ −
∂Σ CBS (t; K, T )
√ √
Since the Vega is given by ∂Σ CBS (t; K, T ) = St τ n(d1 ) = Ke−rτ τ n(d2 ), we write the upper bound as
N (d )
∂K Σ(t; K, T ) ≤ √ 2
K τ n(d2 )
1 √1 ,
In the special case where η = 1, as τ → 0 then N (d2 ) → 2 and n(d2 ) → 2π
such that
1
lim ∂K Σ(t; K, T ) K=F (t,T )
≤ O(τ − 2 )
τ →0
Similarly, to get a lower bound we differentiate the put price and use the call-put parity to get
N [−d1 ] N [−d1 ]
−√ ≤ ∂K Σ(t; K, T ) ≤ √
τ KN [d1 ] τ KN [d1 ]
38
since given Et [ST ] < ∞ we can interchange the limit and the expectation by the dominated convergence theorem.
Similarly, applying L’Hopital’s rule to the right hand side we get
r
2|x| p e−rτ
lim CBS (t; x, T, ) = e−rτ F (t, T ) N [0] − lim ex N [− 2|x|] = F (t, T )
x→∞ T x→∞ 2
Further, Lee [2004] showed that there is a precise one-to-one correspondence between the asymptotic behaviour of the
smile function and the number of finite moments of the distribution of the underlying ST and its inverse S1T . The idea
is to find a coefficient that can replace the 2 in Equation (1.4.16) with the number of finite moments in the underlying
distribution of ST and S1T . Hence, define
and
Σ2 (x, T )
βR = lim sup { }
x→∞ |x|/T
Σ2 (x, T )
βL = lim sup { }
x→−∞ |x|/T
Here, the coefficients βR and βL can be interpreted as the slope coefficients of the asymptotes of the implied variance
function. Lee showed that their values are βR , βL ∈ [0, 2] and
1 βL 1
q̃ = + −
2βL 8 2
39
That is, the IV smile must carry the same information as the underlying risk-neutral transition density. Hence, the
asymptotic behaviour of the smile is shaped by the tail behaviour of the risk-neutral transition density and vice versa.
Further, as options are bounded by moments, which can be interpreted as exotic options with payoffs, and moments
are bounded by mixtures of a strike continuum of plain vanilla options, then the tail decay of the risk-neutral density
determines the number of finite moments in the distribution. This has implications in the extrapolation of the IVS
since the prices of exotic options depend significantly on the specific extrapolation. The results show that linear or
1
convex √ the O(|x| ) behaviour. So, the smile (IV wings) should not grow faster than
skews for far strikes are wrong by 2
√
x and it should not grow slower than x unless we assume that ST has finite moments of all orders.
This is to relate to the result found by Gatheral [1999] where the gradient of the implied volatility, if it exists,
decays pointwise like T1 . More formally, Tehranchi’s results made precise the rate of flattening of the IV skew from
a theoretical stand point. They emphasised that the flattening of the implied volatility smile is a universal property of
all martingale models. For example, one of the inequality states that there exists a martingale (St )t≥0 with St → 0 as
t → ∞ in probability, such that
T ∂η Σ2 (η, T ) → −4
as T → ∞ uniformly for η ∈ [−M, M ]. Hence, given the total variance ν 2 (η, T ) = Σ2 (η, T )T , the implied variance
skew must be bounded by |∂η ν 2 | ≤ 4 and it should decay at a rate of T1 between expiries.
Further, the literature on the modelling of the stochastic dynamics of the Black-Scholes IV being flourishing, to prove
that the IV surface can not move by parallel shifts, as the shape must also change, Rogers et al. [2010] derivied
certain model-independent properties of the implied volatility surface. It was originally a conjecture established by
Ross [2006]. It represents an important result showing that one can not blindly impose dynamics on the IV surface,
such as moving up or down the IVS by parallel shifts, as it may leads to inconsistency. Given the forward moneyness
−(r−q)(T −t)
η=K xe and taking its logarithm η = log η, they showed that if there exists a process (ξt )t≥0 such that for
all t ≥ 0, (T − t) > 0
40
such that the price of an option is the discounted expected value of the payoff under the measure Q. We first follow
the definition given by Roper [2010].
Theorem 1.4.1 Let s > 0 be a constant, T be the maturity, and let C : (0, ∞) × [0, ∞) → R satisfy the following
conditions:
(A1) (Convexity in K)
C(·, T ) is a convex function, ∀T ≥ 0
(A2) (Monotonicity in T )
C(K, ·) is non-decreasing, ∀K ≥ 0
(A3) (Large Strike Limit)
limK→∞ C(K, T ) = 0, ∀T ≥ 0
(A4) (Bounds)
(s − e−rT K)+ ≤ C(K, T ) ≤ s, ∀K > 0, T ≥ 0
(A5) (Expiry Value)
C(K, 0) = (s − K)+ , ∀K > 0
Then, there exists a non-negative Markov martingale X, with the property that
Remark 1.4.1 The implied volatility surface being a mapping from Black-Scholes prices, necessary and sufficient
conditions for the surface to be free from static arbitrage must be defined in terms of the properties and limits of the
Black-Scholes formula.
Remark 1.4.2 To infer static arbitrage from implied volatility surface, one must first establish necessary and sufficient
conditions on the call price surface for it to be free of static arbitrage, and then translate these conditions into
conditions on the implied volatility surface.
We state the theorem given by Roper [2010] that is sufficient to ensure that the implied volatility surface remains free
K
from static arbitrage. In that setting, we get x = ln ( F (t,T ) ) and we define the time scaled implied volatility as
41
Theorem 1.4.2 Let F (t, T ) > 0, T be the maturity, and let Ξ : R × [0, ∞) → R. Let Σ satisfies the following
conditions:
(IV1) (Smoothness)
∀T > 0, Ξ(•, T ) is twice differentiable.
(IV2) (Positivity)
∀x ∈ R and T > 0, Ξ(x, T ) > 0.
(IV3) (Durrleman Condition)
∀x ∈ R and T > 0,
x∂x Ξ 2 1 2
1− − Ξ (∂x Ξ)2 + Ξ∂xx Ξ ≥ 0 (1.4.19)
Ξ 4
(IV4) (Monotonicity in T)
∀x ∈ R, Ξ(x, •) is non-decreasing.
(IV5) (Large-Moneyness Behaviour)
∀T > 0, limx→∞ d1 (x, Ξ(x, T )) = −∞.
(IV6) (Value at Maturity)
∀x ∈ R, Ξ(x, 0) = 0.
Then the call price surface parameterised by F (t, T ) is free from static arbitrage. In particular, there exists a non-
negative Markov martingale X, with the property that
Definition 1.4.4 An implied volatility surface is free from static arbitrage if and only if the following conditions are
satisfied.
(V1) For a given forward moneyness, η, the surface is free from calendar-spread arbitrage.
(V2) For each time-to-maturity, the surface is free from butterfly-spread arbitrage.
The butterfly-spread arbitrage ensures the existence of a (non-negative) implied risk-neutral density for the underly-
ing, whilst calendar-spread arbitrage implies the monotonicity of option prices with respect to maturity. The latter
motivates the following definition:
Proposition 4 Assuming deterministic rates and dividend yield, if ω(η, Ti ) is a strictly increasing function for i = 1, 2
with T1 < T2 , then there is no calendar arbitrage.
or quivalently,
Definition 1.4.5 Assuming deterministic rates and proportional dividend yield, the IVS is free of calendar spread
arbitrage when
42
Note, this is the numerator of the local volatility expressed in terms of the implied volatility. In the special case where
we assume that the implied volatility neither depends on the underlying St nor on the strike, we get
Z T
2
ω(T ) = Σ (T )T = σ 2 (s)ds
0
so that the implied volatility is equal to the quadratic mean of the local volatility during the life of the option. In that
case, the local volatility does not depend on the strike price, and after differentiating with respect to maturity, it must
satisfy
where η = log η and d1 (η, ω(t)) is given in Equation (1.2.10). This condition is equivalent to call option prices tending
to zero as the strike tends to infinity, which is the Large Strike Limit in Theorem (1.4.1). Note, the price formulation
of arbitrage freeness given by Roper [2010] in Theorem (1.4.1) is minimal in the sense that the regularity conditions
on the Call option prices are necessary and sufficient: to be convex in the strike direction and non-decreasing in the
maturity direction. However, Roper assumed that the total variance was twice differentiable in the strike direction.
This regularity is certainly not required. Guo et al. [2012] showed that by starting from option prices convex in K,
their first derivative are defined almost everywhere, and so are those of the total variance (in K or η), since the Black-
Scholes mapping in total variance is smooth. As a result, one can assume that for any t, the function k → w(η, t) is
continuous and almost everywhere differentiable.
In addition, some other asymptotic behaviours of d1 and d2 hold in great generality. For instance, the Small-
Moneyness Behaviour (SMB) condition, proved by Rogers et al. [2010], states
Further, Rogers et al. [2010] imposed two further constraints on the LMB and SMB, obtaining the following Lemma:
Equivalently, using Equation (1.4.22), Guo et al. [2012] showed that the marginal law of the stock price at some fixed
time t > 0 has no mass at zero if and only if
43
which is a statement about a Small-Moneyness Behaviour. These conditions were then recasted in terms of implied
volatility providing a complete characterisation of an IVS free from static arbitrage.
2 CBS + xRe(t, T )
σ=√ N −1 ( )
T −t 2xRe(t, T )
where N −1 (•) the inverse standard normal distribution. In that setting, Brenner et al. [1988] used a first order
Taylor polynomial approximation for the inverse standard normal distribution around 12 , and obtained the volatility
approximation
44
r
2π CBS
σ≈
T − t xRe(t, T )
Later, considering a quadratic approximation to the implied volatility and using a higher order Taylor expansion on
the normal distribution function, Corrado et al. [1996b] extended that result to cases where stock prices deviate from
discounted strike prices. The quadratic equation is
q1 σ 2 + q2 σ + q3 = 0
where
q1 = (T − t)(xRe(t, T ) + K)
xRe(t, T ) − K p
q2 = − CBS 8π(T − t)
2
xRe(t, T )
q3 = 2 xRe(t, T ) − K ln
K
with solutions given by
p s
−q2 ± q22 − 4q1 q3 −q2 q22 q3
σ= = ± 2 −
2q1 2q1 4q1 q1
The largest root is
r
2π 1 h xRe(t, T ) − K
σ ≈ CBS −
T − t xRe(t, T ) + K 2
s
xRe(t, T ) − K 2 xRe(t, T ) i
+ CBS − − 2(xRe(t, T ) + K)(xRe(t, T ) − K) ln
2 K
Setting η = ln xRe(t,T
K
)
= 2 xRe(t,T )−K
xRe(t,T )+K
, the accuracy of the quadratic formula is significantly improved by minimising
its concavity. The resulting approximated volatility is
r s
2π 1 h xRe(t, T ) − K xRe(t, T ) − K 2 (xRe(t, T ) − K)2 i
σ≈ CBS − + CBS − −
T − t xRe(t, T ) + K 2 2 π
For short maturities and strikes away from the money, the terms inside the square root may become negative. The
approximation is accurate enough in the domain η ∈ [0.9, 1, 1] for maturities greater than three months, and it is
accurate in the domain η ∈ [0.95, 1, 05] if the maturity is longer than one month. Chance et al. [2014] considered the
same quadratic equation as above with
xRe(t, T ) + K
q1 =
2
xRe(t, T ) + K p
q2 = − CBS 2π(T − t)
2
xRe(t, T )
q3 = xRe(t, T ) + K ln
K
45
and showed that the larger root σ + corresponds to the solution proposed by Corrado et al. [1996b]. The smaller
root can be positive when q2 < 0, which is equivalent to xRe(t,T 2
)−K
< CBS . Isengildina-Massa et al. [2007]
studied various implied volatility approximations and concluded that the method of Corrado et al. yields relatively
more accurate results than other methods when call premiums are used. Increasing the range of the domain, Liang et
al. [2009] gave under certain conditions an approximation of the implied volatility with an error estimate for it. They
first averaged the Black-Scholes terms di for i = 1, 2, to d3 , and performed a Taylor expansions of N (d1 ) and N (d2 )
around d3 . That is, assuming d3 close to zero and |dξ3 | ≤ 1, or equivalently η ∈ [0.1, 2], they expanded the pricing
1
formula given in Equation (B.1.2), getting rid of the factor 1−η and decreasing the approximation error. Then, they
0
performed a Taylor expansions of N (d3 ) and N (d3 ) around zero and approximated the pricing formula in Equation
bBS (t, x, K, T ) = g3 (d3 (T − t, xC(t, T ), K)) where
(B.1.2) with C
1 4−m m 1
g3 (d3 (T − t, xC(t, T ), K)) = + √ d3 + √
2 4 2π 2 2π d3
The function g3 (•) being an approximation of g1 (•), its solution d˜3 , obtained by solving a quadratic equation, is itself
an approximation to d3 given by
√ √
− 2π(1−2CbBS (t,x,K,T ))− 2π(1−2CbBS (t,x,K,T ))2 −2m(4−m) if 0.1 < η < 1
d˜3 = √ √4−m
− 2π(1−2CbBS (t,x,K,T ))+ 2π(1−2CbBS (t,x,K,T ))2 −2m(4−m) if 1 < η < 2
4−m
Since σ = − √log η 1
T −t d3
the approximated implied volatility σ̃(T, K) is given by
log η 1
σ̃(T, K) = − √ (1.4.24)
T − t d˜3 (T − t, xC(t, T ), K)
This formula being an approximation to the implied volatility, it may not be accurate enough for certain strikes and
maturities. Therefore we choose to use the iterative algorithm discussed in Section (1.4.4.2), but with the first guess
given by Equation (1.4.24).
Therefore, the Black-Scholes formula is strictly increasing in the volatility. Hence, if an approximated solution exists,
it is unique by the strictly positive vega. Moreover, the Volga in Equation (B.1.8) shows that the function σ → CBS (σ)
is convex on the interval [0, σrf ] and concave on the interval [σrf , ∞], where
r
| − log η|
σrf = 2
T −t
is the reflection point of f (σ) derived by Manaster et al. [1982]. Therefore, the equation CBS (σ) = CM , where CM
the market call price in the range
46
σ0 = σrf
1
σn+1 = σn + (CM − CBS (σn )) max ( , 100)
V ega
where V ega = ∂σ CBS (σ), and max(., .) is used to avoid explosion when the vega gets too small, and σ0 is the first
guess of the algorithm since the series (σn ) ≥ 0 is monotonic. The NR algorithm terminates when the iterations
yield an estimated price that is within a preset tolerance of the market price. With initial guess σrf , one can show
that if the NR algorithm converges, it must converge to the correct solution. However, the algorithm itself can not
guarantee convergence (see Chance et al. [2014]). We can also calculate the second order derivative terms and use
2
a higher order iterative root finding algorithm such as the Halley algorithm. One can show that ∂ ∂σf (σ)
2 > 0 provided
2
that 0 < σ < σrf and that ∂ ∂σ f (σ)
2 < 0 provided that σ > σrf . Note, even if f (σ) is differentiable everywhere, the NR
method may not lead to a root. When the strike prices are far from the money or the price is close to the no-arbitrage
bounds, the vega becomes insignificant and the above ratio explode. In that case one must switch to the Bisection
method. Also, as discussed by Jackel [2006], this algorithm is very sensitive to its initial guess and the volatility can
easily be in the range [10−4 %, 1000%]. Recognising the near flat shape of the normalised Black function for small
volatilities, to enlarge the space domain in that region (σ < σ0 ), he performed a change of variable expressing the
prices in the logarithm space. In that setting the initial guess must be modified and asymptotic expansion is performed
on the prices around σ → 0 to calculate the new initial guess. A similar approach is used for σ → ∞ and interpolation
is performed between the two initial guesses. Alternatively, as explained in Section (1.4.4.1), we can estimate the
initial guess by Taylor expanding the normal cumulative integral in the Black-Scholes formula.
∂C(St )
= I{St ≥K}
∂St
∂ 2 C(St ) ∂H
= = δ(St − K)
∂St2 ∂St
0
where I{St ≥K} = H(St − K) is the Heaviside function and H = δ(St − K) is its derivative the Dirac function. We
can therefore calculate the prices of some special contingent claims via the derivation of the call price C(t, St , T, K)
as follow
∂C(t, St , T, K)
Digital = − = P (t, T )Et [I{ST ≥K} ]
∂K
∂ 2 C(t, St , T, K)
Density = = P (t, T )Et [δ(ST − K)]
∂K 2
47
1 ∂C(t, St , T, K)
P (ST ≥ K) = −
P (t, t) ∂K
and the cumulative distribution becomes
1 ∂C(t, St , T, K)
P (ST < K) = 1 + (1.5.25)
P (t, t) ∂K
As a result, the price of a call option can be expressed in terms of those quantities as
C(t, St , T, K) = P (t, T )Et [(ST − K)+ ] = P (t, T )Et [(ST − K)I{ST ≥K} ]
= P (t, T ) Et [ST I{ST ≥K} ] − KEt [I{ST ≥K} ]
which gives
C(t, St , T, K)
= Et [ST I{ST ≥K} ] − KEt [I{ST ≥K} ] = ∆S + K∆K (1.5.26)
P (t, T )
where ∆K is the probability that the stock price end up higher or equal to the strike price at maturity, and ∆S is a
modified probability that the stock price end up higher or equal to the strike price at maturity. Following Ingersoll
[1998] [2000], we call DB (S, t, T ; ξ) the Digital Bond and DS (S, t, T ; ξ) the Digital Stock for the event ξ. The former
is the value at time t of receiving one dollar at the maturity T if and only if a probabilist event ξ occurs, while the latter
is the value at time t of receiving one share of the stock at the maturity T (excluding any intervening dividends) if and
only if a probabilist event ξ occurs. In the special case where ξ = {ST > K}, we have DB (S, t, T ; ξ) = P (t, T )∆K
and DS (S, t, T ; ξ) = P (t, T )∆S . Geman et al. [1995] showed that ∆K = P (ST ≥ K) and ∆S = P S (ST ≥ K),
where the latter is a probability under the spot measure with the stock price as numeraire. We can rewrite ∆S as
C(t, St , T, K)
Et [ST I{ST ≥K} ] = + KEt [I{ST ≥K} ]
P (t, T )
From the definition of the digital option, the above expression becomes
C(t, St , T, K) 1 ∂C(t, St , T, K)
Et [ST I{ST ≥K} ] = −K
P (t, T ) P (t, T ) ∂K
and we see that the Digital Stock satisfies
1 C(t, St , T, K) K 1 ∂C(t, St , T, K)
∆S = −
P (t, T ) St St P (t, T ) ∂K
which can be statically replicated with a call option and a call-spread. Therefore, the pricing of other European
derivatives with piecewise linear and path-independent payoffs only requires valuing Digital Bond and Digital Share
with event ξ = {L < ST < H} for some constants L and H. For example, the call option price is
48
RT
DB (S, t, T ; ξ) = E Q [e− t
rs ds
IST >K |Ft ] = e−r(T −t) N (d2 (T − t, St e(r−q)(T −t) , K))
RT
DS (S, t, T ; ξ) = E Q [e− t
rs ds
ST IST >K |Ft ] = St e−q(T −t) N (d1 (T − t, St e(r−q)(T −t) , K))
where
1 x 1 √ √
d2 (t, x, y) = √ log − σ t and d1 (t, x, y) = d2 (t, x, y) + σ t
σ t y 2
However, when the volatility of the stock price is stochastic, and more generally when the instantaneous volatility
of the stock price, the spot rate and repo rate are stochastic, one can no-longer use the Black-Sholes formula. Under
general Markov processes for the model parameters, the conditional probabilities of the Digital Bond and Digital Share
are difficult to solve analytically under any probability measure, and numerical tools must be used.
49
100
Call and its Approximation
90
80
70
60
50
Call
Call Approx
40
30
20
10
0
0 20 40 60 80 100 120 140 160 180 200
-10
Figure 1.2: The call price and its linear approximation vs strikes.
50
80
70
60
50
Put
40 Put Approx
30
20
10
0
0 20 40 60 80 100 120 140 160 180 200
-10
Figure 1.3: The put price and its linear approximation vs strikes.
51
∂ ∂ ∂ ∂
D(K, T ) = − CBS (K, T ; ΣBS (K, T )) = − CBS (K, T ; ΣBS ) − CBS (K, T ; Σ(K, T )) Σ(K, T )
∂K ∂K ∂Σ ∂K
We can express the Digital option in terms of the Vega and the Skew as
∂
D(K, T ) = − CBS (K, T ; ΣBS ) − V ega(K, T )Skew(K, T ) (1.5.28)
∂K
where V ega(K, T ; ΣBS (K, T )) is the Black-Scholes vega for the strike K and maturity T , and ∂K CBS (K, T ; ΣBS )
is the BS digital price for the volatility ΣBS (K, T ). √
In the special case where r = q = 0, T = 1 and for S0 = 100
and K = 100, then η = 1 and we get d2 = − 21 ΣBS T . Given a skew of 2.5% per 10% change in the strike and an
ATM volatility ΣAT M = 25%, the Digital option simplifies to
Ignoring the skew, the price is 45% of notional which is significantly lower than 55% of notional when the skew is
included.
1 ∂2
φ(t0 ; T, K) = C(K, T ) (1.5.29)
P (t0 , T ) ∂K 2
Hence, for any time T one can recover the marginal risk-neutral distribution of the stock price from the volatility
surface. However, it tells us nothing about the joint distribution of the stock price at multiple times T1 , .., Tn . This
is because the volatility surface is constructed from European options prices which only depend on the marginal
distribution of ST .
52
SBS (t, x, K, T ) = xe−q(T −t) 2N (d1 (T − t, F (t, T ), K)) − 1 − Ke−r(T −t) 2N (d2 (T − t, F (t, T ), K)) − 1
Further, we can compute the limit case of a long straddle when the volatility tends to zero as
+ +
lim SBS (t, x, K, T ) = xe−q(T −t) − Ke−r(T −t) + Ke−r(T −t) − xe−q(T −t)
σ→0
which gives
lim SBS (t, x, K, T ) = Ke−r(T −t) − xe−q(T −t) if xe−q(T −t) < Ke−r(T −t)
σ→0
0 if xe−q(T −t) = Ke−r(T −t)
Hence, we can rewrite the limit of a long straddle when the volatility tends to zero as
which is the absolute value of the payoff. When the volatility tends to infinity, we get
Assuming that the options in the straddle are ATM forward, we use Equation (1.2.12) and get
√
SBS (t, x, K, T ) K=F (t,T )
≈ 0.8xe−q(T −t) σ T − t
From the above equation we can easily compute the delta of an ATM forward straddle option as
√
∆S (t, x, K, T ) K=F (t,T )
≈ 0.8e−q(T −t) σ T − t
Straddles are a good strategy to pursue if an investor believes that a stock’s price will move significantly, but is unsure
as to which direction it will go. The stock price must move significantly if the investor is to make a profit. This
position has a limited risk, since the purchaser of the strategy may lose at most the cost of both options, and there is
unlimited profit potential. However, when shorting a straddle the profit is limited to the premium received from the
sale of put and call. The risk is virtually unlimited as large moves of the underlying security’s price either up or down
will cause losses proportional to the magnitude of the price move. A maximum profit upon expiration is achieved if
the underlying security trades exactly at the strike price of the straddle. In that case both puts and calls comprising
the straddle expire worthless. Note, on stocks that are expected to jump, the market tends to price options at a higher
premium, which ultimately reduces the expected payoff should the stock move significantly.
53
+ +
lim SbBS (t, x, K1 , K2 , T ) = K1 e−r(T −t) − xe−q(T −t) + xe−q(T −t) − K2 e−r(T −t)
σ→0
which gives
lim SbBS (t, x, K1 , K2 , T ) = K e−r(T −t) − xe−q(T −t) if xe−q(T −t) < K1 e−r(T −t)
σ→0 1
0 if K1 e−r(T −t) ≤ xe−q(T −t) ≤ K2 e−r(T −t)
and when the volatility tends to infinity
Assuming that the options in the strangle are close to ATM forward, with K1 ≈ F (t, T ) and K2 ≈ F (t, T ), we use
Equation (1.2.11) and get
1 1 √ 2 √ 1 1 √
SbBS (t, x, K1 , K2 , T ) ≈ K1 e−r(T −t) + σ T − t + xe−q(T −t) σ T − t − K2 e−r(T −t) − σ T − t
2 5 5 2 5
Setting K1 = K2 = F (t, T ), we recover the formula, SBS (t, x, K, T ) K=F (t,T ) , for the ATM forward straddle. In
general, the strategy involves buying an out-of-the-money call and an out-of-the-money put option so that its intrinsic
value is zero and stays there if the underlying asset hardly moves. A strangle is generally less expensive than a straddle
as the contracts are purchased out of the money. The owner of a long strangle makes a profit if the underlying price
moves far enough away from the current price, either above or below.
which gives
54
(K2 − K1 )e−r(T −t) if xe−q(T −t) ≥ K1 e−r(T −t) and xe−q(T −t) ≥ K2 e−r(T −t)
lim CSBS (t, x, K1 , K2 , T ) =
σ→0 0 if xe−q(T −t) < K1 e−r(T −t) and xe−q(T −t) < K2 e−r(T −t)
and
which gives
(K2 − K1 )e−r(T −t) if xe−q(T −t) ≤ K1 e−r(T −t) and xe−q(T −t) ≤ K2 e−r(T −t)
lim P SBS (t, x, K1 , K2 , T ) =
σ→0 0 if xe−q(T −t) > K1 e−r(T −t) and xe−q(T −t) > K2 e−r(T −t)
and
lim P SBS (t, x, K1 , K2 , T ) = K2 e−r(T −t) − K1 e−r(T −t) = (K2 − K1 )e−r(T −t)
σ→∞
Note, in the special case where xe−q(T −t) < K1 e−r(T −t) and xe−q(T −t) < K2 e−r(T −t) then the limit of the call-
spread when σ → 0 and σ → ∞ is zero. Similarly, the limit of the put-spread when σ → 0 and σ → ∞ is the
difference in strike discounted, (K2 − K1 )e−r(T −t) , when xe−q(T −t) ≤ K1 e−r(T −t) and xe−q(T −t) ≤ K2 e−r(T −t) .
That is, stressing volatility values for out-of-the-money (OTM) call-spreads in the BS-formula, we observe that
while stressing volatility values for in-the-money (ITM) put-spreads in the BS-formula, we get
• put option premium converges to discounted strike when volatility is very low.
• put option premium converges to the strike price when volatility is very high (infinity).
Since the call-spread and put-spread are continuous and differentiable with respect to volatility, we can apply the
Theorem of de Rolle (B.5.1) and deduce an optimum when the vega equals zero. We plot in Figure ( 1.4) an OTM
call-spread versus different level of volatility with S0 = 100, T = 0.5, r = 0.03 together with strikes K1 = 130 and
K2 = 140. We plot in Figure ( 1.4) its associated vega. We plot in Figure ( 1.5) an ITM put-spread versus different
level of volatility with S0 = 100, T = 0.5, r = 0.03 together with strikes K1 = 130 and K2 = 140. We also plot in
Figure ( 1.5) its associated vega.
55
Call_Spread CS_Vega
2.5 6
5
2
1.5
Call_Spread 3
1 CS_Vega
0.5
0
0 1 2 3 4 5 6 0
0 1 2 3 4 5 6
-1
56
Put_Spread PS_Vega
12 1
10 0
0 1 2 3 4 5 6
8 -1
6
Put_Spread -2
PS_Vega
4
-3
2
-4
0
0 1 2 3 4 5 6 -5
-6
57
1 1 √
= CBS (t, x, K1 , T ) − CBS (t, x, K2 , T ) ≈ (K2 − K1 )e−r(T −t)
CSBS (t, x, K1 , K2 , T ) − σ T −t
2 5
1 1 √
βxe−q(T −t)
= − σ T −t
2 5
where β ≈ 2. Since in a no-arbitrage world we have CBS (t, x, K1 , T ) ≥ CBS (t, x, K2 , T ), we must have
5 √
≥σ T −t
2
Assuming now that we have some skew, such that σ1 > σ2 then we approximate the call spread as follow
1 1 √ 1 1 √
CSBS (t, x, K1 , K2 , T ) ≈ xe−q(T −t)+ σ1 T − t − K1 e−r(T −t) − σ1 T − t
2 5 2 5
1 1 √ −r(T −t) 1 1 √
−xe−q(T −t)
+ σ2 T − t − K2 e + σ2 T − t
2 5 2 5
−q(T −t) 1
√ √ 1 1 √ 1 1 √
σ1 T − t − σ2 T − t + K2 e−r(T −t) − σ2 T − t − K1 e−r(T −t) − σ1 T − t
= xe
5 2 5 2 5
Replacing Ki for i = 1, 2 as above, we get
1 √ √ 1 1 √
CSBS (t, x, K1 , K2 , T ) ≈ xe−q(T −t) σ1 T − t − σ2 T − t + α2 xe−q(T −t) − σ2 T − t
5 2 5
1 1 √
−α1 xe−q(T −t)
− σ1 T − t
2 5
which simplifies to
1 1h √ √ √ √ i
CSBS (t, x, K1 , K2 , T ) ≈ (α2 −α1 )xe−q(T −t) +xe−q(T −t)
σ1 T − t−σ2 T − t + α1 σ1 T − t−α2 σ2 T − t
2 5
σ2 = Rα σ1
58
where Rα = (1 + α) and α is a percentage. In that case, differentiating the call spread with respect to σ1 , we get
∂ ∂ ∂ dσ2
CSBS (t, x, K1 , K2 , T ; σ1 ) = CBS (t, x, K1 , T ; σ1 ) − CBS (t, x, K2 , T ; σ2 )
∂σ1 ∂σ1 ∂σ2 dσ1
dσ2
with dσ1
= Rα . Adding a skew of −0.1 to the eamples above, we plot in Figure ( 1.6) the call-spread and its associated
vega in Figure ( 1.6). We also plot in Figure ( 1.7) the put-spread and its associated vega in Figure ( 1.7).
59
Call_Spread CS_Vega
8 10
6
5
4
Call_Spread
4
CS_Vega
3
1 0
0 1 2 3 4 5 6
0
0 1 2 3 4 5 6
-2
-4
Figure 1.6: Call-spread and its vega with with −0.1 skew.
60
Put_Spread PS_Vega
12 4
2
10
0
0 1 2 3 4 5 6
8
-2
PS_Vega
6
Put_Spread
-4
4
-6
2 -8
-10
0
0 1 2 3 4 5 6
Figure 1.7: Put-spread and its vega with with −0.1 skew.
61
62
opposite signs, and if the portfolio is a mixture of such positions, the deltas will at least partially cancel each other.
v(t, x, r, q; T, K; σ)
where the underlying asset (Stx )t≥0 and the time t are state variables, the risk-free rate r, the repo rate q and the
volatility σ are fixed parameters of the model, the maturity T and the strike K are fixed parameters of the contingent
claim. The sensitivities of a call price with respect to these parameters play a fundamental role in the risk management
of derivative products. For instance, a call option with maturity T and strike K can be hedged with the quantity
e−q(T −t)
∂xx C = Γ= √ n(d1 ) > 0
xσ T − t
√
∂σ C = V ega = xe−q(T −t) T − tn(d1 )
∂K C = −e−r(T −t) N (d2 )
∂r C = ρ = (T − t)Ke−r(T −t) N (d2 ) > 0
xσ
∂t C = Θ=− √ e−q(T −t) n(d1 ) + qxe−q(T −t) N (d1 ) − rKe−r(T −t) N (d2 )
2 T −t
and the Black-Scholes partial differential equation (PDE) can be written as
1 2 2
σ x Γ + (r − q)x∆ − rC + Θ = 0 (1.6.30)
2
63
Since we can express the Gamma in terms of the Vega (see Equation (B.1.6)), we can rewrite the PDE as
σ
V ega + (r − q)x∆ − rC + Θ = 0
2T
While ∆(t, St ) is a quantity of the risky asset to buy or sell when building the replicating portfolio, the other Greeks are
not easily interpreted as financial quantities and must therefore be modified for risk management purposes. Assuming
constant volatility, the dynamics of the stock price in discrete time is given by
√
∆St = µSt ∆t + σSt y ∆t
where ∆St is the variation of the stock price in the time interval ∆t, and y ∼ N (0, 1). As a result, ∆S
St follows a
t
√ √
normal distribution with mean µ∆t and standard deviation σ ∆t. That is, ∆S St ∼ N (µ∆t, σ ∆t). We let S
t +
=
+
S + α where α = S with << 1, and we define a change in spot as ∆S = S − S = α. In general, the market
assumes = 1%. Using finite difference, we can approximate the Delta with a one-sided finite difference as
C(S + ) − C(S)
∆≈
α
In order to include some convexity in the delta, by considering the difference in value between an option after an
upward bump and one after a downward bump, we can use a two-sided finite difference as
C(S + ) − C(S − )
∆≈
2α
In the case of the one-sided delta we can deduce that
C(S + ) ≈ α∆ + C(S)
We see that we can obtain a modified Delta as ∆
b = α∆ such that
C(S + ) ≈ ∆
b + C(S)
∆(S + ) − ∆(S)
Γ≈
α
and deduce that
∆(S + ) ≈ αΓ + ∆(S)
We see that we can obtain a modified Gamma as Γ
b = αΓ such that
∆(S + ) ≈ Γ
b + ∆(S)
where we get the new Delta simply by adding Γb to the old Delta. In other words, if the spot price jumps from S to S + ,
the trader must immediately modify his position in the risky asset by buying the quantity Γb (or selling that quantity if
Γ
b < 0) of the risky asset. Hence, the modified Gamma corresponds to a change in the hedging strategy induced by
a sudden jump in the underlying. We now let σ + = σ + α where α = σ with << 1, and we define a change in
volatility as ∆σ = σ + − σ = α. Using one-sided finite difference, we can approximate the Vega as
C(σ + ) − C(σ)
V ega ≈
α
64
Since volatility is already expressed in percentage, market practice is to directly set α = 1% and obtain the modified
Vega as Vb ega = αV ega such that
b = Θ × ∆t
Θ
1
where ∆t = 365 .
When risk managing options, traders need to estimate their risk over a short time period. Relaxing the constraint of
constant volatility and setting V = V (t, S, σ), where σ represents a stochastic volatility, we apply a Taylor expansion
on ∆V with respect to the time, the stock and the volatility, getting the variation
65
ΠB = C(St , K2 ) + P (St , K2 ) − P (St , K1 ) − C(St , K3 ) − ∆C2 + ∆P2 − ∆P1 − ∆C3 St (1.6.34)
νc2 νc
ΠB = C(St , K2 ) + P (St , K2 ) − 3 P (St , K1 ) − C(St , K3 )
νp2 νp1
νc2 νc3
− ∆C2 + ∆P2 − ∆P1 − ∆C3 St
νp2 νp1
66
and so any attempt to hedge can only be a true hedge for an arbitrarily small time interval. As such, ones position must
be dynamically adjusted to account for movements in the value of the Greeks. Whilst high-frequency trading systems
may be capable of maintaining neutral positions to a reasonable accuracy, the impact of transactions costs and bid-ask
spreads makes continuous adjustment infeasible in practice. It is instead standard for traders to adjust their positions
daily and discretely approximate the continuous hedge. Others maintain a Delta within a certain interval, adjusting
their hedge when the Delta exists this region, while some will hedge dependent upon the moving of the underlying. In
order to mitigate the risks of inaccurate hedging, second moments are often considered, the most common including
Gamma, Vanna, Vomma and Charm. Gamma hedging for instance, can reduce the impact that movements in the
underlying will have on a positions delta, and hence reduce the inaccuracy of the Delta-hedge. In the case of the
skew trade above, ∆P increases more steadily than ∆C when the underlying decreases (and vice versa as it increases),
meaning a bullish market favours the position, whilst a bearish market is unfavourable. Hence, if the Delta-hedged
position is left too long without re-balancing, the skew trade remains exposed to movements in the underlying asset.
Going around, we can locally approximate the change in value of an option with the Taylor expansion given in Equation
(1.6.32), and the VaR becomes concerned with the calculation of the Greeks (see Kwiatkowski [1997]). However,
as discussed in Section (4.1.3), the Greeks are model dependent and we need to make some assumptions about the
dynamics of the implied volatility surface. In general, assuming Gaussian risk factors, one can compute analytically
the VaR for some asset class. Alternatively, we can use historical changes in risk factors getting historical VaR method,
or we can simulate the variations of the risk factors from a fully calibrated model, getting Monte Carlo VaR method.
It is very time consuming as the derivative prices must be evaluated for each simulation path. Even though complex
derivatives need to be priced numerically, for vanilla options we can rely on the characteristic function method or some
expansions.
∆V = β1 X1 + ... + βn Xn (1.6.36)
67
where βi is the direct exposure of the ith risk factor. It represents the dollar change in the market value of the portfolio
in response to a unit change in the ith risk factor. Thus, the variance of the change in value of the portfolio is given by
n
X
σP2 = V ar(∆V ) = βi βj Cij
i,j=1
In the special case where the factors (Xi )i=1,..,n are jointly normally distributed, then the VaR at 99% confidence
level is the 99-percentile change for a normally distributed random variable V aR(∆V ) = zp σP with zp = 2.33.
Irrespective of the probability distribution function (pdf), by simulating the factors (Xi )i=1,..,n we can simulate the
total unexpected change in market value ∆V and estimate the VaR as the level of loss exceeded by a fraction of
simulated outcomes of ∆V .
There exists several approaches for simulating the risk factors, one of which being to assume a stationary statistical
environment and simulate underlying prices in an historically realistic manner by bootstrapping from historical data.
Considering historical returns as the source of simulated returns we can capture the correlations, volatilities, tail fatness
and skewness present in the data without assuming a model. However, market price changes and historical returns are
significantly non-stationary in terms of volatilities and correlations. It was suggested to update the historical return
distribution by for instance drawing from the returns (R̂i )i=1,..,n defined by
V̂
R̂i = Ri
V
where V is the historical volatility and V̂ is a recent volatility estimate. Similar approaches were proposed for correla-
tions, even though correlation estimates are relatively unstable. In general, the change in market value of an option is
non-linear in the factor Xi such that the parameter βi is not a constant as time passes. Nonetheless, when dealing with
options it is assumed that for small changes in the underlying the delta approach is sufficiently accurate, such that we
get the approximation
∆V ≈ ∆1 X1 + ... + ∆n Xn
where ∆i is the delta of the total portfolio with respect to the ith factor. That is, assuming k options on the same
underlying S with prices P1 , .., Pk , the delta of the portfolio satisfies
d ∂ ∂
[P1 (S) + ... + Pk (S)] = P1 (S) + ... + Pk (S)
dS ∂S ∂S
which is the sum of the individual deltas. Improving on the accuracy, the same method can be applied with the gamma
of an option. In the case of a portfolio exposed to several underlying assets, the delta-gamma approximation of the
market value of the portfolio can be computed in terms of the deltas and gammas of the book with respect to each
underlying asset and each pair of underlying assets. The (i, j)-gamma of the portfolio for the pair (i, j) is merly the
sum of the (i, j)-gammas of all individual positions. Combining all deltas and gammas the total change in value of
the portfolio is approximated as
n n n
X 1 XX
∆Vt = ∆j Xj + Γjk Xj Xk
j=1
2 j=1
k=1
n
X X 1 X
V ar(∆Vt ) = ∆j ∆k Cov(Xj , Xk ) + ∆i Γjk Cov(Xi , Xj Xk ) + Γjk Γkl Cov(Xi Xj , Xj Xk )
4
j,k i,j,k i,j,k,l
68
In the case of normal returns (Xi )i=1,..,n the covariance terms involving the products Xi Xj for i, j = 1, .., n can be
computed explicitly.
Using the BS-formula, we can calculate the value of the portfolio as a function of the underlying asset price.
∆Vt St
≈ ∆S RS
Vt Vt
with variance given by
∆Vt St 2
V ar( )≈ ∆S V ar(RS )
Vt Vt
Thus, for small changes in the spot price, we approximate the change in market value of an option as that of a fixed
position in the underlying whose size is the delta of the option. Since the delta ∆S is a constant, the potential profit
and loss ∆Vt is a linear function of the changes in S, and the worst loss for V can be obtained from the worst loss
for S. Thus, V aR(∆V ) = |∆S | × V aR(∆S). Even though the delta-based approximation is fairly accurate for
short periods of time, VaR is typically concerned with large changes. Fom the convexity of the BS-formula, we see
that this approach over-estimates the loss on a long option position and under-estimates the√loss for a short option
position. Assuming that RS = ∆S S is √ a Gaussian variable with mean zero and volatility σS ∆t, then the volatility
of the portfolio becomes σP = ∆S σS ∆tSt . Note, we recover the volatility of the BS-price dynamics obtained by
applying Ito’s lemma to the option price and given in Equation (1.2.5). Hence, the VaR follows as
69
∂V 1 ∂2V 1
∆Vt = ∆St + (∆St )2 = ∆S ∆St + Γ(∆St )2
∂S 2 ∂S 2 2
Since the delta ∆S and gamma Γ are constants, we can obtain the variance of the portfolio return as
1
V ar(∆Vt ) = ∆2S V ar(∆St ) + Γ2 V ar((∆St )2 ) + ∆S ΓCov(∆St , (∆St )2 )
2
In the case of Gaussian returns, we canpcompute explicitly the moments of ∆St and obtain a VaR with a 99% confi-
dence level given by V aR(∆Vt ) = zp V ar(∆Vt ) with zp = 2.33. Alternatively, the worst loss of the spot price is
∆St = −zp σSt , so that the change in value of the portfolio becomes
1
∆Vt = −∆S (zp σSt ) + Γ(zp σSt )2
2
and we get the VaR
1
V aR(∆Vt ) = |∆S | × zp σSt − Γ(zp σSt )2
2
Note, the approximation in the delta-gamma method corresponds to a parabolic function over-estimating long option
positions resulting in an under-estimated VaR. Further, it assumes that the underlying risk and its square are joint
normally distributed, which is not the case in practice. The underlying risks having non-normal distributions, some
authors proposed closer approximations to the true distribution (see Britten-Jones et al. [1997]). The Greeks being
model dependent and very sensitive to the dynamics of the volatility surface (see Section (4.1.3)), we should include
the additional terms defined in Equation (1.6.31) when calculating the VaR. However, it vould implies a model for the
dynamics of the IV surface.
70
and rewriting the Theta in terms of the Gamma, the total variation of the portfolio becomes
1 1 ∆St 2 1 ∆St 2
∆V (t) = ∆γ ∆St + St2 γ 2 Γγ ∆t − ∆γ ∆St − St Γγ ( ) = St2 Γγ γ 2 ∆t − ( )
2 2 St 2 St
and the worst case happen when the trader is under-hedged
i j
σi σj ρij , such that given ∆St = St RS we get Ωt,ij = St St σi σj ρij . As a result, the variation of the portfolio is
approximated by
N X
d
X ∂P i
∆V ≈ ωi ∆S j
i=1 j=1
∂S i
71
72
RT
C(t, St ) = E Q [e− t
rs ds
h(ST )|Ft ] (1.7.38)
Z ∞
= P (t, T ) h(x)φS (t, x)dx
−∞
where h is a sufficiently smooth payoff function and φS (t, ·) is a continuous density function. Furthermore, we assume
that these price functions are of class C 2 , such that for T > t the law of ST knowing St = x has a density φ(T, K)
given by
1
φ(T, K) = EtQ [δ(ST − K)] = RT ∂KK C(t, St ; K, T ) (1.7.39)
EtQ [e− t
rs ds
|ST = K]
where δ(·) is the Dirac function.
73
exists a Markov process X̃t with the same marginal distributions as those of our original process. That process follows
the dynamics
dX̃t
= b(t, X̃t )dt + v(t, X̃t )dW̃ (t)
X̃t
where at time t0 < t
are respectively the local drift and local variance of the one-dimensional process. Note, they are still deterministic
functions of time t and spot value x even though the instantaneous volatility and instantaneous drift are stochastic.
Since the marginal distributions of our model are matched to those of a simple Markov process, we can use this
approach to calibrate the parameters of a high-dimensional dynamics to European market prices. Therefore, given the
one-dimensional underlying process St defined in Equation (1.7.37), the local drift and variance are given by
where S is an arbitrary realisation of St for t ≥ 0. Using Bayes relationship, the local variance becomes
dS̃t
= µt dt + σLV (t, S̃t )dW̃S (t)
S̃t
with
2
σLV (t, S) = EtQ0 [σ(t, ω)σ(t, ω)> |St = S] (1.7.40)
which is sometime called the stochastic local variance (SLV) (see Alexander et al. [2004]). Hence, the process S̃t
has the same marginal distributions as those of the process St for all t ≥ 0. However, St and S̃t do not follow the
same dynamics, resulting in different exotic options prices as well as hedge ratios. In the special case where the
instantaneous volatility is a deterministic function of time t and the spot price St , then the local volatility is equal to
the instantaneous volatility, that is,
2
σDLV (t, S) = σ(t, S)σ(t, S)>
which we call the deterministic local volatility (DLV). It is the only case where the processes St and S̃t follow the
same dynamics. Dupire [1994] showed that when n = 1, the deterministic local volatility can be estimated from
74
market call and put prices of all strikes and maturities. He considered call prices as function of strike K and maturity
T and solved the forward PDE
1 2
∂T C(T, K) = σ (T, K)K 2 ∂KK C(T, K) − µT K∂K C(T, K) (1.7.41)
2 DLV
with initial condition C(t0 , St0 , K, t0 ) = (St0 − K)+ , obtaining a unique solution to the function of volatility
This result is crucial because it allows perfect fit to the market prices leading to a complete modification of the equity
market.
Remark 1.7.1 The Black-Scholes implied volatility is no longer the only form of observable volatility, and one can
directly use the local volatility for calibration.
So, in principle one can directly diffuse the process St with stochastic instantaneous volatility provided that its local
variance matches that of the DLV
2 2
σLV (t, S) = σDLV (t, S) (1.7.43)
That is, we may impose any dynamics such that the above equality holds and the local variance stays non-negative.
This is very appealing because we get perfect fit to the market prices, but it is difficult to implement in practice.
75
where the local volatility σLV (ti+1 , ·) now appears on both side of the equation. It is usually solved with a fixed point
algorithm. Starting at time t0 a first guess is chosen σLV (t0 , ·) and then simulating the underlying process between
the times ti and ti+1 the local volatility σLV (t1 , ·) is solved. Then, we iterate the process for all time ti , i = 1, ..., n.
In the special case where we assume that the process Sti+1 is only a function of σ(ti , ·), then simplifications can be
made. For example if we discretise Sti+1 with the Euler scheme
p
Sti+1 = Sti+1 1 + σLV (ti , Sti ) Vti (Wti+1 − Wti )
the local variance simplies to
2
2 σDLV (ti+1 , K)
σSLV (ti+1 , K) = Q
E [Vti+1 |Sti+1 σLV (ti , ·) = K]
and we no-longer need to iterate to solve for the local variance. This simplification is called the practical method.
Note, a solution may not exist, leading the algorithm to diverge.
2 Σ2 (T, K) + 2Σ(T, K)(T − t) ∂T Σ(T, K) + (rT − qT )K∂k Σ(T, K)
σ (T, K) =
2d1 √ d1 d2
1 + Σ(T, K)(T − t)K 2 KΣ(T,K) T −t
∂K Σ(T, K) + ∂ KK Σ(T, K) + Σ(T,K) (∂K Σ(T, K)) 2
(1.7.44)
or we can factorise the numerator with Σ(T, K)(T − t). The implied volatility being a function of the strike K and the
maturity T , we use the chain rule to express the derivatives of the market prices in terms of the Black-Scholes Greeks.
Concentrating on the density CKK (t, St , T, K) (see Equation (1.7.39)), we get
Σ(T, K)(T − t) √ 2
K2 CKK (t, St , T, K) = 1 + Kd1 T − t∂K Σ(K, T )
V ega(K, T )
√
K Σ(K, T )(T − t) ∂KK Σ(K, T ) − d1 (∂K Σ(K, T ))2 T − t
2
+
Quantitatively speaking, the implied risk-neutral probability density function (pdf) in the local volatility model is given
by
2 2 !!
√ ∂2Σ √
1 ∂Σ ∂Σ
p(d1 ) = √ 1 + ST d1 T −t + ST2 (T − t)Σ − d1 T −t
ST Σ T − t ∂K ∂K 2 ∂K
76
∂Σ(T,K)
is a quadratic function in d1 . Note, the risk-neutral density is a function of the skew ∂K and the curvature
2
∂ Σ(T,K)
∂K 2 . However, for the density to be valid we must impose some constraints on the shape of the skew and the
curvature. We can easily observe that if the quantity
2
∂2Σ √
∂Σ
− d1 T −t (1.7.47)
∂K 2 ∂K
∂Σ
is too negative, which happens when the slope of the smile ∂K is too high, the pdf can become negative. This means
that the European call prices are not the result of the expectancy under the risk-neutral measure of future pay-off and
∂Σ
therefore allow for arbitrage opportunities 4 . Note, we can rewrite the risk-neutral density in terms of the skew ∂K
2
∂ Σ
and the curvature ∂K 2 as follow
1 2d1 ∂Σ √ 1 ∂Σ 2 √ ∂2Σ
− ST T − t T − td1 − d21 (
√
p(d1 ) = √ + ) + ST T − t (1.7.48)
ST Σ T − t Σ ∂K Σ ∂K ∂K 2
In the case where we model the logarithm of the stock price, we denote the implied volatility as Σ(ek , T ) and get the
local volatility as
Σ(T, ek )(T − t)
∂kk C(t, St , ek , T ) − ∂k C(t, St , ek , T )
V ega(K, T )
2d1 d1 d2
= 1 + Σ(T, ek )(T − t) k k k 2
√ − 1 ∂k Σ(T, e ) + ∂kk Σ(T, e ) + (∂k Σ(T, e ))
Σ(T, ek ) T − t Σ(T, ek )
Σ(T, ek )(T − t)
∂kk C(t, St , ek , T ) − ∂k C(t, St , ek , T )
V ega(K, T )
√
k
2d1 k k d21 − d1 Σ(T, ek ) T − t k 2
= 1 + Σ(T, e )(T − t) √ − 1 ∂k Σ(T, e ) + ∂kk Σ(T, e ) + (∂ k Σ(T, e ))
Σ(T, ek ) T − t Σ(T, ek )
Σ(T, ek )(T − t)
∂kk C(t, St , ek , T ) − ∂k C(t, St , ek , T )
V ega(K, T )
√ 2 √
= 1 + d1 T − t∂k Σ(T, ek ) + Σ(T, ek )(T − t) ∂kk Σ(T, ek ) − ∂k Σ(T, ek ) − d1 T − t(∂k Σ(T, ek ))2
√
The Vega for the logarithm of the stock is given by V ega = ex Re(t, T ) T − tφSN (d1 ), so that the density becomes
4 In reality the slope of the smile can be higher than the one calculated using the risk-neutral measure as the market is incomplete. However we
77
p(d1 ) =
S √ 2 √
√T 1 + d1 T − t∂k Σ(T, ek ) + Σ(T, ek )(T − t) ∂kk Σ(T, ek ) − ∂k Σ(T, ek ) − d1 T − t(∂k Σ(T, ek ))2
Σ T −t
We can rewrite the risk-neutral density for the logarithm of the spot price in terms of skew ∂k Σ(T, ek ) and curvature
∂kk Σ(T, ek ) as follow
S d1 √
√T − T − t ∂k Σ(T, ek )
p(d1 ) = + ST 2 k
Σ T −t Σ(T, e )
√ d21 √ √
− d1 T − t (∂k Σ(T, ek ))2 + ST T − t∂kk Σ(T, ek )
+ ST T − t k
Σ(T, e )
Z +∞
= e−r(t1 −t0 ) E (St1 − K)+ |Ft1 = e−r(t1 −t0 )
C(t, K, t1 − t0 ) (x1 − K)φ(x1 , t1 , St0 , t0 )dλ(x1 )
K
Z +∞
∂C
(t, K, t1 − t0 ) = −e−r(t1 −t0 ) φ(x1 , t1 , St0 , t0 )dλ(x1 )
∂K K
∂2C
(t, K, t1 − t0 ) = e−r(t1 −t0 ) φ(K, t1 , St0 , t0 )
∂K 2
By definition C(t, K, t1 − t0 ) = CBS (St0 ; K, t1 − t0 ; r, Σ(St0 ; K, t1 − t0 )), such that applying the chain rule we have
Definition 1.7.1 (Kolmogorov-Compatibility) Any future deterministic conditional density, or smile surface, such
that
Property 1.7.1 Given a current admissible5 smile surface, if the future smile surface is Kolmogorov-compatible no
model-independent strategy can generate arbitrage profits.
5 i.e. ∂Call(t,K,T −t) ∂ 2 Call(t,K,T −t) ∂Call(t,K,T −t) ∂P ut(t,K,T −t)
such that the associated call prices sarisfy ∂K
< 0, ∂K 2
> 0, ∂T
> 0, ∂K
> 0,
Call(t, K, T − t)|K=0 = St and limK→∞ Call(t, K, T − t) = 0
78
σL (S) = σ0 + βS , ∀t
and integrating, the implied volatility becomes
β
Σ(S, K) ≈ σ0 + (S + K)
2
which can be rewritten in terms of the local volatility as
β
Σ(S, K) ≈ σL (S) +
(K − S)
2
More formally, we can use the local volatility in Equation (1.7.44) to understand its relation with the implied volatility.
First, factorising the numerator with Σ(T, K)(T − t) and re-arranging the denominator, the local volatility becomes
Σ(T,K)
2 T −t + 2 ∂T Σ(T, K) + (rT − qT )K∂k Σ(T, K)
σL (T, K) =
1 2d1 √ d1 d2
K2 Σ(T,K)(T −t)K 2 + ∂ Σ(T, K)
KΣ(T,K) T −t K
+ ∂KK Σ(T, K) + Σ(T,K) (∂K Σ(T, K))
2
Noting that
6 Here it is understood as (T, K) → Σ̃(T, η).
79
1 1 2d1 1 1 2
2
+ √ ∂K Σ(T, K) + d1 d2 (∂K Σ(T, K))2 = √ + d1 ∂K Σ(T, K)
Σ(T, K) (T − t)K K T −t Σ(T, K) K T − t
√
− d1 T − t(∂K Σ(T, K))2
we recover
Σ(T,K)
2 T −t + 2 ∂T Σ(T, K) + (rT − qT )K∂K Σ(T, K)
σL (T, K) = 2 √
1 √1
K2 Σ(T,K) K T −t + d1 ∂K Σ(T, K) − d1 T − t(∂K Σ(T, K))2 + ∂KK Σ(T, K)
Assuming no interest rates and no repo rates, we let the implied volatility be indenendent from the strike K. The local
volatility simplifies to
2
σL (T, K) = Σ2 (T, K) + 2Σ(T, K)(T − t)∂T Σ(T, K)
2
Setting τ = T − t, it can be written as σL (τ ) = ∂τ Σ2 (τ )τ , so that
Z τ
τ Σ2 (τ ) = 2
σL (u)du
0
which expresses the total variance as an average of forward variances.
Next, we let the implied volatility be independent from maturity, and assume a weak linear dependence of the skew
on K, such that ∂KK Σ(T, K) and ∂K Σ(T, K) are negligible. The local volatility is approximated by
2 Σ2 (K)
σL (T, K) ≈ √ 2 (1.7.49)
1 + d1 K T − t∂K Σ(K)
such that
Σ(K)
σL (K) ≈ √
1 + d1 K T − t∂K Σ(K)
If we now assume that the strike is close to the money, K = S + ∆K, then
1 S ∆K ∆K
d1 ≈ √ ln ≈− √ ≈− √
Σ T −t K S(Σ T − t) K(Σ T − t)
and the local volatility becomes
Σ(K) ∆K
σL (K) ≈ ∆K
≈ Σ(K) 1 + ∂K Σ(K) ≈ Σ(K) + (∆K)∂K Σ(K)
1− Σ(K) ∂K Σ(K)
Σ(K)
As a result, we can write the local volatility as a function of S + ∆K as follow
80
Σ(K)
σL (T, K) = K S
1+ Σ(K) ln K ∂K Σ(K)
which we can rewrite in the log-moneyness metric η, with implied volatility Σ̃(η), as
Σ̃(η)
σL (η) = η
1 − Σ̃(η) ∂η Σ̃(η)
dK
since dη = −K. The solution satisfies
Z η
1 1 1
= du
Σ̃(η) η 0 σL (u)
and for very short time to maturity, the implied volatility in the log-moneyness metric is the harmonic mean of the
local volatility.
Note, ν(t) is a blending parameter taking value in the range [0, 1] and controlling the slope of the local volatility
function ψ(t, x) = ν(t)x + (1 − ν(t))x0 . In the special case where rates are deterministic, setting ν = 0 we recover
the Gaussian distribution, while when ν = 1 we recover the lognormal distribution. Also, since the local volatility
ψ(t, x) is linear in x the model can be transformed into an Affine model and classical pricing formula of call options
can be applied, provided that we modify the characteristic function of the logarithm of the forward price and that we
shift the strike of the option. That is, the underlying price seen at time t is given by
S0
St = M (t) − (1 − ν(t))
ν(t)
where the process M (t) follows the dynamics
dM (t)
= σM (t)dWS (t) with M (0) = 1
M (t)
with σM (t) = ν(t)σD (t). Note, the underlying asset can become negative even if ν(t) > 0 unless ν(t) = 1. Therefore,
to generate some skew the displaced diffusion model must allow for the underlying price to become negative. One can
81
calibrate the model parameters ν(t) and σ(t) to the European call price Ct seen at time t with a strike K and maturity
T satisfying
St +
C(t, St , T, K) = P (t, T )E Q
M (T ) − K |Ft
ν(T )
where K = 1−ν(T )
ν(T ) St + K. The displaced diffusion model became popular among market practitioners since it is
the simplest way to introduce an implied volatility skew and vanilla option are priced with the standard Black-Scholes
formula with appropriate input parameters. In the presence of a non-zero drift, the dynamics of the underlying price
become
dSt
= µt dt + γ(t, St )dW (t)
St
where the local volatility function is
F (0, t)
γ(t, x) = σD (t) ν(t) + (1 − ν(t))
x
with F (0, t) being the initial forward price with delivery date t. In that setting, St + 1−ν(t)
ν(t) F (0, t) is not a geometric
Brownian motion due to the drift term and the Black-Scholes formula can no-longer be used to price vanilla options.
One possibility is to set the interest rates to zero, to multiply the strike with the discount factor and to diffuse the
St
process Sbt = Re(0,t) which is obtained by reinvesting the dividend in the stock price. When rates are stochastic,
one should diffuse the forward price under the forward measure. However, contrary to deterministic local variance
(DLV), the instantaneous variance of the forward price is not a function of the forward price but it is a function of
d
the underlying asset St = F (t, T ) PP f (t,T )
(t,T )
with stochastic zero-coupon bond prices. It is therefore difficult to solve
analytically, but approximations can be made. In the special case where the instantaneous volatility of the underlying
asset is deterministic (CEV) and the rates are Gaussian, Piterbarg [2005] introduced the autonomous representation
of the underlying process F̃ (t, T ) and approximated its local volatility function.
82
and produces vanilla option prices similar to those produced by the CEV model. Later, Svoboda-Greenwood [2007]
proved that the parameters transformation were the result of a simple linearisation of the CEV dynamics around the
initial value S0 . Therefore, considering the simple displaced diffusion model in Section (1.7.4.1), in order to recover
the vanilla option prices produced by the CEV model, we must impose
σD = σβ S0β−1 and ν = β
That is, given the elasticity parameter β and the elasticity volatility σβ , the displaced diffusion volatility depends in
a non-linear way on the initial value S0 . Muck [2005] tested the approximated displaced diffusion against the CEV
model on swaption prices and barrier swaption prices with β = 0.5, σβ = 0.12 and initial forward rate S0 = 0.06
which gives a displaced diffusion volatility of σD = 0.4898. However, it is not directly applicable to the Equity market
where the index can be as high as S0 = 20000 on the Japanese market, leading to an almost zero displaced diffusion
volatility.
+
V (St , T, K, σt ) = P (t, T )EQt ω(ST − K)
Z ∞ N
X
ω(y − K)+ λi fTi (y)dy
= P (t, T )
0 i=1
N
X Z ∞
ω(y − K)+ fTi (y)dy
= P (t, T )λi
i=1 0
N
X
= λi Vi (St , T, K, σti )
i=1
83
where P (t, T ) = e−r(T −t) . Brigo et al. noted that the use of densities leading to explicit options prices makes
extremely tractable models, whilst the potentially unlimited number of parameters in the asset price dynamics can
help achieving satisfactory fits to market data. For example, setting σ(t, St ) = σ(t)St to generate a log-normal
density model for each of the asset prices processes (St )N i
i=1 , each stock St has a marginal density conditional on S0
given by
2 !
i 1 1 y 1 2
ft (y) = √ exp − 2 ln − µt + Σi (t)
yΣi (t) 2π 2Σi (t) S0 2
qR
t 2
where Σi (t) = σ (s)ds is the time-dependent volatility. Clearly, the resulting option prices, representing a
0 i
weighted sum of expectations of log-normal densities, will reduce to a weighted sum of Black-Scholes prices
N
" ! !#
V (St , T, K, σt ) X ln St + (µ + 21 ηi2 )(T − t)) ln St + (µ − 21 ηi2 )(T − t)
= ω λi St eµ(T −t) Φ ω K √ − KΦ ω K √
P (t, T ) i=1
ηi T − t ηi T − t
N
X
= λi VBS (Sti , K, T, r, σti ) (1.7.51)
i=1
Σi (T −t)
where ηi := √
T −t
.
84
Y = f (X1 , X2 , ..., Xn )
where the random variables X1 , X2 , ..., Xn represent small and independent actions on the system under considera-
tion. Similar situations take place when analysing observational errors arising in experiments. For instance, Laplace
[1774] and Gauss [1809] associated the error distribution with the scheme of summation of random variables (see
details in Uchaikin et al. [1999]). The theory they developed assumes that the random variables inPthe sum are small
n
and independent. Originally, J. Bernoulli [1713] considered the sequence of normalised sums n1 i=1 Xi where in-
dependent random variables Xi take the value 1 with probability p and the value 0 with probability 1 − p. Bernoulli’s
therorm states that for any arbitrary small but fixed > 0 we have
1
P( Sn − p > ) → 0 , n → ∞
n
Pn
where Sn = i=1 Xi . It is the Bernoulli’s form of the law of large numbers. The law of large numbers and its various
generalisations connect together the theory and practice. This is a special case of the Central Limit Theory (CLT) of
probabilty theory. We now consider a sequence of independent random variables X1 , X2 , ... possessing one and the
same distribution function. Assuming that the mathematical expectation a = E[Xi ] and the variance σ 2 = V ar(Xi )
of these variables are finite, we construct the corresponding sequence of the normalised sums Z1 , Z2 , ...
Sn − na
Zn = √
σ n
Then for any x1 < x2
85
Z x2
P (x1 < Zn < x2 ) ⇒ pG (x)dx , n → ∞
x1
where
1 x2
pG (x) = √ e− 2
2π
is the density of the standard normal law. The aim of the generalised CLT was the use of not only the normal law
as limiting approximation, but also some other distributions of a certain analytical structure. Considering the i.i.d,
sequence of random variables X1 , X2 , ... witout any preliminary assumptions about their distribution, and using the
real-valued constants a1 , a2 , ... and positive constants b1 , b2 , ..., we introduce
Sn − an
Zn =
bn
We want to find the constants an and bn in such a way that the distribution functions of Zn weakly converge to some
limit distribution function G(x), that is,
1. Exponential decay : it decreases at a rate proportional to its current value. That is, Xt = X0 e−λt where λ is the
exponential decay constant and τ = λ1 is the mean lifetime.
2. Power-law decay : one quantity varies as a power of another. Power laws are scale-invariant, that is, for f (x) =
ax−α we get f (cx) = c−α f (x) ∝ f (x). A power law x−α has a well-defined mean over x ∈ [1, ∞] only if
α > 2 and it has a finite variance only if α > 3.
3. The behaviour of distributions with finite endpoint of their support.
More generally, a power-law probability distribution is a distribution whose density function has the form
p(x) ∝ L(x)x−α
where α > 1 and L(x) is a slowly varying function at infinity 1 . Thus, p(x) is asymptotically scale invariant.
Heavy-tailed distributions are probability distributions whose tails are not exponentially bounded. That is, they have
heavier tails than the exponential distribution. The distribution of a random variable X with distribution function F is
said to have a heavy right tail if
1 L(ux)
limx→∞ L(x)
= 1, for any u > 0.
86
A fat-tailed distribution is a probability distribution exhibiting large skewness or kurtosis relative to the normal distri-
bution. Some fat-tailed distributions have power law decay in the tail of the distribution, but do not necessarily follow
a power law everywhere. The distribution of a random variable X is said to have a fat tail if
A stochastic process Y = (Y (t), t ≥ 0) is called self-similar if there is H such that for all c > 0 we have
N = sD
ln (N )
where D = ln (s) is the dimension of the scaling law, known as the Hausdorff dimension.
87
The stability parameter α define the existence, or not, of the variance. That is, the normal random variable is
unbounded, and can take arbitrarily large (absolute) values, but can not take infinite values in the sense that
Z ∞ Z x
0 0 0 0
lim p(x )dx = lim p(x )dx = 0
x→∞ x x→−∞ −∞
Thus, a random variable distributed by the normal law (or any other stable) law takes finite values with probability
one. They are called proper. Any non-degenerate stable distribution has a smooth (infinitely differentiable) density
function, but for α < 2 the density has an asymptotic behaviour of a heavy-tail distribution, leading to infinite variance.
The meaning of infinite variance can be expressed as follow
Z ∞ Z −x
0 0 0 0
p(x )dx + p(x )dx ∝ x−α , x → ∞ , 0 < α < 2
x −∞
Hence, the existence of finite variance of the normal law is connected with just a faster decrease of tails as compared
with the others. So, all the members of the stable law compete on equal terms, and only the common habit of using
the variance to characterise distributions makes the normal law attractive.
To understand the relationship between power laws and moments we provide a brief discussion of the properties of
the tails of non-Gaussian (α < 2) stable laws:
• For all α < 2 and −1 < β < 1, both tail probabilities and densities are asymptotically powers laws.
• When β = −1, the right tail of the distribution is not asymptotically a powers law.
• When β = 1, the left tail of the distribution is not asymptotically a powers law.
The smaller α is, the heavier the tails of the density. One consequence of heavy tails is that not all moments exits.
The first two moments are not generally useful for heavy tailed distributions since the integral
R∞ expressions for these
expectations may diverge. Instead, we can use fractional absolute moments E[|X|p ] = −∞ |x|p f (x)dx, where p is
any real number. One can show that for α < 2,
Thus,
• when 0 < α < 2, then E[|X|2 ] = E[X 2 ] = +∞ and stable distributions do not have finite second moments
(variances).
Definition 2.1.1 The random variable X with values in R has a stable distribution if the following condition holds:
Pnn ∈ N+ and X1 , ..., Xn is a sequence of independent variables, each with the same distribution as X, then Sn =
If
i=1 Xi has the same distribution as an + bn X for some an ∈ R and bn ∈ (0, ∞). X is strictly stable if and anly if
an = 0 for all n.
88
The problem of summing stable random variables, can be reduced to finding an and bn . For a strictly stable random
variable, an = 0, it simplifies to
n
X
Sn = Xi = bn X
i=1
which is easlily solved for the normal distribution (only case with finite variance). In the general case of strictly stable
random variables, we get
n
1
X
Sn = Xi = n α X
i=1
1
One can show that the only possible choice for the scaling constants is bn = n α for some α ∈ (0, 2]. All stable
distributions remain stable under linear transformations, and one can choose some standard values of the shift and
scale parameters.
Two important properties of the stable distributions are
1. the stability property, and
2. the generalisised central limit theorem.
A basic property of stable laws is that sums of α-stable random variables are α-stable. Using characteristic functions,
we can show
Definition 2.1.2 The stability property states that the random variables X1 , ..., Xn are independent and symmetrically
Pn with the same characteristic exponent, α, if and only if for any constants a1 , ..., an , the linear combination
stable
i=1 ai Xi is symmetric α-stable.
Remark 2.1.1 When the summands in Definition (2.1.2) are dependent, the sum is stable, but the precise statement is
more difficult and depends on the exact dependence structure.
Remark 2.1.2 In the case where the summands in Definition (2.1.2) do not all have the same α, the sum will not be
stable.
One can show that the sum of two independent stable random variables with different αs is not stable.
The central limit theorem (CLT) states that the sum of a number of independent and identically distributed (i.i.d.)
random variables with finite variances will tend to a normal distribution as the number of variables grows. To be more
2
PnX1 , X2 , ... be i.i.d. random variables with mean µ and variance σ . CLT states that the sample mean
precise, let
1
X n = n i=1 Xi will have
n
X
an Xi − bn → Z ∼ N (0, 1) as n → ∞
i=1
√
where an = σ√ 1
n
and bn = σnµ . The generalisied CLT shows that if the finite variance assumption is dropped,
the only possible resulting limits are stable. That is, the sum of a number of random variables with a power-law tail
distribution decreasing as |x|−(α+1) where 0 < α < 2 (having infinite variance) will tend to a stable distribution as the
number of summands grows. If α > 2, then the sum converges to a stable distribution with α = 2, that is, a Gaussian
distribution (see Gnedenko [1943] and Gnedenko et al. [1954]). It is mathematically formalised as follow:
89
1 − FX (x) ∼ cs−µ , x → ∞
FX (x) ∼ d|x|−µ , x → ∞
with µ > 0. Then there exist sequences an and bn > 0 such that the distribution of the centered and normalised sum
n
1 X
Zn = Xi − an
bn i=1
weakly converges to the stable distribution with parameters
µ,µ≤2
α=
2,µ>2
and
c−d
β=
c+d
as n → ∞:
FZn (x) ⇒ GA (x : α, β)
When discussing convergence of normalised sums, we can use the following definition:
Definition 2.1.3 A random variable X is in the domain of attraction of Z if there exists constants an > 0, bn ∈ R
with
n
X
an Xi − bn → Z
i=1
where X1 , X2 , ... are i.i.d. copies of X. DA(Z) is the set of all random variables that are in the domain of attraction
of Z.
1
Thus, the only possible nondegenerate distributions with a domain of attraction are stable. When an = n− α , X is
1
said to belong to the normal domain of attraction of Z. Generally, an = n− α h−1 (n) where h(x) is a slowly varying
function at infinity 2 .
2 h(ux)
limx→∞ h(x)
= 1, for any u > 0.
90
h(x) h(x)
P (X > x) ∼ p α
and P (X ≤ −x) ∼ q α , x → ∞
x x
This relation is called tail balance condition. Even though the limit in Equation (2.1.2) is a benchmark for weakly
dependent stationary sequences with regularly varying marginal distribution, in the presence of dependence, conditions
for the convergence of the partial sums towards a stable limit require special structure.
Going further, Davis et al. [1995] assumed the stronger condition that the strictly stationary sequence Xt is regularly
varying with index α ∈ (0, 2). That is, the finite-dimensional distributions of Xt have a jointly regularly varying
distribution in the following sense: for every d ≥ 1, there exists a non-null Radon measure µd on the Borel σ-field of
d
R \ {0} (µd is finite on sets bounded away from zero), R = R ∪ {±∞}, such that
1 v
nP ( (X1 , ..., Xd ) ∈ ·) −
→ µd (·) (2.1.3)
an
v
where −
→ denotes vague convergence and an satisfies
lim nP (Sd > an ) = b+ (d) and lim nP (Sd ≤ −an ) = b− (d) , d ≥ 1 (2.1.5)
n→∞ n→∞
91
1
Hence, if b+ (d) + b− (d) > 0, the regular variation of an with index α is equivalent to regular variation of Sd with
index α.
Regular variation (RV) of a stationary sequence Xt is well accepted in applied probability theory. Davis et al.
[1995] proposed several conditions that we are going to briefly discuss.
where m = mn → ∞ is the block size used in the definition of the mixing condition A(an ).
Davis et al. proved convergence of the normalised partial sums by showing that the limiting distribution is infinitely
divisible with a Levy triplet corresponding to an α-stable distribution. They needed conditions to ensure that the sum
of the small values in the sum a−1n Sn does not contribute to the limit. The vanishing small values conditions (VSM)
is as follow
exists, where b+ (d) and b− (d) are the tail balance parameters given in Equation (2.1.5).
3 [x] denotes the integer part of x.
92
Bartkiewicz et al. [2011] showed that c+ and c− are non-negative and a1n Sn converges in distribution to an α-stable
random variable and provided the characteristic function. They applied their results to some standard time series
models, such as
Xt = σt Zt
where the volatility sequence σt is strictly stationary independent of the i.i.d. noise sequence Zt .
• the stochastic recurrence equation
Xt = At Xt−1 + Bt , t ∈ Z
where (At , Bt ) constitutes an i.i.d. sequence of non-negative random variables At and Bt . Examples of such
process are the ARCH(1) process and the volatility sequence of a GARCH(1, 1) process.
• the stochatic volatility model
Xt = σt Zt
where Zt is an i.i.d. sequence with E[Z] = 0 and var(Z) = 1 and
The mathematical notion of independence of random variables is a fundamental part of probability theory since it mod-
els situations where the events do not cause one another. However, even though the concepts of linearly independent,
orthogonal, and uncorrelated variables indicate lack of relationship between variables, the mathematical distinctions
between them are confusing. Rodgers et al. [1984] discussed these concepts on fixed variabes, and proposed the
following algebriac definition:
Definition 2.1.4 Let X and Y be vector observations of the variables X and Y . Then
1. X and Y are linearly independent iff there exists no constant a such that aX − Y = 0.
93
Note, linear independence and orthogonality are properties of the raw variables, while zero correlation is a property of
the centered variables.
In a geometric framework, uncorrelated implies that once each variable is centered, then the vectors are perpendicular.
That is, centering variables will change the angle between the two vectors. Thus, orthogonal denotes that raw variables
are perpendicular, while uncorrelated denotes that the centered variables are perpendicular. For linearly independent
variables we get the following situations
1. two variables that are perpendicular can become oblique once centered: orthogonal but not uncorrelated.
2. two variables not perpendicular can become perpendicular once centered: uncorrelated but not orthogonal.
In probability theory, the property of mutual independence of a set of random variables is some special property of
its joint distributions. It is important to note that correlation is only a measure of linear dependence, but it does not
necessarily imply something about other kinds of dependence. In the special case where X or Y has an expected value
of 0, then the covariance is the expectation of the product, and X and Y are uncorrelated if and only if E[XY ] = 0.
For example, we let X be normally distributed with E[X] = 0 and we let Y = X 2 be a quadratic function of X.
Thus, X and Y are dependent (quadratic dependence), but Cov(X, Y ) = E[XY ] = E[X 3 ] = 0. Therefore, X and
Y are uncorrelated, but not independent (situationR(2)). Note, X does not have to be normally distributed, and any
density function symmetric about 0 and for which |x|3 dP exists, will do.
Many variables in complex natural and engineered systems are actually correlated, or, have some nonlinear in-
terdependence. Even though assuming independent variables is mathematically convenient in presence of a lack of
evidence about dependence, it is not correct. Ferson et al. [2005] reviewed the dangers of assuming all variables in
an assessment are independent, and he showed how different dependencies can lead to quantitatively different results.
Some important facts leading to confusion are
• independence between X and Y and between Y and Z implies nothing at all about the dependence between X
and Z.
• independence implies that the correlation will be zero, but not vice versa.
• uncorrelatedness does not imply there is no relationship between the variables.
94
Finite variance In the case of finite variance, there are many different measures of correlation among which are the
Pearson’s product-moment correlation and Spearman’s rank correlation coefficient. See Hutchinson et al. [1990] for
a review. One can use the Pearson’s product coefficient for computing the correlation between normally distributed
variables since it only measures the degree of linear dependence. Otherwise, a more general measure of dependency
between variables, such as the Spearman’s rank correlation coefficient, can be used. In fact, it takes an entire depen-
dence function, or copula, to fully specify the dependence between two random variables (see Clemen et al. [1999],
Li [2000]). We introduce copulas in Section (2.1.3.3). There are many ways for dependence between variables, such
as nonlinear relationships, clusters, subgroupings, impossible regions, and other complexities. As a result, copula
function can not be completely characterised by a single dimension such as correlation coefficient. Nonetheless, there
exists a few special cases where correlation completely determine the dependence, such as the Bernoulli marginals or
when the correlation is extreme, ±1. The above result naturally leads to the following fact
• varying correlations is insufficient to explore the possible nonlinear dependencies between variables.
Thus, a sensitivity analysis based on varying correlation gives an incomplete picture of uncertainty that is far too tight,
even if correlation is varied in the range [−1, 1].
Infinite variance In the case of infinite variance, these measures can be classified as
• using moments with orders less than two, like the coefficient of covariation, or
• not using any moment, like the rank-based correlation coefficients.
Note, the former is not bounded and not symmetric. To remedy this problem, Garel et al. [2005] proposed the
symmetric coefficient of covariation which gives good results. Further, when estimating the multivariate extreme
dependence of two series we can use the bivariate EVT method, but we still need to check if the two series are
asymptotically independent or not (see Bekiros et al. [2008]). In a limiting sense, two stituations are possible, when
quantifying dependence of multivariate extreme values:
However, asymptotic independence is not equivalent to pure statistical independence, as it allows a certain level of
dependence between two series of finite samples. Non-parametric measures exist to describe the levels of asymptotic
dependence or independence, but the data must be sampled independently (see Coles et al. [1999], Poon et al. [2003]
[2004]). Thus, one can use a vector autoregressive model to filter for any serial correlation, and multivariate Garch
models can be used to filter for any heteroskedasticity.
95
While copulas are relatively simple structures compared to joint distributions, Sklar’s theorem (see Sklar [1959])
explains how to compute the joint distribution function H(x, y) from specified marginal distributions (CDFs) and a
dependence function represented by a copula.
N (N −1 (a), N −1 (b))
which is computed with the method of inversion. Numerous parametric families of copula exist (see Nelsen [2010]),
such as
Taking the second cross-partial derivative of Equation (2.1.6) with respect to x and y, we get
h(x, y)
c(FX (x), FY (y)) = (2.1.8)
fX (x)fY (y)
Thus, the dependence function is the ratio of marginal and joint PDFs, but it is not scale-free. The dependence function
under the product copula is 1, and under the normal copula density it is
96
n(x, y)
c(N (x), N (y)) =
n(x)n(y)
which is a particular example of Equation (2.1.8).
Hurst measured how a reservoir level (Nile River Dam project) fluctuated around its average level over time, and
found that the range of the fluctuation would change, depending on the length of time used for measurement. If the
series were random, the range would increase with the square root of time
1
R=T2
but it was not the case. To standardise the measure over time, he created a dimensionless ratio by dividing the range
by the standard deviation, S, of the observation, obtaining the Rescaled Range
R
= k × TH
S
where k is a constant depending on the time series. Thus, by rescaling the data, the so called rescaled range analysis
(R/S analysis) can compare diverse phenomena and time periods. He found that most natural phenomena follow a
biased random walk, that is, a trend with noise which could be measured by how the rescaled range scales with time,
or, how high the exponent H is above 21 . This phenomenon became known as the Hurst phenomenon (in the sense of
H > 12 ). The value of the Hurst exponent varies between 0 and 1, with
1
• H= 2 implying a random walk, or an independent process.
• For 0 ≤ H < 12 we have anti-persistence (or ergodicity) where the process covers less distance than a random
walk (mean reverting).
97
• For 12 < H ≤ 1 we have persistence (or trend-reinforcing) where the process covers more distance than a
random walk (long memory effects).
The fractal dimension D of a time series measures how jagged the time series is. In order to calculate the fractal
dimension of a time series we count the number of circles of a given, fixed diameter needed to cover the entire time
series. We then increase the diameter and count again the number of circles required to cover the time series. Repeating
that process, the number of circles has an exponential relationship with the radius of the circle satisfying
N × dD = 1
where N is the number of circles, and d is the diameter. Transforming the above equation 4 , the fractal dimension is
given by
log (N )
D=
log ( d1 )
Thus, we get s = d1 , and we see that the fractal dimension of a time series is a function of scaling in time. As a result,
the connection between the two types of scaling (time and space) is determined by the Hurst exponent. Mandelbrot
showed that the Hurst Exponent, H, is related to the fractal dimension D for a self-similar surface in n-dimensional
space by the relation
D =n+1−H
where 0 ≤ H ≤ 1. Further, noises can be characterised by their power spectra, which follow simple inverse power
laws, P (f ) ∼ f −βs , where βs ∈ [0, 4] is the spectral exponent (see Fourier analysis). For white noise βs = 0,
meaning that it is not related to frequency. When white noise is integrated. we get βs = 2, the power spectrum for
brown noise. If 0 < βs < 2, we have pink noise. Beyond brown noise, there is black noise where βs > 2. There is a
relationship between fractional noises and the Hurst exponent given by
2H + 1 if H 6= 21
βs =
2H − 1 if H = 12
such that
98
• unusually heavy tails via infinite variance. The Noah effect: extreme incidents of precipitation.
• unusually long memory. The Joseph effect: long stretches of time higher than average and lower than average
precipitation.
• fractional stable processes: self-similar processes with LRD and heavy tails.
However, the Joseph effect (long memory) can be taken to indicate non-stationarity. Hence, the difficulty to distinguish
between stationary long memory models and other non-stationary models. Bhattacharya et al. [1983] showed that the
Hurst phenomenon could as well be explained by a simple non-stationary model. The model proposed is as follow
Xi = Yi + (a + i)−β , i = 1, 2, ...
where Yi are i.i.d. random variables with a finite variance σ 2 , a ≥ 0 and 0 < β < 21 . It has the same R/S statistic than
the fBm with H = 1 − β. It can be generalised to the class of Regime Switching models, which is a very important
class of non-stationary models resembling empirically long memory stationary models. They have break points with
location changing with the sample size, in either a random or non-random manner. Such a change can affect the mean
and the variance of the process after break points, so that to many sample statistics it will look like long memory.
In fact, the sationary long memory processes form a layer among the stationary process that is near the boundary
with non-stationary processes. Therefore, since most stationary processes do not have long memory, non-stationary
processes can provide an alternative explanation to the notion of LRD.
1. the Hurst exponent of the time series is related to the characteristic exponent.
1
2. the characteristic exponent (α = H) is the fractal dimension of the probability space.
3. D = 2 − H is the fractal dimension of the time series.
• For 0 < α ≤ 1 the distribution has infinite mean and infinite variance.
• For H = 1, we get α = 1.
• Values of 12 < H < 1 implies 1 < α < 2, indicating undefined or infinite variance leading to the non-existence
of a martingale process. We get the Pareto-Levy distributions.
1
• Values of H = 2 implies α = 2 and we get the Gaussian distribution.
• Values 0 < H < 21 implies 2 < α < ∞ and the distribution is not stable, implying that the process is not
independent with identically distributed innovations. However, the distributions still have finite variance.
99
1. in the time domain, LRD occurs at a high degree of correlation between distantly separated data points,
2. in the frequency domain, LRD occurs at a significant level of power at frequencies near zero,
Xt = tH X1 , ∀t > 0
so that the distribution of Xt is completely determined by the distribution of X1
x
Ft (x) = P (tH X1 ≤ x) = F1 ( )
tH
Differentiating the above equation, we get the density ρt of Ft as
1 x
ρt (x) = ρ1 ( H )
tH t
and the kth moment is
100
1. Multifractality due to a broad probability density function (pdf) for the values of the time series. One can not
remove the multifractality by shuffling the series.
2. Multifractality due to different long-range (time-) correlations of the small and large fluctuations. In this case,
the pdf of the values can be a regular distribution with finite moments. The corresponding shuffled series will
exhibit nonmultifractal scaling, since all long-range correlations are destroyed by the shuffling procedure.
While multifractality, or anomalous scaling, allows for a richer variation of the behaviour of a process across different
scales, there are various ways of defining multifractality. For instance,
Cq (∆t) ∼ (∆t)qH(q)
with H(q) changing continuously with q at least for some region of the q values (see Barabasi et al. [1991]). As
a result, multiscaling in empirical data is typically identified by differences in the scaling behaviour of different
(absolute) moments
101
Holder exponent and depends on both time and the sample path of X. Researchers focused on a global description
of the regularity of the function of f in form of multifractal spectrum (also called the singularity spectrum) D(h)
reflecting the size of the set of points for which the Holder exponent takes a certain value h. The main idea being
that the relative frequency of the local exponents can be represented by a renormalised density called the multifractal
spectrum. Thus, multifractal analyses estimate a multifractal spectrum of scaling exponents. Under some assumptions
on the homogeneity and isotropy of the statistics of local singularities, it is possible to derive a relation between self-
similarity exponents τ (q) and the singularity spectrum D(h). Various tools exist to define the scaling exponents of
higher order statistical moments and the temporal modulation of a local scaling exponent.
102
Eveven though it has been assumed little evidence of fractional integration in stock returns, long memory has been
identified in the first differences of many economic series a long time ago. For instance, Mandelbrot [1974] argued
against martingale models and Maheswaran et al. [1993] suggested potential applications in finance for processes
lying outside the class of semimartingales. Econophysics developed to study the herd behaviour of financial markets
via return fluctuations, leading to a better understanding of the scaling properties based on methods and approaches
in scientific fields. To measure the multifractals of dynamical dissipative systems, the generalised dimension and
the spectrum have effectively been used to calculate the trajectory of chaotic attractors that may be classified by the
type and number of the unstable periodic orbits. Even though a time series can be tested for correlation in many
different ways (see Taqqu et al. [1995]), some attempts at computing these statistical quantities emerged from the
box-counting method, while others extended the R/S analysis (see Mandelbrot et al. [1979]). The moment-scaling
properties of financial returns have been the object of a growing physics literature confirming that multiscaling was
exhibited by many financial time series (see Vandewalle et al. [1998], Schmitt et al. [1999], Pasquini et al. [2000]).
Since scaling analysis and multifractal analysis developed, various authors performed empirical analysis to identify
anomalous scaling in financial data (see Calvet et al. [2002]).
103
Since the absence of long range dependence (LRD) in returns is still compatible with its presence in absolute
returns (or volatility), several authors suggested models such as the Fractionally Integrated GARCH models, where
returns have no autocorrelation but their amplitudes have LRD (see Baillie [1996]). Many continuous multifractal
models, such as the multifractal model of asset returns (MMAR), have been proposed to capture the thick tails and
long-memory volatility persistence exhibited in the financial time series (see Mandelbrot et al. [1997]). Such models
are consistent with economic equilibrium, implying uncorrelated returns and semimartingale prices, thus precluding
arbitrage in a standard two-asset economy. Returns have a finite variance, and their highest finite moment can take
any value greater than 2. However, the distribution does not need to converge to a Gaussian distribution at low
frequencies and never converges to a Gaussian at high frequencies, thus capturing the distributional nonlinearities
observed in financial series. These multifractal models have long memory in the absolute value of returns, but the
returns themselves have a white spectrum. That is, there is long memory in volatility, but absence of correlation in
returns. Subsequent literature moved from the more combinatorial style of the multifractal model of asset returns
(MMAR) to iterative causal models of similar design principles, such as the Markov-switching multifractal (MSM)
model proposed by Calvet et al. [2004] and the multifractal random walk (MRW) introduced by Bacry et al. [2001],
constituting the second generation of multifractal models.
Examining the stability of the Hurst exponent, H, on financial time series on the basis of characteristic values, such
as rescaled ranges or fluctuations analysis, some authors (see Vandewalle et al. [1998c], Costa et al. [2003], Cajueiro
et al. [2004], Grech [2005]) observed that the values of H could be significantly higher or lower for a specific
scale. Using detrended fluctuation analysis (DFA), or detrended moving average (DMA), to analyse asset returns on
different markets, various authors observed that the Hurst exponent would change over time indicating multifractal
process. They found that the exponent values and the range over which the power law holds varied drastically from
one underlying asset to another one, obtaining three categories, the persistent behaviour, the antipersistent behaviour,
and the strictly random one. In addition, they showed in some stocks and some exchange rate that the Hurst exponent
was changing with time with successive persistent and antiperisistent sequences (see Costa et al. [2003], Kim et
al. [2004]). Several authors proposed to use methods of long-range analysis such as DFA or DMA to determine the
local correlation degree of the series by calculating the local scaling exponent over partially overlapping subsets of the
analysed series (see Vandewalle et al. [1998c], Costa et al. [2003], Cajueiro et al. [2004], Carbone et al. [2004],
Matos et al. [2008]).
Later, measuring multifractality with either DFA, DMA, or wavelet analysis, and computing the local Hurst ex-
ponent on sliding windows, a large number of studies confirmed multifractality in stock market indices, commodities
and FX markets, such as Matia et al. [2003], Kim et al. [2004], Matos et al. [2004], Norouzzadeh et al. [2005].
Further studies confirmed multifractality in stock market indices such as Zunino et al. [2007] [2008], Yuan et al.
[2009], Wang et al. [2009], Barunik et al. [2012], Lye et al. [2012], Kristoufek et al. [2013], Niere [2013], to
name but a few. Other studies confirmed multifractality on exchange rates such as Norouzzadeh et al. [2006], Wang
et al. [2011b], Barunik et al. [2012], Oh et al. [2012], while some confirmed multifractality on interest rates such as
Cajueiro et al. [2007], Lye et al. [2012], as well as on commodity such as Matia et al. [2003], Wang et al. [2011].
Then methods for the multifractal characterisation of nonstationary time series were developed based on the gen-
eralisation of DFA, such as the MFDFA by Kantelhardt et al. [2002]. Consequently, the multifractal properties as a
measure of efficiency (or inefficiency) of financial markets were extensively studied in stock market indices, foreign
exchange, commodities, traded volume and interest rates (see Matia et al. [2003], Ho et al. [2004], Moyano et al.
[2006], Zunino et al. [2008], Stosic et al. [2014]). For instance, Zunino et al. [2008] used MFDFA to analyse
the multifractality degree of a collection of developed and emerging stock market indices. Gu et al. [2010] analysed
the return time series of the Shanghai Stock Exchange Composite (SSEC) Index with the one-dimensional MFDMA
model within the time period from January 2003 to April 2008, and confirmed that the series exhibits multifractal
nature, not caused by fat-tailedness of the return distribution. Lye et al. [2012] used MFDFA coupled with the rolling
window approach to scrutinise the dynamics of weak form efficiency of Malaysian sectoral stock market, and showed
that it was adversly affected by both Asian and global financial crises.
104
It was also shown that observable in the dynamics of financial markets have a richer multifractality for emerg-
ing markets than mature one. As a rule, the presence of multifractality signalises time series exhibiting a complex
behaviour with long-range time correlations manifested on different intrinsic time scales. Considering an artificial
multifractal process and daily records of the S&P 500 index gathered over a period of 50 years, and using multifractal
detrended fluctuation analysis (MFDFA) and multifractal diffusion entropy analysis (MFDEA), Jizba et al. [2012]
showed that the latter posses highly nonlinear, and long-ranged, interactions which is the manifestation of a number of
interlocked driving dynamics operating at different time scales each with its own scaling function. Such a behaviour
typically points to the presence of recurrent economic cycles, crises, large fluctuations (spikes or sudden jumps), and
other non-linear phenomena that are out of reach of more conventional multivariate methods (see Mantegna et al.
[2000]).
1. fractal analysis
2. statistical self-similarity analysis (using various statistical methods)
However, these statistical approaches are based on the moment properties of stochastic processes and must be re-
stricted to second-order stationary processes. In addition, fractal properties are only verified in the infinitesimal limit.
Nonetheless, a large number of statistical studies on market prices checked for the existence of long-range and short-
range power law correlation in financial data. Most of these studies did not find temporal correlations present in the
system for price changes, but they did for absolute price changes, the average of absolute price changes, the square
root of the variance, and the interquartile range of the distribution of price changes. We have seen in Section (2.1.4)
that it was difficult to test for a particular parametric model, such as α-stable Levy processes or fractional Brownian
motions, since self-similarity can have very different origins. For instance, we have seen in Section (2.1.2.2) that stud-
ies estimating the tail exponent directly from the tail observations tends to overestimate the characteristic exponent α.
Also, the true tail behaviour of L-stable laws is only visible for extremely large data sets, leading to strongly mislead-
ing results when the sample is not large enough, such as the rejection of the L-stable regime (see McCulloch [1997]).
Thus, we should use high-frequency data and consider the most outlying observations when estimating the tail index.
In addition, if the time series of asset returns possess the two features of heavy tails and long range dependence, then
many standard statistical tests will fail to work (see Resnick et al. [1999]). Also, Resnick et al. [2000] give examples
of such processes where sample autocorrelations converge to random values as the sample size grows. Further, we
have seen that slow decay of sample autocorrelation functions could indicate non-stationarity, making it hard to dis-
tinguish between stationary long memory models and other non-stationary models. Mikosch et al. [2003] observed
that the non-stationarity of returns may generate spurious effects easily mistaken for LRD in the volatility. Granger et
al. [2004] considered the interaction of LRD with non-stationarity by combining an underlying long memory process
with occasional structural breaks.
More generally:
• in the frequency domain, at frequencies near zero, measurement errors are largest
105
Yet, stock returns and FX rates suffer from systematic effects mainly due to the periodicity of human activities, and
can not be considered as processes with stationary increments. Clegg [2006] showed that LRD was a very difficult
property to measure in real life because the data must be measured at high lags/low frequencies where fewer readings
are available, and all estimators are vulnerable to trends in the data, periodicity and other sources of corruption.
When eliminating these problems, academics found correlations present in systems and highlighted the multifractal
nature of financial time series. It was shown that self-similar models were too restrictive to explain financial time
series, as they were unable to capture more fully the complex dynamics of the series. While all these models assume
that the response times are trial-independent random variables, financial series exhibit multifractal scaling behaviour
since a multitude of scaling exponents are necessary to fully describe the series. Simply put, financial time series
exhibit
• non-Gaussian distribution,
• the presence of extreme values (outliers),
• long-range dependent dynamics.
1. that the variations can not be exclusively described by the scaling of the variance alone, but that the scaling of
higher order statistical moments must be considered.
2. intermittent changes in the magnitude of response time variation which might be due to feedback effects, or
changes in investor’s behaviour. These changes provide temporal modulation of both the width and shape of the
response time distribution, and consequently, temporal modulation of the scaling exponent.
The Black-Scholes model [1973] for pricing European options assumes a continuous-time economy where trad-
ing can take place continuously with no differences between lending and borrowing rates, no taxes and short-sale
constraints. Investors require no compensation for taking risk, and can construct a self-financing riskless hedge which
must be continuously adjusted as the asset price changes over time. A major assumption of the BS model is that the
106
underlying returns are normally distributed with a variance proportional to the length of time over which the asset
trades. Even though the BS-model of option pricing is based on a normal distribution, if the distribution is actually
a fat-tailed one, then the model will under-price options that are far out of the money, since a 5 or 7-sigma event
is much more likely than the normal distribution would predict. The problem with the distributional assumption of
the BS model means that it generally underestimates equity, or FX, option values because the likelihood of having
an extreme price movement is greater than the model expects. Non-normal skewness and kurtosis in option-implied
distributions have been found to contribute significantly to the phenomenon of volatility smile. For instance, Sterge
[1989] found that very large (three or more standard deviations from the norm) price changes could be expected to
occur two or three times as often as predicted by normality. Hence, given sufficiently long record of stock market, the
observed largest historical price changes would be so unlikely under the normal law that one would have to wait for
horizons beyond at least the history of stock markets to observe them occur with non-negligible probability. In reality,
since the volatility is not constant it has a major impact on the values of certain options, especially those options that
are away from the money, because the dynamics of the volatility process rapidly change the probability that a given
out-of-the-money (OTM) option can reach the exercise price.
In order to justify the use of the Central Limit Theorem, it has been argued that financial asset returns are the
cumulative outcome of a vast number of pieces of information and individual decisions arriving almost continuously
in time (see McCulloch [1996]). Thus, returns have been modelled by the Gaussian distribution. However, financial
markets are populated with interacting agents making only imperfect forecast and directly influencing each other.
The uneven assimilation of information causes a biased random walk, with bias abruptly changing in direction or
magnitude. These intermittent changes in the magnitude of response time variation provide temporal modulation of
both the width and shape of the response time distribution, leading to temporal changes in the Hurst exponent. The
presence of multifractality signalises time series exhibiting a complex behaviour with long-range time correlations
manifested on different intrinsic time scales. Such a behaviour typically points to the presence of recurrent economic
cycles, crises, large fluctuations, and other nonlinear phenomena that are out of reach of more conventional methods.
As a result, the multifractal nature of financial markets contradicts the EMH, and thus the Black-Scholes pricing model
and its associated assumptions. For instance, Bergman et al. [1996] showed that dependence of the volatility on a
second Brownian motion, or jumps in the stock price, could lead to non-increasing, non-convex European call prices.
In a complete market, El Karoui et al. [1998] showed that when the volatility of the underlying stock was allowed to
be random in a path-dependent way, the value of a European call could decrease with increasing volatility, and even
decrease with increasing stock price.
Nonetheless, it seems that the academics who developed the option pricing theory (OPT) focused on the empirical
results that demonstrated thick tails and long-memory volatility persistence exhibited in the financial time series,
but did not account for the evidence of long memory in raw returns. Such models of uncorrelated returns (white
spectrum) and semimartingale prices are consistent with economic equilibrium and the EMH of Samuelson [1965].
It led to the well known stylised facts on volatility showing volatility clustering, asymmetry and mean reversion,
comovements of volatilities across assets and financial markets, stronger correlation of volatility compared to that of
raw returns, (semi-) heavy-tails of the distribution of returns, anomalous scaling behaviour, changes in shape of the
return distribution over time horizons. For example, Engle [1982] proposed to model volatility as conditional upon
its previous level, that is, high volatility levels are followed by more high volatility, while low volatility is followed by
more low volatility. Engle [1982] and LeBaron [1992] found supportive evidence of the autoregressive conditional
heteroskedastic (ARCH) model family, such that standard deviation is not a standard measure. This led jump-diffusion
models to develop in continuous time, such as the Merton model (see Merton [1976]), the Heston model (see Heston
[1993]), or a combination of both.
107
• jumps to the process, where jumps means significant unexpected discontinuous changes in prices.
• stochastic volatility, meaning volatility changing at random over time.
For example, assuming an expected frequency of jumps per year, the process jumps by adding independent normal
random variables. Usually, the jump arrival process follows a Poisson process. Fat tails can also be produced with a
stochastic volatility σt changing at random, with persistence. Persistence means that relatively high recent volatility
implies a relatively high forecast of volatility in the near future, while recent low volatility is associated with a predic-
tion of lower volatility in the near future. Hence, a jump-diffusion model can also be approximated in a discrete time
setting by an extreme version of a stochastic volatility model having non-persistent random volatility. That is, every
day the volatility is drawn at random independently of the previous day.
We are now going to illustrate this approach with a very simple model. At a meeting of the Financial Research Initia-
tive at Stanford University in March 1996, Robert Litterman proposed to simulate a random variable X of zero mean
and unit variance but with a given degree of tail fatness (see Duffie et al. [1997]). To do so, we first draw a random
variable Y whose outcomes are 1 and 0 with respective probabilities p and 1 − p, and then independently we draw a
standard normal random variable Z, Z ∼ N (0, 1). We let α and β be the standard deviations of the two normals to be
mixed. The random variable X is defined as follow
αZ if the outcome of Y is 1
X=
βZ if the outcome of Y is 0
that is, X = αZI{Y =1} + βZI{Y =0} . From independence we get E[ZI{f (Y )} ] = E[Z]E[I{f (Y )} ], such that E[X] =
0. The standard deviation α and β must be chosen such that the variance of X is 1, that is,
108
a local measure of volatility. On the other hand, as discussed in Section (2.2.3.1), we can formulate the IV as an
additional stochastic process entering the pricing equation of derivatives. It is well known that both the crash fears
and the volatility evolution are explanatory factors for the negatively skewed implicit distribution, and that each of
them implies a different relationship between the option maturity and the implicit skewness. Stochastic diffusion
models imply a direct relationship between the option maturity and the magnitude of the implicit skew while that
relationship is inversed in a finite variation jump model. This is because jump components address moneyness biases,
while having stochastic latent variables allows distributions to evolve stochastically over time. Hence, the BS model
consistently underestimates the value of an option to the extent that volatility is stochastic rather than constant as
assumed. However, it appears that stochastic volatility smiles are too shallow, while jump diffusions imply the smile
only for short maturity options. Moreover, when fitted to the substantial negative skewness and leptokurtosis implicit
to the short maturity options prices, these models assign a huge probability of getting large weekly movements in
the underlying returns which are not observed in practice. Nonetheless, according to Bakshi et al. [1997] only
a combination of jump and stochastic volatility models is capable of capturing the IVS. Similarly, Bates [2000]
studied empirically the impact of each explanatory factors on the shifted distribution and concluded that one need a
combination of both to recover a good fit to the market distribution.
109
As discussed in Section (1.5.2), a European call price can be decomposed into a Digital Stock, ∆S , and a Digital Bond,
∆K . In the Black-Scholes formula these terms simplifies to functions of the cumulative distribution function (CDF).
The values of the CDF, N (x), may be approximated by a variety of methods such as numerical integration, Taylor
series, asymptotic series and continued fractions. As a result, numerous models have been proposed to generate
the smile, and a considerable body of literature exist proposing methods for extracting the risk-neutral distribution
from option prices (see Bahra [1997]). However, any successful generalisation of the Gaussian assumption must
achieve flexibility and analytical tractability. We are now going to describe a few tractable models generating the
smile. Assuming a time-homogeneous Markov process Xt valued in an open subset D ∈ Rn , we let V (Xt ) be the
instantaneous variance process of the stock price {St ; t ∈ [0, T ]} over the life of the option price, and define the mean
variance as
Z T
2 1
I (t, T ) = V (Xs )ds
T −t t
In that setting, we are going to show that most models generating the smile can be decomposed as
dK(t, T )
= νγσ(t)dZK (t)
K(t, T ) + a(T )
110
where a(T ) is a constant in [0, T ] sometime given by a(T ) = 1−νν K(0, T ). Note, ν is a blending parameter taking
value in the range [0, 1] and controlling the slope of the local volatility function ψ(x) = νx + (1 − ν)x0 . In the
special cases where ν = 0 the forward price is normally distributed, while when ν = 1 it is log-normally distributed
and both of them belong to the class of the Affine models. Moreover, since the local volatility ψ(x) is linear in x
the model can be transformed into an Affine model. We now assume that calculation time is t0 = 0 and denote
KL (t, T ) = ln K(t, T ) + a(T ) and then apply Ito’s lemma to get its dynamics
1
dKL (t, T ) = − ν 2 γ 2 σ 2 (t)dt + νγσ(t)dZK (t)
2
We then integrate this SDE between [0, t] and take the exponential, getting
dU (t, T )
= νγσ(t)dZ(t) , U (0, T ) = 1
U (t, T )
We let F (t, T ) = (K(0, T ) + a(T ))U (t, T ) be the modified forward price seen at time t with the fixed maturity T
and dynamics
dF (t, T )
= νγσ(t)dZ(t) , F (0, T ) = K(0, T ) + a(T )
F (t, T )
and we re-express the European call price C(t; K, T ) seen at time t with a strike K and maturity T in terms of the
logarithm of the forward price Y (t, T ). It satisfies the equation
C(t; K, T ) = P (t, T )E Q [((K(0, T ) + a(T ))U (T, T ) − K)+ |Ft ] = P (t, T )E Q [(F (T, T ) − K)+ |Ft ]
1 a(T ) p
d1 = d1 − p log (1 + ) and d2 = d1 − σF (T − t)
σF (T − t) K
111
1 F (t0 , T ) 1 p p
d1 = p log + σ (T − t0 ) and d2 = d1 − σ (T − t0 )
σ (T − t0 ) K 2
1 a(T )
which we can write as d1 = d1 − √ log (1 + µT ). Setting µT = K , this model is similar in spirit to the
σ (T −t0 )
displaced diffusion described in Section (3.1.1.1). Bloch [2012a] showed that the call price could be approximated as
1 N (d ) 1
1 1 2
α(K, T ) = − √ V egaBS (t0 , T ) µT + 2
log (1 + µT ) d1 − (1 + µT )d2
T − t0 n(d1 ) 2 σ (T − t0 )
1
− µT p log (1 + µT )
σ (T − t0 )
In the special case where the strike is at-the-money forward, we can use Equation (1.2.12), so that the call price
simplifies to
p
C(t0 ; K, T ) ≈ (1 + µT )0.4St0 Re(t0 , T ) ω(T − t0 )
K=F (t0 ,T )
(3.1.5)
1 N (d ) 1
1 1 2 p
− √ V egaBS (t0 , T ) µT + 2
log (1 + µT ) σ (T − t0 ) − µT
T − t0 n(d1 ) 2 σ (T − t0 )
1
− µT p log (1 + µT )
σ (T − t0 )
p p
with d1 = 21 ω(T − t0 ) and d2 = −d1 for ω(T − t0 ) = σ (T − t0 ).
Remark 3.1.1 Contrary to the BS-formula, the ATM-forward call option price is not a linear function of the spot
price and the square root of the total variance, as it depends in a complex way on the shift parameter µT .
112
2
I 2 (t, T ) = (1 + λVb )Σ(T − t)
(3.1.6)
We then let C(t, St , K, T ; Vb ) be the price at time t of a contingent claim on climate with maturity T and strike K.
Given the call price C(T, ST , K, T ; Vb ) at maturity, the discounted call price must be a Q-martingale so that its value
at time t satisfies
RT
C(t, T ; x, y) = EtQ [e− t
rs ds
C(T, ST , K, T ; Vb )|St = x, Vb = y] (3.1.7)
which is solved numerically. However, the volatility Vb being a random variable, we do not know with certainty its
value at time t. Conditional on the mean variance I 2 (t, T ), the process St is lognormally distributed. Therefore, from
the properties of conditional expectation 1 , the value of a call option in the stochastic local volatility model becomes
More generally, if B is a sufficiently nice subset of R then, for the variable X we get
Z
P (X ∈ B) = fX (x)dx
B
where fX (x) is the probability density function of X. We can then think of the pdf fX (x)dx as the element of the
probability P (X ∈ dx) since
P (X ∈ dx) ≈ fX (x)dx
For example, to get plausible values for the mean variance in Equation (3.1.6) that guarantee its positivity, we must
draw Vb in the range [−1, 1]. Beta and Kumaraswamy distributions are the most popular models to fit continuous
bounded data. For ease of computation, we consider the Kumaraswamy’s double bounded distribution defined on the
interval [zlow , zhigh ] with the shape parameters a and b (see details in Appendix ()). For λ = 0 the BS-volatility
corresponds to the term structure σBS = Σ(t), while for λ =]0, 1] the variable Vb is drawn in the range [−λ, λ]
automatically resizing the bounds zlow and zhigh . By varying the parameters a and b we obtain a range of distributions
1 E[E[X|Y ]] = E[X]
113
for the variable Vb which are either centered around zero (symmetric), or having positive or negative skew, generating
skew and curvature to the smile. Then, the value of the call option is approximated as
n
X
C(t, T ; St ) ≈ C(t, T ; St , vi )φVb (vi )dvi
i=1
where n is a positve integer, and φVb (•) is the Kumaraswamy’s density. Note, the value of n is numerically chosen
such that
n
X
φVb (vi )dvi = 1
i=1
114
In any case, the probability distribution must be sufficiently close to a normal distribution to be approximated by a
Gram-Charlier expansion. Given the cumulant generating function
1 2 1 1
ψ(s; w) = logE[esw ] = s + γ1n s3 + γ2n s4
2! 3! 4!
we can differentiate it to recover the coefficients
∂ψ(s; w)
= kjn
∂sj s=0
Using that relation between the density and its higher moments, and applying the no-arbitrage condition
γ1n γ2n
1 − d2 + 3dσn − 2σn2 − 3d(1 + 2σn2 ) + 4d2 σn − d3 − 4σn + 3σn3
β(K, nt) = −φ(d)
3! 4!
Letting σBS (d, n) be the implied volatility for the modified moneyness d, it is approximated by
1 1
σBS (d, n) ≈ σn 1 − γ1n d − γ2n (1 − d2 )
(3.1.9)
3! 4!
115
This approximation is obtained by discarding terms involving powers two and higher order of σn in the function
α(K, nt). It is approximately a quadratic function of moneyness with coefficients related to skewness and excess-
kurtosis. For |d| < 32 the differences with the true implied volatility are small, but for larger values the true IV
is smaller than the approximated one. In all cases, the call price, its associated delta, and the implied volatility
correspond to the conventional BS-results plus additional terms involving skewness and excess-kurtosis.
Remark 3.1.2 The additional term α(K, nt) is a deterministic function of the spot price St , so that the Gram-Charlier
option price is associated to a local volatility model. Further, that dependency is expressed via the moneyness, d, in
its associated implied volatility, corresponding to the sticky delta regime of Derman (see Section ((4.1.1.2))).
As discussed by Corrado et al. [1997], the S&P 500 index have historical rates of return with mean in the range
[0.009, 0.13], standard deviation in the range [0.11, 0.35], skewness in the range [−1.12, 0.15], and kurtosis in the range
[0.22, 8.92]. However, the Gram-Charlier density yields negative values for certain skewness and kurtosis parameters
because it is a polynomial approximation. Rubinstein [1998] provided approximate kurtosis values for which the
Gram-Charlier and Edgeworth expansions do not provide negative values. Jondeau et al. [2001] characterised the
boundary delimiting the domain in the skewness-kurtosis space over which the Gram-Charlier expansion is positive.
They identified the region D in the (γ1 , γ2 )-plane for which f (w) is positive definite by requiring that the polynomial
p4 (w) be positive
1 3 1
p4 (w) = 1 − γ1n D φ(w) + γ2n D4 φ(w) ≥ 0 , ∀w
3! 4!
and they used notions of analytical geometry. They found that the excess-kurtosis must be inside the iterval [0, 4], and
that for each acceptable excess-kurtosis value there is a symmetrical interval for the skewness which must be solved
numerically. The acceptable values for skewness were in the range [−1.05, 1.05]. To examine the range of applicability
of the Gram-Charlier approximation, Backus et al. [1997] considered distributions generated by a Merton model (see
Section (3.2.1)). In that model, the density of the n-period log-price change is a countable mixtutre of normals given
by
∞
X
f (x) = pj φ(x; µ0 n + jµj n, σ02 n + jσj2 n)
j=0
j
where pj = e−λn (λn) 2
j! is the probability of j jumps, and (µ0 , σ0 ) are the mean and standard deviation of the diffusion
such that
σ02 σj2
+ λ eµj + 2 − 1
rnt − qnt = µ0 +
2
The jump-diffusion density exhibits greater kurtosis than the normal one, and non-zero skewness if µj 6= 0. The first
four cumulants are
k1 = (µ0 + λµj )n
k2 = σ02 n + λ(σj2 + µ2j )n
k3 = λ(µ3j + 3µj σj2 )n
k4 = λ(3σj4 + 6µ2j σj2 + µ4j )n
In their example, the moneyness vector, d, corresponds to N (d) = 0.1, 0.2, ..., 0.9, annualised jump intensities range
in [1, 50]. In the benchmark case, rt = qt = 0, σ = 0.1 (annualised), λ = 10, γ1 = 0 and γ2 = 1, and maturity
is 1 month. In Panel A, the approximated volatility smile produced γ1 = −0.006 and γ2 = 0.898, while using the
116
more accurate price equation they obtained γ1 = −0.013 and γ2 = 0.917. Modifying the jump intensity in Panel B,
they got γ2 = 1.046 with λ = 15, and γ = 0.748 with λ = 5. Note, for λ < 5, impying larger less frequent jumps,
the estimated γ2 were well below those of the Merton model. For six-month options the Gram-Charlier’s model was
not capable of reproducing γ2 = 1 with the benchmark value of λ, which had to be lowered to γ2 = 0.5 to get a
match. Adding skewness was even more difficult as the model would underestimate the absolute amount of skewness,
and for larger values underestimates the kurtosis as well. The great difficulties of the Gram-Charlier approximation to
reproduce the jump-diffusion model highlited the differences between the two models.
Corrado [2007] imposed a martingale restriction in the pricing formula by using a simple parameter space reduc-
tion for the Gram-Charlier expansion coefficients. The pricing equation is expressed in terms of Hermite polynomials
defined as follow:
Definition 3.1.1 The Hermite polynomials Hei (•) are defined by the identity
where the coefficients are derived from the Hermite polynomials as follow
Z ∞
1
cr = f (x)Her (x)dx = Ef [Her (x)]
r! −∞
1 0 r[2] 0 r[4] 0
= µr − µr−2 + 2 µr−4 − ...
r! 2 × 1! 2 × 2!
0
r!
where r[k] = (r−k)! is a partial factorial, and µr represents the rth raw moment of a target density function. In general,
0 0 0
the function f (x) is standardised to get µ2 = 1 by setting c0 = µ0 = 1, c1 = µ1 and c2 = 0. Then the coefficients
become
0 0 0
(µ3 − 3µ1 ) (µ − 3)
c3 = , c4 = 4 , ...
6 24
0 0
µ µ
and so on. In that setting, we get the skew γ1 = 3!3 and excess-kurtosis γ2 = 4!4 , which correspond to γ3 and γ4 in
Corrado. Further, the martingale restriction imposes that P (0, t)E m [St ] = S0 under the risk-neutral measure. This
is not the case when deriving the call option price formula with the Gram-Charlier expansion. To recover the call-put
parity, we must impose the constraint
m
X
cj σ j = 0
j=1
Corrado proposed to fix one degree of freedom to recover the constraint. The local coefficient c1 is modified as follow
117
m
X
c1 = − cj σ j−1
j=3
leaving the coeffiecients cj , j ≥ 3 free to determine the shape of the density function. The polynomial accounting for
the hidden martingale restriction is
3
p4 = 1 + γ3 (He3 (x) − xσ 2 t) + γ4 (He4 (x) − xσ 3 t 2 )
Note, Backus et al. [1997] used δnt as an explicit martingale restriction, while Corrado [2007] considered an implicit
martingale restriction, the latter being a particular case of the former. Schlogl [2010] proposed an option pricing
formula in terms of the full series and presented a fitting algorithm ensuring that a series truncated at an arbitrary
moment represents a valid probability density. The following assumption describe a sufficient condition to obtain
no-arbitrage at a fixed maturity T .
Assumption 1 The standardised risk-neutral distribution, seen at time t for the maturity T , of the logarithm of the
asset price ST = X(T ) is given by the Gram-Charlier expansion
∞
X
f (x) = cj Hej (x)φ(x)
j=0
where
Z ∞
1
cr = f (x)Her (x)dx
r! −∞
Proposition 6 Under the previous assumption, the risk-neutral expected value µ of the logarithm of the asset price
S(T ) must satisfy
∞
S(t) X 1
µ = ln − ln cj σ j − σ 2
P (t, T ) j=0
2
where S(t) is the current asset value and σ is the standard deviation of ln S(T ).
Given the previous Assumption, the price of a call option at time t on the asset S(T ) with expiry T is given by
where
µ − ln K + σ 2
d∗ =
σ
To make sure that the calibrated expansion coefficients yield a valid density, the author adapted an unconstrained non-
linear optimisation algorithm. He let c0 = 1, standardised f (x) by setting c1 = c2 = 0, and truncated the series by
setting cj = 0 ∀j > k for some choice of k ≥ 4. The set Ck for k = 3, .., k defines a convex set. The boundaries
of Ck along any given direction d ∈ Rk−2 is obtained by solving two one-dimensional searches. Then a multiple
dimensions minimisation algorithm is used to calibrate the model to market prices.
118
1
∂K Σ(K, T ) = − √ (b + 2cd1 )
ST Σ T − t
d2 d1 1
and since dK 2 = 2 Σ T −t ,
ST
√ the curvature satisfies
1 1
∂KK Σ(K, T ) = ( √ )2 2c + 2 √ (b + 2cd1 )
ST Σ T − t ST Σ T − t
Putting terms together, the risk-neutral density in Equation (1.7.46) becomes
1 2d1 1 2 1
(b + 2cd1 )2
√
p(d1 ) = √ + √ (b + 2cd1 ) − T − td1 − d √
ST Σ T − t ST Σ2 T − t Σ 1 ST Σ T − t
1 1
+ √ 2c + (b + 2cd1 )
2
ST Σ T − t ST Σ
which√is a polynomial in d1 of the fourth order. In the special case where we consider the Gram-Charlier approximation
Σ(d1 ) T − t in Equation (3.1.9), we get
J = eµ+γZt
where Zt follows a standard normal law. We apply Ito’s lemma to the logarithm of the stock price and integrate in the
range [0, T ] to get
119
RT R PNT
r−λE[J−1]− 12 σ 2 (s) ds+ 0T σ(s)dW (s)+ i=1 log(J)
ST = S0 e 0
If we allow for n fixed jumps between [0, T ], so that the number of jumps is no longer random, we get
RT RT
1 2
Pn
ST = S0 e 0 r−λE[J−1]− 2 σ (s) ds+ 0 σ(s)dW (s)+nµ+ i=1 γi
W
where √t
t
and i are independent standard normal variables. The variance of the stock price is given by
Z T Z T Z T
n 2
σn2 (T )ds = σ 2 (s)ds + nγ 2 = γ + σ 2 (s) ds
0 0 0 T
so that the stock price becomes
RT R √
γ +σ 2 (s)) ds+ 0T
n 2
rn (t)− 12 ( T n 2
(T γ +σ 2 (s))dW (s)
ST = S0 e 0
where rn (t) = r − λE[J − 1] + Tn log(E[J]). Using the dynamics of the underlying stock price previously defined,
we define the call price of a vanilla option in the Merton framework as
RT
CM (t, St , T, K, r; ΘM ) = E[e− t
rs ds
(ST − K)+ |Ft ]
RT
= e− t
rs ds
E[E[(ST − K)+ |Ft , NT − Nt ]]
where ΘM are the model parameters of the Merton model. Using the property of independent increments of a Poisson
process, the number of jumps between dates t and T , that is NT − Nt , has the same law as NT −t , namely a Poisson
law with intensity, or frequency, λ(T − t). The call price can be re-written as
RT
CM (t, St , T, K, r; ΘM ) = e− t
rs ds
E[E[(ST − K)+ |Ft , NT −t ]]
Solving the conditional expectation with the density of the Poisson process, we get
∞
RT X (λ(T − t))n
CM (t, St , T, K, r; ΘM ) = e− t
rs ds
e−λ(T −t) E[(ST − K)+ |Ft , NT −t = n]
n=0
n!
∞
RT X (λ(T − t))n RT
= e− t
rs ds
e−λ(T −t) CBS (t, St , T, K, rn , σn )e t rn (s)ds
n=0
n!
0
RT
since CBS (t, St , T, K, rn , σn ) = e− t
rn (s)ds
E[(ST − K)+ |Ft , NT −t = n]. Recall that λ = λE[J], so that
0
λ
E[J] = λ . Replacing in the equation and simplifying the exponentional terms, we get
0
RT RT RT RT RT 0
n
e− t rs ds− t λds+ t rn (s)ds = e t −λ + T −t log(E[J]) ds = (E[J])n e t −λ ds
so that the call price becomes
∞ 0
X RT
λ ds (λ
0 (T − t))n
CM (t, St , T, K, r; ΘM ) = e− t CBS (t, St , T, K, rn , σn )
n=0
n!
This is a weighted sum of Black-Scholes option prices each of which assuming n jumps. The weights correspond to
the probability that there is n jumps before maturity T . In this framework the implied volatility surface obtained by
inverting the Black and Scholes formula must be computed numerically. It is well known that a diffusion model is
convexity preserving, that is, when the payoff of option prices is convex with respect to the underlying then the option
price is convex. It implies that for two call prices PA and PB with the same expiration date and identical convex
120
payoff function such that σB (s, t) ≥ σA (s, t) then PA (s, t) ≥ PB (s, t). However, Bergman et al. [1996] showed
that when the underlying price follows a discontinuous process then call prices can have properties very different from
those of a diffusion model. Ekstrom and Tysk [2006] gave necessary condition for convexity to be preserved in jump-
diffusion models in arbitrary dimensions. Using that condition they showed that the only higher-dimensional convexity
preserving models are the ones with linear coefficients. The Affine models are therefore convexity preserving and the
Merton model belongs to the class of Affine models. We can then assume in our setting that if the option prices are
convex at all time then they are increasing in the volatility. Bellamy and Jeanblanc [2000] showed that in a jump-
diffusion model the option prices model were bounded by the underlying and were dominating the diffusion prices.
We can therefore assume the implied volatility surface inferred from a jump-diffusion model dominate the one inferred
from a diffusion model.
Assumption 2 We assume there exists a time-homogeneous Markov process Xt valued in an open subset D ∈ Rn
such that the market value at time t of an asset maturing at t + τ is of the form
f (Xt , τ )
where Wt is an Ft standard Brownian motion on Rn under the risk-neutral probability measure Q. The drift µ : D →
Rn and diffusion σ : D → Rn×n are regular enough to have a unique strong solution valued in D.
In the Yield factor model, the asset f (Xt , τ ) in assumption (2) is the zero coupon bond of maturity t + τ (see Duffie
et al. [1996]).
Assumption 3 We assume that there exists a measurable function R : D → R such that the short rate at time t is
rt = R(Xt ) with
log f (x, τ )
R(x) = lim − for x ∈ D
τ →0+ τ
Given the definition of the risk-free zero coupon bond, its price is given by
RT
f (Xt , T − t) = E Q [e− t
R(Xs )ds
|Ft ] (3.2.11)
We consider Affine models and assume
Assumption 4 For an affine process Xt , we assume there exists A : [0, ∞[→ R and B : [0, ∞[→ Rn such that τ ≥ 0
and x ∈ D such that
f (x, τ ) = eA(τ )+B(τ )x
121
Proposition 8 Under mild non-degeneracy conditions the exponential-affine assumption (4) implies that µ(·) and
σσ > (·) are affine functions on D.
We can re-think the variance-covariance matrix σσ > (x) = (αi,j + βi,j x)1≤i,j≤n in terms of Bessel processes and get
the following proposition
Proposition 9 For σσ > (x) affine in x then under non-degeneracy conditions we can take σ(x) as
p
v1 (x) ... 0
σ(x) = . ... p . Σ
0 ... vn (x)
• for all x such that vi (x) = 0, βi> (ax + b) > 12 βi> ΣΣ> βi .
• for all j, if (βi> Σ)j 6= 0 then vi = vi .
If the constraints are satisfied, there exists a unique strong solution (Xt )t≥0 to the stochastic differential equation
(3.2.10) with vi (x) = αi + βi x and vi (Xt ) > 0. Therefore, it is possible to define a non-negative variance process
V (Xt ) such that
X
V (Xt ) = γi2 vi (Xt ) for γi ≥ 0 (3.2.12)
i
V (Xt ) = α + βXt
We denote VBS (t) as the square of the Black-Scholes local volatility satisfying
s
Z T
1
Σ(t; KAT M , T ) = VBS (s)ds
T −t t
with Σ(·) being the market implied volatility. If we differentiate the last expression with respect to time t, we can
∂
re-express the local variance in terms of the market implied volatility, that is VBS (t) = Σ(·) − (T − t) ∂t Σ(·).
For example, one can use a spline to fit the Black-Scholes at-the-money implied volatility Σ(·) and then deduce the
variance VBS (t). Therefore, by construction, we can impose that the Black-Scholes variance term structure be fitted
exactly
122
In some cases we can express the characteristic function of a stock price as the price of a bond. In order to price
exotic equity products within the Yield Factor approach, we need to calculate the characteristic function φY (u, T ) of
the process sτ = ln SSTt with τ = T − t, and where the stock price St under the risk-neutral measure is given by
n
dSt X q
= rt dt + γj vj (X(s))dZj (t)
St j=1
for j = 1, ..., n. In that setting, the characteristic function of the process sτ becomes
RT Pn RT √
γj2 vj (X(s))ds+iu
Pn
rs − 21
φY (u, T ) = E[eiusτ |Ft ] = E[eiu t j=1 j=1 γj t
vj (X(s))dZj (s)ds
]
We first assume that the process X and the Brownian motions Zj are independent, and then we condition the expecta-
tion in the characteristic function with respect to the variance V (X(t)), to get
h RT Pn 1 2
Pn RT √ i
rs −
φY (u, T ) = E E[eiu t j=1 2 γj vj (X(s)) ds+iu j=1 γj t
vj (X(s))dZj (s)ds
]|V (X(t))
RT Pn RT
γj2 vj (X(s))ds
= E[eiu t
rs ds−λ j=1 t ]
with λ = 21 (iu + u2 ). By analogy to the Yield factor model in interest rates, and given the price of the zero coupon
bond in Equation (3.2.11), we obtain the measurable function R as
n
X
R(Xt ) = −iurt + λ γj2 vj (X(t))
j=1
The solution to the characteristic function φY (u, T ) satisfies the assumption (4), where A(τ ) and B(τ ) are obtained
by solving a system of Ricatti equations (see Duffie et al. [2000]).
dSt p
= v(X(t))dZ(t)
St
dX(t) = µ(X(t))dt + σ(X(t))dWX (t)
with < dZ, dWX >t = ρdt. For instance, Hull and white [1987] considered the instantaneous variance to be given by
v(X(t)) = X(t), and they assumed a log-normal volatility process with dynamics
123
1 2
bV∞ > ξ
2
√
Stein et al. [1991] let the volatility process Yt = Xt be an arithmetic Ornstein-Uhlenbeck process
dY (t) = a − bY (t) dt + σdWY (t)
and derived explicit closed-form solution to the price of a call option. Even though the volatility Yt can become
negative, it is used in a squared fashion so that this formulation is equivalent to putting a reflecting barrier at Yt = 0 in
the volatility process. Further, for a wide range of relevant parameter values, the probability of actually reaching the
point Yt = 0 is very small.
In a qudratic volatility model, we let the process Xt be the OU-process above and define the variance process as
Zt = Xt2 with dynamics given by
In any case, we then integrate the stock price SDE in the range [t, T ] and take the exponential, getting the solution
1
RT RT √
ST = e− 2 t v(X(s))ds+ t v(X(s))dZX (s)
The European call price C(t; K, T ) seen at time t with a strike K and maturity T satisfy C(t; K, T ) = P (t, T )E Q [(ST −
K)+ |Ft ].
Following Hull et al. [1987], we let I 2 (t, T ) be the mean variance over the life of the option price defined as
Z T
2 1
I (t, T ) = V (Xs )ds
T −t t
where V (Xt ) is variance process, and such that the distribution function of ST can be written as
Z
p(ST |V (Xt )) = g(ST |I 2 (t, T ))h(I 2 (t, T )|V (Xt ))dI 2 (t, T )
We can then condition the expectation with respect to the mean variance I 2 (t, T ) and get the option price
Hence, conditional on the mean variance I 2 (t, T ) the process St is lognormally distributed and we can re-write the
option price as
+
P (t, T )E Q E[ ST − K ]|I 2 (t, T )
C(t; K, T ) =
Z
CBS ST , K, I 2 (t, T ) h(I 2 (t, T )|V (Xt ))dI 2 (t, T )
=
124
where V ar(I 2 (t, T )) and Skew(I 2 (t, T )) are the second and third moments of the mean variance I 2 (t, T ). The series
converges very quickly for small values of k = ξ 2 (T − t). That is, if the variance of Xt is small the distribution of the
process log SST0 is close to a normal distribution and few extra moments are necessary.
Alternatively, to price exotic equity products within the Yield Factor approach we need to calculate the characteristic
function φY (u, T ). The process of interest to us is s(t, T, ν) = ln SSTt
Z T Z T
1 p
s(t, T, ν) = − ν 2 X(s)ds + ν X(s)dZ(s)
2 t t
In that case the measurable function R becomes
R(Xt ) = λX(t)
where λ = λν 2 .
RT 1
RT √ PNT
E E[eiu t (rs −λm− 2 v(X(t)))ds+iu t v(X(s))dZ(s)+iu Nt log(J)
φT (u) = ]|V (X(t)), NT −t
RT RT PNT
(rs −λm− 12 v(X(t)))ds− 21 u2
= E[eiu t t
v(X(s))ds+iu Nt log(J)
]
Then we condition the expectation in the characteristic function only with respect to the variance V (X(t)) and get
RT 1 1 2 T
R R T µu+ 1 u2 γ 2
= E eiu t (rs −λm− 2 v(X(t)))ds− 2 u t v(X(s))ds+λ t [e 2 −1]ds
φT (u)
1 2 γ2
eµu+ 2 u
RT RT RT RT
(rs − 21 v(X(t)))ds− 12 u2
= E[eiu t t
v(X(s))ds
]E[e−iu t
λmds+λ t
−1 ds
]
RT
(rs − 12 v(X(t)))ds− 12 u2 tT v(X(s))ds
R
We recognise that E[eiu t ] can be written as eA(τ )+B(τ )x where A and B are so-
RT RT 1 2 2
lutions of the Ricatti equations from above. Let A0 (τ ) = A(τ ) − iu t λmds + λ t [eµu+ 2 u γ ], and then we
have
125
We saw in Section (1.2.3) that when the stock price is log-normally distributed with constant model parameters, the
European call prices satisfy C(t, x) = CBS (t, x, K, T ) for t ∈ [0, T ]. We also know that when the model parameters
are general Markov processes we can no-longer use the Black-Scholes Equation (1.2.6). However, we still want to
express the solution to the European call options in terms of the Black-Scholes formula, which can then be seen as a
function of the stochastic spot rate, repo rate and volatility. Based on the work of Alos [2004] and Alos et al. [2007],
Bloch [2010a] have described a general pricing approximation technique for call options in a jump-diffusion model
with stochastic interest rates and stochastic repo rates. At maturity, the European option prices must satisfy
s s s
Z T Z T Z T
1 2
1 1
C(T, x) = CBS t, x, K, T ; σs ds, rs ds, qs2 ds
T −t t T −t t T −t t t=T
Rt
and since e− 0
rs ds
C(t, x) is a Q-martingale, we get
Rt RT
e− 0
rs ds
C(t, x) = E Q [e− 0
rs ds
C(T, x)|Ft ]
s s s
Z T Z T Z T
RT 1 1 1
= E Q [e− 0
rs ds
CBS t, x, K, T ; 2
σs ds, rs ds, qs2 ds |Ft ]
T −t t T −t t T −t t t=T
126
T
C(t, ST ) = E Q [F (T, T )N (d1 (T − t, F (T, T ), K; I(T, T ), D(T, T )))|Ft ]
T
− KE Q [N (d2 (T − t, F (T, T ), K; I(T, T ), D(T, T )))|Ft ]
which we can express in terms of the Black Digital Bond DB (S, t, T ; ξ) and Black Digital Share DS (S, t, T ; ξ) as
T T
C(t, ST ) = E Q [S(F, T, T ; ξ; I(T, T ), D(T, T ))|Ft ] − KE Q [D(F, T, T ; ξ; I(T, T ), D(T, T ))|Ft ]
We can therefore approximate the Digital Bond and the Digital Share and combine them to price other path-independent
contingent claims.
Z T
C(t, K) T 1 T
= E Q [CB (t, YT (t), I(t, T ))|Ft ] + E Q [ H(s, YT (s), I(s, T ))Λs ds|Ft ]
P (t, T ) 2 t
Z T
QT
− λ(θY (1) − 1)E [ ∂2 CB (s, YT (s), I(s, T ))ds|Ft ]
t
Z T Z
T
EQ
+ [ λ CB (s, Y (s− , T ) + z, I(s, T )) − CB (s, Y (s− , T ), I(s, T )) FY (ω, dz)ds|Ft
t R2
127
where
RT √
• Λs = s DsW Vr dr Vs .
Following Hull and White [1987], we can approximate this price by approximating the future average volatility with
T
its expectation I0 (t, T ) = E Q [I(t, T )|Ft ]. So the approximated call price becomes
Z T
C(t, K) 1 T
= CB (t, YT (t), I0 (t, T )) + H(t, YT (t), I0 (t, T ))E Q [ Λs ds|Ft ]
P (t, T ) 2 t
Z T
− λ(θY (1) − 1) ∂2 CB (s, YT (s), I0 (s, T ))ds
t
Z T Z
+ λ CB (s, Y (s− , T ) + z, I0 (s, T )) − CB (s, Y (s− , T ), I0 (s, T )) FY (ω, dz)ds
t R2
The computation of the call option price is now as simple as the Black formula.
Assuming deterministic rates, Alos et al. [2007] tested, within the Heston model, the quality of their approximation
against the original implementation given by Heston. They chose a very short maturity of T = 0.1 where skew is
most pronounced and found fairly accurate results. They also found that the larger part of the error was produced by
replacing E[CBS (t, T ; I(t, T ))|Ft ] with the approximation CBS (t, T ; I0 (t, T )). It is well known that this approxima-
tion corresponds to the first term of the Taylor expansion of E[CBS (t, T ; I(t, T ))|Ft ] around I0 (t, T ) (see Equation
(3.2.13)). So, we can improve the accuracy of the approximation of the call option price by improving the computation
of E[CBS (t, T ; I(t, T ))|Ft ] by adding extra terms from the Taylor expansion. Doing so, we get
1 ∂2
E[CBS (t, T ; I(t, T ))|Ft ] = CBS (t, T ; I0 (t, T )) + var(I 2 (t, T )) CBS (t, T ; I(t, T )) (3.3.14)
2 ∂(I 2 (t, T ))2 I0 (t,T )
It is not difficult to calculate explicitly the derivative expression of the second term of the expansion as
C(t, K)
= CB (t, YT (t), I0 (t, T )) + α(K, T ) (3.3.15)
P (t, T )
where
128
1 ∂2
α(K, T ) = var(I 2 (t, T )) CB (t, T ; I(t, T ))
2 ∂(I 2 (t, T ))2 I(t,T )=I0 (t,T )
Z T Z T
1 T
+ H(t, YT (t), I0 (t, T ))E Q [ Λs ds|Ft ] − λ(θY (1) − 1) ∂2 CB (s, YT (s), I0 (s, T ))ds
2 t t
Z T Z
+ λ CB (s, Y (s− , T ) + z, I0 (s, T )) − CB (s, Y (s− , T ), I0 (s, T )) FY (ω, dz)ds
t R2
p
dSt = St rdt + Vt dW (t)
p
dVt = κ(θ − Vt )dt + ν Vt dZ(t)
with < dW, dZ >t = ρdt, and κ > 0, θ > 0 and ν > 0. Alos et al. [2007] give a proof of the Malliavin differentiability
of the Heston volatility as well as its square root process and gives explicit expression for their derivatives. The
Malliavin derivative of the instantaneous variance is
Rt 2
−κ κθ ν 1
2 −( 2 − 8 ) V ds
p
Dr Vt = νe r t Vt (3.3.16)
RT
Then, the pricing of a call price is considered. Letting I 2 (t, T ) = T 1−t t Vs ds be the square of average future
volatility starting at time t and maturing at time T , the call price at maturity T is given by CBS (T, T ; I(T, T )) =
C(T, T ). Since the discounted call price must be a Q-martingale, we get
e−rt C(t, T ) = E[e−rT C(T, T )|Ft ] = E[e−rT CBS (T, T ; I(T, T ))|Ft ]
We then apply the modified Ito’s formula to the process e−rt CBS (t, T ; I(t, T )), getting
Z T
1
e−r(s−t) H(s, Xs , I(s, T ))dUs |Ft
C(t, T ) = E[CBS (t, T ; I(t, T ))|Ft ] + E
2 t
where
Z T Z T p
Ut = ( DsW Vr dr) Vs ds
t s
and
∂3 ∂2
H(t, Xt , I(t, T )) = 3
− CBS (t, T ; I(t, T ))
∂x ∂x2
Note that Alos et al. expanded the derivative with respect to the process Ut in the integral and approximated the option
formula by considering only a first order approximation to the option price.
1
E[CBS (t, T ; I(t, T ))|Ft ] + E[H(t, Xt , I(t, T ))Ut |Ft ]
2
Obviously, the first term of this approximation corresponds to the non-correlated case which was approximated by
Hull et al. [1987] by replacing I(t, T ) with I0 (t, T ) = E[I(t, T )|Ft ]. In the same spirit they used this approximation
in the calculation of the function H, so that the approximated pricing formula becomes
129
1
C(t, T ) = CBS (t, T ; I0 (t, T )) + H(t, Xt , I0 (t, T ))E[Ut |Ft ]
2
RT W √
Now, denoting Λs = s Ds Vr dr Vs , the call price becomes
Z T
ρ
C(t, T ) = CBS (t, T ; I0 (t, T )) + H(t, Xt , I0 (t, T ))E[ Λs ds|Ft ]
2 t
RT
So, in order to evaluate the option price we need to compute two quantities, that is, I0 (t, T ) and E[ t
Λs ds|Ft ]. In
the Heston model, the mean variance I02 (0, T ) seen at time t0 = 0 is given by
T
1 − e−κT
Z
1
I02 (0, T ) = E[Vs ]ds = θ + (V0 − θ)
T 0 κT
−κT
and the total variance is ω(T ) = θT + (V0 − θ) 1−eκ with derivative with respect to maturity being
1 ∂2
C(t, T ) = CBS (t, T ; I0 (t, T )) + var(I 2 (t, T )) CBS (t, T ; I(t, T )) (3.3.17)
2 ∂(I (t, T ))2
2 I(t,T )=I0 (t,T )
Z T
ρ
+ H(t, Xt , I0 (t, T ))E[ Λs ds|Ft ]
2 t
130
Knowing the price of the call option in a fast and robust way we can then invert the Black and Scholes formula to
recover the equivalent implied volatility surface. Using the implied volatility surface applied to the Heston model, we
get
131
dF = σL (F )F dW , F (0) = f
Using singular perturbation methods, they showed that European call and put prices were given by the Black-Scholes
formula with implied volatility
132
00
n 1 σL (fav ) o
σB (K, f ) = σL (fav ) 1 + (f − K)2 + ...
24 σL (fav )
with fav = 21 (f + K), so that K = 2F − f . The first term dominates the solution and the second term provides a
much smaller correction. Analysing the dynamics of the local volatility, Hagan et al. [2002] calibrated the model to
market prices, obtaining
0
σL (F ) = σB (2F − f0 ){1 + ...}
0
where σB (K) is the implied volatility seen at time t0 with forward price f0 . Assuming that the forward price changes
from f0 to f , the new implied volatility becomes
0
σB (K, f ) = σB (K + f − f0 ){1 + ...}
Thus, if the forward price increases from f0 to f , the IV curve moves to the left, and if it decreases, the IV curve
moves to the right. In presence of negative skew, local volatility models predict that the skew moves in the opposit
direction as the price of the underlying asset. They illustrated their findings by considering the quadratic smile
0
σB (K) = a + b(K − f0 )2
obtaining the local volatility
σL (F ) = a + 3b(F − f0 )2 + ...
and they showed that when the forward moves from f0 to f , then the implied volatility becomes
3 1 2 3
σB (K, f ) = a + b K − ( f0 − f ) + b(f − f0 )2 + ...
2 2 4
where ∂f σB (K, f ) = b[K + (2f − 3f0 )]. The implied volatility shows that the curve also shift upward regardless of
whether f increases or decreases. This is only true because the smile is quadratic, which is rarely the case in equity.
1. sticky delta : the strike of the option is rescaled according to how the current spot evolved with respect to the
spot at inception
S0
Σ(t, St ; K, T ) = Σobs
0 (K St , T − t)
2. absolute floating : the future implied volatility is obtained from the original smile surface by simply reducing
time to maturity and linearly offsetting the strike by how much the spot has moved
Σ(t, St ; K, T ) = Σobs
0 (K + (S0 − St ), T − t)
3. absolute sticky (sticky strike) : any dependence on the current spot level or calendar time is ignored
Σ(t, St , K, T ) = Σobs
0 (K, T − t)
133
where Σobs is the current smile. These regimes of volatility were slightly extended by Derman [2008], who for
simplicity of exposition, considered the linear smile (only true when the strike is near the money)
1. sticky moneyness : The dynamics of option prices derive entirely from the moneyness
K
Σ(t, St ; K, T ) = a − b( − 1)S0 , b > 0
St
It shifts the skew as the stock price moves by adjusting for moneyness, assuming that the market mean reverts to
the value a independently from market level. When the skew is nearly linear we get Σ ≈ Σ(St − K), so that the
implied volatility rises when S rises. In the BS-delta (see Equation (B.1.3)), the dependence on K and S occurs
only through the moneyness η, such that for ATM option we get |∆| = 12 . However, the delta depends on the
scaled moneyness √ 1 log η, which should be the fixed point. In presence of negative skew, the IV increases
ω(t)
with increasing spot, leading to a greater delta than BS-delta. Stochastic volatility models and jump-diffusion
models have this property (assuming the other variables do not change).
2. sticky implied tree : In the implied tree, local volatility increases as the underlying decreases. Assuming that
the IV is approximately the linear average of local volatilities between the initial spot price and the strike price,
we can extract the future local and implied volatilities from current implied volatilities. We get the linear
approximation to the skew
Σ(t, St ; K, T ) = a − b(K + St − 2S0 ) , b > 0
so that the implied volatilities decrease as K or S increases, leading to a smaller delta than BS-delta. Hence,
when linearly approximating the local volatility we get Σ ≈ f (K + St ) and volatilities are inversly correlated
with the underlying spot price. The sticky implied tree regime corresponds to a fear of higher market volatility
in the case of a fall of the spot price. Local volatility models have this property.
However, these simplified regimes of volatility have a couple of major drawbacks. First of all, they are not consistent
with market observable data, and secondly the way they are used to price forward call options is not derived from the
theory of No Arbitrage.
134
dynamics of the smile he had to re-introduce inhomogeneity in the stochastic model. On the contrary, for Hagan et al.
[2002] local volatility models (LVMs) predict smile dynamics opposite of observed market behaviour. LVMs predict
that the smile shifts to higer prices when the underlying asset decreases, and that the smile shifts to lower price when
spot price increases. Observing interest rate smiles, where asset prices and market smiles move in the same direction,
they devised the SABR model, which is space inomogeneous for β 6= 1 and allows to capture the correct dynamics of
the smile (in the rates market). Thus, we see that the specific model chosen characterise a particular type of dynamics
for the IVS, which in turn impose a particular delta hedging ratio.
∂Σ(t, St , K, T )
∆(t, St ) = ∂x C(t, St , K, T ) = ∂x CBS (t, St , K, T ; Σ) + ∂Σ CBS (t, St , K, T ; Σ) (4.1.1)
∂x
135
where ∂Σ(t,S∂xt ,K,T ) is the sensitivity of the implied volatility with respect to the spot price St = x,
∂Σ CBS (t, St , K, T ; Σ) is the Vega given in Equation (B.1.4), and ∂Σ CBS (t, St , K, T ; Σ) ∂Σ(t,S∂xt ,K,T ) is called the
shadow delta.
Remark 4.1.1 The new delta will be either bigger or smaller than the Black-Scholes delta, depending on the shape
of the implied volatility with repect to the spot price. If ∂Σ ∂Σ
∂x > 0, the new delta will be bigger, but If ∂x < 0, the new
delta will be smaller.
The difference between the two deltas is treated as a new source of risk, called the vega risk which can be hedged by
making the portfolio vega-neutral. Hence, given a portfolio of European
PN options, the sum of the derivatives of each
option with respect to its implied volatility should be null, that is, k=1 ∂σ CBS (t, St )|σ=Σk = 0. To be fully hedged,
we must also assume that the implied volatility Σ(t, St ; K, T ) for different strikes and different maturities changes
in the same way with respect to the underlying, that is, ∂Σ ∂x does not depend on K. Again, this assumption is rarely
satisfied.
Another approach taking into consideration the shadow delta is to make an assumption on the dynamics of the implied
volatility with respect to the underlying.
• For instance, Derman [1999] proposed the sticky delta regime also called the sticky moneyness regime. In that
regime, it is assumed that the dynamics of the implied volatility depends on the ratio m = K x (moneyness) for
K
x = St , but not on the two variables separately. Letting Σ = Σ( x ) and using the chain rule, we get
∂ 0 dm
Σ(m) = Σ (m)
∂x dx
0
∂
where Σ (m) = ∂m Σ(m). Thus, the implied delta becomes
K 0
∆(t, St ) = ∂x C(t, St , K, T ) = ∂x CBS (t, St , K, T ) − ∂Σ CBS (t, St , K, T ; Σ) Σ (m)
S2
0
Since in general the volatility decreases with increasing strikes and the Vega is positive, then Σ (m) < 0 and
the sticky delta regime implies
Σ(m) = a − bm , b > 0
d ∂
then dm Σ(m) = −b so that ∂x Σ(m) = b xK2 (which is positive).
• Alternatively, considering the sticky implied tree regime, the implied volatility satisfies Σ ≈ f (K + x), such
0
d
that dx f (K + x) = f (K + x) and the delta becomes
0
∆(t, St ) = ∂x C(t, St , K, T ) = ∂x CBS (t, St , K, T ) + ∂Σ CBS (t, St , K, T ; Σ)f (K + x)
Assuming the linear smile
Σ(t, St ; K, T ) = a − b(K + St − 2S0 ) , b > 0
0
we get Σ (K + x) = −b, such that the sticky implied tree regime implies
136
When delta-hedging a portfolio, the best hedge ratio is the one minimising the variance of the PnL. Comparing delta
hedging between the Black-Scholes model and the local volatility model, Crepey [2004] showed that BS hedging
strategy performed worst in fast selloffs or slow rise, while local volatility hedging strategy performed worst in slow
selloffs and fast rises. The equity index markets being negatively skewed, Crepey argued that the BS hedging strategy
is worse than the local volatility strategy.
∂Σ
Γ(t, St ) = ∂xx C(t, St , K, T ) = ∂xx CBS (t, St , K, T ) + 2∂xΣ CBS (t, St , K, T ; Σ) (4.1.2)
∂x
∂Σ 2 ∂2Σ
+ ∂ΣΣ CBS (t, St , K, T ; Σ) + ∂Σ CBS (t, St , K, T ; Σ) 2
∂x ∂x
where ∂xΣ CBS (t, St , K, T ; Σ) is the Vanna given in Equation (B.1.7) and ∂ΣΣ CBS (t, St , K, T ; Σ) is the Volga given
in Equation (B.1.8). Note, the gamma also depends on the second derivative of the implied volatility with respect to
the spot price St . In the sticky delta regime, using the chain rule, we get
∂2 ∂2 dm 2 ∂ d2 m
2
Σ(m) = 2
Σ(m)( ) + Σ(m) 2
∂x ∂m dx ∂m dx
∂2
Applying that result to the example above, the second derivatives becomes ∂x2 Σ(m) = −2b xK3 . However, in the
simple example of the sticky tree, the second derivative is null.
1
x x 1
K =K =K
x+α x(1 + ) 1+
137
and the new Black-Scholes implied volatility corresponds to a translation in the forward moneyness of the initial
implied volatility. In fact, its dynamics follow the sticky delta regime of volatility described in Section (4.1.1.2) with
strike K StS+α
t 1
= K (1+) .
Note, from the property of homogeneity of the BS-prices given in Equation (1.2.13) we can set K = K(1 + ), such
that η recovers the initial forward moneyness η, and the call price becomes
CT V (t, x, η(1 + ), T ; ) = x(1 + )e−q(T −t) N (d1 (T − t, η)) − ηN (d2 (T − t, η)) = (1 + )CT V (t, x, η, T )
∂Σ(t, St ; K, T ) ∂Σ(t, St ; K, T )
= kS (4.1.3)
∂St ∂K
where
0 if sticky strike
1 if sticky implied tree
kS = K
− St if sticky delta
K
St if minimum variance
where the minimum variance (MV) was introduced by Lee [2001]. Further, using the chain rule and making some
approximations 2 , we can relate the second derivative of the implied volatility with respect to the spot price St to that
with respect to the Curvarture as follow
∂ 2 Σ(t, St ; K, T ) ∂ 2 Σ(t, St ; K, T )
≈ k C (4.1.4)
∂St2 ∂K 2
where
0 if sticky strike
kC = 1 if sticky implied tree
K 2
( St ) if sticky delta and minimum variance
2 d2 m = 2 xK3 << 1.
dx2
138
where
1 1 1 √ √
d2 (t, ηt ) = √ log − Θ t and d1 (t, ηt ) = d2 (t, ηt ) + Θ t
Θ t ηt 2
Note, given the total variance ω(t) = Θ2 t, we recover the scaled price in Equation (1.2.9). Thus, the delta in Equation
(4.1.1) becomes
∂Θ(t, St , ηt , T ; Θ)
∆(t, St ) = ∂x C(t, St , ηt , T ) = ∂x CBS (t, St , ηt , T ) + ∂Θ CBS (t, St , ηt , T ; Θ)
∂x
Using the chain rule, we have
∂Θ(t, St ; ηt , T ) ∂Θ(t, St ; ηt , T )
= kS (ηt )
∂St ∂η
where
0 if sticky strike
1 P (t,T ) = ηt if sticky implied tree
St Re(t,T ) K
kS (ηt ) = P (t,T ) ηt
− K2 Re(t,T ) = − St if sticky delta
ηt St
St if minimum variance
P (t,T ) ηt
That is, we move from kS to kS (ηt ) by a factor of S1t Re(t,T ) = K . Note, the minimum variance (MV) delta, given
ηt
in the moneyness metric by kS (ηt ) = St , is the exact opposit of the term in the sticky delta regime (see Lee [2001]).
∂Σ(t,St ;K,T )
In any case, to compute the adjusted delta we need to estimate the market skew by computing either ∂K or
∂Θ(t,St ;ηt ,T )
∂η .
For example, Alexander et al. [2010] chose to fit the smile with the cubic polynomial
∂Θ(t, St ; ηt , T )
=b a2 (t)ηt2 + 3b
a1 (t) + 2b a3 (t)ηt3
∂η
To account for the fact that the implied volatility is non-Markovian, Alexander et al. modified the IV sensitivity to
the spot price, ∂x Θ(t, St , ηt , T ; Θ), by letting the correction term kS (ηt ) be dependent on the regime of volatility
observed over a recent period of historical data. At each time t, they fixed kS (t, St ; ηt , T ), computed ex-post the delta
hedging error on a sample of option prices, and chose the value for kS (t, St ; ηt , T ) minimising the resulting standard
deviation. The delta obtained has a regime dependent (RD) smile adjustment. To improve the smile dynamics further,
they added an extra term, independent from the moneyness, to the sensitivity of the IV with respect to the spot price,
getting
139
∂Θ(t, St ; ηt , T ) ∂Θ(t, St ; ηt , T )
= α(t, St ; T ) + kS (ηt )
∂St ∂η
Performing a linear regression of daily changes in ATM volatility on the daily log return of the underlying asset, they
set α
b(t, St ; T ) equal to the estimated slope coefficient, divided by the spot price. The delta obtained has a regime-
dependent shift (RS).
φ(St2 ∈ dx|Ft1 )
where Ft1 can be considered all available information at time t1 . Yet, the knowledge of this function is of fundamental
importance if we want to price forward start call options. This is the problem with Dupire’s local volatility model,
that is, while assuming the evolution of the volatility surface to be Markovian and considering an infinity of fixing
dates 3 , one can not control its evolution between the start date (previously denoted t1 ) and the maturity of the forward
start option, that is, its conditional density. However, when the number of fixing dates is finite, there is an infinity of
conditional densities, which is not the case in Dupire’s model. In fact, Rebonato [2002] explained that in a discret-
time model if the no-arbitrage violation is to be allowed there is an infinity of solutions for the future conditional
deterministic density, which means that the future is unknown and can not be derived from today’s information. One
solution is to add some structure by choosing a particular model with extra state variables other than the underlying
spot price. These extra processes need to be calibrated to forward starting options, which are directly linked to the
forward smiles (see Ayache et al. [2004]). This way, we let the market decide on the structure of the conditional
distributions. Put another way, we let the market decide on the dynamics of the volatility surface. However, the market
of forward start options is an OTC business which is model dependent and reflects the views of the parties but not
the market as a whole. As a result, volatility trading is about taking a view on the smile dynamics that will prevail,
where traders take a bet with or against the future smile. As explained by Derman [1999], depending on the regime
you think the market is, you adjust accordingly your BS-hedge. An alternative approach is to satisfy this infinity of
solutions (conditionals) by giving the forward smile a shape consistent with its historical evolution.
140
if we look at forward start call option with absolute pay-off 4 (ST − kSt1 )+ . By conditioning the forward start option
with information available at time t1 i.e. by the filtration Ft1 we get
Recall that the implied volatility for a given strike and maturity is the number that plugged in Black-Scholes formula
fit the European call market prices. So, from now on we consider a general form of implied volatility surface (IVS)
and assume that it is driven by a factor process, X, such as the one in Equation (3.2.10). Then, we get the following
hypothesis
Assumption 4.1.1
As there exists a one-to-one relation between the implied volatility surface Σ and the pdf φ with respect to the Lebesgue
measure λ, defined as the function such that for any set S the probability that Xt1 ∈ S at t1 given Xt0 at t0 is
Z
P (Xt1 ∈ S|Xt0 ) = φ(x, t1 , Xt0 , t0 )λ(dx)
S
The current factor evolution directly influences its future increment pdf, which means that economic agent anticipa-
tions are dynamic. Note that these dynamics neither appear in the Black-Scholes model nor in Dupire model, where
agent anticipations are assumed to be static. Linking the factors and smile together is to say that the smile effect, which
characterises agent beliefs for future evolution of the stock, is dynamically modified according to factors realisations.
This is a way of modelising dynamic, i.e. rational anticipations.
A consequence of such an hypothesis (4.1.5) is that call prices are all determined by factor levels and consequently
S risk. Basically, the σ-algebra Ft , which is a priori the σ-algebra
that the factor process X holds the whole market
generated by {ηu , u ≤ t} where ηu = {Xu ∪ K,T C(u, Su , K, T − u)} coincides with σ(Xu , u ≤ t) for ηu is fully
determined by Xu :
Ft = σ{ηu , u ≤ t}
Moreover the dependency of Σt on Xt conserves the Markov property for X. Therefore, considering the payoff of a
call option in Section (1.7.1), the forward starting call option becomes
d.XT
e
C F (t0 , t1 , k, T ) = e−r(T −t0 ) E ed.Xt1 E ( d.Xt − k)+ | Xt1 | Xt0
e 1
Since the stock price at time t1 is St1 = ed.Xt1 , we will keep this notation for clarity when appropriate. Contrary
to the lognormal hypothesis where the increments of the process log S are stationary and independent (see Section
(1.2)) this is no longer the case in the jump-diffusion model and one must rely on taking the stock price at time t1 as
numeraire where QS is a martingale measure equivalent to Q such that
4 There exists forward start call options which pay their owner the difference, if positive, between the performance and the relative strike k i.e.
( SST − k)+
t1
1 σ 2 )(T −t)
x−log K+(r− 2
5 In the Black & Sholes formula, we get CBS (t, x, σ) = ex N [d1 ] − Ke−r(T −t) N [d2 ] with d2 = √
σ T −t
.
141
dQS 1
RT RT √
= e− 2 t Vs ds+ t Vs dWY (s)
dQ
and performing the change of measure
p
dWYS (t) = dWY (t) − Vt dt
which is a Brownian motion under the QS probability measure. When the Brownian motions of the forward price and
that of its instantaneous variance Vt are independent under the Q probability measure then the process Vt is unchanged
when changing probability measures. However, when the Brownian motions are correlated then the process Vt is
modified but only the order of magnitude of its parameters is changed. Therefore, for clarity of exposition we will not
change our notation when describing the instantaneous variance. Examples can be found in Zhu [2007] in the case of
the libor process and in Kruse [2003] in the case of the Heston model. The forward starting call option under the QS
probability measure becomes
B(t1 ) 1
C F (t0 , t1 , k, T ) = St0 ES E (ST − kSt1 )+ | Xt1 | Xt0
B(T ) St1
where B(·) is the money market. Introducing the pdf φS we get
Z
B(t1 )
C F (t0 , t1 , k, T ) = e−d.xt1 E (ST − ked.xt1 )+ | Xt1 = xt1 φS (xt1 , t1 , Xt0 , t0 )λ(dxt1 )
St
B(T ) 0
for some d ∈ Rn . Now, the integrand is
Z
−d.xt1 d.xt1 + −d.xt1
(ed.xT − ked.xt1 )+ φ(xT , T, xt1 , t1 )λ(dxT )
e E (ST − ke ) | Xt1 = xt1 = e (4.1.7)
which is independent from the stock price St1 . As we linked the implied volatility surface Σ to the factor level Xt and
the time to maturity T − t, and as there exists a one-to-one relation between the implied volatility surface Σ and the
pdf φ, we can consider φ a function φ(., T, Xt , t) of factor level and time to maturity. It actually means that, given
Xt1 , the density φ(., T, Xt1 , t1 ) is known.
We can now compute the quantity E (ST − ked.xt1 )+ | Xt1 = xt1 for any moneyness k. In our Jump-Diffusion
model, we have :
e−d.xt1
Z
C F (t0 , t1 , k, T ) = P (t1 , T )St0 CJD (t1 , xt1 ; ked.xt1 , T )φS (xt1 , t1 , Xt0 , t0 )dλ(xt1 ) (4.1.8)
P (t1 , T )
1
= St0 ES
CJD (t1 , Xt1 ; kSt1 , T )|Xt0
St1
which implies that the forward start call option price is given by the discounted expectancy under the pricing measure
QS of future European calls prices starting at t1 and fixing at T . Note that these products do not exist and cannot
be priced today unless we make some assumptions on future anticipations. This is actually what we are doing here.
There are different methods for computing such an expectancy. For example, we can consider pricing the forward start
option by using the characteristic function of the call price, we can also consider using the density of the underlying
process at the determination time of the strike, or we can consider an approach consisting in applying Ito’s lemma to
the approximated call price at time t1 .
142
In order to understand the performance of a hedging portfolio derived from misspecified volatility, we consider
an option on a stock whose volatility is unknown and stochastic. That is, observing the true stock price, the hedger
incorrectly computes the price of the contingent claim and the corresponding hedging portfolio. Given deterministic
interest rates, and assuming a volatility dependent only on time and the current stock price, El Karoui et al. [1998]
showed that the price of a European and American contingent claim with convex payoffs is a convex function of
the price of the stock. Examining the performance of a hedging portfolio derived from misspecified volatility, they
found that if the misspecified volatility dominates (or is dominated by) the true volatility, then the self-financing value
of the misspecified hedging portfolio exceeds (or is dominated by) the payoff of the contingent claim at expiration.
Alternatively, assuming that they know the true forecasted constant volatility in the Black-Scholes world, Ahmad et
al. [2005] built a replicating portfolio by buying an underpriced option and delta hedging it to expiry with the actual
volatility.
We are going to illustrate these two approaches by considering a trader expecting the underlying stock price S to go
down, and choosing to sell a call option with payoff h(ST ) at maturity T . Assuming the Black-Scholes world, to hedge
himself the trader can use the option premium π0 to build a replicating portfolio V (t) = π∆ (t) for 0 ≤ t ≤ T . He is
then going to dynamically buy a quantity of the underlying stock to hedge the risk of paying the payoff at maturity.
That is, he is going to monitor the option price P (t) for t ∈ [0, T ] and risk-manage the self-financing portfolio V (t)
with initial value being the option premium x = π0 , making sure that his tracking error (T ) = V (T ) − h(ST ) is null
at maturity. While the market prices vγ (t, St ) with misspecified volatility γ correspond to the expectected value of
the future financial flux to be paid, the replicating portfolio is directly observed on the market. Hence, the remaining
question is that of the choice of the volatility when computing the delta-hedging portfolio.
143
Assumption 4.2.1 We define the money market M (t) and the stock price S(t) on a probability space (Ω, F, P ) and
adapted to a filtration {Ft ; 0 ≤ t ≤ T }. In addition their dynamics are given by
Rt
r(s)ds
M (t) = e 0
dSt
= r(t)dt + σ(t)dW (t)
St
where {W (t); 0 ≤ t ≤ T } is a one-dimensional Brownian motion adapted to Ft for 0 ≤ t ≤ T , σ is a non-negative
RT
volatility process satisfying E[ 0 σ 2 (t)dt] < ∞, and r is a deterministic interest rate process.
2
S S (t)
Letting M be a martringale and assuming E[ M 2 (t) ] < ∞ for 0 ≤ t ≤ T , we consider a bounded adapted portfolio
Definition 4.2.1 Given a non-random initial portfolio value π∆ (0), the self-financing value of a portfolio process ∆
is the solution of the linear SDE
dπ∆ (t) = r(t) π∆ (t) − ∆(t)S(t) dt + ∆(t)dS(t) (4.2.9)
with solution
Z t
S(u)
π∆ (t) = M (t) π∆ (0) + ∆(u)d( )
0 M (u)
S(t)
Since M (t) is assumed to be a square-integrable martingale, and ∆ is bounded, the discounted portfolio value πM∆(t)
(t)
is a square integrable martingale. We now define a payoff convex function h on (0, ∞), having bounded one-sided
derivatives as
0
h (x±) ≤ C , ∀x > 0
for a positive constant C. In that setting, a price process for a European option is any adapted process {P (t); 0 ≤ t ≤
T } satisfying the terminal condition P (T ) = h(S(T )) a.s.. In a complete market the arbitrage price of a European
option is
h(S(T ))
PE (t) = M (t)E |F (t) , 0 ≤ t ≤ T (4.2.10)
M (T )
We assume that we short the option P and hedge it by being long a quantity ∆ of the stock and some cash. We now
define our measure of error, the Tracking Error associated with (P, ∆), which is the process
144
Remark 4.2.1 A hedger who incorectly estimates the volatility of the underlying will incorectly compute the European
option price and hedging portfolio.
Let (P, ∆) be the result of such a computation. If the hedger starts with a portfolio with initial value π∆ (0) = P (0)
e
and uses the portfolio process ∆, at maturity, the portfolio will be π∆ (T ). If (P, ∆) is a superstrategy, then M is
non-decreasing, and since e(0) = 0 we have
π∆ (T ) = P (T ) + e(T ) ≥ h(S(T ))
and
• the hedger has successfully hedged a short position in the contingent claim.
• A substrategy (the negative of the portfolio process of a substrategy) hedges a long position, and the above
inequality is reversed.
• If the market is complete, there exists a unique replicating strategy for the European contingent claim with price
process given in Equation (4.2.10).
Remark 4.2.2 Allowing for the misspecified volatility γ to be only dependent on the current stock price, comparisons
between σ and γ lead to comparisons of contingent claim prices and performance of hedging portfolios.
Assumption 4.2.2 The functions r : [0, T ] → R and γ : [0, T ] × (0, ∞) → [0, ∞) are Holder continuous.
We write the dynamics of the misspecified underlying asset price process as
dSγ (u) = Sγ (u) r(u)du + γ(u, Sγ (u))dW (u) , t ≤ u ≤ T
with initial condition Sγ (t) = x for x > 0. The misspecified option value at time t is
RT
vγ (t, x) = E e− t r(u)du h(Sγ (T )) , 0 ≤ t ≤ T , x > 0
(4.2.11)
Remark 4.2.3 If the stock prices Sγ (t) were governed by the above SDE, the market would be complete and vγ (t, x)
would be the arbitrage price of the contingent claim.
The option prices satisfy the PDE in Equation (1.6.30) which we rewrite as
∂vγ 1 ∂ 2 vγ
(t, S(t)) = r(t)vγ (t, S(t)) − r(t)S(t)∆γ (t) − γ(t, S(t))2 S(t)2 (t, S(t))
∂t 2 ∂x2
∂v
where ∆γ (t) = ∂xγ is bounded. Observing the true stock price S(t), the hedger (incorrectly) computes the contingent
claim price Pγ (t) = vγ (t, S(t)) and uses the hedging portfolio ∆γ (t) = ∂x vγ (t, S(t)). Starting with the initial value
vγ (0, S(0)), the dynamics of the self-financing replicating portfolio is given in Equation (4.2.9). Alternatively, we can
145
express the option price process with respect to the true stock process as Pγ (t) = vγ (t, S(t)) and find its SDE using
Ito’s lemma
∂vγ ∂vγ 1 ∂ 2 vγ
dPγ (t) = (t, S(t))dt + (t, S(t))dS(t) + σ 2 (t)S 2 (t) 2 (t, S(t))dt
∂t ∂x 2 ∂x
∂vγ
Now, replacing ∂t (t, S(t)) in the above equation with the value previously calculated with the misspecified volatility,
we get
1 ∂ 2 vγ
dPγ (t) = r(t)Pγ (t)dt + ∆γ [dS(t) − r(t)S(t)dt] + [σ 2 (t) − γ 2 (t, S(t))2 ]S 2 (t) 2 (t, S(t))dt
2 ∂x
Given the tracking error eγ (t), with SDE
1 ∂ 2 vγ
deγ (t) = r(t)eγ (t)dt + [γ 2 (t, S(t)) − σ 2 (t)]S 2 (t) 2 (t, S(t))dt
2 ∂x
Using the relation between the gamma and the vega in Equation (B.1.6), we can rewrite the dynamics of the tracking
error as
1 ∂ 2 vγ
d(e−rt eγ (t)) = e−rt [γ 2 (t, S(t)) − σ 2 (t)]S 2 (t) 2 (t, S(t))dt
2 ∂x
Integrating in the range [0, t], the tracking error eγ (t) is given by
Z t
1 1 ∂ 2 vγ
eγ (t) = M (t) [γ 2 (u, S(u)) − σ 2 (u)]S 2 (u) 2 (u, S(u))du (4.2.12)
2 0 M (u) ∂x
such that to be over-hedged we require
π∆γ (T ) ≥ h(S(T ))
Inversly, if σ(t) ≥ γ(t, S(t)), then (Pγ , ∆γ ) is a substrategy.
6 ert d(e−rt eγ (t)) = deγ (t) − r(t)eγ (t)dt
146
This theorem shows that successful hedging is possible even under significant model error. It depends on the rela-
tionship between the misspecified volatility γ(t, S(t)) and the true volatility σ(t). In presence of a superstrategy, the
trader makes a profit with probability one even though the true price model is different from the assumed one as long
∂2v
as Γt = ∂x2γ (t, S(t)) ≥ 0. Further, since the hedging error depends on the option convexity gamma, for small gamma
the hedging error is small, irrespective of the model. The expected PnL is maximum when the stock price is close to
the money at maturity, where Gamma is largest, and when the instantaneous volatility is large at maturity. That is,
the PnL depends on the path taken by the underlying asset. Two trajectories with the same average volatility ending
at-the-money can lead to two very different PnL.
1 ∂ 2 vγ
dCγ (t) = r(t)Cγ (t)dt + ∆γ [dS(t) − r(t)S(t)dt] + [σ 2 (t) − γ 2 (t, S(t))]S 2 (t) 2 (t, S(t))dt
2 ∂x
where γ(t, S(t)) is the misspecified volatility. The Tracking Error associated with (Cγ , ∆a ) is the process
1 ∂ 2 vγ
d(t) = r(t)dt + (∆a (t) − ∆γ (t))[dS(t) − r(t)S(t)dt] + [γ 2 (t, S(t)) − σ 2 (t)]S 2 (t) 2 (t, S(t))dt
2 ∂x
which is equal to
1 ∂ 2 vγ
ert d(e−rt (t)) = (∆a (t) − ∆γ (t))[dS(t) − r(t)S(t)dt] + [γ 2 (t, S(t)) − σ 2 (t)]S 2 (t) 2 (t, S(t))dt
2 ∂x
and correspond to the Tracking Error from time t to t + dt. Discounting with P (t0 , t) = e−r(t−t0 ) to get the Tracking
Error at time t0 , we get
1 ∂ 2 vγ
ert0 d(e−rt (t)) = P (t0 , t)(∆a (t)−∆γ (t))[dS(t)−r(t)S(t)dt]+ P (t0 , t)[γ 2 (t, S(t))−σ 2 (t)]S 2 (t) 2 (t, S(t))dt
2 ∂x
So, the total Tracking Error from time t0 to maturity T is
147
Z T Z T
ert0 d(e−rt (t)) = P (t0 , T )(T ) = P (t0 , u)(∆a (u) − ∆γ (u))[dS(u) − r(u)S(u)du]
t0 t0
T
∂ 2 vγ
Z
1
+ P (t0 , u)[γ 2 (u, S(u)) − σ 2 (u)]S 2 (u) (u, S(u))du
2 t0 ∂x2
since (t0 ) = 0.
In the case of time-dependent volatility, both the dynamics of the call price CBS (t, St ; σ(t)) with true forecasted
volatility σ(t) and the dynamics of the call price CBS (t, St ; γ(t)) with misspecified volatility γ(t) satify the Equation
(1.2.5). Further, the two BS prices are solutions to the parabolic equation (1.6.30) with flat volatility σ(t) and γ(t)
respectively. As we assumed that our forecasted volatility was correct, the dynamics of the former price is
1 2
dCBS (t, St ; γ(t)) =(σ − γ 2 (t))St2 ∂xx CBS (t, St ; γ(t)) + rCBS (t, St ; γ(t)) + ∆γ (t)(dSt − rSt dt)
2 t
Now, taking the difference between these two theoretical option values with different volatilities, we get
dCBS (t, S; σ(t)) − dCBS (t, S; γ(t)) = r(CBS (t, S; σ(t)) − CBS (t, S; γ(t))dt + (∆a (t) − ∆γ (t))(dSt − rSt dt)
1 2
+ (γ (t) − σt2 )St2 ∂xx CBS (t, St ; γ(t))
2
which is equal to
ert d e−rt (CBS (t, S; σ(t)) − CBS (t, S; γ(t))) = (∆a (t) − ∆γ (t))(dSt − rSt dt)
1 2
+ (γ (t) − σt2 )St2 ∂xx CBS (t, St ; γ(t))
2
and correspond to the Tracking Error from time t to t + dt. Discounting with the zero-coupon bond P (t0 , t) =
e−r(t−t0 ) , to get the Tracking Error at time t0 , we get
ert0 d e−rt (CBS (t, S; σ(t)) − CBS (t, S; γ(t))) = P (t0 , t)(∆a (t) − ∆γ (t))(dSt − rSt dt)
1
+ P (t0 , t)(γ 2 (t) − σt2 )St2 ∂xx CBS (t, St ; γ(t))
2
So, the total Tracking Error from time t0 to maturity T is
Z T
ert0 d e−rt (CBS (t, S; σ(t)) − CBS (t, S; γ(t))) = CBS (t0 , S; γ(t)) − CBS (t0 , S; σ(t))
t0
Z T Z T
1
= P (t0 , u)(∆a (u) − ∆γ (u))(dSu − rSu du) + P (t0 , u)(γ 2 (u) − σ 2 (u))Su2 ∂xx CBS (u, Su ; γ(u))
t0 2 t0
148
In the case of forecasted volatility, when integrating the present value of all of the profits over the life of the option
to get a total profit, the result is always positive, but highly path dependent. Hence, maximising the total profit will
depend on the trajectory of the stock price, and more precisely its historical drift term. In that setting, the PnL is
∂2v
always increasing in value but the end result is random, and we are left with maximising S 2 (t) ∂x2γ (t, S(t)). That
is, we want maximum fluctuations in the trajectory of the stock price. However, when hedging with a delta based on
implied volatility, the profit made each day is deterministic (with constant volatility), but the present value of total
profit up to expiration is path dependent, and given by the tracking error (T ). Ahmad et al. [2005] considered
P (t, S, ) to be the real expected value of the PnL with known terminal condition P (T, S, ) = (T ), and derived its
partial differential equation, assuming a solution of this equation of the form
P (t, S, ) = + F (t, S)
Obtaining analytical solutions, they analysed the expected profit versus the growth rate µ of the stock price. They
found that to maximise the expected profit, the growth rate must ensure that the stock ends up close to, or, at the
money at expiration, where gamma is largest. Further, for most realistic parameter regimes the maximum expected
profit hedging with implied volatility is similar to the guaranteed profit hedging with actual volatility. Practically,
considering the expected profit versus the strike K and the drift µ, the higher the growth rate the larger the strike price
at the maximum. Introducing the skew effect, when the actual volatility is ATM there is no maximum, and profitability
increases with distance away from the money. Similarly, they calculated the variance in the final profit V ar((T ))
using implied volatility by solving the PDE of v(t, S, ).
h(S(τ ))
ess sup E[ |F(t)] , 0 ≤ t ≤ T
τ ∈Tt M (τ )
where Tt is the set of stopping times τ satisfying t ≤ τ ≤ T almost surely. Bensoussan [1984] and Karatzas [1988]
showed that in a complete market, the arbitrage price process PA of an American option with payoff h satisfies
h(S(τ ))
PA (t) = M (t) ess sup E[ |F(t)] , 0 ≤ t ≤ T
τ ∈Tt M (τ )
An optimal exercise time is
149
h(S(T ))
PA (t) = M (t)E[ |F(t)] , 0 ≤ t ≤ T
M (T )
and the price processes for the European and American options agree.
Theorem 4.2.2 Working under the Hypothesis (4.2.1), in a complete market, if r(t) ≥ 0 for all t ∈ [0, T ] and
h(0) = 0, then the European and American contingent claim price processes coincide.
When shorting the American option, the hedger must be prepared to hedge it all the way to maturity. However, if he
is long the option the optimal exercise time is
Theorem 4.2.3 Given the Assumptions (4.2.1), (4.2.2) and the optimal exercise time Dγ , if
150
where the first part corresponds to the price variation of the option, the second one to the stock price move, and the
last one is the risk-free return of the amount of cash. Setting V = V (t, S, σ) and applying Ito’s lemma on dVt , we get
1 2 ∂2V ∂V ∂V ∂V
σimp St2 2 + rSt
d(t) = − − rVt dt + dSt + dσt
2 ∂S ∂S ∂S ∂σ
1 ∂2V ∂2V ∂2V
+ 2
< dS, dS >t + 2
< dσ, dσ > t +2 < dS, dσ >t − δdSt + (δSt − πδ (t))rdt
2 ∂S ∂σ ∂S∂σ
∂V
Assuming the trader delta-hedge himself with the quantity δ = ∂S , the PnL simplifies to
1 2 dSt 2 2
∂V 1 ∂2V ∂2V
πV (t, t + dt) = ΓSt ( ) − σimp dt + dσt + < dσ, dσ > t + σt St < dW, dσ >t
2 St ∂σ 2 ∂σ 2 ∂S∂σ
where πV (t, t + dt) = ert d(e−rt (t)). In trading terms, it can be expressed as
1 2 dSt 2 2
1
πV (t, t + dt) = ΓSt ( ) − σimp dt + Vega dσt + Volga < dσ, dσ >t + Vanna σt St ξρS,σ dt (4.2.14)
2 St 2
where ρS,σ is the correlation between the stock price and the volatility, and ξ is the volatility of volatility.
151
Note, in the literature some authors use only Equation (4.2.16) to represent the PnL of an option, where Θ > 0 when
we short the option and Θ < 0 when it is bought. When using a stochastic volatility model, we can approximate the
PnL for being long the option and short the replicating portfolio as
dSt 2 1
πV (t, t + dt) ≈ Θ 1 − ( √ ) dt + Vega dσ + Volga < dσ, dσ >t + Vanna σt St < dW, dσ >t (4.2.17)
St σ dt 2
152
153
A multifractal market
CS (t; K, T ) = P (t, T )a(t, T )Et [(YT − k)+ ] = P (t, T )a(t, T )CY (t; k, T ) = C(t, Yt ; k, T )
where a(t, T ) and b(t, T ) are deterministic functions to be specified, and where k = K−b(t,T
a(t,T )
)
and CY (t; k, T ) =
Et [(YT − k)+ |Yy = y]. Hence, a discrete set of option prices on S determines a discrete set of option prices on Y . If
we reverse the process we get K = a(t, T )x + b(t, T ), for some x > 0, and the call price becomes
1
CY (t; x, T ) = CS (t; a(t, T )x + b(t, T ), T )
P (t, T )a(t, T )
However, since the market only quotes strike K, we should use the price
1
CY (t; k, T ) = CS (t; K, T ) (5.1.2)
P (t, T )a(t, T )
expressed in terms of (k, T ) when stock price has discrete dividends. As explained by Buehler [2008], in presence
of discrete dividends the no-arbitrage conditions no-longer hold for S, but it does for the pure process Y . As a result,
one must satisfy the no-arbitrage conditions in the Y-space and have positive butterflies ∂kk CY (t; k, T ) ≥ 0 and
154
positive calendar spreads ∂T CY (t; k, T ) ≥ 0 for all maturities T ≥ 0 and all strikes k ≥ 0. Hence, the Black-Scholes
implied volatility defined in Equation (1.4.14) is not the appropriate number for observing pure volatility risk in order
to compare option prices. When using this mapping device, traders have no tools for identifying the inherent violation
of arbitrage bounds implied by the presence of discrete dividends and can no-longer properly manage their books.
Since the no-arbitrage constraints are only defined in terms of the pure process Y , one should use a mapping device on
Y by directly considering the implied volatility of Y . Hence, given the option price CY (t; k, T ) at time t for a strike
k and a maturity T , the pure implied volatility Σ(t, Yt ; K, T ) satisfies
Re(t, T ) RT
C(t, T ) = = e t µs ds
P (t, T )
be the capitalisation factor from time t until time T . Further, we let Dt be the dividends paid till time t given by
∞
X
Dt = H(t − tdi )di C(tdi , t)
i=0
D(t, T ) DT
DP V (t; t, T ) = = − Dt
C(t, T ) C(t, T )
Having made an assumption on discrete dividends, for simplicity of exposition, we are going to consider the dynamics
of the stock price in the Spot model (see details in Appendix (B.2.3)). The stock price dynamics in the spot model and
under the risk-neutral measure Q become
St = C(0, t)Yt − Dt
dYt = Yt σY,t dWt
Y0 = S0
with a(t) = C(0, t), b(t) = −Dt and where σY,t is the instantaneous volatility which is independent from the discrete
dividend Dt . Using Ito’s calculus we can derive the process for the stock price dynamics as
0
dSt = µt St + (St + Dt )σY,t dWt − Dt dt (5.1.4)
In that setting the price of a European call option with strike K and maturity T is
155
00 00
RT
C(t, Yt , K , T ) = EtQ [e− t
qs ds
(YT − K )+ ]
00
where K = K+D(t,T )
C(t,T ) .
For notational purpose we let Zt = C(0, t)Yt be the asset price St plus the forward value of all dividends paid from
inception up to time t. Thus, the spot price becomes
St = Zt − Dt
and Zt is a continuous process with dynamics given by
with a correlation structure between the n-dimensional Brownian motion (Wti )1≤i≤N,t>0 given by
156
As explained in Appendix (B.2), in the spot model the forward price is given by
Re(t, T ) X Re(ti , T )
F (t, T ) = St C(t, T ) − D(t, T ) = St − di (5.1.7)
P (t, T ) P (ti , T )
ti ∈[t,T ]
P Re(ti ,T )
where D(t, T ) = ti ∈[t,T ] di P (ti ,T ) . For indices, we usually use an approximation by projecting the discrete
dividends to a continuous dividend yield, that is, St pays continuously qSt dt. Note, since the share price St can not
become negative, it implies that the forward price must be positive. Merton [1973] first noted that reducing a call
option by the amount of any dividend paid on the stock could reduce the price of the stock and the value of the option
to zero no matter what adjustment was made in the exercise price of the option. He pointed out that there maybe
was no adjustment in terms of the option capable of providing adequate protection against a large dividend. From the
definition of the forward price above, we can get the condition
D(t, T )
St ≥ = DP V (t; t, T )
C(t, T )
meaning that the price process (St )[t,T ] is floored by the deterministic process DP V (t; t, T ). Hence, the stock price
must always exceed the discounted expected value of all forthcoming dividends. As a result, the stock price can not
be modelled with a random process which can be arbitrarily close to zero. When modelling the stock price, we must
separate the volatility risk of the stock from the other charactersistic of the equity, which is what we did in the spot
157
model. Alternatively, Buehler [2008] proposed to modify the spot price as S0∗ = S0 − DP V (0; 0, T ) and obtained the
formula
C(t, St , K, T ) − P (t, St , K, T ) = P (t, T ) St C(t, T ) − D(t, T ) − KP (t, T )
= St Re(t, T ) − P (t, T )D(t, T ) − KP (t, T ) (5.1.8)
Dividing Equation (5.1.8) by the zero-coupon bond P (t, T ), we get the put-call parity expressed as
Re(t, T )
C(t, St , K, T ) − P (t, St , K, T ) = St − D(t, T ) − K (5.1.9)
P (t, T )
Re(t, T ) 1 Re(t, T )
St −K ≤ C A (t, St , K, T ) − P A (t, St , K, T ) ≤ St −K (5.1.10)
P (t, T ) P (t, T ) P (t, T )
In presence of discrete dividends the inequality becomes
Re(t, T ) 1 Re(t, T )
St − D(t, T ) − K ≤ C A (t, St , K, T ) − P A (t, St , K, T ) ≤ St −K
P (t, T ) P (t, T ) P (t, T )
158
T, the theoretical value of a forward contract between times t and T is given by Equation (5.1.7), where ti = tdi
represents the ex dividend date of the ith dividend. We assume a series of forward contracts, increasing in maturity,
such as that seen in Figure (5.1). We assume that there exist N forward contracts, each with expiry Ti , i = 1, ..., N
between times t and TN . Further, between each forward maturity Ti , the equity in question may also pay dividends.
However, the theoretical price of each forward contract does not allow for clear identification of the repo rate in each
period. We therefore seek an alternative representation to that given in Equation (5.1.7) by allowing for separation of
the repo factors. Note initially that the value of the first forward, expiring at date T1 remain unchanged. Then, due to
the multiplicative nature of the repo and discount factors when rates are deterministic, and by splitting the dividend
payments, we can rewrite the value of the second forward contract as
Re(T1 , T2 ) X Re(tdi , T2 )
= F (t, T1 ) − di
P (T1 , T2 ) P (tdi , T2 )
tdi ∈[T1 ,T2 ]
Hence we are able to express the T2 maturity forward in terms of the T1 maturity forward, and in doing so separate
the repo factors in the two time periods. Similarly, we can iterate this process through to time TN , resulting in the
recursive relationship
Re(TN −1 , TN ) X Re(tdi , TN )
F (t, TN ) = F (t, TN −1 ) − di (5.1.11)
P (TN −1 , TN ) P (tdi , TN )
tdi ∈[TN −1 ,TN ]
the repo factor satisfies Re(0, t) = e−q(t)t . We set F s (t, T ) = F (t, T, q), and find qi = q(Ti ) such that
and setting
159
we can approximate the repo rate qTi−1 ,Ti for the period [Ti−1 , Ti ] as
F s (t, Ti ) − Fb(t, Ti )
qTi−1 ,Ti ≈ −
Fb0 (t, Ti )
0
St P di
where Fb (t, T ) = P (t,T ) (T − t) − ti ∈[t,T ] P (ti ,T ) (T − ti ).
Definition 5.2.1 We define the American implied volatility as the level of volatility we need to plug into a basic pricer
(bp) to recover the market prices of the option.
Hence, given the American option price CA (t, St , K, T ) at time t for a strike K and a maturity T , the market implied
volatility ΣA (t, St ; K, T ) satisfies
160
where Cbp (t, St , K, T ; σ) is the basic pricer for an American option under the univariate lognormal model (UL).
As explained in Section (5.1.2.3), the put-call parity exists for European vanilla option prices, where the call option
price can be calculated from the put option price with the same parameters, but it does not hold for American options.
Thus, one can not infer the market forward price from American options. Further, the American smile for call options
does not equal the one for put options. We give an idea of how to prove that result. Given the decomposition
of the American call option in Equation (5.2.12), if we assume that the early exercise premium A is small with
respect to CA (t, St ), then we can perform a Taylor expansion on prices (see Appendix (B.3.1 )) to approximate
the American implied volatility. Given the volatility decomposition in Equation (B.3.14), we can approximate the
American volatility as follow
1
ΣA (K, T ) ≈ ΣE (K, T ) + A + ...+ (5.2.14)
V ega(ΣE (K, T ))
where V ega(σ) = ∂σ CA (t, St ). Since it may be optimal to exercise immediately American put options, which is not
the case for American call options, the early exercise premium for call options differ from that for put options. Thus,
the American smile for call options is not the same as that for put options.
161
That is, the put-call parity in the BS-formula holds if and only if dcall
1 = dput call put
1 which leads to σBS = σBS . Even though
different implementations of the BS-formula, due to different methods of handling dividends, result in different implied
volatility surfaces, we expect the volatility surfaces to be very similar in shape (see Haugh [2009]). The seminal work
of Breeden et al. [1978] on option-implied risk-neutral distribution paved the way to implied trees, model-free
implied volatility and implied skewness and kurtosis. However, this option-implied approach requires market prices
of European options across a wide range of strike prices. This is because the absence of butterfly arbitrage, which
corresponds to non-negative density (see Section (1.7.2.4)), only applies to market prices of European options across
a wide range of strike prices. In addition, as explained in Section (5.1.2.3), the put-call parity exists for European
vanilla option prices, where the call option price can be calculated from the put option price with the same parameters,
but it does not hold for American options. As a result, when considering American options there is no guarantee that
the implied volatility (IV) call equals the IV put, leading to two different smiles of volatility for a given maturity with
the largest discrepancy being around-the-money forward. As an example of this discrepancy, we plot in Figure (5.2) a
European smile versus an American smile. It is therefore pointless to fit an American implied smile with a polynomial
model. However, when systematicall trading options we rely on an option-implied model to fit the implied volatility
surface. We must therefore modify the American prices to recover a single, smooth, smile maturity per maturity. A
similar problem arise when one has to infer the option-implied risk-neutral distribution and requires the European
option prices in the estimation of these moments. Hence, the inability to handle American options may prevent the
application of the option-implied methodology to a broader range of option classes.
162
for a given maturity and strike, and back out the corresponding American implied volatility using Equation (5.2.13).
We then use this American implied volatility in the same CRR tree, but without the American feature, to determine the
price of the corresponding European option. The only assumption made in this approach is that the American implied
volatility is identical to the European implied volatility. An alternative approach is to use an analytical approximation
to extract the early exercise premium. Even though these methods are simple to implement, they do not work when
the options are deep in-the-money, and they treat each option independently. In the former approach we use the CRR
binomial tree as a tool to map out the implied volatility smile across American options. However, in the latter approach
we rely on the lognormality assumption, which is not consistent with the market. Tian [2010] proposed an accurate
method for extracting risk-neutral density and its moments from American option prices by using an iterative Implied
Binomial (iIB) tree. Calculating the early exercise premiums on the American options, they are then removed from
the market prices of the American options until the extracted early exercise premiums converge. We are now going to
present that method.
CA (t, St ) = CE (t, Xt )
= P (t, T )E Q [h(XT )|Ft ]
Z ∞
= P (t, T ) h(x)φX (t, x)dx
−∞
where CE (t, Xt ) is a synthetic European option with St is replaced by Xt . We are therefore interesed in finding the
density φX (t, ·), called the implied risk-neutral density (IRND) of the American option, such that
163
Note, given the market anomalies, many different densities φX (t, ·) can satisfy Equation (5.2.15). That is, the implied
risk-neutral density is model dependent.
Remark 5.2.1 In the Black-Scholes framework, the only degrees of freedom we have to find a process Xt satisfying
Equation (5.2.15) is to either play with the repo rates or with the volatility.
Assuming that the European options CE (t, St ) are known, several authors proposed to use the European density
φS (t, ·) to approximate the IRND density φX (t, ·). Using the multivariate extension of the Edgeworth expansion
(MGEE) (see Jarrow et al. [1982]), Arismendi [2013] proposed to approximate the density as follow
2
X (−1)j ∂j
φ X ≈ φS + M[l1 ,[l2 ]] φS (5.2.16)
j=1
j! ∂s[l1 ] ∂s[l2 ]
where Ml1 and Ml1 , l2 are the differences between the first and second-order moments of the risk-neutral densities
φX (t, ·) and φS (t, ·). There are two ways of considering Equation (5.2.16):
D1 The density φS (t, ·) is known, we can then estimate the difference of the cumulants M [l1 , [l2 ]] and compute the
implied density φX (t, ·).
D2 The density φS (t, ·) is not known, but the density φX (t, ·) and the difference of the cumulants M [l1 , [l2 ]] are
known. We can then approximate the density φS (t, ·).
Tian [2010] considered the problem D2 in the univariate case. Arismendi et al. [2014] considered the problem D1
in the multivariate case. As discussed in Section (5.2.2.1), single stock options are American, while European options
are OTC. Thus, in practice one do not know the density φS (t, ·) and can not solve the problem D1.
164
money forward. That is, the inequality in Equation (5.1.10) is not strong enough to guarantee the equality of implied
volatility smile. Nonetheless, practitioners assume that the put-call parity holds for ATM American options within
reasonable approximation. Ignoring discrete dividends, the implied dividend dimp is computed as follow
1 h P (t, T ) i
dimp (t, T ) = − ln C A (t, St , KAT M , T ) − P A (t, St , KAT M , T ) + KAT M
T −t St
Difficulty arises when we can not find American options in the market with strike close enough to KAT M . In the
single stock market, there does not exist a continuum of quoted option prices for all possible strike prices, meaning
that it is often unlikely to find the relevant option prices. One way forward is to linearly interpolate the option prices
to achieve a synthetic price at-the-money. Although linear interpolation is a rudimentary technique, the option prices
should behave almost linearly at-the-money. Doing so, we should expect some appreciable improvement in the smile
when implemented, but this is not the case because the non-existence of the put-call parity applies to all strikes and
not just the ATM one. Thus, the European options that we would recover from this smile would not satisfy the put-call
parity and we would not be able to apply the option implied methodology to infer the risk-neutral distribution, or
equivalently, the volatility surface.
where int(x) is the integer part of x 1 . If the error between the estimated forward and the model forward, =
Fb(t, T ) − F (t, T ), is greater than a threshold, the listed dividends are modified such that the model forward matches
the estimated one. Having a single forward price per maturity for European options, we get the modified implied
volatility Σ b E (T ∗ , K). This approach does not guarantee that the put-call parity is matched at all strikes. Further, if
we need to interpolate or extrapolate the European volatility surface for a pair (T, K), it becomes very difficult to
recover the associated American price. For example, given listed options with expiries T1 < T2 < ... < TN , we can
use the model of our choice to interpolate the modified European implied volatility Σ(T b ∗ , K) for the pair (T ∗ , K)
∗
with Ti < T < Ti+1 . However, we can not perform reverse engineering to recover the price of the equivalent
American option, since we can interpolate the forward price Fb(t, T ∗ ) but we do not know the synthetic forward prices
F s (t, T ∗ , Ki ) composing that price. Thus, we can not convert the modified implied volatility Σ b E (T ∗ , K) back to the
American implied volatility ΣA (T ∗ , K), hence, we can not recover the associated American price.
1
bxc for x ≥ 0
int(x) =
dxe for x < 0
where bxc and dxe are the floor and ceiling functions.
165
(T, K) → q(T, K)
If we want to implement path-dependent options on single stocks we need to devise a stochastic process for the repo
rates. One solution would be to assume that the repo rates follow the local function q(t, St ) (in the sense of the local
volatility), and derive that function to match the synthetic forwards prices. However, since the respective call and put
smiles in American options result from the inclusion of discrete dividends, with the discrepancy appearing only after
the first ex-dividend date, adjusting the continuous repo rates is not an ideal solution due to the impact it also has on
the theoretical forward through the multiplying factor on the stock price.
where di (t, ω) is a stochastic discrete dividend. Thus, we now need to infer the implied cumulative dividends
D(t, T, Ki ) for all strikes Ki such that the put-call parity is satisfied. Specifically, given the definition of the the-
oretical forward price in Equation (5.1.7), we get
Re(t, T )
D(t, T, Ki ) = St − F (t, T, Ki ) , ∀i (5.2.18)
P (t, T )
where D(t, T, Ki ) is the implied cumulative dividends between time t and T . The model forward price F (t, T, Ki )
is calculated by using the synthetic forward price computed at strike Ki using Equation (5.2.17). Thus, we infer the
implied cumulative dividend for each strike present in the market, and on each expiration date, getting the surface
(T, K) → D(t, T, K)
Since we do not have to diffuse the process St , we do not need to compute the stochastic discrete dividends di (t, ω).
Thus, given the diffusion process Zt and the implied cumulative dividends D(t, T, Ki ), we can use Equation (5.1.6)
for each strike, Ki , shifted by the amount D(t, T, Ki ) as follow
166
0
Ki = Ki + D(t, T, Ki ) , ∀i (5.2.19)
to estimate the modified American implied volatility Σ
e A (T, Ki ). The modified implied volatility is obtained by
solving
e A (T, Ki )) , ∀i
CA (t, St , Ki , T ) = Cbp (t, Zt , Ki , T ; D(t, T, Ki ); Σ
where Cbp (t, Zt , K, T ; D(t, T, K); σ) is the basic pricer for the Spot model with strike shifted by D(t, T, K). Fol-
lowing this method, we are able to ensure that our model matches the synthetic forward prices, enforcing the put-call
parity relationship at each strike Ki and at each maturity T , and yielding a smooth market volatility smile. We are left
with decomposing our model for American prices into European options and early exercise premiums. To do so, we
need to define our forward model. We choose to use the initial list of discrete dividents and to modify the dividend
St
yield such that the synthetic forward F s (t, T, Ki ) around the forward moneyness Ki ≈ η = P (t,T ) is matched. Since
the modified implied volatility is now identical for American call and put options, and assuming that Σ e A (T, Ki ) is the
same for American and European options (in the case of OTM), we can then decompose the American price as
where D(t, T ) is the cumulative dividend computed from the list of discrete dividents, Re(t, T ) is the repo rates
matching the forward price, and A f is a modified early exercise premium. Plugging Σ
e A (T, Ki ) in a CRR tree, without
the American feature, we can then calculate the synthetic European options CE (t, Zt ). An estimate for the early
exercise premium (EEP) is obtained by taking the difference between the American price and the synthetic European
price. Then, we can implement the option-implied approach on the synthetic European options. For example, given
listed options with expiries T1 < T2 < ... < TN , we can use the model of our choice to interpolate the European
implied volatility for the pair (T ∗ , K) with Ti < T ∗ < Ti+1 . We perform reverse engineering to recover the price of
the equivalent American option. We interpolate the surface of calibrated implied cumulative dividend at (T ∗ , K), set
the repo rates and the discrete dividends to zero, and compute the price as follow
As an example, we consider options on BASF with evaluation date 18/01/16, and we graph in Figure (5.3) the
European smile at maturity 15/12/17 and its associated American prices.
167
168
where the interest rate, ν(t), is a Lipschitz continuous function. Expanding the portfolio, we get
µ(t) − ν(t)
λ(t) =
σ(t)
The dynamics of the price process X under Q become
Definition 5.3.1 The measure of intrinsic risk in a time interval dt is defined by dG(t, T ) = ξ(t, T )X(t)dt, where
ξ(t, T ) is a continuous adapted process representing a rate of intrinsic risk.
That measure implies adding, or removing, capital in a time interval dt, leading the trading strategy to be adaptable,
that is,
ξ(t, T )
dZ(t) = λ∗ dt + dB(t) = dt + dW (t)
σ(t)
with
169
where the misspecified volatility σ is a Lipschitz continuous function. It is an implied price process with misspecifi-
cation in the underlying asset as in El Karoui et al. [1998]. Le showed that Z was an S-Brownian motion and that Y
could be a replacement of the true price process X with dynmics
dX(t) = ν(t) − ξ(t, T ) X(t)dt + σ(t)X(t)dZ(t)
where
ν(t)
σ 2 (t, T ) − σ 2 (t)
ξ(t, T ) =
σ 2 (t, T )
We see that the hedging error introduced by El Karoui et al. [1998] in the complete market becomes an intrinsic price
of risk, presented as traded asset, in the incomplete market theory. The measure S is subjective since the valuation of
the claim depends on the exogenous measure of risk ξ. That is, it becomes explicit in the pricing equation
1
∂t V (t, x) + σ 2 (t)x2 ∂xx V (t, x) + (ν(t) − ξ(t, T ))x∂x V (t, x) − ν(t)V (t, x) = 0
2
with X(t) = x and V (T, x) = h(x). That measure is called the risk-subjective measure, and it implies that possible
arbitrage exists in the market. Nonetheless, the growth of the portfolio value is still at the risk-free rate ν. While
forward contracts are associated with the underlying asset, their prices should be obtained by assuming complete
market and using Equation (). However ... such that they can be priced in incomplete market where the measure
of intrinsic risk ξ corresponds to the convexity adjustment. Similarly, in presence of dividends, the put-call parity is
not necessarily satisfied, suggesting that we can price vanila options in incomplete market by letting the measure of
intrinsic risk be the dividend yield.
We identify the increasing concave functions Ub : R → R ∪ {−∞} and Us : R → R ∪ {−∞} as the buyer and the
seller utility functions, respectively. We let wb ≥ 0, ws ≥ 0 be their initial wealth and P (X) be the claim’s discounted
payoff on the underlying X. We define Πb (P ) ≥ 0 and Πs (P ) ≥ 0 as the indifference prices of the buyer and seller,
given by the equations:
170
U (x) := 1 − e−λx , ∀x ∈ R
where λ > 0 is a known parameter corresponding to the coefficient of risk aversion representing the counterpart view
on financial risk. This utility function is widely used in the literature and has no constraint on the sign of the cash-flow.
Moreover, the indifference prices over this utility are independent of the initial wealth, which simplifies the number
of parameters to include in the computation of the prices (see Barrieu et al. [2009]). Using the exponential utility
function with the coefficient of risk aversion λb from (5.3.20) we have:
1 B
Πb (X B ) = − log EP [e−λb P (X ) ] (5.3.22)
λb
Similarly for the seller with an exponential utility function with a coefficient of risk aversion λs from (5.3.21) we have:
1 S
Πs (X S ) = log EP [eλs P (X ) ] (5.3.23)
λs
2
(λb BT U )2 2
K − mX σX mX − σX λb − K
E[e−λb P (X) ] = Φ + exp + λb (K − mX ) Φ
σX 2 σX
We can then deduce the buyer indifference price as
Πb (X) = (5.3.25)
2
σX (λb BT U )2 2
1 K − mX mXT − σX λb BT U − K
− log Φ + exp + λb BT U (K − mX ) Φ
λb σX 2 σX
Similarly, we consider the indifference price of the seller having an exponential utility function with risk aversion
0
parameter λs in Equation (5.3.23). Since the computation of E[e−λ P (X) ] in Equation (5.3.25) does not depend on the
sign of λb , we can have λb = −λs ∀λs and X = X S . Then, we get
2
σX (λs BT U )2 2
λs P (X) K − mX mX + σX λ s BT U − K
E[e ]=Φ + exp − λs BT U (K − mX ) Φ
σX 2 σX
We can then deduce the seller’s indifference price Πs (X) as
Πs (X) = (5.3.26)
2 2
2
1 K − mX σX (λs BT U ) mX + σX λs BT U − K
log Φ + exp − λs BT U (K − mX ) Φ
λ σX 2 σX
171
(D1) The law of one price: the same asset must trade at the same price on all markets.
(D2) Two assets with identical cash flows in the future must have the same price to start with.
(D3) An asset with a known future price must trade today at the price discounted at the risk free rate.
The law of one price is a less demanding requirement than the non-existence of dominant trading strategies or no-
arbitrage. It can be proved that
• if there is no arbitrage there is no dominant trading strategy, but there may be arbitrage opportunities even if
there are no dominant trading strategies.
• if there is no dominant trading strategy then the law of one price holds, but the law of one price may hold even
when trading strategies exist.
• if there are no dominant trading strategies then it can be shown that there must exist a linear pricing measure.
1. Futures: they must satisfy rule number (D3), any deviation from this equality leads to arbitrage.
2. Derivaties: buy and sell the same asset on two different markets, namely the spot market and the derivative
market. Hence, they must satisfy rule number (D1).
The No-Dominance law is most easly expressed in terms of relative value which is the attractiveness of one instrument
relative to another instrument measured in terms of risk, liquidity, and return. Given that options are a derivative
instrument, meaning they derive their value from an underlying security, options themselves have value relative to
other options. Hence, when comparing two options, one option’s value can be deduced from or defined relative to
another option’s value. In practice, even though options are quoted on the basis of price, option traders assess relative
value on the basis of volatility. That is, knowing the price of the option, one can solve for the IV of the underlying
stock by inverting the Black-Scholes formula (see Section (1.4.4)). More generally, all of financial modelling is based
upon comparison between assets. No one would buy the more expensive portfolio if the cheaper one would produce
the same cash flows. If these rules are not satisfied arbitrage opportunities will arise and traders will take advantage of
market’s imbalance.
172
where P (t, T ) is a zero-coupon bond with maturity T . Hence, the put-call parity posits a risk neutral equivalence
relationship between a call, a put and some amount of the underlier. If there is a difference between the two assets
173
there is an arbitrage which must be corrected. That is, arbitrageurs can make a sure profit by shorting the securities in
the expensive portfolio and buying the securities in the cheaper one. For instance, assuming no discrete dividends, in
the case where C(t, St , K, T )−P (t, St , K, T ) < St Re(t, T )−KP (t, T ), one should short both the put option and the
stock and use the proceeds to buy the call option and invest the difference P (t, St , K, T )+St Re(t, T )−C(t, St , K, T )
at the risk-free rate. At expiration, one of the two options is ITM and the other one is OTM. In either case, the
arbitrageur ends up buying one share at the price K, leaving him with a profit of P (t, T ) P (t, St , K, T )+St Re(t, T )−
C(t, St , K, T ) − K. In the case of American options the put-call parity does not exit because it can be exercised
at any time prior to the expiration date. Only an inequality holds given in Equation (5.1.10). Still, by adjusting for
dividends and rates, in the case of at-the-money call and put options, some authors assume the existance of such a
relationship.
As an example of such synthetic position is the call/put parity described in Section (5.4.2.1). However, there is a
dividend difference between the two type of assets. In that setting, we distinguish two cases:
• conversion: buy a put and sell a call, while buying the stock.
• reverse conversion (a synthetic long position): sell a put and buy a call, while shorting the stock.
which leads to
As long as the put and call have the same underlying, strike price and expiration date, a synthetic long position will
have the same risk/return profile as owning an equivalent amount of the underlying stock. However, it can be affected
by the borrowing cost of shorting the stock and the put option. Unlike the conversion, where you have to pay for the
position with a net cash outlay, a reversal involves selling short the stock and the put, which brings in a credit to the
account.
174
realised volatility will be low. Due to the put-call parity, being long a delta-hedged call results in the same returns
as being long a delta-hedged put. We let the price of a call option C = f (S, σ, T − t, K, ...) be a function of the
underlying S, the volatility σ, the time to maturity T − t, the strike K. We consider a ∆-neutral portfolio (options
plus a quantity of the underlying), and look at the difference between the implied volatility (IV) and the future realised
volatility (FRV). The trade is based on the assumption that if IV is far away from FRV it will mean revert to FRV at a
certain speed called the speed of mean reversion. Hence,
• we need to predict the future direction of the IV, and we can face Black Swan (abrupt changes in IV).
• we need to forecast the FRV.
1. Bullish options strategies are employed when the options trader expects the underlying stock price to move
upwards. It is necessary to assess how high the stock price can go and the time frame in which the rally will
occur in order to select the optimum trading strategy. The simplest of unlimited profit bullish options strategies is
the Long Call strategy where you buy a longer term call option and hold on to it to maturity. Being moderately
bullish means that you expect the price of the underlying stock to increase to a certain pre-determined price
instead of being bullish for an unknown extended period of time to an unknown high price. Bullish options
strategies with limited risk and limited profit are typically debit spread strategies that profit only if the price of
the underlying stock rises beyond a certain breakeven point.
2. Neutral strategies in options trading are employed when the options trader does not know whether the underlying
stock price will rise or fall. Also known as non-directional strategies, they are so named because the potential to
profit does not depend on whether the underlying stock price will go upwards. Rather, the correct neutral strategy
to employ depends on the expected volatility of the underlying stock price. Neutral trading strategies that are
bullish on volatility profit when the underlying stock price experiences big moves upwards or downwards. They
include the long straddle, long strangle, short condor and short butterfly (see Section (1.5.2.3) and (1.5.3)).
Neutral trading strategies that profit when the underlying stock price experience little or no movement include
the short straddle, short strangle, ratio spreads, long condors and long butterflies.
3. A volatility strategy means profiting from the market no matter if the price of the underlying stock breaks out
to upside or downside. Volatile options strategies are options strategies designed to profit under such conditions
of uncertainty and are commonly used ahead of major news or earnings releases in order to profit from either
direction depending on how the release turn out.
175
Despite the fact that trading strategies based on such option combinations possess many features distinguishing them
from plain assets strategies, the main determinant of their performance is the accuracy of price forecasts. In the case
of options, however, position profitability depends not only on the direction of the price movement, but on many other
factors such as volatility and time left until the expiration. While directional trading requires the trader to have a strong
conviction about the market or security’s near-term direction, the trader also needs to have a risk mitigation strategy in
place to protect investment capital in the event of a move to the opposite direction. For instance, market-neutrality is a
trading strategy where small changes in the underlying price do not lead to a significant change in the position value,
and given larger price movements, the position value changes by approximately the same amount regardless of the
direction of the underlying price movement. The main analytical instrument used to create market-neutral positions
is the delta of an option. The position of a portfolio is market-neutral if the sum of the deltas of all its components is
equal to or close to zero. If we consider strategies containing certain directional elements, the reduction to zero of the
delta is not an obligatory requirement. Forecasts of the directions of future spot price movements represent an integral
part of such a strategy. These forecasts can be incorporated into the strategy structure in the form of biased probability
distributions or asymmetrical option combinations. Directional strategies are often used to engineer a particular risk
profile to movements in the underlying security. Buying a butterfly spread (long one K1 call, short two K2 calls, and
long one K3 call) allows a trader to profit if the stock price on the expiration date is near the middle exercise price K2 ,
and does not expose the trader to a large loss (see Section (1.5.2.3)). One well-known strategy is the covered call, in
which a trader buys a stock (or holds a previously-purchased long stock position) and sells a call option. If the stock
price rises above the strike price, the call will be exercised and the trader will get a fixed profit. If the stock price falls,
the call will not be exercised, and any loss incurred to the trader will be partially offset by the premium received from
selling the call. Overall, the payoffs match the payoffs from selling a put due to the put-call parity. A bull Call Spread
is constructed by buying a call option with a low exercise price KL , and selling another call option with a higher
exercise price KH with the same maturity. Often the call with the lower exercise price will be at-the-money while the
call with the higher exercise price is out-of-the-money. It is designed to profit from a moderate rise in the price of the
underlying security. A short straddle is a non-directional options trading strategy that involves simultaneously selling
a put and a call of the same underlying security, strike price and expiration date. The profit is limited to the premiums
of the put and call, but it is risky because if the underlying security’s price goes very high up or very low down, the
potential losses are virtually unlimited. One of the advantage of selling an ATM straddle as opposed to a call option is
that its sensitivity with respect to a change in spot price is nearly null at inception. It is a volatility product as long as
the spot price does not deviate from the strike price, otherwise it becomes very risky and one should buy it back. That
is, in the case where the spot price St deviates too much from the strike K, one of the two legs of the straddle brings
non-necessary risk to the seller of the contract who should buy it back. Selling a straddle (selling both a put and a call
at the same strike price K) would give a trader a greater profit than a butterfly if the final stock price is near the strike
price, but might result in a large loss. Similar to the straddle is the strangle which is also constructed by a call and a
put, but whose strikes are different, reducing the net debit of the trade, but also reducing the risk of loss in the trade.
176
approaches by providing the traders with tools capable of properly quantifying market arbitrages to option prices,
together with tools modeling dynamically the agents rational anticipations.
177
178
underlying stock, same strike prices, but different expiration months. They can be created with either all calls or all
puts. Using calls, the calendar spread strategy can be setup by buying long term calls and simultaneously writing an
equal number of near-month at-the-money or slightly out-of-the-money calls of the same underlying security with the
same strike price. The investor hopes to reduce the cost of purchasing a longer-term call option. He hopes that the
price of the underlying remains unchanged at expiration of the near month options so that they expire worthless. As
the time decay of near month options is at a faster rate than longer term options, his long term options still retain much
of their value. The options trader can then either own the longer term calls for less or write some more calls and repeat
the process. If the options trader is bullish for the long term and is selling the near month calls with the intention to ride
the long call for free, he is implementing the bull calendar spread strategy. The maximum loss would occur should the
two options reach parity, which happen if the underlying stock declined enough that both options became worthless,
or if the stock rose enough that both options went deep in-the-money and traded at their intrinsic value. In either case,
the loss would be the premium paid to put on the position. At the expiration of the near-term option, the maximum
gain would occur should the underlying stock be at the strike price of the expiring option. If the stock were any higher,
the expiring option would have intrinsic value, and if the stock were any lower, the longer-term option would have
less value. An increase in implied volatility, all other things equal, would have an extremely positive impact on this
strategy since longer-term options have a greater sensitivity to changes in market volatility. The difference in time to
expiration of these two call options results in their having a different theta, delta and gamma. The near-term call suffers
more from time decay (greater theta), but it has a lower delta and a higher gamma (if the strike is at-the-money). This
means that if the stock moves sharply higher, the near-term call becomes much more sensitive to the stock price and
its value approaches that of the more expensive longer-term call.
1. Volatility Surface Relative Value: It optimises a portfolio of long and short option positions, on the same under-
lying security, to harvest gains from changes in the shape of the volatility surface.
2. Implied Correlation / Dispersion: It trades the relative value relationship between the implied volatility of an
Index option and the implied volatilities of the Component Stocks that comprise the Index.
3. Capital Structure Arbitrage: It trades the relative value relationship between options on various segments of the
capital structure of an individual company, or the options embedded in the same company.
4. Generic Volatility Long/Short: It trades the relative value relationship between options on a full array of securi-
ties, and is not limited to one underlying security or issuer of securities.
5. Directional Volatility: It creates an option portfolio that exhibits a continuous short or long exposure to changes
in implied volatility.
179
Then the trades are implemented and executed based on some targeted opportunities. Note, trades can also be identified
for risk management purposes by offsetting aggregated portfolio sensitivities (Greeks). In general the market volatility
surface is analysed relative to a model volatility surface and an electronic eye is used to identify and display trade
opportunities. Using quantitative tools, traders should be able to assess opportunities, identify hedges and execute
orders. Therefore, one can identify potential opportunities by profiling a smile of IV generated by proprietary modeling
against the IV of options actually observed in the market place. For instance, one can consider cheap IV versus model
as well as the expensive one against model. For that relation to exist the model must be arbitrage-free, not just in
space but also in time. This is the case with the parametric MixVol model described in Section (6.2). In addition, the
market surface can also be described statistically by looking at the relationship between each point. For example, one
can look at the ratios and spreads of 10% OTM puts for various maturities, placed in one-year historical context via
a percentile calculation. A pair with a Vol Ratio Percentile of less than 15% would be coloured blue or green, while
high ratios would be coloured red.
The growth of the variance swap (VS) market and the success of the VIX raised the profile of volatility trading
as an asset being negatively correlated to the underlying equity market. Rather than solving the difficult questions of
knowing which volatility to own, and how to mitigate the expense of decay, the VS is a liquid, non-strike dependent
hedge (see Section (7.1)). One strategy consists in owning a basket of single stock volatility, assuming that volatilities
would increase when the underlying shares sell off, and then take advantage of the elevated implied correlation.
Rather than selling the basket of single volatility, the relatively more elevated index volatility is sold against the
basket of single stock volatilities. Based on empirical studies, it is assumed that a basket of higher beta single stock
volatilities may have a larger absolute move up than index volatility during a sell-off. However, single stock volatility
having wider bid/offer spreads, exhibit a greater expected cost over the course of a year, and potentially unmanageable
180
bid/offer spreads. In addition, the relationship between single stock volatilities and index volatility may not perform as
historically expected, leading to basis risk. Nonetheless, some opportunities each year to sell elevated index volatility
present the possibility of significantly mitigating the expense.
181
)κT ( (Th−T
Pn
i=1 κF ( (F (t,T )−F
hF
(ti ,T ))
)κK ( (K−K
hK
i)
T
i)
)σi
σ
b(F (t, T ), K, T ) = Pn
i=1 κF ( (F (t,T )−F
hF
(ti ,T ))
)κK ( (K−K
hK
i)
)κT ( (Th−T
T
i)
)
The risk-neutral density function is obtained by taking the second derivative of the call option pricing formula with
respect to the strike price. Considering a centered finite difference approximation, the density is computed as follow
182
where the processes XtT −t , YtT −t and ZtT −t represent respectively the at-the-money volatility, the Skew and the
Curvature of the smile at time t for call options with maturity T i.e. time to maturity T − t. Using historical data,
some authors studied the dynamics of these processes. For instance, assuming an Ornstein-Uhlenbeck dynamic for the
processes, Cont et al. [2002] explained the deformation of the volatility surface. Alternatively, Bloch et al. [2002]
assumed the parameters to be led by the spot process holding the whole market risk. When it comes to generating
a volatility surface, the evaluation time t is fixed and practitioners estimate one set of parameters X, Y and Z per
trading maturities. Then, they rely on some interpolation and extrapolation of parameters in time with no guarantee of
satisfying the calendar spread. Further, in the presence of discrete dividends it is not an easy task to satisfy the time
constraint given by the calendar spread.
As an example, setting η(t, T ) = ln SKP (t,T )
t Re(t,T )
, we can consider the functional form
2
f (t, T − t, X, Y, Z; K) = X − Y η(t, T ) + Z (η(t, T )) (6.1.1)
2
t ;K,T −t) t ;K,T −t)
with P (t, T ) = e−r(T −t) and X = Σ(t, KP (t, T ); KP (t, T ), T − t), Y = ∂Σ(t,S∂K and Z = ∂ Σ(t,S
∂K 2 .
Note, whatever the shape we take, the smile needs to be caped and floored in the lowest and highest strikes in order to
∂Σ
avoid any arbitrage opportunity. If the slope of the smile ∂K is too high, the implicit pdf can be negative (see Equation
(1.7.47)).
Among the polynomial models describing the whole volatility surface with one equation are the Cubic model and
the Spline model. In the former the IV is a cubic function of moneyness η = ln K S (or the time adjusted moneyness
ηb = √1τ ln K
S ) and a quadratic function of time to expiry τ
183
with
0 if η < 0
D=
1 if η ≥ 0
together with the constraints
a5 + a6 .0 + a7 .02 + a8 τ + a9 τ 2 = 0 , Da6 = 0
for the volatility function to be continuous and differentiable.
In the case of the SABR model, it can either be used as a model for a whole volatility surface, or for the skew.
Under the first approach, the parameters α, β, ρ and ν are calibrated for all given times to expiration τi for i = 1, 2, ...
In the second approach, one fit a SABR skew for each observed time to expiration, and then interpolate the values
of implied volatility for any arbitrary τ . The piecewise SABR (PSABR) parameters αi , βi , ρi and νi are calculated
separately for each time to expiration τi , and the IV surface is built as a linear approximation of separate skews.
The SVI is smooth in the strike direction, and it has five parameters at each maturity with intuitive interpretations in
terms of implied volatility changes. It is a slice parametrisation of the implied variance, which should be relatively easy
to fit liquid listed option prices. However, as noticed by Zeliade Systems [2009], the least-square fit of the SVI to the
market is not trivial, as there are often local minima. The same smile can be very well calibrated with sets of parameters
that are totally different one from the other. To overcome this problem, they decomposed the minimisation problem
into an analytical one and a numerical one, which improved the situation by reducing the number of dimensions of
the numerical optimisation. That is, they downsised the minimisation problem from dimension five to dimension two,
while the optimisation over the remaining three is performed explicitly. Another problem with the SVI model is that
parameter set cannot be directly comparable between smile with different maturities. Intuitively, we would like to
rescale the parameters related to the skew so that we can compare the skewness of smile between different maturities.
In other words, we would like to normalise the three skew related parameters by certain time factor. Gurrieri [2011]
derived a set of sufficient conditions on the parametric forms of the SVI in order to satisfy the no-calendar spread
arbitrage constraint, while preserving the condition of no-strike arbitrage. Adapting the strategy of Bloch [2010b] to
the case of SVI, he allowed the model parameters to be functions of time and derived a set of sufficient conditions on
these functional forms such that no calendar spread arbitrage can exist. He further proved the existence of solutions,
and proposed a strategy to find explicit examples.
184
moves in order to match market prices. This is due to the fact that today’s market prices do not provide us with the
right future anticipations of the stock price process, because the implied volatility surface is neither stationary nor
Markovian but stochastic. As a result, it stresses the necessity to take into consideration the dynamics of the volatility
surface when pricing and risk-managing a portfolio of options. Practitioners use rules of thumb to compute the Greeks
by considering some standard evolutions of the implied volatility surface providing an indication of some possible
behaviour for the smile that one might expect (see Section (4.1.1.2)).
While a model with a large number of parameters may calibrate well the volatility surface on a given day, the
same model parameters may give poor results on the next day. On the other hand, any risk management system tries
to estimate the future (short term forecast) behaviour of the volatility surface. As a result, one need a model that
accurately fit market prices with stable and robust parameters. Ladokhin [2009] focused on two different approaches
to model the dynamics of the IV surfaces, one applicable to the Cubic and the Spline model and the other used for
SABR models. In the first approach, the dynamics of the surface is treated as dynamics of these parameters. To reduce
the dimensionality of the problem, he applied Principal Component Analysis (PCA) to the values of the parameters
ai for i = 0, .., n. This is to switch to another space of the no-correlated factors, that fully describe the dynamic of
the implied volatility surface. Since the first few principal components explain most of the variance of the calibrated
parameters, the dynamics of the volatility surface over time is explained by the dynamics of the first two principal
components modelled with an Autoregressive moving average model (ARMA). In the second approach, the SABR
model already assumes certain dynamics of the volatility and the underlying asset expressed by a system of stochastic
differential equations. Consequently, given the calibrated parameters, the spot price can be simulated with the Monte
Carlo method, and for each path, the IV surface generated. The forecast of the implied volatility surface is an average
surface over the simulated paths.
Strong of these dynamics, Ladokhin [2009] used the rolling horizon technique to build a 1 and 5 day forecasts of
the volatility skews. Setting N = 100, he used observations from days t − N till t − 1 to calibrate the dynamic models
in order to build a forecast of the skew for day t. Then, at time t + 1 the horizon is rolled so that days t − N + 1 till
t are used for calibration. An equivalent technique is used for the five days ahead forecast. He then tested how the
models can hold the volatility skew pattern by performing a static test. Calibrating the models to observed IV surface
at date t − 1 or t − 5 he calculated the weighted mean square error (WMSE) between the model results and the IV
surface on day t. Cubic and Spline models approximate the implied volatility surface with the function of a certain
form. The Spline model is perhaps, the most effective to minimise the fitting error. Good performance of the dynamic
version of the Spline model is an empirical evidence of the dependence of the dynamics of the surface of two principal
components. Both of these models use much less parameters, than the PSABR model. Even though the SABR and the
PSABR assume a certain model for the joint dynamics of the volatility and the underlying asset, their fitting results
have a higher error than the polynomial models. Nonetheless, the PSABR tends to model the skew rather effectively
in case of insufficient or bad data. Because these models assume some shape of the IV surface, they are predisposed
to give a more theoretical shape of the skew resulting in lower relative forecasting error. As a result, models relating
prices in time and space perform better on an incomplete market or with missing data. To conclude, no single method
exhibits superior accuracy in the analysis of every data set. Some methods perform better for certain underlying assets,
while other methods are more suitable for the other.
185
though the SVI model is consistent with Lee’s moment formula for extreme strikes (see Lee [2004]), it is well-known
that the SVI smiles may be arbitrageable. Roper investigated well known models such as SVI, Avellaneda’s SABR,
Quadratic parametrisation, and showed that they still admit arbitrage under certain parameter classifications. Setting
K
x = ln ( F (t,T ) ) and given the time scaled implied volatility Ξ defined in Equation (1.4.18), the parametrisation of the
SVI follows
p
Ξ2SV I (x, τ ) = a + b ρ(x − m) + (x − m)2 + σ 2
and the parameters chosen are a = 0.04, b = 0.8, σ = 0.1, ρ = −0.4 and m = 0. The parametrisation of the SABR
model proposed by Avellaneda [2005] is
k|x|
ΞSABR (x, 1) = q
ln k|f (x)| + 1 + k 2 f 2 (x)
where
1 − e−βx
f (x) =
σ0 β
with parameters σ0 = 0.2, β = −4.0 and k = 0.5. At last, the Quadratic parametrisation is given by
To remedy the problem of arbitrageable smiles, Gatheral et al. [2012] considered a class of SVI volatility surfaces
with a simple closed-form representation, for which absence of static arbitrage is guaranteed. They found a class of
SVI smiles for which the absence of butterfly arbitrage is guaranteed which is an extension of the natural parametri-
sation on total variance given by ω(η, t) = θt SV Iρ (ηφ(θt )) where η is the logarithm of the forward moneyness and
SV Iρ is the classical SVI parametrisation. Note that this representation amounts to considering the volatility surface
in terms of ATM variance time θt , instead of standard calendar time. In their new class of SVI volatility surfaces,
they defined the absence of calendar spread arbitrage and butterfly arbitrage by the non-negativity of the numerator
and denominator of Dupire’s equation in implied volatility. However, in light of the Fokker-Plank equation, these
properties are necessary, but not sufficient for no-arbitrage. As discussed by Guo et al. [2012], the right statement is
that there exists an inhomogeneous diffusion process associated to the implied volatility through Dupire’s equation. To
this end, the local volatility defined through Dupire’s equation should have adequate continuity and growth properties.
Alternatively, one can consider the additional conditions defined by Roper [2010]. Guo et al. [2012] proposed a gen-
eralisation of the work by Gatheral et al. [2012] to volatility surfaces parametrised as ω(η, t) = θt Ψ(ηφ(θt )) for some
general function Ψ, obtaining necessary and sufficient conditions such that the corresponding implied volatility sur-
face is free of arbitrage. To facilitate the calibration of the entire volatility surface, they preconised to parametrise the
set of functions φ to get easy-to-implement calibration algorithms among the whole admissible class. They provided
an example of non-SVI parametric family, but the relation between the model parameters and the implied volatility is
no-longer intuitive.
186
volatility surfaces, it must be flexible and fast to compute. Among the different techniques proposed for obtaining
from market prices a smooth volatility surface, Rebonato et al. [2004] argued that modelling directly the density
was the most desirable approach. They extended the mixture of normals approach proposed by Alexander [2001],
obtaining a density with non-zero skew and satisfying the risk-neutral forward condition while retaining an uncon-
strained numerical search. However, in markets with long maturity products and discrete dividends, it is important for
model pricing to obtain a reliable volatility surface satisfying the no-arbitrage constraint not only in space but also in
time. So, we intend to generate a surface without arbitrage in time and in space as closely as possible to market data.
Interpolation techniques to recover a globally arbitrage-free call price function have been proposed by Kahalé [2004]
where he considered a piecewise convex polynomials, and by Wang et al. [2004] who suggested the use of a cubic
B-spline interpolation. Later, Fengler [2009] considered smoothing call prices with a natural cubic splines by choos-
ing to minimise a penalised sum of squares resulting in an iterative quadratic minimisation problem under constraints.
Similarly, Bloch [2010b] proposed to impose smoothness and value constraints directly on the market prices and their
resulting implied volatility surface. He imposed the market prices to satisfy the no-arbitrage conditions defined by
Durrleman [2003] and Roper [2010], and smoothed the implied volatility surface by fitting a special functional form
to the observed market prices. A parametric representation of the market call prices under constraints was considered
in order to smooth the data and get nice probability distribution functions (pdf). The Differential Evolution algorithm
described by Bloch et al. [2011] was used to calibrate the model’s parameters to a finite set of option prices.
We let the time function t → Σi (t) be regular enough, and choose to model directly the square-root of the average
variance defined as
1 t 2
Z
2
Σi (t) = σ (s)ds
t 0
where σ(t) is the instantaneous volatility of the underlying process. To guarantee the positivity of the local volatility
given in Equation (1.4.20), the average variance must verify
187
a0i
µi (t) = µ0i f (t, βi ) and ai (t) =
f (t, βi ) × norm
Pn a0i
where norm = i=1 f (t,βi ) . As a result, with seperable functions of time, we can prove that the no-free lunch
constraints simplify.
a0i ≥ 0 (6.2.4)
n
X
ai (t) = 1
i=1
Xn
a0i µ0i = 0
i=1
µ0i ≥ −1
188
(A1) Monotonicity of total variance is guaranted provided that Equation (6.2.3) is satisfied. This will be dependent
upon the choice of the volatility functions Σi (t), and will be discussed in Section (6.2.2.3).
(A2) Taking the partial derivative of the call price with respect to the strike K, we get
n
∂ 1 X ∂ ∂ K̃
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = ai (t) CallBS (t0 , S0 , Rt , 1, K̃(K, t), t, Σi (t))
∂K norm i=1 ∂ K̃ ∂K
∂ K̃
where ∂K = P (t0 , t)(1 + µi (t)). Substituting for the partial derivative of the BS-formula with respect to the
strike yields
n
∂ 1 X
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = − ai (t)P (t0 , t)(1 + µi (t))N (di2 (t − t0 , St , K̃(K, t)))
∂K norm i=1
Differentiating one more time the call price with respect to the strike, and using Equation (B.1.9), we get
n
∂2 1 X ∂2 ∂ K̃ 2
2
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = ai (t) CallBS (t0 , S0 , Rt , 1, K̃(K, t), t, Σi (t))( )
∂K norm i=1 ∂(K̃)2 ∂K
n
1 X 1
= ai (t)[P (t0 , t)(1 + µi (t))]2 V ega(K̃(K, t), t; Σi (t))
norm i=1 K̃ 2 (K, t)Σi (t)(t − t0 )
which is greater or equal to zero if and only if a0i ≥ 0 and µ0i ≥ −1, ∀i, given Σi (t) > 0.
(A3) The ith forward moneyness is defined in the model as
1 1
lim d1 (ηM,i , ω(ηM,i , t)) = − p lim ln (ηM,i ) − ωi (t) = −∞
ηM,i →∞ ωi (t) η M,i →∞ 2
p
Since d2 (ηM,i , ω(ηM,i , t)) = d1 (ηM,i , ω(ηM,i , t)) − ωi (t), the second result follows immediately.
(A4) We are going to use the equivalent condition of Small-Moneyness-Behaviour on the call price surface given in
Equation (1.4.23). Given the partial derivative of the model price with respect to the strike in (A2), taking the
limit K → 0, we get
189
n
∂ 1 X
limCM (t0 , S0 , Pt , Rt , Dt ; K, t) = − ai (t)P (t0 , t)(1 + µi (t)) lim N (di2 (t − t0 , St , K̃(K, t)))
K→0 ∂K norm i=1 K→0
n
X n
X
= −P (t0 , t) ai (t) + ai (t)µi (t)
i=1 i=1
n n
X X a0i
= −P (t0 , t) ai (t) + Pn a0j
µ0i f (t, βi )
i=1 i=1 f (t, βi ) j=1 f (t,βj )
n
1 X
a0i µ0i
= −P (t0 , t) 1 + P a0j
n
j=1 f (t,βj ) i=1
= −P (t0 , t)
Pn Pn
if and only if ai (t) = 1, i=1 a0i µ0i = 0 and µ0i ≥ −1, ∀i.
i=1
Pn
By construction we set i=1 ai (t) = 1 to ensure a normalised risk-neutral probability density function (pdf), whilst
P n 0 0 0
i=1 ai µi = 0 safeguards the martingale property of the induced pdf. Further, ai ≥ 0 is used to guarantee convexity
0
of the price function, and µi ≥ −1 is included to obtain non-degenerate Black-Scholes functions.
n
1 X
CM (t0 , S0 ; K, t) − PM (t0 , S0 ; K, t) = ai (t) CBS (t0 , S0 ; K i (K, t), t) − PBS (t0 , S0 ; K i (K, t), t)
norm i=1
0
where K i (K, t) = K (K, t)(1 + µi (t)). From the classical call-put parity we must have
∀i , ai (t) CBS (t0 , S0 ; K i (K, t), t) − PBS (t0 , S0 ; K i (K, t), t) = ai (t)(S0 Re(t0 , t) − K i (K, t)P (t0 , t))
n
X
ai (t)K i (K, t)P (t0 , t) = KP (t0 , t) + DP V (t0 ; t0 , t)
i=1
which simplifies to
190
n
0 0 X
K (K, t)P (t0 , t) + K (K, t)P (t0 , t) ai (t)µi (t) = KP (t0 , t) + DP V (t0 ; t0 , t)
i=1
0
Given K (K, t) = K + D(t0 , t) and since D(t0 , t)P (t0 , t) = DP V (t0 ; t0 , t), the constraint becomes
n
X
ai (t)µi (t) = 0
i=1
a0i
Now, given µi (t) = µ0i f (t, βi ) and ai (t) = f (t,βi )×norm , the constraint simplifies to
n
X
a0i µ0i = 0
i=1
∂t ωi (t) ≥ 0 , ∀i (6.2.5)
In order to satisfy the constraint in Equation (6.2.5) we can directly model the derivative of the total variance with
respect to maturity, Fi (t) = ∂t ωi (t), as a positive function. Following this route, Gurrieri [2011] considered the
function
t
F (t) = s2∞ + Bt + s20 − s2∞ e− τ
where s∞ , s0 , B, and τ are parameters satisfying the constraints τ > 0 and
B≥0
or
B < 0
s2
0 −s∞
2
−1+
s2∞ + Bτ e Bτ ≥ 0
for the function F (•) to be positive. We get the limit behaviour limt→0 F (t) = s20 and limt→∞ F (t) = s2∞ . Integrat-
ing that function in the range [0, t], we get the total variance term structure
−t
ωi (t) = − Bi τi + s2∞,i t − τi Bi t + Bi τi + s20,i − s2∞,i (e τi − 1)
which corresponds to the total variance of the Heston model described in Section (3.3.2.1).
Since we chose to model directly the square-root of the average variance, to guarantee the positivity of the local
volatility we must verify
191
1
q −t
Σi (t) = √ − Bi τi + s2∞,i t − τi Bi t + Bi τi + s20,i − s2∞,i (e τi − 1)
t
which simplifies to
r
τi −t
Σi (t) = − Bi τi + s2∞,i − Bi t + Bi τi + s20,i − s2∞,i (e τi − 1)
t
and derivative with respect to time t as
0 1 τi −t − 21
Σi (t) = − Bi τi + s2∞,i − Bi t + Bi τi + s20,i − s2∞,i (e τi − 1)
2 t
τ 1 −t
i 2 2
− τt 2 2
× Bi τi + s0,i − s∞,i (e i − 1) + B i t + B i τi + s0,i − s∞,i e τi
t2 t
which we can rewrite as
0 1 τi −t 1 −t
Σi (t) = 2
Bi τi + s20,i − s2∞,i (e τi − 1) + Bi t + Bi τi + s20,i − s2∞,i e τi
2Σi (t) t t
Another approach is to specify the time-dependent volatility Σi (t) that capture the term structure of the implied
volatility surface, and derive the constraints in Equation (6.2.3) that it must satisfy. In the special case where the user
has no information on the term-structure of the implied volatility surface, we set Σi (t) = di where di > 0. On all
the other cases, the user can choose among different term-structures based on his information of the implied volatility
surface. To make sure that the IV surface flattens for infinitely large expiries, we impose the limit behaviour
Σi (t) = ai + bi t e−ci t + di
where one must impose bi ≥ 0 for Σi (.) to remain positive, generating only upward humps. Its derivative with respect
0
to time is Σi (t) = −ci ai + bi t e−ci t + bi e−ci t and its second derivative with respect to time is
00
Σi (t) = c2i ai + bi t e−ci t − 2ci bi e−ci t
However, if there are humps in the volatility surfaces they tend to be downwards humps. One possibility would be
to let bi be negative and to introduce a penalty in the target function by for instance letting the Black-Scholes price
be arbitrarly high when Σi (t) < 0. Alternatively, we can solve an optimisation problem under constraints. The
no-arbitrage constraint in Equation (6.2.3) must satisfy
1
which is true at t = 0 and t = ∞. For t > ci and ci sufficiently large, the term di should dominate (ai + bi t). Hence,
at time t = c1i we get
bi −1
di ≥ ai −
e
ci
To get a general volatility function capable of generating both an upward hump or a downward one, Bloch [2012a]
proposed the volatility function
Σi (t) = ai + bi ln (1 + ei t) e−ci t + di
192
where ci > 0, di > 0 and ai ∈ R, bi ∈ R and ei ∈] − 1t , ∞[. The derivative of the function with respect to time t is
0 ei
− bi ci ln (1 + ei t) e−ci t
Σi (t) = − ai ci + bi
1 + ei t
and the no-arbitrage constraint must satisfy
ei
e−ci t (1 − 2tci ) ai + bi ln (1 + ei t) + 2tbi
+ di ≥ 0
1 + ei t
Since ai ∈ R, at time t = 0 the left hand side of the inequality can becomes negative. Consequently, we must impose
the constraint
ai + di > 0
to get the constraint satisfied at t = 0. Further, when t > c1i then for ci sufficiently large the constant di will dominates
ai + bi ln (1 + ei t) ensuring positivity of the left hand side. Hence, at time t = c1i we must impose
bi e i ei −1
di ≥ ai − 2 ei + bi ln (1 + ) e
ci 1 + ci ci
193
Sigma
0.35
0.3
0.25
0.2
Sigma
0.15
0.1
0.05
0
0 0.5 1 1.5 2 2.5
194
Total Variance
0.09
0.08
0.07
0.06
0.05
Total Variance
0.04
0.03
0.02
0.01
0
0 0.5 1 1.5 2 2.5
195
Fct a b λ σD σA σC σB σE
Expo2 0 0 0 0.2 -1.7 8.919 0.15 0
Expo3 0 0 0 0.2 -0.375 8.088 0.15 7.4
KumaExpo2 1.3 3 0.45 0.22 -2.144 9.015 0.179 0
KumaExpo3 1.3 3 0.45 0.23 -0.43 8.9 0.179 8.8
T Expo2 Const Expo3 Const Heston Gurrieri KE2 Const KE3 Const
0 0.35 0.35 0.35 0.35 0.35 0.35 0.336 0.399 0.344 0.409
0.006667 0.327 0.285 0.322 0.270 0.319 0.320 0.314 0.322 0.313 0.304
0.0133 0.307 0.229 0.297 0.205 0.294 0.294 0.293 0.257 0.287 0.222
0.02 0.288 0.182 0.276 0.153 0.274 0.272 0.275 0.201 0.263 0.157
0.02667 0.271 0.142 0.256 0.111 0.258 0.251 0.258 0.155 0.243 0.107
0.033 0.256 0.108 0.239 0.077 0.245 0.234 0.243 0.117 0.225 0.070
0.04 0.242 0.081 0.224 0.052 0.235 0.218 0.229 0.085 0.210 0.042
0.046667 0.229 0.058 0.210 0.032 0.226 0.205 0.217 0.060 0.196 0.022
0.0533 0.217 0.040 0.198 0.018 0.219 0.194 0.205 0.040 0.184 0.010
0.06 0.207 0.026 0.188 0.009 0.212 0.184 0.195 0.024 0.174 0.002
0.066667 0.197 0.015 0.178 0.003 0.207 0.176 0.186 0.013 0.165 0.00007
0.0733 0.189 0.008 0.170 0.0003 0.203 0.169 0.178 0.006 0.157 0.0008
0.08 0.181 0.003 0.163 0.00002 0.199 0.163 0.170 0.001 0.151 0.004
0.086667 0.174 0.0006 0.157 0.001 0.196 0.158 0.164 0.00006 0.145 0.010
0.0933 0.167 0.00001 0.151 0.005 0.193 0.154 0.158 0.0006 0.140 0.018
0.1 0.162 0.0010 0.146 0.010 0.190 0.151 0.152 0.003 0.136 0.026
196
0.35
0.3
Expo2
0.25 Expo3
Heston
Gurrieri
KumaExpo2
KumaExpo3
0.2
0.15
0.1
0 0.2 0.4 0.6 0.8 1 1.2
197
0.35
0.3
Expo2
Expo3
0.25
Heston
Gurrieri
KumaExpo2
KumaExpo3
0.2
0.15
0.1
-0.04 0.01 0.06 0.11 0.16 0.21
198
199
36%
34%
32%
30%
28%
26%
24%
22%
0.575 0.675 0.775 0.875 0.975 1.075 1.175 1.275 1.375 1.475
Bid Vol(No Arb) Bid Vol(Arb) Ask Vol(No Arb) Ask Vol(Arb) DE Vol SVI Vol
200
Price - 2013/06/27
0.5266049
0.4266049
0.3266049
0.2266049
0.1266049
0.0266049
3.335 3.835 4.335 4.835 5.335 5.835 6.335
Model Price Bid Price(No Arb) Bid Price(Arb) Ask Price(No Arb) Ask Price(Arb)
respectively, for maturities ranging from 1 week till 10 years. The weighted sum of term structures generated by
Expo3 and ExpoTV are given in Figure ( 6.9) and in Figure ( 6.10), respectively, for maturities ranging from 0 to 2
years. We clearly see that the ExpoTV model is not capable of generating a steep downward hump as exhibited in the
market.
201
Date Cash
14/05/2015 2.73
16/05/2016 2.73
15/05/2017 2.73
15/05/2018 2.73
15/05/2019 2.73
Fct µ0 a0 β0 σD σA σC σB σE
1 0.2376 0.1449 0.7612 0.3929 -0.3000 54.3117 1.5994 45.1605
2 -0.0404 0.8550 0.4959 0.2094 0.1653 56.2481 -0.1742 49.5995
Fct µ0 a0 β0 σD σA σC σB σE
1 -0.1663 0.7488 3.5245 0.2093 0.2370 0.2915 -0.0158 0
2 0.4957 0.2511 0.4111 0.2567 0.9674 19.9976 -6.9933 0
202
0.6
0.55
0.5
1W
1M
0.45
2M
3M
6M
0.4 1Y
2Y
3Y
4Y
0.35 5Y
7Y
10Y
0.3
0.25
0.2
-80 -60 -40 -20 0 20 40 60 80 100 120
203
0.65
0.6
0.55
0.5 1W
1M
2M
3M
0.45
6M
1Y
2Y
0.4 3Y
4Y
5Y
0.35 7Y
10Y
0.3
0.25
0.2
-80 -60 -40 -20 0 20 40 60 80 100 120
204
SigmaCap
0.35
0.3
0.25
SigmaCap
0.2
0.15
0.1
0 0.5 1 1.5 2 2.5
0 0 0
α(K (K, t), t; σ) = St0 D(t0 , t) N (d1 ) − N (d1 ) − K (K, t)P (t0 , t) N (d2 ) − N (d2 ) − K (K, t)µt P (t0 , t)N (d2 )
with
205
SigmaCap
0.28
0.26
0.24
0.22
0.2
SigmaCap
0.18
0.16
0.14
0.12
0.1
0 0.5 1 1.5 2 2.5
Re(t0 , t) 0 Re(t0 , t) 0 1
d1 (t − t0 , x , K (K, t)) = d1 (t − t0 , x , K (K, t)) − p log (1 + µt )
P (t0 , t) P (t0 , t) σ (t − t0 )
In this model, limµt →0 α(σ) = 0, so that the parameter µt control the skew and the curvature of the volatility surface.
Combining terms together, the parametric price model is approximated by
206
n n
1 X 0 1 X 0
CM (t0 , S0 , Pt , Rt , Dt ; K, t) ≈ ai (t)CBS (t0 , St0 , K (K, t), t; Σi (t))+ ai (t)αi (K (K, t), t; Σi (t))
norm i=1 norm i=1
a0i
where ai (t) = f (t,βi ) for i = 1, .., n. Note, the first term of the parametric model
n
1 X 0
y(0) = ai (t)CBS (t0 , St0 , K (K, t), t; Σi (t))
norm i=1
is a weighted sum of lognormal prices generating a symetric smile. We define the pure implied volatility for the price
y(0) with strike K and maturity t as
0 0
σ b(K (K, t), t)) − y(0) = 0
b : CBS (K (K, t), t; σ (6.2.7)
0
There are various ways of estimating σb(K (K, t), t)), and for simplicity of exposition we use the property of lineartity
of the Black-Scholes formula with respect to the volatility when the spot price S0 is close to the at-the-money forward
strike (see Equation (1.2.11)), by assuming
n
X
b) ≈
CBS (t0 , t; σ ai CBS (t0 , t; σi )
i=1
Pn
where σb = i=1 ai σi . That is, the implied volatility σ b is no-longer a function of the strike but only a function of
time. We will call it the MixVol term structure. As a result, the approximated parametric model simplifies to
0 0
CM (t0 , St0 , Pt , Rt , Dt ; K, t) ≈ CBS (t0 , St0 , K (K, t), t; Σ(t))
b + α(t0 , St0 , K (K, t), t)
where
n
0 1 X 0
α(t0 , St0 , K (K, t), t) = ai (t)αi (K (K, t), t; Σi (t))
norm i=1
and
n
1 X
σ
b = Σ(t)
b = ai (t)Σi (t) (6.2.8)
norm i=1
By performing a Taylor series expansion around the weighted Black-Scholes formula y(0), we deduce the approxi-
mated implied volatility for strike K and maturity t as follow
0 1 0 1 d1 (bσ )d2 (b
σ) 1 0
Σ(K (K, t), t) ≈ σ
b+ α(t0 , St0 , K (K, t), t) − α2 (t0 , St0 , K (K, t), t)
vega(b σ) 2 σ
b σ )]2
[vega(b
1 d1 (b
σ )d2 (b σ ) + d21 (b
σ ) + d22 (b σ ) + 2d21 (b
σ )d22 (bσ) 1 0
+ α3 (t0 , St0 , K (K, t), t) + ...+ (6.2.9)
6 b2
σ σ )]3
[vega(b
From this representation of the IVS we see that it is a highly non-linear function of the term structures Σi (t) as well
as the shift parameters µi (t). Further, given the properties of the Vega in Section (B.1.2), this IVS is not defined for
short maturities (T − t) << 0 and for the strike range where the Vega is too small. One possibility, if one wants to
0
directly use Σ(K (K, t), t), is to cap the inverse Vega
1
min ( , cap)
vega(b
σ)
207
with a positive constant. Since the Black-Scholes Vega reaches its highest values at-the-money where the BS-formula
is given in Equation (1.2.12), we choose
√
cap = 0.4x T − t
n
1 X
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = xRe(t0 , t) ai (t)N (di1 (t − t0 , x, K̃(K, t))
norm i=1
n
0 1 X
− K (K, t) ai P (t0 , t)(1 + µi (t))N (di2 (t − t0 , x, K̃(K, t)))
norm i=1
Given the derivative of the parametric model with respect to the strike estimated in Equation (6.3.13), and combining
terms together, the parametric model for a call option becomes
n
1 X 0 ∂
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = xRe(t0 , t) ai (t)N (di1 (t−t0 , x, K̃(K, t))+K (K, t) CM (t0 , S0 , Pt , Rt , Dt ; K, t)
norm i=1 ∂K
Since the Digital Bond can be expressed in terms of a digital option as D(S, t0 , t; ξ) = −∂K C(t0 , St0 , t, K), its value
in the parametric model is
n
1 X
DM (S, t0 , t; ξ) = ai (t)P (t0 , t)(1 + µi (t))N (di2 (t − t0 , x, K̃(K, t))) (6.2.11)
norm i=1
which we can write as a shifted weighted sum of digital options on a shifted strike
n
1 X
DM (S, t0 , t; ξ) = ai (t)(1 + µi (t))D(S, t0 , t; ξi )
norm i=1
We can easily show that the Small-Moneyness Behaviour (SMB) defined by Roper [2010] is satisfied by construction
in the MixVol model. Letting the strike tends to zero in the model, and using Equation (1.4.23) on each Black-Scholes
formula, we get the limit
n n n
P (t0 , t) X X X
a0i µ0i = −P (t0 , t)
lim DM (S, t0 , t; ξ) = − ai (t)(1 + µi (t)) = −P (t0 , t) ai (t) +
K→0 norm i=1 i=1 i=1
208
from the no-free lunch constraints in Equation (6.2.4). Similarly, by analogy to the call price in Equation (1.5.27), the
Digital Share is
n
1 X
SM (S, t0 , t; ξ) = xRe(t0 , t) ai (t)N (di1 (t − t0 , x, K̃(K, t))
norm i=1
so that the call option becomes
0 0 0
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = SM (S, t0 , t; Zt > K ) − K (K, t)DM (S, t0 , t; Zt > K )
∂ xRe(t0 ,t) ∂ dd2 √1
Given ∂d2 N (d2 ) = KP (t0 ,t) ∂d1 N (d1 ) and dx = xσ t−t0
the delta of a Digital option is
n
∂ 1 X 1
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = − ai (t)P (t0 , t)(1 + µi (t))n(di2 (t − t0 , x, K̃(K, t))) √
∂KS norm i=1 xΣi (t) t − t0
n
1 1 X
Skew(K, t) = DBS (S, t0 , t; ξ) − ai (t)(1 + µi (t))D(S, t0 , t; ξi ; Σi ))
V ega(K, t; ΣBS (K, t)) norm i=1
1
Pn
When the shift terms are set to zero, µi (t) = 0 for i = 1, .., n, the Digital Bond simplifies to norm i=1 ai (t)D(S, t0 , t; ξ; Σi )
which is a weighted sum of digital option on a GBM. Note, it still has a skew but much less prononced.
209
∂KK Σ(K, t) =
∂KK CM (t0 , S0 ; K, t) 1 h √ i
− 2 1 + 2Kd1 t − t0 Skew(K, t) + K 2 d1 d2 (t − t0 )(Skew(K, t))2
V ega(K, t; ΣBS (K, t)) K Σ(K, t)(t − t0 )
Given the optimum vector Ψ∗ of model parameters, we can compute analytically the European call and put prices for
all maturity t and strike K. Inverting the Black-Scholes formula, we recover the implied volatility surface Σ(K, t).
We can then use that surface to compute exactly the skew and curvature of the parametric IV surface. Alternatively,
one can approximateP the skew and curvature of the IV surface around the money by considering the ATM volatility
1 n
σ
b = Σ(t)
b = norm i=1 ai (t)Σi (t). As a result, given a volatility function Σ(K, t) obtained with a parametric
function such as a SVI or SABR, one can directly fit our parametric model to volatilities without inverting the Black-
Scoles formula.
n
X m
X
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = ai (t) CallBS (t0 , S0 , Rt , Pt , K(K, t), t, Σi (t) + vj )φVb (vj )dvj (6.2.12)
i=1 j=1
0 0
where ai (t) for i = 1, .., n are the weights, K(K, t) = K (K, t)(1 + µi (t)) with K (K, t) = K + Dt , and φVb (•) is
the lognormal density. Letting Σij (t) = Σi (t) + vj be the new term structure of volatility, the total variance must be
an increasing function of time. From the constraint in Equation (6.2.3), the volatility Σij (t) must satisfies
CM (t0 , S0 ; K, t) − PM (t0 , S0 ; K, t) =
n m
1 X X
ai (t) CBS (t0 , S0 ; K i (K, t), t, Σij (t)) − PBS (t0 , S0 ; K i (K, t), t, Σij (t)) φVb (vj )dvj
norm i=1 j=1
n m
1 X X
ai (t) S0 Re(t0 , t) − K i (K, t)P (t0 , t) φVb (vj )dvj = S0 Re(t0 , t) − DP V (t0 ; t0 , t) − KP (t0 , t)
norm i=1 j=1
Since S0 Re(t0 , t) − K i (K, t)P (t0 , t) does not depend on vj , we can rewrite the equation as
n m
1 X X
ai (t) S0 Re(t0 , t) − K i (K, t)P (t0 , t) φVb (vj )dvj = S0 Re(t0 , t) − DP V (t0 ; t0 , t) − KP (t0 , t)
norm i=1 j=1
210
Pm
Further, j=1 φVb (vj )dvj = 1, such that the equation simplifies to
n
1 X
ai (t) S0 Re(t0 , t) − K i (K, t)P (t0 , t) = S0 Re(t0 , t) − DP V (t0 ; t0 , t) − KP (t0 , t)
norm i=1
and we recover the constraint for the MixVol model in Section (6.2.2.2).
n
∂ 1 X ∂ ∂ K̃
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = ai (t) CallBS (t0 , S0 , Rt , 1, K̃(K, t), t, Σi (t)) (6.3.13)
∂K norm i=1 ∂ K̃ ∂K
211
∂ K̃
where ∂K = P (t0 , t)(1 + µi (t)). Differentiating one more time the call price with respect to the strike, we get
n
∂2 1 X ∂2 ∂ K̃ 2
C M (t 0 , S0 , Pt , Rt , Dt ; K, t) = ai (t) CallBS (t0 , S0 , Rt , 1, K̃(K, t), t, Σi (t))( )
∂K 2 norm i=1 ∂(K̃) 2 ∂K
(6.3.14)
Differentiating the call price with respect to time t0 , we get
∂ d 1
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = CM (t0 , S0 , Pt , Rt , Dt ; K, t)norm
∂t0 dt0 norm
n
1 X d
+ ai (t)CallBS (t0 , S0 , Rt , 1, K̃(K, t), t, Σi (t))
norm i=1 dt0
n
X d ∂CallBS
+ ai (t) K̃(K, t) (t0 , S0 , Rt , 1, K̃(K, t), t, Σi (t))
i=1
dt0 ∂ K̃
n
X d ∂CallBS
+ ai (t) Σi (t) (t0 , S0 , Rt , 1, K̃(K, t), t, Σi (t))
i=1
dt0 ∂Σi
n
X ∂CallBS
+ ai (t) (t0 , S0 , Rt , 1, K̃(K, t), t, Σi (t))
i=1
∂t0
where
d d d
K̃(K, t) = D(t0 , t)P (t0 , t)(1 + µi (t)) + rt0 K̃(K, t) + P (t0 , t)(K + D(t0 , t)) (1 + µi (t))
dt0 dt0 dt0
with dtd0 µi (t) = µ0i dtd0 f (t, βi ). Note, when there is no dividends between t0 and t0 + then the cumulative dividends
term D(t0 , t) = Dt does not depend on t0 and the derivative becomes dtd0 K(K, t) = rt0 K̃(K, t) + P (t0 , t)(K +
D(t0 , t)) dtd0 (1 + µi (t)).
212
Hence, for a Fixed Delta with an upwind finite difference and a bumped spot St = S0 +α, where α = S0 , multiplying
the strike with (1 + ), the shifted option price becomes
n
1 X
CM (t0 , S0 , Pt , Rt , Dt ; K, t; ) = ai (t)CallBS (t0 , S0 (1 + ), Rt , Pt , K(K, t)(1 + ), t, Σi (t))
norm i=1
= (1 + )CM (t0 , S0 , Pt , Rt , Dt ; K, t)
Similarly, for a Floating Delta, the shifted option price in our parametric model becomes
n
1 X
CM (t0 , S0 , Pt , Rt , Dt ; K, t; ) = ai (t)CallBS (t0 , S0 (1 + ), Rt , Pt , K(K, t), t, Σi (t))
norm i=1
1
= (1 + )CM (t0 , S0 , Pt , Rt , Dt ; K , t)
(1 + )
In that setting, given the strike K at time t, the old strike is K SS0 +
0
, or vice versa, given K at inception t0 = 0, the
St S0 +α
new strike is K S0 = K S0 . It is interesting to note that the Floating Delta obtained with finite difference is an
approximation to the weighted sum of Black-Scholes deltas
n
∆ 1 X ∆
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = ai (t)CallBS (t0 , S0 , Rt , Pt , K(K, t), t, Σi (t))
norm i=1
Further, assuming that the discrete dividend impact is negligable between t0 and t0 + , we bump the initial time with
a constant to compute the shifted price. It is an approximation to differentiating the parametric model with respect
to time t0 in order to get the annualised analytic Theta. One can show that
Θ Θ
∂CM (t0 , S0 , Pt , Rt , Dt ; K, t) ∂CM (t0 , S0 , Pt , Rt , Dt ; K, t) dτ
=
∂t0 ∂τ dt0
where τ = t − t0 . Hence, since our parametric model is homogeneous in τ , when t0 = 0, we can use the derivative of
the price with respect to the maturity t with formula given in Section (6.3.1.1).
213
∂Σ
∂xx CM (t, St , K, T ) = ∂xx CBS (t, St , K, T ; Σ) + 2∂xΣ CBS (t, St , K, T ; Σ)kS (6.3.16)
∂K
2
∂Σ 2 ∂ Σ
+ ∂ΣΣ CBS (t, St , K, T ; Σ) kS + ∂Σ CBS (t, St , K, T ; Σ)kC
∂K ∂K 2
where kS and kC satisfy
0 if sticky strike
0 if sticky strike
1 if sticky implied tree
kS = kC = 1 if sticky implied tree
− K if sticky delta K 2
K St ( St ) if sticky delta and minimum variance
St if minimum variance
n
∂ 1 X ∂ ∂Σi
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = ai (t) CallBS (t0 , S0 , Rt , 1, K̃(K, t), t, Σi (t))
∂Σb norm i=1 ∂Σi ∂Σ
b
∂Σi norm
Since ∂Σ
b = ai (t) , the Vega simplifies to
n
∂ X ∂
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = CallBS (t0 , S0 , Rt , 1, K̃(K, t), t, Σi (t))
∂Σb
i=1
∂Σi
Further, from the expansion of the IVS, we were capable of understanding how to modify our parametric model price
in order to generate a parallel shift, a skew shift, or a curvature shift of the implied volatility surface. Hence, in view of
adding control to the generated volatility surface, we propose to modify the model in Equation (6.2.10) with an extra
term as follow
n
1 X
CM (t0 , S0 , Pt , Rt , Dt ; K, t) = ai (t)CallBS (t0 , S0 , Rt , Pt , K(K, t), t, Σi (t)) (6.3.17)
norm i=1
0 0
+ vegaBS (K (K, t), t; σ
b(K (K, t), t)))h(K, t; AT M , S , C )
0
where σ
b(K (K, t), t)) satisfy Equation (6.2.7) and where
0 0
K (K, t) K (K, t) 2
h(K, t; AT M , S , C ) = AT M − S ln + C (ln )
x x
214
with AT M << 1, S << 1, and C ∈ R+ . The parameter AT M produces a parallel shift of the volatility surface
∂
and should be of the order of the percentage (for example 1%). The skew S is given by ∂K Σ(t, St ; K, T − t), and
using finite differences, it is expressed as a few percents per 10% change in the strike. For example, a skew of 2%
∂2
per 10% change in the strike is 0.2. Similarly, using finite differences to discretise the curve ∂K 2 Σ(t, St ; K, T − t)
4% 0 0
in the previous example, gives C = (10%) 2 = 4. One can calibrate the model parameters ai , µi , and Σi (t) to the
market and then as time evolves modify manually AT M to generate parallel shift to the implied volatility surface, S
to generate skew shift to the implied volatility surface, and C to generate curve shift to the implied volatility surface.
The only additional constraint is for the price in Equation (6.3.17) to remain positive. Note, the parameters AT M , S
and C can also be used to generate analytically the new local volatility when computing the Vegas of an option.
n
1 X 0
CM (t0 , S0 , Pt , Rt , Dt ; K, t; ) = ai (t)CallBS (t0 , S0 , Rt , Pt , K (K, t)(1 + (µ0i + )fi (t)), t, Σi (t))
norm i=1
where is a constant of the order of magnitude of µ0i , and such that µ0i + ≥ −1. In that scenario, the martingale
property of the induced risk-neutral pdf given in Equation (6.2.4) becomes
215
n
X
a0i µ0i =
i=1
n
1 X 0
CM (t0 , S0 , Pt , Rt , Dt ; K, t; ) = ai (t)CallBS (t0 , S0 , Rt , Pt , K (K, t)(1 + µ0i (fi (t) + )), t, Σi (t))
norm i=1
Rather than modifying the function fi (t), we can directly modify the parameter βi in the previous parametric price
CM (t0 , S0 ; K, t), getting f (t, βi (1 + )) with << 1. In addition, multiplying each parameter µ0i with the same
constant (1 + ), rather than adding a constant, will preserve the no-arbitrage constraints. Hence, under that scenario,
the shifted option price becomes
n
1 X 0
CM (t0 , S0 , Pt , Rt , Dt ; K, t; ) = ai (t)CallBS (t0 , S0 , Rt , Pt , K (K, t)(1 + (1 + )µ0i fi (t)), t, Σi (t))
norm i=1
where << 1. Alternatively, we can multiply each parameter µ0i with the constant (1 + i ) if we force one degree of
liberty as
n
1 X 0 0
1 = − a µ (1 + i )
a01 µ01 i=2 i i
One can also bump each µ0i with 2 for i = 1, .., n, but in order to satisfy the no-arbitrage constraint we also need to
bump each weight a0i with 1 such that
2
1 = − Pn 0
i=1 i + n2
µ
As the representation of the IVS in Equation (6.2.9) is a highly non-linear function of the term structures Σi (t) as well
as the shift parameters µi (t), adding
Pn a positive constant to each term structure Σi (t) will produce a parallel shift in
the MixVol term structure σ b = i=1 ai (t)Σi (t) but not in the IVS itself. That is, for t1 > t0 the movement of the
0
new implied volatility surface Σ(t1 ; K , t) produced by that scenario depends on the shape of the IVS at inception
0
Σ(t0 ; K , t) which is in accordance with the result found by Rogers et al. [2010] given in Equation (1.4.17). The
smaller the skew, the larger the impact of σ
b on the IV surface. As a result, we can shift each BS function Σi (t) with
a positive constant to generate a parallel shift type of movements of the IVS around the money, getting the shifted
parametric price
n
1 X
CM (t0 , S0 , Pt , Rt , Dt ; K, t; ) = ai (t)CallBS (t0 , S0 , Rt , Pt , K(K, t), t, Σi (t) + )
norm i=1
where = 1% to be consistent with the Black-Scholes vega. We can also define another type of volatility scenario
dependent on the maturity with
216
Z t
1
(1 + )Σ2i (t) = (1 + ) σ 2 (s)ds
t 0
where << 1, so that the shifted option price becomes
n
1 X
CM (t0 , S0 , Pt , Rt , Dt ; K, t; ) = ai (t)CallBS (t0 , S0 , Rt , Pt , K(K, t), t, Σi (t)(1 + ))
norm i=1
Combining all these bumps together, we obtain a no-arbitrage deformation of the IV surface generated by the shifted
option price
CM (t0 , S0 , Pt , Rt , Dt ; K, t; µ , β , σ )
n
1 X 0
= ai (t)CallBS (t0 , S0 , Rt , Pt , K (K, t)(1 + (1 + µ )µ0i f (t, βi (1 + β ))), t, Σi (t) + σ )
norm i=1
For risk management purposes, we can therefore compute the modified vega as
217
We are going to describe a few complex strategies on single stock options and we will then discuss some well known
strategies on multi-asset options. Multi-asset options is an example of contingent claims necessitating the analysis of
sequences of random variables requiring two types of input:
A direct consequence of the Efficient Market Hypothesis (EMH) is that the most important concepts in theoretical and
empirical finance developed around the assumption that asset returns follow a normal distribution. This includes the
classical portfolio theory, the Black-Scholes-Merton option pricing model and the RiskMetrics variance-covariance
approach to Value at Risk (VaR). One consequence of the capital asset pricing model (CAPM), BS-formula and Var
is that statistical approaches must be restricted to second-oder stationary processes. Thus, the portfolio risk becomes
a weighted sum of covariation of all stocks in the portfolio and one is left with modelling the variance and correlation
of Gaussian random variables. However, financial time series exhibit multifractal scaling behaviour indicating a
complex behaviour with long-range time correlations manifested on different intrinsic time scales. Hence, variance
and covariance are not the proper measures of risk when analysing financial derivatives. These phenomena can easily
be observed in the market via the existence of an implied volatility surface for single stock options and an implied
correlation matrix for multi-asset options. Moreover, since the variance and covariance of financial time series are not
necessarily defined, their historical dynamics are stochastic with erratic jumps. Thus, in order to account for these
dynamics when pricing path-dependent options, practitioners and academics focused on modelling the variance and
covariace with jump-diffusion processes.
218
The variance swaps have a large number of applications such as hedging, volatility trading and relative value
strategies, portfolio diversification and structured products. They can be used in association with a view, to speculate
on future levels of volatility, or for relative value trades including volatility pairs and dispersion trades. They can also
be used as a way to diversify a portfolio when considered as an asset class. To conclude they can be used to structure
products as they can be embedded into certificates, notes and other structured products.
Proposition 11 If we take the limit and let n tends to infinity, then the realised annualised variance V n is approxi-
mated by the quadratic variation of the stock price, that is,
1 T
Z Z T
1 2
V T = [L, L]T = σ(ω, s) ds + dJbL (s)
T T 0 0
∆L 6
= 0 N
|∆Ls |2 = j t |ZL (j)|2 is a pure jump process with E[JbL (t)] = V ar[JL (t)] = [0,t] λ R z 2 FL (dz)ds.
P s
P R R
where JbL (t) = s∈[0,t]
Definition 7.1.1 We let the price of the variance swap contract (VA (t, T ))t≥0 , seen at time t with maturity T , satisfies
under the risk-neutral measure Q
RT
Z T Z T
VA (t, T ) = E Q [e− rs ds
σ 2 (ω, s)ds +
t dJbL (s) |Ft ]
0 0
1 JL is a pure jump process with intensity λL and whose distribution FL has the transform θL associated to the logarithm of the stock price.
219
where the payoff of the contingent claim is the quadratic variation of a Lévy process.
We denote W T the total variance over the period [0, T ] and get
Z T Z T
dSt ST 2
WT = |σ(t, ω)| dt = 2
− log
0 0 St St0
The first term in the brackets is the net outcome of continuously rebalancing a stock position so that it is always
instantaneously long S1t shares of stock worth 1. So the strategy of holding a portfolio of shares constantly equal to
one unit of currency defines the payoff, that is the cost or profit of the strategy. The second term in the brackets is
a static short position in a log-contract which can be replicated with a weighted combination of European options.
Therefore, following this strategy we can replicate the total variance W T in a model-independent way as long as the
stock price is a diffusion. Taking the expectation of the total variance and using the Breeden and Litzenberger [1978]
results, we get
Z T Z T
dSt ST
Et0 [ |σ(t, ω)|2 dt] = 2Et0 [ ] − 2Et0 [log ]
0 0 St St0
Z T Z St0 Z ∞
1 1
rt dt − (P (0, T )−1 − 1) +
= 2 P (T, K)dK + C(T, K)dK
0 0 K2 St0 K 2
where C(T, ·) and P (T, ·) are call and put prices. In the case where we center the expansion on the forward price
(with deterministic rates) we get
Z T Z T
ST dSt 1
log = − |σ(t, ω)|2 dt + log P (0, T )
FT (0) 0 St 2 0
and the total realised variance over the period [0, T ] becomes
Z T Z T
dSt ST St0
WT = |σ(t, ω)|2 dt = 2 − log + log
0 0 St FT (0) FT (0)
Taking the expectation of the total variance we get
Z T Z FT (0) Z ∞
2 ST 1 1
Et0 [ |σ(t, ω)| dt] = −2Et0 [log ]=2 P (T, K)dK + C(T, K)dK (7.1.1)
0 FT (0) 0 K2 FT (0) K2
where C(T, ·) and P (T, ·) are undiscounted call and put prices. From the properties of the law of iterated expectation
for random variables 2 we can recover the variance swap from the local volatility of the underlying price. That is, we
have the approximation in space
2 E[X] = E[E[X|Y ]]
220
Kmax
X
Et0 [|σ(T, ω)|2 |ST = K]P (ST = K) ≈ Et0 [|σ(T, ω)|2 ]
K=Kmin
So that the weighted sum in time and in space of the local variance of a diffusion process with deterministic rates gives
back the variance swap.
Remark 7.1.1 Since the instantaneous volatility σ(t, ω) is not a function of time T , differentiating the variance swap
(or the expected log-contract) with respect to maturity T and conditioning the resulting expected value with ST = K
we can express the forward variance swap in terms of the local variance.
where the Laplace transform of the jump size is θ(c) = E[ec.z ] = R ec.z FL (dz) and which is such that the discounted
R
stock price is a martingale. We then compute the variation of the process St = eLt , corresponding to the exponential
of the Lévy process defined above. Applying Ito’s lemma, we get its variation as
Z t Z t Z t X
eLs− +∆Ls − eLs−
St = S0 + Ss bs ds − λ Ss (θ(1) − 1)ds + σs Ss dW (s) +
0 0 0 0≤s≤t,∆Ls 6=0
Using the theory on Lévy processes, we can express the relationship between the dynamic of the process St and
the dynamic of its logarithm Lt , that is, transform the exponential of a Lévy process into a stochastic Doleans-Dade
exponential. Integrating that equation between [0, T ], we get
Z Z T Z Z
dSt
σs2 ds + 2 ez − 1 − z JL (ds × dz)
2 − LT + L0 =
[0,T ] St− 0 [0,T ] R
Taking the expectation of the price of the variance swap expressed in terms of the price of the logarithm contract, we
get
Z Z T Z Z
dSt
|σs |2 ds] + 2λ ez − 1 − z FL (dz)ds
2 E[ ] − E[LT ] + L0 = E[
[0,T ] St− 0 [0,T ] R
We know from Proposition (11) that the fair price of the variance swap is
Z t Z Z
2
z 2 FL (dz)ds
E [L, L]t = E[ |σs | ds] + λ
0 [0,t] R
We see that the logarithm contract is a bias estimator of the variance swap as the right order of magnitude of the
jump term is O(z 2 ) when computing the quadratic variation of the logarithm of the stock price, while when using the
logarithm contract it is of the order O(z). The error term at maturity T is calculated by taking the difference between
the two approaches, getting
221
Z Z
1 2
z − ez + 1 + z FL (dz)ds
E [L, L]T − E[< U >T ] = 2λ
[0,T ] R 2
Thus, replicating the price of the variance swap with a continuum of call and put prices does not approximate correctly
the quadratic variation of the logarithm of the stock price in presence of jumps. It incurs an error term which is model
dependent (depending on the distribution of the jump size) and varies depending on the maturity considered. We
therefore need a model (incorporating stochastic volatility and jumps) to correctly value the price of the variance swap
for all time t.
where
dS(t) = S(t) rdt + σdWt
The price of an Asian call option is given by
Z T
1 +
CA (t) = e−r(T −t) E[ S(u)du − K ] (7.2.2)
θ T −θ
In the special case where all parameters are constants, the stock price simplifies to
r−d− 12 σ 2 ti +σ(ti )Wti
S(ti ) = S(0)e
We are interested in the law of a weighted sum of Geometric Brownian Motion
n
X
Y = αi S(ti )
i=1
222
Assuming the call option payoff to be convex, the call option is convex and we can apply Jensen’s inequality, getting
Therefore, the law of Y is approximated by the law of Y |Z, where we consider the process Ui = Wti normalised, to
give the state variable
U ' N (0, 1)
This can be written as
n
X
Y |Z = µ̃i eγ(ti )U (7.2.3)
i=1
where
qi = ρσ(ti )
There is no more process, and we are only left with the manipulation of the terminal distribution. We are not concerned
by a possible evolution of a given process. We have mapped the weighted sum of n Geometric Brownian Motions into
a one dimensional world where the volatilities are multiplied by their respective correlation. Note that as Z gets closer
to Y , its distribution tends to the Dirac distribution. We are therefore trying to reduce Y |Z to a deterministic function.
We have actually obtained a variance reduction, since
223
Applying Fubini’s theorem and interchanging the expectation with the integral operator, we get
T
e−r(T −t)
Z
X(t) = E[S(u)du]
θ T −θ
E[S(u)|Ft ] = S(u)
so that the process becomes
t∨(T −θ)
e−r(T −t)
Z
X(t) = S(u)du
θ T −θ
X(t) = ρ(t)S(t)
where
X(0) = ρ(0)S(0)
where
1 − e−rT
ρ(0) =
rθ
Now, we can write the process as
t∨(T −θ)
e−r(T −t)
Z
X(t) = ρ(t)S(t) + S(u)du
θ T −θ
224
σρ(t)S(t)
(7.2.6)
X(t)
Since we have the relation
ρ(t)S(t)
<1 (7.2.7)
X(t)
it can be seen that the true volatility is dominated by the misspecified volatility σ. We now consider Theorem (4.2.1)
and write
σρ(t)S(t)
• the true volatility as σ(t) = X(t)
+
CA (0) = E[e−rT X(T ) − K ]
≤ BS(T, X(0); r, σ)
Therefore, the price of an asian option is dominated by a Black-Scholes price. However, this Black-Scholes price is
not a tight upper bound. This is because among all possible choice of the misspecified volatility γ(t, S(t)), we chose
RT
the average volatility to be a constant. Remember that our underlying replicate T −θ S(u)du at maturity, and that at
maturity, it is a well known event with volatility equal to zero. Thus, our Black-Scholes price over estimate the true
price as time tends to maturity. To overtake this problem, we need to take a well chosen time dependent misspecified
volatility, with the property of being zero at maturity. In that case, we obtain a good approximation of the stochastic
volatility by a deterministic one. In terms of hedging, the pair process
is a superstrategy.
225
In the case of the Black-Scholes with stochastic volatility, we take the Log-Normal distribution of the true forward
with the dominated misspecified volatility
1 2
Y = X(0)eσWT − 2 σ
We see that the problem is to choose the best possible misspecified time dependent volatility which always satisfies
where φB (·, ·) is the joint density of S1,t and S2,t under the risk-neutral measure. However, in the case where we can
directly buy and sell options on a traded index Itx , like stock index options (SIO), we can directly price these options
as
where φI (·) is the density of the index It under the risk-neutral measure.
(K, T ) → Σ(K, T )
226
as the implied volatility surface (IVS). Rebonato once stated that implied volatility was the wrong number to put in the
wrong formula to get the right price. Nonetheless, it contains additional information about expected future volatility
not included in the historical volatility. After 1981, a market for stock index options (SIO) developed to manage
and hedge portfolio exposure, where the Black-Scoles formula was used to price these complex options. Similarly
to single stock options, after the market crash of 1987, we observed an implied volatility surface when pricing stock
index options. Further, combining the constituent single stock options with a Gaussian copula to price an index
option would not fully explain the volatility skew of that index option, leading to the notion of implied correlation
(IC). Thus, the market attempted at inferring an implied correlation from option prices on multiple assets. It would
require liquid markets for such products in the multi-asset case and the existence of a closed-form solution to the
multivariate lognormal model, equivalent to the BS-formula in single asset. However, neither of them exist. It did
not stop practitioners from defining different implied correlation indicators based on the different derivative products
considered. For example, they defined an implied correlation for spread options, one for worst-of options, one for
index dispersion, and one for correlation swaps. Obviously, these implied correlations are not consistent with one
another. Moreover, they observed a non-flat implied correlation surface, indicating that returns are non-Gaussian and
most likely that their increments are not independent. By analogy with the implied volatility, the implied correlation
can be viewed as the wrong number to put in the wrong pricer given a wrong volatility model to get the right price
(see Delanoe [2014]). Rather than modifying the Browninan motions, they chose to consider stochastic and local
correlation models to explain the correlation skew.
(n, 1) and σt is a matrix of dimension (1, n) with element σi (t). We assume that σt , rt and qt are second-order
stochastic processes adapted to the filtration generated by B. That is, we have assumed a structure of correlation
modelled with a finite number of independent Brownian motions and a vector of volatility σt . When all the ele-
ments σi (t) are constants, the model reduces to the classic multivariate lognormal model (generalisation of Samuelson
lognormal model
Pn described in Section (1.2.2)). Hence, we get a family of random functions of volatility, such that
σt dB(t) = j=1 σj (t)dBj (t), and the dynamics of the process rewrite as
n
X
dXt = µt dt + σj dBj (t)
j=1
Using the properties of the stochastic integral and the Brownian motion, we can re-express the above equation in terms
of the volatility volS (t) and a unidimensional Brownian motion Z(t) as
227
n
X
V (t) = volS2 (t) = σj2 (t) = ||σt ||2 = σt σt>
j=1
˜ T ) = (T − t)I 2 (t, T )
I(t,
The stochastic differential equation (SDE) of the process becomes
p
dXt = µt dt + V (t)dZ(t)
with
Pn
j=1 σj dBj (t)
dZ(t) = qP
n 2
j=1 σj (t)
volS1 S2 (t) = ||σt1 + σt2 || = ||σt1 ||2 + ||σt2 ||2 + 2ρ12 (t)||σt1 ||||σt2 ||
228
7.3.4.1 Type 1
In a multidimensional Black-Scholes model with N stocks S1 , .., SN , the risk-neutral dynamics are given by
n
dSi (t) X
= r(t)dt + σij (t)dBj (t) , i = 1, ..., N
Si (t) j=1
where Bj for j = 1, .., n are independent standard Brownian motions. Further, the interest rate and the volatilities
Rt
σij (t) can be deterministic functions of time. For simplicity, we denote σij = 0 σij (s)ds the time dependent volatility
function. Correlations among different stocks are captured through the matrix (σij ). The solutions to the above SDE
are
Pn 2
1
)t+ n
P
Si (t) = Si (0)e(r− 2 j=1 (σij ) j=1 σij Bj (t) , i = 1, ..., N
In a general framework, we let X be the random variable
N √
t− 12 V ar(Gi )t
X
X= i xi eGi
i=0
where (Gi )i=0,...,N is a mean zero Gaussian vector of size (N + 1) and covariance matrix Σ, i = ±1, and xi > 0 for
all i = 0, ..., N . Note, Σ is symmetric positive semi-definite but not necessarily definite.
The risk-neutral dynamics of the forward price Fi (t, T ) are given by
n
dFi (t, T ) X
= σij (t)dBj (t) , i = 1, ..., N
Fi (t, T ) j=1
PN
Then, given the forward basket price B(t, T ) = i=1 ωi Fi (t, T ), with weights ωi ≥ 0, its dynamics satisfy the SDE
N
X n
X
dB(t, T ) = ωi Fi (t, T ) σij (t)dBj (t)
i=1 j=1
where
ωi Fi (t, T ) ωi Fi (t, T )
ω
bi (t) = PN = > , i = 1, ..., N (7.3.10)
ω
i=1 i i F (t, T ) ω F (t, T )
PN
with 0 ≤ ω
bi (t) ≤ 1 and i=1 ω bi (t) = 1.
We can define the matrix of volatility of the basket ΓB (t, T ) of size (1, n) as
229
dB(t, T )
= ΓB (t, T )dZ(t)
B(t, T )
with solution
Rt
1
VB (s)ds+ 0t n
R P
B(t, T ) = B(0, T )e− 2 0 j=1 ΓB,j (t,T )dZj (s)
7.3.4.2 Type 2
Alternatively, following the notation in Section (7.3.3), we can rewrite the dynamics of the underlying forward price
as follow
dFi (t, T )
= σti dZ(t)
Fi (t, T )
Pn
where σti is a matrix of dimension (1, n) such that σti dZ(t) = j=1 σij (t)dZj (t) and the stock’s instantaneous
variance is given by Vi (t) = ||σti ||2 . Then, the dynamics to the forward basket price B(t, T ) satisfy
N
X
dB(t, T ) = ωi Fi (t, T )σti dZ(t)
i=1
N
X
T r(Ωt Xu ) = Ωi,j (t)Xi,j (t)
i,j=1
N
X
= ω ωj (t)σui σuj
bi (t)b
i,j=1
230
and note that σtB is Ft -measurable. Hence, we can rewrite the dynamics of the basket as
dB(t, T )
= σtB dZ(t)
B(t, T )
PN Pn
where σtB dZ(t) = i=1 ω
bi (t) j=1 σij (t)dBj (t) and with instantaneous variance given by
7.3.4.3 Type 3
An alternative approach is to consider the dynamics of the basket as
N
dB(t, T ) X dFi (t, T )
= ω
bi (t) (7.3.11)
B(t, T ) i=1
Fi (t, T )
where the instantaneous return of the basket is a weighted sum of the instantaneous returns of single stocks with
stochastic weights. We can compute the instantaneous variance of the basket as
N
dB(t, T ) X dFi (t, T )
VB (t) = V ar( ) = V ar( ω
bi (t) )
B(t, T ) i=1
Fi (t, T )
From independence of the Brownian motions, the instantaneous variance simplifies to
N N
X dFi (t, T ) X
VB (t) = V ar(b
ωi (t) )= ωi (t)σti ||2
||b
i=1
Fi (t, T ) i=1
In the case where we have not projected the Brownian motions W in a basis of independent Brownian motions B, the
instantaneous variance of the basket becomes
N
X dSi (t) dSj (t)
VB (t) = ω
bi (t)b
ωj (t)Cov( , )
i,j=1
Si (t) Sj (t)
Since the Brownian motions are correlated, each stock has a single Brownian motion instead of a vector of Brownian
motions, and the instantaneous variance of the basket becomes
N
X N
X
VB (t) = ω
bi (t)b
ωj (t) < σi (t), σj (t) > dt = ω
bi (t)b
ωj (t)ρij (t)σi (t)σj (t)dt (7.3.12)
i,j=1 i,j=1
231
N
1 X
VB (t) = ωi ωj ρij (t)Fi (t, T )Fj (t, T )σi (t)σj (t)dt (7.3.13)
B 2 (t, T ) i,j=1
Since the weights ωi are positive, the maximal covariance at time t is achieved when ρij (t) = 1 for all i and all j.
Thus, we must have
N
X
VB (t) ≤ ω
bi (t)b
ωj (t)σi (t)σj (t)dt (7.3.14)
i,j=1
Remark 7.3.1 The relation in Equation (7.3.14) between the instantaneous variance of a basket and that of its con-
stituents derives from stochastic calculus and can not be used to relate basket options to individual options.
Definition 7.4.1 We define the implied correlation of a spread option as the level of correlation we need to plug into
a basic pricer to recover the market prices of the option.
Hence, given the option price C(t, S1,t , S2,t , K, T ) at time t for a strike K and a maturity T , the market implied
correlation ρS (t, S1,t , S2,t ; K, T ) satisfies
C(t, S1,t , S2,t , K, T ) = Cbp (t, S1,t , S2,t , K, T ; Σ1 (K, T ), Σ2 (K, T ); ρS ) (7.4.15)
232
where Cbp (t, S1,t , S2,t , K, T ; σ1 , σ2 ; ρ) is the basic pricer of a spread option under the multivariate lognormal (ML)
model. When K = 0 we use Magrabe’s formula, and the price Cbp is monotonically decreasing in the correlation
ρS . Otherwise, for K 6= 0, the monotonicity depends on the choice of the approximation pricer. Kirk’s formula, for
K close to zero, preserves that property. The implied correlation is obtained by inverting the basic pricer formula
−1
Cbp (C(t, S1,t , S2,t , K, T ); K, T ). Consequently, for a fixed maturity T , we refer to the two-dimensional map
(α, K) → ρS (α, K)
as the implied correlation surface. We can therefore define the 3d-volume of implied correlation as
(α, K, T ) → ρS (α, K, T )
The trading volume of spread options is much smaller than that of options on single stock making it very difficult
to calibrate implied correlation on market spread options prices. It has been suggested to calibrate part of the IC
coefficients to single stock options, fitting the individual implied volatility surfaces (IVS). Then, the correlation ρ is
seen as an additional input that can be calibrated to recover the impled correlation curve. However, as discussed in
Section (??), nearly all stock and equity options are American options. Nonetheless, mostly all the authors discussed
the calibration of their model to single stock European options. To overcome this problem, we suggested to first
extract synthetic European option prices from American option prices, and then apply the usual implied method to the
extracted European option prices.
Ma [2009b] used the Jacobi diffusion model (for details see Delbaen et al [2002]) to bound the stochastic correlation
p
dρt = κ ρ∞ − ρt dt + ξ (1 − ρt )(1 + ρt )dW
where W is a Brownian motion independent from W1 and W2 . Further, κ > 0 is the speed of mean reversion, ξ > 0
is the volatility of the instantaneous correlation, −1 < ρ0 < 1 and −1 < ρ∞ < 1. Under the constraint
ξ2 ξ2
− 1 < ρ∞ < 1 −
κ κ
the correlation ρt does not exit the interval [−1, 1]. Generalising the multiscale model of Fouque et al. [2003] to
options on multi-assets, Carmona et al. [2011] developed a two factor multiscale stochastic volatility model, and used
asymptotic methods to approximate the prices of spread options. In Fouque’s model, two factors Zt and Vt are used to
233
capture fast and slow scale volatilities respectively. The former is fast mean-reverting with > corresponding to the
fast time scale of the process. The latter is a slow-varying factor with small parameter δ. In that model, the implied
volatility is approximated by
1 K
Σ(K, T ) ≈ β(T ) + α(T ) ln
T x
1
which is linear with respect to the log-moneyness to maturity T ln K
x . More precisely, the implied volatility satisfies
a K K
Σ(K, T − t) ≈ σ + b + ln + aδ ln + b (T − t)
T −t x x
where σ is the average implied volatility, aδ and b are explicit function of the model. Allowing for more flexibility
than the multivariate lognormal model, they can account for non-Gaussian returns in a multi-assets framework, but the
market becomes incomplete.
Definition 7.4.2 Given f (x, y) an integrable continuous function defined on R2 , the Radon transform of f is the
function Rf defined by the line integral
Z ∞
Rf (α, K) = f (K + αy, y)dy
−∞
In the special case where the volatilities of the two processes are constant, σ1 (t, x1 , x2 ) = σ1 and σ2 (t, x1 , x2 ) = σ2 ,
the local correlation becomes
σ1 ασ2
R−1 ∂KK C 2ασ 1
2
+ 2σ1 − ∂t C ασ1 σ2
ρ(t, x1 , x2 ) =
R−1 (∂KK C)
When the volatilities σ1 and σ2 are not constants the local correlation formula is more involved as it requires computing
the density function
Problems We saw in Section (1.7.2.1) that Gyongy [1986] gave sufficient conditions for an n-dimentional Ito
process to have the same one dimensional marginal distributions as a Markov diffusion process with drift and volatility
being deterministic functions of time and the underlying process. Since the marginal distributions of the original model
are matched to the simple Markov process, this approach is used to calibrate the parameters of a high-dimensional
dynamics to European option prices. That is, the theoretical foundation of projecting an n-dimentional Ito process into
a Markov process only applies to European options and not to American options because the joint distributions are
not matched. However, in practice spread options are not liquid and practitioners calibrate their models to single stock
options, which are American options.
234
IO1 the irregular pattern of discrete dividends on the index. It impacts the valuation of American options.
IO2 the effect of interest rate volatility. It has an effect on the probability of early exercise of American options.
IO3 the fact that the stock index is not directly traded, only the component stocks are. It creates a hedging problem for
options written on large indices. Thus, there is no arbitrage relationship guaranteeing a particular link between
the options and the index. The use of futures contracts, when they exist, can improve the hedge. However, it is
problematic in the case of American options as futures price can stronly deviate from its theoretical price.
Brenner et al. found that in the case of narrow index the discrete dividend had an impact on the valuation formula.
When futures price can not reproduce its theoretical level, basis risk is introduced which can not be hedged away.
Further, stock index options are settled with cash, so that sellers are facing the risk of early cash settlment.
Note, the weights ωi are known at the outset so that they stay the same over time.
235
Proposition 12 The value of a portfolio of European/American call/put options, with common strike and maturity,
always exceeds the value of the corresponding basket option.
By sublinearity of the maximum, we get the following relation on the intrinsic value
N
X
max (B(T ) − K, 0) ≤ ωi max (Si (T ) − K, 0)
i=1
If the portfolio of options is exercised on the optimal excercise date of the option on the portfolio, the payoff of the
former is never less than that on the latter. Hence, by the dominance principle (see Proposition 10) we see that a
basket option is cheaper than the corresponding portfolio of plain vanilla options. In addition, a basket option takes
into account the imperfect correlation between the asset components, and minimise the transaction costs. However,
if we assume that the i-th stock price follows the dynamics of an Affine model, such as the Heston one factor model,
PN
and apply Ito’s Lemma to the forward basket price B(t, T ) = i=1 ωi Fi (t, T ) where Fi (t, T ) = PS(t,T i (t)
) , then its
dynamics are no-longer affine because of the processes Fi (t, T ) in the diffusion terms. In general, there is no explicit
analytical expression available for the distribution of the weighted sum of the assets, and using the Black-Sholes for-
mula on a collection of underlying stocks will not produce a closed-form solution to the price of a basket option. As a
result, determining the price of a basket option is not a trivial problem.
The most straightforward extension of the univariate BS-model is the Gaussian copula model, where the stocks com-
posing the basket are lognormally distributed and a Gaussian copula connetcs the marginals (see Hull et al. [1993],
Milevsky et al. [1998]). When asset prices evolve according to a multivariate binomial tree, the multivariate risk-
neutral density converge to a multivariate lognormal in the limits as the time step approaches zero (see Boyle [1988],
Boyle et al. [1989]). Rosenberg [1998] developed a parametrised density function for the valuation of multi-asset
options which was extended to the case of nonparametric multivariate density (see Rosenberg [2003]). El Karoui et
al. [1998] provided upper and lower bounds on a European call on the arithmetic average of two independent stock
price processes. In order to obtain bounds to the price of a basket option several authors applied methods used for
Asian options combined with the notion of comonotonicity (see Nielsen et al. [2003]). That is, Rogers et al. [1995]
proposed a method of conditioning to price Asian options. Kaas et al. [2000] developed a general technique based on
comonotonic risks to derive upper and lower bounds for stop-loss premiums of sums of dependent random variables.
Since the problem of pricing arithmetic basket options is equivalent to calculating stop-loss premiums of a sum of
dependent risks, Deelstra et al. [2002] combined these results with the conditioning technique of Rogers et al. to get
upper and lower bounds to the price of basket options. Alternatively, using a moment-matching method, Brigo et al.
[2004] found an equivalent log-normal random variable having the same mean and variance as the weighted basket of
its constituents. Thus, the BS-formula could be applied.
where φB (·, ·) is the joint density of S1,t and S2,t under the risk-neutral measure. From Skalar’s theorem, we can
express that joint density in terms of the marginal densities φ1 and φ2 of S1,t and S2,t as follow
236
φB (x1 , x2 ) = c12 F1 (x1 ), F2 (x2 ) f1 (x1 )f2 (x2 )
where Fi (·) is the cumulative distribution function (cdf) of Si,t (see details in Section (2.1.3.3)). Note, c12 =
∂y1 ,y2 C(y1 , y2 ) where C(·, ·) is the copula between S1,t and S2,t under the Q-measure. Thus, all we need to compute
the option price is the marginal distributions Fi , i = 1, 2, and the copula C(·, ·). For simplicity, we now describe
the Gaussian copula. Assuming a continuum of prices on the basket option market and single stock option market,
we use Equation (1.5.25) to compute the cumulative distribution P (B(T ) < K) of the basket and Equation (1.5.29)
to compute its density φB (t0 ; T, K). Similarly, we can compute the cumulative distribution P (Si (T ) < K) of the
ith single stock and its associated density φSi (t0 ; T, K). We let ψ be a n-dimensional standard normal correlated
Gaussian variate with
E Q [ψi ψj ] = ρij , i 6= j
and
E Q [ψi ] = 0 , E Q [ψi2 ] = 1
for a given correlation matrix ρij . Note, the random variables ψi are driven by a systematic factor W (ρ) and a stock
specific factor Wi (1 − ρ), where W and Wi are standard Brownian motions. The ith stock price at maturity T satisfies
√
Si (T ) = Si (t0 )eµi (T −t0 )+σi T −t0 ψi
such that the log returns are modelled by Normal marginals and a Gaussian copula. We can construct the joint
distribution of the basket from its constituents by inverting
φB (t0 ; S1 (T ), ..., SN (T ); T, K)
Thus, we can approximate the price of the basket option as follow
Z ∞ N
X +
BC(N,
d T, K) = P (0, T ) ωi Si − K φB (t0 ; S1 , ..., SN ; T, K)dS1 ...dSN
0 i=1
237
dSi (t)
= r(t)dt + σi (t)dZiQ (t) , i = 1, ..., N
Si (t)
with independent Brownian motions ZiQ for i = 1, ..., N where the interest rate and the volatilities σi (t) are deter-
PN
ministic functions of time. Then, the dynamics to the forward basket price B(t, T ) = i=1 ωi Fi (t, T ) satisfy
N
X
dB(t, T ) = ωi Fi (t, T )σi (t)dZiQ (t)
i=1
where
ωi Fi (t, T ) ωi Fi (t, T )
ω
bi (t) = PN = > , i = 1, ..., N
i=1 ωi Fi (t, T )
ω F (t, T )
PN
with 0 ≤ ω
bi (t) ≤ 1 and i=1 bi (t) = 1. Assuming independend Brownian motions ZiQ for i = 1, .., N , we define a
ω
new Brownian motion
Z t PN Q
Q i=1 ωi Fi (u, T )σi (u)dZi (u)
Z (t) = q PN
0 2 2 2
i=1 ωi Fi (u, T )σi (u)
dB(t, T )
= σB,t dZ Q (t)
B(t, T )
238
1
where σB,t is a stochastic volatility. In the special case of an arithmetic mean, the weights become ωi = N, and we
can rewrite the Brownian motion as
Z t PN Q
Q i=1 Fi (u, T )σi (u)dZi (u)
Z (t) = q PN
0 2 2
i=1 Fi (u, T )σi (u)
1
α = qP
N 1
i=1 σi2 (t)
such that we have α ≤ σt ≤ β for all t ∈ [0, T ], almost surely. Hence, we can bound the price of a basket option as
Alternatively, we can obtain a lower bound to the basket option by noting that the geometric mean G(t) =
QN
(Si (t))ωi , with ωi = N1 , always lies below the arithmetic basket B(t). Taking the logarithm, we get LG(t) =
Pi=1
N
i=1 ωi log Si (t), so that the dynamics of LG(t) becomes
N N N
X 1 X 2 1 X
dLG(t, T ) = ωi d log Fi (t, T ) = − σi (t)dt + σi (t)dZiQ (t)
i=1
2N i=1 N i=1
239
1
since ωi = N. Again, we set the Brownian motion as
Z t PN Q
Q i=1 σi (u)dZi (u)
Z̃ (t) = q PN 2
0
i=1 σi (u)
Note, this volatility corresponds to the denominator of the lower bound α of an option on the arithmetic basket.
(r− 21 σi2 )t
where αi (t) = ωi Si (0)e and Yi (t) = σi Wi (t) ∼ N (0, σi2 t). That is, Xi (t) = ωi Si (t). Hence, the process
Xi (t) is lognormally distributed
1
Xi (t) ∼ LN ωi Si (0) + (r − σi2 )t, σi2 t
2
and the stop-loss premium E Q [(Xi −di )+ ] for some retention di can be computed. Given ln (Xi (t)) ∼ N (µi (t), σY2 i (t) )
√
with µi (t) = ln (αi (t)) and σYi (t) = σi t, we get
1 2
E Q [(Xi − di )+ ] = eµi (t)+ 2 Yi (t) N (di,1 (t)) − di N (di,2 (t))
where
i=1
for any uniformaly distributed random variable U distributed on the unit interval. The upper bound for any K > 0 is
given by
N
X √
BC(N, T, K) ≤ ωi Si (0)N (σi T − N −1 (FBc (K))) − e−rT K(1 − FBc (K))
i=1
240
Following Simon et al. [2000] we can rewrite this upper bound as a combination of Black-Scholes prices. As a result
of the lognormality of Si , we get
Fω−1
i Si
(p) = ωi FS−1
i
(p) for all p ∈ [0, 1]
Remark 7.5.1 The comonotonic upper bound is the smallest linear combination of European BS-prices dominating
the basket option price.
We can write it in terms of BS-price with strike Ki = FS−1
i
(FBc (K)) as follow
N
X
BC(N, T, K) ≤ P (0, T ) ωi E Q [(Si (T ) − FS−1
i
(FBc (K)))+ ]
i=1
N
X
= ωi Si (0)N (di,1 ) − P (0, T )Ki N (d2,1 )
i=1
where Ki satisfies
√
1 2
T N −1 (FBc (K))
Ki = FS−1
i
(FBc (K)) = Si (0)e(r− 2 σi )T +σi
and
ln SK
i (0)
+ (r + 12 σi2 )T √
di,1 = i
√ and di,2 = di,1 − σi T
σi T
Remark 7.5.2 This comonotonic upper bound does not dependent on the instantaneous correlations ρij , i = 1, ..., N .
Hence, we still have an upper bound to the price of the basket option even if the correlations are not known. Deelstra
et al. [2002] improved this upper bound by taking into account the instantaneous correlations.
d K, T ) ∀K, T
CI (t; K, T ) = BC(t;
However, as discussed in Section (7.5.1), most of the major index options are European type, and there is no trading
going on in the underlying index itself, creating a hedging problem for options written on large indices. On the other
241
hand, single stock options are American type and one can not enter the American volatility in the pricing equation of
the basket option BC(t;
d K, T ). Thus, the stock index options and the basket options seems hardly comparable. Still,
various authors analysed the volatility skew of the index options against that of its constituents.
For example, Langnau [2009] [2010] compared the index option prices of the Euro Stoxx 50 against theirs
constituents prices. The distribution of the Euro Stoxx 50 index together with that of all the constituent of the basket
are computed and an approximated option price with Gaussian copula is derived. It is observed that half the volatility
skew of the index is explained by the skew of the index constituents. It is concluded that the correlation depends on
the strike of the option impacting the distribution of the index on the downside. The market impled volatility smile
is compared with the one generated with a Gaussian copula at maturity T = 2 and T = 4 years. Defining the skew
as skew = ∆Σ ∆η , in the former, the skew around the money (η ∈ [0.9, 1]) is about skew ≈
0.28−0.26
0.1 , and in the latter
0.285−0.265
it is about skew ≈ 0.1 . When anaysing the market skew of index options, the choice of these maturities is
rather strange considering that the liquidity ranges from one week to one year, and that the skew is most prononced
for short maturities and decreases over time. Further, the one year ATM spread of volatility on the Euro Stoxx 50 is
about 0.004, the two year ATM spread is about 0.006 and that a four year ATM spread is above 0.01 (1 vol = 100 basis
points). In the case of single stock options, these maturities are not quoted and the ATM spread could easily reach a
few vol. In addition, the spread of volatility increases as we move away from the money, even if we account for the
vega. However, in that example, the slope of the smile at maturity T = 2 and T = 4 is roughly constant over the
moneyness. It would be more reasonable to conclude that the difference of volatility smile between the index option
and the one obtained by combining its constituents with a Gaussian copula is absorbed by market liquidity.
Similarly, Delanoe [2014] considered the evolution of the SP500 and its associated 1Y sliding implied correlation,
and concluded that the implied correlation tended to increase for decreasing index. On the graph in Figure (7.1), the
index is slowly increasing from below 1000 on the 6/07/09 to below 2000 around the 06/14, with a jump occuring
towards the end of 2011. However, the 1Y IC is moving randomly around a level of 60 with increasing behaviours
both when the index increases as well as when it decreases. The process is quite eratic, with jumps, and nothing can
be concluded about a possible relationship between the implied correlation and the index. He also analysed the SP500
index smile for a one year maturity with a stock model having local volatilities and constant correlation calibrated to
match ATM index option, and concluded that additional correlation needed to be added. For the correlation matrix to
remain positive semi-definite he used convexity property of the matrix obtaining
242
ρbij = (1 − $)ρH
ij + $ (7.5.19)
where ρbij is the bumped correlation matrix, ρH
ij is the historical correlation matrix and $ ≈ 15%.
1 2 2
∂T C = K ∂KK CσH,LV (K, T )
2
N
1 Q X
∂T C = E [ ∂xi xj hI{HT >K} + ∂xi h∂xj hI{HT =K} Si,T Sj,T σi σj ρij ]
2 i,j
2
where σH,LV (K, T ) is the local variance of the multi-asset option. Equating the two models, we must have
N N
∂K C Q X X
K 2 σH,LV
2
(K, T ) = − E [ Q
∂xi xj hσi σj ρij |HT > K] + E [ ∂xi h∂xj hSi,T Sj,T σi σj ρij |HT = K]
∂KK C i,j i,j
with
ρij (S1,t , ..., SN,t ) = 1 − $(t, h(S1,t , ..., SN,t )) ρ0ij + $(t, h(S1,t , ..., SN,t ))
In the case of the basket option, we get Ht = B(t), and the above equation simplifies to
N
X
K 2 σB,LV
2
(K, T ) = E Q [ ωi ωj Si,T Sj,T σi σj ρij |B(T ) = K]
i,j
Thus, the local volatility σB,LV (K, T ) of a basket option is a condtitional expectation (see Avellaneda et al. [2002b]).
243
Remark 7.5.3 This equation corresponds to expectation of the instantaneous variance of the basket VB (T ) at matu-
rity T , given in Equation (7.3.13), conditional on the basket being equal to the strike at maturity. That is,
2
σB,LV (K, T ) = E Q [VB (T )|B(T ) = K]
Given the definition of the correlation matrix, we obtain
PN
K 2 σB,LV
2
(K, T ) − E Q [ i,j ωi ωj Si,T Sj,T σi σj ρ0ij |B(T ) = K]
$(K, T ) = PN
E Q [ i,j wi wj Si,T Sj,T σi σj (1 − ρ0ij )|B(T ) = K]
For computation stability this equation can be rewritten as follow
PN
2 ωi ωj Si,T Sj,T σi σj ρ0ij
σB,LV (K, T ) − E Q [ i,j
PN |B(T ) = K]
i,j ωi ωj Si,T Sj,T
$(K, T ) = PN
ωi ωj Si,T Sj,T σi σj (1−ρ0ij )
EQ[ i,j
PN |B(T ) = K]
i,j ωi ωj Si,T Sj,T
7.5.3.3 Problems
Again, the theoretical foundation of projecting an n-dimentional Ito process into a Markov process only applies to
European options and not to American options because the joint distributions are not matched. However, European
options on the constituents of indices, such as the Euro Stoxx 50 index or the SP500, are not listed, and the liquid
options are only American options. Hence, one should question the relevance of a local correlation model.
In the problem of portfolio selection, Markowitz [1952] introduced the mean-variance approach as a simple trade-
off between return and uncertainty, where one is left with the choice of one free parameter, the amount of variance
acceptable to the individual investor. The mean-variance efficient portfolios are obtained as the solution to a quadratic
optimisation program, with theoretical justification requiring either a quadratic utility function or some fairly restrictive
assumptions on the class of return distribution, such as the assumption of normally distributed returns. For instance, we
assume zero transaction costs and a portfolio with price Vt taking values in R and following the geometric Brownian
motion with dynamics under the historical probability measure P given by
dVt
= µdt + σV dWt (7.6.20)
Vt
244
where µ is the drift, σV is the volatility and Wt is a standard Brownian motion. Markowitz postulated that while
diversification would reduce risk, it would not eliminate it, so that an investor should maximise the expected portfolio
return µ, while minimising portfolio variance of return σV2 . The argument follows from the relation between the
variance of the return of the portfolio σV2 and the variance of return of its constituent securities σj2 for j = 1, 2, ..., N
given by
X XX
σV2 = wj2 σj2 + wj wk ρjk σj σk (7.6.21)
j j k6=j
P
where the wj are the portfolio weights such that j wj = 1, and ρjk is the correlation of the returns of securities j
and k. Therefore, ρjk σj σk is the covariance of their returns. So, holding various assets not perfectly correlated in a
portfolio should offer a reduced risk exposure to a specific asset. Thus, the decision to hold any security would depend
on what other securities the investor wants to hold. That is, securities can not be properly evaluated in isolation, but
only as a group, resulting in the notion of efficient portfolios.
One consequence of the capital asset pricing model (CAPM) is that statistical approaches must be restricted to
second-oder stationary processes. Thus, the portfolio risk becomes a weighted sum of covariation of all stocks in
the portfolio. Using volatilities and correlations calculated on any desired historical time interval for price data and
different forecast times, we can then compute the risk of the portfolio, or the index, for different terms. Rather than
using volatilities and correlations statistically forecasted in Equation (7.6.21), it has been suggested to use implied
volatility from option prices because they are biased estimators of the future volatility. Since implied volatility on
options on an index as well as implied volatilities on options on single stocks are assumed easy to estimate, it has
been proposed to use them directly in Equation (7.6.21). In addition, practitoners defined Correlation as the statistic
ρimp ≈ 1 and Dispersion as the statistic ρimp ≈ −1, leading to a new type of correlation products called dispersion
trading (DT). Yet, since several indices are actively trading in the option market, a market on basket options developed.
In that market, the dynamics of the basket are assumed to follow a geometric Brownian motion and the BS-formula is
used to price and risk manage basket options. However, we saw in Section (7.5) that there was no explicit analytical
expression available for the distribution of the weighted sum of the assets, such that using the Black-Sholes formula
on a collection of underlying stocks will not produce a closed-form solution to the price of a basket option. That is,
due to the non-linearity of option prices with respect to volatility, no such relationship exists in the option pricing
theory (OPT). As a result, Markowitz equation has been wrongly used to relate the implied volatility of an option on
an index to the implied volatilities of its components. Moreover, practitoners wrongly used this pseudo relation to
calculate a value expressing the correlation level between the implied volatilities of the stocks and the index implied
volatility observed on the market. Nonetheless, from the no-dominance principle (see Proposition (10)), we must get
consistancy between the two markets, and if it is not the case arbitrage opportunities will arise. As a results, traders
need a computationally fast and robust model to relate the two markets together and to devise indicators measuring
the occurrence of arbitrage oppotunities.
245
N
X
V ar(I) = σI2 = wi wj Cov(Xi , Xj )
i,j=1
N
X N X
X
= wi2 V ar(Xi ) + 2 wi wj Cov(Xi , Xj )
i=1 j=1 i<j
N
X N X
X N
X N X
X
= wi2 σi2 + 2 wi wj σi σj ρij = wi2 σi2 + 2 wi wj σi σj ρij
i=1 j=1 i<j i=1 i=1 j>i
where σI is the actual index volatility (also noted σP for portfolio volatility), wi is the weight of the ith-stock in the
basket, and ρij is the instantaneous correlation between the ith-stock and the jth-stock in the basket.
Remark 7.6.1 The portfolio variance σI2 is not to be confused with the instantaneous basket variance VB (t) defined in
Equation (7.3.12). The former is derived from a probability formula (see Equation (A.3.2)) and has constant weights,
while the latter is derived from stochastic calculs within infinitesimal time and has stochastic weights.
In order to relate portfolio theory (PT) to option pricing theory (OPT), several authors played with the volatilities of the
ith component, σi , and that of the jth component, σj , in the above equation by allowing them to be either historically
estimated or implied from option prices.
Remark 7.6.2 From the moment we are plugging implied volatility to both the LHS and RHS of the portfolio variance,
we are trying to relate an option on a basket with a sum of options on its component. We enter the option pricing theory
(OPT), where the no-dominance principle apply, and this relationship is no-longer valid.
Nonetheless, over time this approach constituded a market standard. For instance, assuming σi and σj for the ith and
jth component of an index I are implied volatilities, and letting ρ = ρij for i 6= j, practitioners defined the so-called
Implied Correlation (IC) of this portfolio, or average level of correlation, as follow
PN
σI2 − i=1 wi2 σi2
ρimp = PN P (7.6.22)
2 i=1 j>i wi wj σi σj
which we can also write as
N X
X wi wj σi σj
ρimp = PN P ρij
i=1 j>i i=1 j>i σi σj
The Implied Index Correlation (IC) attempts at defining the correlation level between the actual implied volatility
of the index and the implied volatilities of its stock components. This is in itself absurd. The greater the implied
correlation ρimp , the stronger the correlation between the index implied volatility and that of its constituent stocks,
and therefore the more suitable the market conditions
PN for deploying a dispersion strategy. Under some reasonable
conditions, Bossu [2006] assumed that the term i=1 wi2 σi2 was close to zero, and showed that a good proxy for the
implied correlation was
σI2
ρ∗imp = PN 2 (7.6.23)
i=1 wi σi
PN 2 PN
wi2 σi2 +
P
since i=1 wi σi = i=1 i6=j wi wj σi σj . We can call
246
N
X
σAvg = wi σi
i=1
the average implied volatility of the components. Thus, we can approximate the index volatility as follow
q
σI ≈ ρ∗imp σAvg
for ρ∗imp > 0.15 and N > 20.
The Realised Correlation (RC) was proposed as an alternative to the implied correlation. Denoted ρb, it is defined as
PN
i6=j wi wj ρij
ρb = PN (7.6.24)
i6=j wi wj
A different definition, which is not market practice, is given by
PN
i6=j wi wj σ
bi σ
bj ρij
ρb = PN
i6=j wi wj σ
bi σ
bj
where (b
σI , σ
b1 , ..., σ
bN ) represent realised volatilities. It can be approximated as
bI2
σ bI2
σ
ρb = PN 2 =
wi σ
bi σAvg )2
(b
i=1
Alternatively, we can define the implied correlation with the help of Markov Functional modelling. In that setting,
the price of a call option is given by
+
C(t, St ; T, K) = E[ F (t, T )eφ(WT −t ,T −t) − K |Ft ]
where F (t, T ) is the forward price seen at time t. The function φ is estimated in terms of the implied volatility
Σ(K, T ) as follow
√ √
φ−1 (x, τ ) = τ N −1 N (−d2 ) + F (t, T )n(d1 ) τ ∂K Σ(F (t, T )ex , T − t)
where τ = T − t and x is the log forward moneyness. It depends on the market skew ∂K Σ(K, T − t). In a multi-asset
framework we define the Multidimensional Gaussian Markov Functional (MGMF) as the Markov Functionals together
with Gaussian copula to capture the dependence of the marginal distributions. We can therefore define the implied
correlation as follow
247
Definition 7.6.1 The implied correlation is the (Gaussian) corelation number we plug in the MGMF to recover the
market price.
Volatility dispersion trading is essentially a hedged strategy designed at taking advantage of relative value differences
in implied volatilities between an index and a basket of component stocks. The idea being to profit from price differ-
ences in volatility markets using index options and options on individual stocks due to temporary shifts in correlations
between assets, idiosyncratic news on individual stocks, etc. It is achieved by looking for a high degree of dispersion.
Typical strategies involve short option position on an index, against which long option positions are taken on a set of
components of the index. There are multiple instruments available to implement dispersion trading. Since DT is a non-
directional strategy exposed to the volatility of the underlying components, when using vanilla options it is necessary
to delta-hedge each option. Straddles and strangles (see Section (1.5.2)) are two basic volatility spreads suitable for
these requirements since they have volatility exposure with limited delta. Thus, it is market practice to devise straddles
and strangles strategies when building dispersion exposure. It is common to consider a short position of a straddle, or
near-ATM strangle, on the index and long positions of straddles or strangles on 30% to 40% of the stocks that make
up the index. This is because at-the-money straddle, or out-of-the-money strangle positions, have delta exposures very
close to zero. Hence, a dispersion strategy that buys index straddles/strangles and sells straddle/strangle positions on
individual components is hedged against large market movement and has low volatility risk. If maximum dispersion
is realised, the strategy will make money on the long options on the individual stocks and will lose very little on the
short option position on the index, since the latter would have moved very little. The strategy is typically a very low-
premium strategy, with very low initial Delta and typically a small net long vega. Rather than using vanilla options
to build dispersion trading, Ganatra [2004] considered using variance swaps. However, this approach is difficult to
implement in period of low liquidity for single stock variance swaps. Marshall [2008] studied the performance of
dispersion trading on the S&P 500 index and underlying stocks by using implied volatility obtained from synthetic
VIX-indexes calculated on instruments weighted to 30-days time period.
The former argues that the index options are more expensive relative to individual stock options because they bear
some risk premium, such as volatility risk and correlation risk, that are absent from individual stock options. Driessen
et al. [2005] argued that index options have a risk premium which is absent in stock options because the former
hedge correlation risk, which is pronounced in index option puts. The latter argues that options market demand and
supply drive option premiums to deviate from their theoretical values. Garleanu et al. [2009] developed a model
248
showing that option premium increase with market demand. They showed that end investors are net short single
stock options and net long index options, translating into a net demand for index options and a net supply of single
stock options. Taking advantage of market structural changes during late 1999 and 2000, which reduced the costs
of arbitraging any differential pricing of individual equity and index options via dispersion trading, Deng [2008]
found that the dispersion strategy was quite profitable through the year 2000, after which it disappeared. These results
provided support for the market inefficiency hypothesis against the risk-based explanation. Marshall [2008] developed
empirical evidence for dispersion trading in the US by showing that S&P 500 index-option-implied volatility (both
for calls and puts) tended to exceed Markowitz implied volatility (MIV), and assumed that the difference came from
the correlation risk premium. This is because correlation tends to rise in stressful times so that a seller of the option
requires extra premium to hedge this risk. Lozovaia et al. [2005] explained that the best timing to execute the direct
strategy (short index, long constituents) was when implied index volatility exceeded the realised (or historical) one
and when the implied index correlation was close to its maximum registered value, since the strategy works better if
implied volatility of the index is highly correlated with implied volatilities of its components. Lisauskas [2010] found
that the difference between index option implied volatility and the theoretical implied volatility was largely dependent
on the time period used to calculate the correlation between stocks.
We must then decide on the number of stocks selected. As it is not possible to place all the legs of this trade simultane-
ously, the trade is exposed to lag risk. Dispersion trading being a delta-neutral strategy, it is not affected by directional
price moves in the underlying asset. Since both straddle and strangle options are almost delta-neutral when they are
ATM, or perfectly centred around ATM for strangle, they are good candidates for DT. However, as the stocks and index
move, these options may no longer be close to ATM and delta exposure will arise, leading to possible readjustment of
the options position. It is also necessary to delta hedge the portfolio of options in order to realise the actual volatility
for which straddles and/or strangles are bought cheaper and sold more expensive against. In general, dynamic delta
hedging is necessary for dispersion trading since no other suitable instruments exist with cash flows equal to that gen-
erated from the delta exposure to be hedged. Since the dispersion trade is a bet that implied correlation will return to
normal levels, if it continues to increase ( due to all stocks moving in the same direction) the trade may lose money.
Further, with many stocks in the index, some options on its components may not exist, or be illiquid, such that we can
not uniquely determine an implied correlation matrix. One approach consists in using the historical correlation and
the implied volatility and compare the resulting volatility with the actual implied volatility of the index. This method
implies the use of some tools to assess the relative implied volatility differential. Moreover, as the stocks move up
and down, their relative weightings in the index change, impacting on the index implied volatility. If a stock moves
down dramatically, then its weight in the index drops, but its implied volatility increases. However, these effects tend
to partially cancel each other.
All these risks can be attenuated by carefully choosing the weights of the portfolio. We distinguish three main weight-
ing strategies
• Vega-hedging weighting: the dispersion is built such that the vega of the index equals the sum of the vega of the
constituents. This way the trader will be immune against short moves in volatility.
249
• Gamma-hedging weighting: the dispersion is built such that the gamma of the index is worth the sum of the
gammas of the constituents. This way the trader will be immune against large move in the stocks.
• Theta-hedging weighting: this strategy results in both a short vega and a short gamma position.
and applying Jensen’s inequality, we get the following relation on the intrinsic value
N
X
max (B(T, T ) − K, 0) ≤ ωi max (Fi (T, T ) − Ki , 0) (7.6.25)
i=1
which states that the premium from an option on the index is less or equal to that from its components. Taking the
expectation of the payoff under the risk-neutral measure and discounting with a zero-coupon bond, we can rewrite this
relation in terms of prices as
N
X
CB (t, B(t, T ), K, T ) ≤ ωi Ci (t, Fi (t, T ), Ki , T ) (7.6.26)
i=1
Hence, we see that an option on a basket and a weighted sum of options on its components are two different products,
the latter being an upper bound to the former. As a result, the Equation (7.6.26) defines a no-arbitrage condition
beteween an option on a basket and a weighted sum of options on its components.
We know that a simple way of understanding the properities of linear combination of European option prices in
the Black-Scholes world is to assume options are at-the-money (ATM) forward and to linearise them (see Equation
250
(1.2.12)). In that setting, given K = B(t, T ) and Ki = Fi (t, T ), the implied volatility of each European option
no-longer depend on the strike, and the price relation above simplifies to
N
√ X √
B(t, T )e−q(T −t) σB T − t ≤ ωi Fi (t, T )e−qi (T −t) σF,i T − t
i=1
where σB is the ATM implied volatility of an option on a basket and σF,i is the ATM volatility of its ith component.
From the definition of the stochastic weight ω
bi (t) in Equation (7.3.10), and assuming zero dividend yield, we can
rewrite the relation as
N
X
σB ≤ ω
bi (t)σF,i
i=1
Note, σF,i is the implied volatility of the ith component, that is, the constant which is plugged in the BS-formula to
recover the market option price. Even though there are instantaneous correlations in the portfolio’s theory, there is
no explicit notion of implied correlation in the option pricing theory. At most, the instantaneous correlations of the
Brownian motions in the dynamics of the basket have been implicitely embedded in the BS-volatility σB . Further,
since an option on a basket depends on the stochastic weights ω bi (t), i = 1, .., N , the relation between an option on
a basket and a weighted sum of options on its components expressed in terms of implied volatilities also depends on
these stochastic weights.
Assuming that the stochastic weights ω bi (t) for i = 1, .., N are stable, we can approximate them with their time
zero values, getting ω i , i = 1, .., N . We can then approximate the no-arbitrage relation for ATM implied volatility of
an option on a basket as
N
X
σB ≤ ω i σF,i = σW B
i=1
PN
where σW B = i=1 ω i σF,i is the approximated average implied volatility of the basket components. Since volatilities
and weights are positive, we can square both sides, getting
N N N
2
X 2 X X
σB ≤ ω i σF,i = (ω i )2 σF,i
2
+ ω i ω j σF,i σF,j
i=1 i=1 i6=j
We can therefore define an ATM indicator of arbitrage between an option on a basket and a weighted sum of options
on its components as follow
2
σB
Mρ = 2 ≤1 (7.6.27)
σW B
If Mρ is close to 1, then the implied volatility of a basket option, σB , is very expensive, and it is likely to move down.
Further, if this indicator is greater than one, there is an arbitrage and we should sell the option on the basket. However,
this indicator does not represent the market’s expectation of the future realised correlation. Only a correlation swap
can be seen as such. Note, the indicator Mρ is similar to Bossu’s indicator ρ∗imp in Equation (7.6.23), but with different
weights.
Remark 7.6.3 One can only relate Markowitz’s portfolio theory to option pricing theory when the options have been
linearised and the stochastic weights have been frozen.
251
where σ
b is the implied volatility of a weighted sum of lognormal prices generating a symmetric smile approximated
by
n
1 X
b(K, T ) ≈ σ +
σ ai CBS (t, x, K, T ; σi ) − CBS (t0 , t; σ) − ...
vega(σ) i=1
Pn
and σ = i=1 ai σi .
Remark 7.6.4 For a fixed maturity, the weighted sum of BS-formula with constant volatility σi generates a symmetric
smile centered around σ.
In the special case where the strike is ATM-forward, the approximate implied volatility simplifies to (see Equation
(B.3.17))
n
1 X
Σ(K, T ) K=F (T )
≈σ+ ai αi (K, T )
vega(σ) i=1
Going further and assuming that the IVS of each constituent of the basket option is flat, we set αi (K, T ) = 0, ∀i, and
we get
Σ(K, T ) K=F (T )
≈σ
which gives an idea of the over-simplification made by pratitioners when using Markowitz’s portfolio theory with
ATM volatilities.
Remark 7.6.5 Linearising the BS-formula to express the implied volatility of a basket option in terms of that of
its constituents, via Markowitz’s equation, only applies around the ATM-forward strike. We see that the notion of
Correlation Skew is meaningless.
252
N
X n
X
C
bB (t0 , K, t) = ωi aij (t)CallBS (t0 , Fi (0), Ri (t), Pi (t), K(K, t), t, Σij (t))
i=1 j=1
where aij (t) for j = 1, .., n are the weights for the ith underlying, Σij (t) is the volatility for the ith underlying. Given
0 0
the strike Ki , we get K(K, t) = K (K, t)(1 + µij (t)) with K (K, t) = Ki + Dt . We calibrate independently the
MixVol model’s parameters aij (t), µij , Σij (t) to the liquid market prices of each underlying Fi (0) for i = 1, .., N .
From linearity of the model, we can interchange the summation operators and rewrite the super-replication portfolio
as
n X
X N
C
bB (t0 , K, t) = ωij (t)CallBS (t0 , Fi (0), Ri (t), Pi (t), K(K, t), t, Σij (t))
j=1 i=1
where ωij (t) = ωi aij (t). Since that model is a weighted sum of modified BS-formula, we can easily compute the
Greeks of the super-replicating portfolio. Differentiating with respect to the initial stock price, the delta becomes
n X
N
X ∂
∆CbB (t0 , K, t) = ωij (t) CallBS (t0 , Fi (0), Ri (t), Pi (t), K(K, t), t, Σij (t))
j=1 i=1
∂Fi (0)
Differentiating one more time with respect to the initial stock price, the gamma is given by
n X
N
X ∂2
ΓCbB (t0 , K, t) = ωij (t) Call BS (t 0 , Fi (0), Ri (t), Pi (t), K(K, t), t, Σ ij (t))
j=1 i=1
∂Fi2 (0)
We can use the approximation to the implied volatility generated by the MiVol model given in Equation (6.2.9) to get
a feel of what the implied volatility of the replicating portfolio looks like. Assuming first order expansion, we get
N
X 0
ΣBb (K, t) ≈ ωi Σi (K (K, t), t)
i=1
such that
253
where
n
1 X
σ
bi = aij (t)Σij (t)
normi j=1
n
0 1 X 0
αi (t0 , St0 , K (K, t), t) = aij (t)αij (K (K, t), t; Σij (t))
normi j=1
However, as explained in Section (7.6.3) we can not compute the theoretical volatility σP with implied volatilities
since that relation does not hold.
One method for calculating an index volatility is to consider it as a weighted sum of volatilities of its components
2
PN 2
σW C = i=1 wi σi , which is called weighted components implied volatility, or the weighted volatility of index.
That is, it expresses overall implied volatility of the index components, but ignores correlation between component
stocks. The ratio of the components implied volatility to the actual implied index volatility, σσWIC , is called a first
volatility level coefficient. Given
N
X N X
X
σI2 = wi2 σi2 + 2ρimp wi wj σi σj
i=1 i=1 j>i
2
PN 2 PN
wi2 σi2 +
P
and since σW C = i=1 wi σi = i=1 i6=j wi wj σi σj , we get
N X
X
σI2 = σW
2
C + 2(ρimp − 1) wi wj σi σj
i=1 j>i
254
so that for ρimp = 1 the actual implied volatility of index is exactly the weighted sum of the individual stock con-
stituent implied volatilities. When ρimp < 1 then σW C > σI and the first volatility level coefficient is greater than
1, and when ρimp > 1 then σW C < σI and the first volatility level coefficient is less than 1. Hence, the main factor
affecting the first volatility level coefficient is the changing correlation between stock and index volatility. Note, this
interpretation is eroneous and leads to misleading conclusions.
When the index is defined as a portfolio of component stocks with the corresponding weights, the risk of the index is
given by the risk of portfolio σP (volatility of the portfolio). Replacing the volatility of each component σi with the
implied volatility index of a component stock σI,i , we obtain the theoretical correlated implied variance of an index
calculated on a correlation adjusted basis
N
X N X
X
σP2 I = wi2 σI,i
2
+2 wi wj σI,i σI,j ρI,ij
i=1 i=1 j>i
where ρI,ij is the correlation between implied volatility indexes of two stocks. The ratio of theoretical correlated
implied volatility of the index to the actual implied volatility σσPII is calculated to estimate the difference between
theoretical and real prices of index options. It is called a second volatility level coefficient. One can show that the
theoretical correlated implied volatility of an index can not be greater than the weighted implied volatility, which
ignores correlations. As a result, the second volatility level coefficient is always less or equal to first coefficient level.
Further, if the second volatility level is less than one, and relatively low, then theoretically, index options are too
overpriced and it is profitable to sell index options. Conversly, if the second volatility level coefficient is greater than
one, and relatively high, then theoretically, index options are conservatively priced and it is profitable to buy them.
The theoretical historical volatility of index can be calculated from the portfolio volatility on the basis of historical
volatilities of each component and correlations between stock prices. The ratio of theoretical historical volatility of
index to actual volatility is called the third volatility level coefficient and it indicates how much theoretical historical
volatility differs from actual volatility. If the third coefficient is less than 1 and low, then theoretical performance of
the index is less then actual volatility and we can generate profit by selling index options.
where Si,T is the ith stock price at maturity T , Ki,t is the ith strike price, and νi,t is the number of individual straddles
traded at t. The payoff from the short side of the straddle is
255
the strategy is protected by construction against large stock market movements. The return of the strategy over the
risk-free rate is given by
(
VT −Vt 1
Vt − P (t,T ) if Vt ≥ 0
R(t, T ) VT −Vt 1
− Vt + P (t,T ) if Vt < 0
where VT = HL (t, T ) − HS (t, T ) is the payoff from the portfolio at expiration, and
n
X
Vt = νi,t Ci (t, T, Ki ) + Pi (t, T, Ki ) − CI (t, T, KI ) + PI (t, T, KI )
i=1
is the initial price paid for the portfolio. Since the index options are European-style and the individual options are
American-style, assuming the option portfolio is hold till expiration may underestimate the resulting returns since we
are selling index options and buying individual options. The rate of return net of transaction costs is given by
δt
Rnet (t, T ) = R(t, T ) −
Vt
where δt is the transaction costs at time t (half the bid-ask spread).
256
where < dWi , dWj >= ρij dt. Assuming perfect correlation, ρij = 1 for all i, j then all the stocks have the same
Brownian motion. In that setting we know how to compute analytically a bond option in a 1-factor model (see El
Karoui et al. [1989]). Applying the same method, we can solve for X in
N
X 1 2
K= ωi Fi (0, T )e− 2 σi t+σi X
i=1
√
where X = Y t with Y ∼ N (0, 1). Once we know X we recover the strike Ki as
1 2
Ki = Fi (0, T )e− 2 σi t+σi X
Under the assumption of perfect correlation, Equation (7.6.25) simplifies, and we get equality of the intrinsic values
N
X
max (B(T, T ) − K, 0) = ωi max (Fi (T, T ) − Ki , 0) , ∀T
i=1
257
so that the profit and loss for the index option becomes
N
X σi 2
πB (t, t + dt) ≈ ΘB ω
bi (t) ni − 1
i=1
σB
Further, we can decompose the standardised move of the basket as
N
X σ i 2
n2B = ω
bi (t) ni
i=1
σB
N
X σi2 2 X σi σj
= bi2 (t)
ω 2 ni + ω
bi (t)b
ωj (t) 2 ni nj
i=1
σB σB
i6=j
Assuming that we can use Markowitz Equation (7.6.21) to relate an option on the index to options on single stocks,
we can replace the implied volatilities in the PnL equation with historical volatilies. The instantaneous variance of the
basket in Equation (7.3.12) is
N
X
2
σB = ω
bi (t)b
ωj (t)σi σj ρij
i,j=1
N N
X σi2 2 X σi σj X σ2 X σi σj
πB (t, t + dt) ≈ ΘB bi2 (t)
ω 2 ni + Θ B ω ωj (t) 2 ni nj − ΘB
bi (t)b bi2 (t) 2i − ΘB
ω ω
bi (t)b
ωj (t) 2 ρij
i=1
σB σB i=1
σ B σB
i6=j i6=j
258
N
X σi2 2 X σi σj
πB (t, t + dt) ≈ ΘB bi2 (t)
ω 2 (ni − 1) + ΘB ω ωj (t) 2 (ni nj − ρij )
bi (t)b (7.7.30)
i=1
σB σB
i6=j
Recall, in the dispersion trade we short the index option and buy the options on the stocks. Combining the PnL for all
the options, where the PnL for being long the ith option is πFi ≈ −Θi (n2i − 1), with that of the index option, we get
the Gamma PnL for the dispersion trade as
N
X
πD (t, t + dt) ≈ πFi (t, t + dt) + πB (t, t + dt)
i=1
N
X σi2 X σi σj
bi2 (t) ΘB (n2i − 1) + ΘB
≈ −Θi + ω 2 ω ωj (t) 2 (ni nj − ρij )
bi (t)b
i=1
σB σB
i6=j
• the correlation swap can be dynamically quasi-replicated with a variance dispersion trade.
Pn
• the PnL of a variance dispersion trading is worth i=1 wi σi2 (1 − ρb), where ρb is the realised correlation.
The problem with dispersion trading is that the correlations between the stocks vary with the market volatility. One
way of looking at it is to study the derivative of the variance of the index with respect to the correlation, that is,
∂V ar(I) XX
=2 w i w j σi σj
∂ρij j i<j
which depends on the volatility of the i-th stock and the j-th stock. It is not a pure correlation product. On the contrary,
a correlation swap with the payoff at maturity being (ρ − K) plays directly with the realised correlation. It is a pure
correlation product since its derivative with respect to the correlation is 1, that is ∂RS(t,T
∂ρ
)
= 1.
In the context of dispersion, we assume that the correlations ρij are equal to an average one, denoted ρ. Since this
correlation makes the implied variance of the index and the implied variance of the weighted sum of the components
equal, it represents the implied correlation (IC). We observe empirically a spread of a few basis points between the
strike of a correlation swap and the dispersion implied correlation. That spread was studied more formally studied by
Jacquier et al. [2007]. Following their arguments, we can express the relation between a dispersion trade in terms of
variance swaps and a correlation swap as follow
259
N
X X
2 2
bi2 (t)(b
σi2 − σi2 ) +
σ
bB − σB = ω ω
bi (t)b
ωj (t) σ bi ρb − σi σi ρ
bi σ
i=1 i6=j
N
X X
2 2
bi2 (t)(b
σi2 − σi2 ) +
σ
bB − σB = ω ω
bi (t)b ρ − ρ) + (b
ωj (t) σi σi (b bi − σi σi )b
σi σ ρ
i=1 i6=j
N
X X X
= bi2 (t)(b
ω σi2 − σi2 ) + ω
bi (t)b ρ − ρ) +
ωj (t)σi σi (b ρb(b bi − σi σi )
σi σ
i=1 i6=j i6=j
which leads to
N
X X X
2 2
(b
σB − σB )− bi2 (t)(b
ω σi2 − σi2 ) − ω
bi (t)b ρ − ρ) =
ωj (t)σi σi (b ρb(b bi − σi σi )
σi σ
i=1 i6=j i6=j
It is the PnL for being short a dispersion trade through variance swap and long a correlation swap.
N
1 X ∆Fi (t, T ) 2
− B 2 (t, T )ΓB bi2 (t) ( ) − σi2 ∆t
πB (t, t + dt) ≈ ω
2 i=1
Fi (t, T )
1 2 X ∆Fi (t, T ) ∆Fj (t, T )
− B (t, T )ΓB ω
bi (t)bωj (t)σi σj − ρij ∆t
2 Fi (t, T )σi Fj (t, T )σj
i6=j
We assume that the correlation ρij between two components are all equal to an average correlation ρ, which is in
i (t,T ) ∆Fj (t,T )
fact the implied correlation in the dispersion trade. Hence, ∆F
Fi (t,T ) Fj (t,T ) is the instantaneous realised covariance
∆F (t,T )
between two underlyings Fi (t, T ) and Fj (t, T ). Therefore, F∆F i (t,T ) j
i (t,T )σi Fj (t,T )σj
is the instantaneous realised correlation
between the two forward prices. Thus, assuming the same implied and realised correlation for all the pairs of forward
prices, ρb, and expending the stochastic weights ωbi (t), then the PnL for the index option becomes
N
1 X ∆Fi (t, T ) 2
ωi2 Fi2 (t, T ) ( ) − σi2 ∆t
πB (t, t + dt) ≈ − ΓB
2 i=1
Fi (t, T )
1 X
− ΓB ωi ωj Fi2 (t, T )Fj2 (t, T )σi σj (b
ρ − ρ)∆t
2
i6=j
260
That PnL is made of a spread between the implied and the realised correlation over the period of time [t, t+dt] together
with a volatility exposure.
We can also consider a position in a dispersion trade with variance swaps (αi represents the proportion of variance
swaps for the ith stock), the Gamma PnL becomes
N
X
Γ
πD (t, t + dt) = αi πFi (t, t + dt) − πB (t, t + dt)
i=1
N
1X 2 ∆Fi (t, T ) 2
) − σi2 ∆t αi Γi − ωi2 ΓB
≈ Fi (t, T ) (
2 i=1 Fi (t, T )
1 X
− ΓB ρ − ρ)∆t
ωi ωj Fi (t, T )Fj (t, T )σi σj (b
2
i6=j
The dispersion trade PnL is equal to the sum of a spread between the implied and the realised correlation over the
period of time [t, t + dt] (pure correlation exposure) together with a volatility exposure. Setting Γi = T F 22(t,T ) and
i
2
ΓB = T B 2 (t,T ) , the Gamma PnL becomes
N
1 X ∆Fi (t, T ) 2 F 2 (t, T )
Γ
) − σi2 ∆t αi − ωi2 i2
πD (t, t + dt) ≈ (
T i=1 Fi (t, T ) B (t, T )
1 X
− ρ − ρ)∆t
ωi ωj Fi (t, T )Fj (t, T )σi σj (b
T B 2 (t, T )
i6=j
N
Γ 1 X ∆Fi (t, T ) 2
) − σi2 ∆t αi − ω
bi2 (t)
πD (t, t + dt) ≈ (
T i=1 Fi (t, T )
1X
− ω
bi (t)b ρ − ρ)∆t
ωj (t)σi σj (b
T
i6=j
1
P
Setting βV = T i6=j ω
bi (t)b
ωj (t)σi σj , which only depends on the components of the index, we rewrite the PnL as
N
Γ 1 X ∆Fi (t, T ) 2
) − σi2 ∆t αi − ω
bi2 (t) − βV (b
πD (t, t + dt) ≈ ( ρ − ρ)∆t
T i=1 Fi (t, T )
Choosing αi = ω bi2 (t), the Gamma PnL of the dispersion trade corresponds to the spread between the implied and the
realised correlation, multiplied by the factor βV
N
X
Γ
πD (t, t + dt) = αi πFi (t, t + dt) − πB (t, t + dt) ≈ −βV (b
ρ − ρ)∆t
i=1
Thus, the dispersion trade is close to a pure correlation trade as it is multiplied by the quantity βV , which is a weighted
average variance of the components of the index.
Considering the delta-hedged PnL with stochastic volatility in Equation (4.2.14), we can add the volatility terms
when dσ 6= 0, obtaining
261
N
V ol
X 1
πD (t, t + dt) = αi Vega(i) dσi,t + Volga(i) < dσi , dσi >t + Vanna(i) σi,t Si,t ξi ρSi ,σi dt
i=1
2
1
− Vega(B) dσB,t + Volga(B) < dσB , dσB >t + Vanna(B) σB,t SB,t ξB ρB,σB dt
2
Thus, the total PnL of the dispersion trade becomes
Γ V ol
πD (t, t + dt) = πD (t, t + dt) + πD (t, t + dt)
Hence, the difference between the implied correlation of a dispersion trade and the strike of the correlation swap with
the same characteristics is due to the volatility terms (vega, volga, and vanna).
∆Fi (t, T ) √
RFi = = ni σi ∆t
Fi (t, T )
PN
Further, since ∆B(t, T ) = i=1 ωi ∆Fi (t, T ), we can deduce that the return of the basket is
N N
∆B(t, T ) X √ X
RB = = bi (t)ni σi ∆t =
ω ω
bi (t)RFi
B(t, T ) i=1 i=1
N
2
X √ 2
RB = ω
bi (t)ni σi ∆t
i=1
N
X X
= bi2 (t)n2i σi2 ∆t +
ω ω
bi (t)b
ωj (t)σi σj ni nj ∆t
i=1 i6=j
262
N
X X
RF2 i + RB
2
− 2RFi RB = n2i σi2 ∆t + bi2 (t)n2i σi2 ∆t +
ω ω
bi (t)b
ωj (t)σi σj ni nj ∆t
i=1 i6=j
X
− ωi (t)n2i σi2 ∆t − 2ni σi
2b ω
bj (t)nj σj ∆t
i6=j
From the definition of the standardised move of the basket n2B , we have
N
X X
2 2
σB nB = bi2 (t)σi2 n2i +
ω ω
bi (t)b
ωj (t)σi σj ni nj
i=1 i6=j
we can rewrite it as
N
X
πD ≈ Θi n2i + ΘB n2B − Θs
i=1
PN PN σ2
with Θs = i=1 Θi + ΘB . Adding and substracting the quantity ΘB i=1 bi (t) σ2i n2i , we get
ω
B
N N N
X X σ2 X σ2
πD ≈ Θi n2i + ΘB bi (t) 2i n2i
ω − ΘB bi (t) 2i n2i
ω + ΘB n2B − Θs
i=1 i=1
σB i=1
σB
and putting terms together, it simplifies to
N N
X σi2 2 ΘB X
bi (t)σi2 n2i − σB
2 2
πD ≈ Θi + Θ B ω
bi (t) 2 n i − 2 ω nB − Θs
i=1
σB σB i=1
Replacing with the dispersion statistic, the PnL of the dispersion becomes
263
N
X σi2 2 ΘB 2
πD ≈ Θi + Θ B ω
bi (t) 2 n i − σ 2 D − Θs
i=1
σB B
where the first term is the Idiosyncratic gamma, the second term is the Dispersion gamma, and the last term is the
time-decay.
σi → σi + ∆σ , ∀i
2 2
σB,σ −σB
and compute σB2 . Replacing the single option variance in the basket instantaneous variance, we get
N
X
2
σB,σ → ω
bi (t)b
ωj (t)(σi + ∆σ)(σj + ∆σ)ρij
i,j=1
which gives
N
X N
X N
X
2
σB,σ → ω
bi (t)b
ωj (t)σi σj ρij + ω
bi (t)b
ωj (t)(σi + σj )(∆σ)ρij + ω ωj (t)(∆σ)2 ρij
bi (t)b
i,j=1 i,j=1 i,j=1
2
The correlation ρij between two assets appears in the instantaneous variance of the basket σB . The variance can be
decomposed as
N
X X
2
σB = bi2 (t)σi2 +
ω ω
bi (t)b
ωj (t)σi σj ρij
i=1 i6=j
We want to bump the correlations by a constant according to
ρij → ρij + ∆ρ , i 6= j
2 2
σB,ρ −σB
and compute σB2 . Replacing the correlation in the instantaneous variance, we get
N
X X
2
σB,ρ → bi2 (t)σi2 +
ω ω
bi (t)b
ωj (t)σi σj (ρij + ∆ρ)
i=1 i6=j
which gives
N
X X
2
σB,ρ → ω
bi (t)b
ωj (t)σi σj ρij + ω
bi (t)b
ωj (t)σi σj ∆ρ
i,j=1 i6=j
Writting
v
N
X
uN
uX
σ1 = ω
bi (t)σi and σ0 = t bi2 (t)σi2
ω
i=1 i=1
we get
X
[σ12 − σ02 ]∆ρ = ω
bi (t)b 2
ωj (t)σi σj ∆ρ = ∆ρ σB
i6=j
264
265
Appendices
266
Some probabilities
For details see text books by Grimmett et al. [1992], Oksendal [1998] and Jacod et al. [2004].
Definition A.1.2 An event is a property which can be observed either to hold or not to hold after the experiment is
done. In mathematical terms, an event is a subset of Ω.
We think of the collection of events as a subcollection F of the set of all subsets of Ω such that
1. if A, B ∈ F then A ∪ B ∈ F and A ∩ B ∈ F
2. if A ∈ F then Ac ∈ F
3. the empty set belongs to F
Any collection F of subsets of Ω which satisfies these three conditions is called a field. It follows from the properties
of a field F that
n
[
if A1 , A2 , .., An ∈ F then Ai ∈ F
i=1
so that F is closed under finite unions and hence under finite intersections also.
Definition A.1.3 A collection F of subsets of Ω is called a σ-field if it satisfies the following conditions
We consider a set E in Rd and let the measure µ on E associate to some measurable subsets A ⊂ E be a positive
number µ(A) ∈ [0, ∞] called the measure of A. The domain of definition of a measure on E is a collection of subsets
of E called a σ -algebra which contains the empty set,
P is stable under unions and contains the complementary of every
element. We define the counting measure µX = i δxi on a countable set of points X = {xi , i = 0, 1, ...} ⊂ E
267
where δx (A) = 1 if x ∈ A and δx (A) = 0 if x 6= A is a Dirac measure such that for any A ⊂ E, µX (A) counts the
number of points xi in A
X
µ(A) = #{i , xi ∈ A} = Ixi ∈A
i≥1
It is an integer valued measure. A finite measure with mass 1 is called a probability measure.
Definition A.1.4 Let E ⊂ Rd . A Radon measure on (E, B) is a measure µ such that for every compact measurable
set B ∈ B, µ(B) < ∞.
A measure µ0 which gives zero mass to any point is said to be diffusive or atomless, that is ∀x ∈ E, µ0 ({x}) = 0.
Any Radon measure can be decomposed into a diffusive part and a sum of Dirac measures
Proposition 13 Any Radon measure µ can be decomposed into a diffusive part µ0 and a linear combination of Dirac
measures X
µ = µ0 + bj δxj xj ∈ E, bj > 0
j≥1
Definition A.1.5 We consider two measurable spaces (E, E) and (F, F), then a function f : E → F is called
measurable if for any measurable set A ∈ F, the set
f −1 (A) = {x ∈ E, f (x) ∈ A}
is a measurable subset of E.
If the measure µ can be decomposed as in Proposition (13) then the integral of µ with respect to f denoted by µ(f ) is
Z X
µ(f ) = f (x)µ0 (dx) + bj f (xj )
j≥1
We let Ω be the set of scenarios equipped with a σ-algebra F and consider a probability measure on (Ω, F) which
is a positive finite measure P with total mass 1. Therefore, (Ω, F, P) is called a probability space and any measurable
set A ∈ F called an event is a set of scenarios to which a probability can be assigned. The probability measure assigns
value in [0, 1] to each event such that
P: F → [0, 1]
A → P(A)
An event A with probability P(A) = 1 is said to occur almost surely and if P(A) = 0 the event is impossible. We
will say that a property holds P-almost surely if the set of ω ∈ Ω for which the property does not hold is a null set
(subset of an impossible event). Two probability measures P and Q on (Ω, F) are equivalent if they define the same
impossible events
268
Definition A.2.2 The distribution function of a random variable X is the function F : R → [0, 1] given by
F (x) = P (X ≤ x)
Definition A.2.4 The probability mass function of a discrete random variable X is the function f : R → [0, 1] given
by f (x) = P (X = x).
The distribution and mass functions are related by
X
F (x) = f (xi )
i:xi ≤x
Let x1 , x2 , .., xN be the numerical outcomes of N repetitions of some experiment. The average of these outcomes is
1 X
m= xi
N i
In advance of performing these experiments we can represent their outcomes by a sequence X1 , X2 , .., XN of random
variables, and assume that these variables are discrete with a common mass function f . Then, roughly speaking, for
each possible value x, about N f (x) of the Xi will take that value x. So, the average m is about
1 X X
m≈ xN f (x) = xf (x)
N x x
where the summation is over all possible values of the Xi . This average is the expectation or mean value of the
underlying distribution with mass function f .
Definition A.2.5 The mean value or expectation of X with mass function f is defined to be
X
E[X] = xf (x)
x:f (x)>0
269
mk = E[X k ]
The kth central moment σk is
σk = E[(X − m1 )k ]
The two moments of most use are m1 = E[X] and σ2 = E[(X − E(X))2 ] called the mean and variance of X.
Definition A.2.8 The expectation of a continuous random variable X with density function f is
Z ∞
E[X] = xf (x)dx
−∞
Definition A.2.9 Random variables X and Y are independent if their joint distribution function factors into the
product of their marginal distribution functions
Theorem A.2.1 Suppose X and Y are jointly continuous random variables. X and Y are independent if and only if
given any two densities for X and Y their product is the joint density for the pair (X, Y ), that is,
Independence requires that the set of points where the joint density is positive must be the Cartesian product of the set
of points where the marginal densities are positive, that is, the set of points where fX,Y (x, y) > 0 must be (possibly
infinite) rectangles.
270
Definition A.3.1 The characteristic function of a Rd random variable X is the function ΦX : Rd → C defined by
Z
ΦX (z) = E[eiz.X ] = eiz.x dµX (x) for z ∈ Rd
Rd
mn = E[X n ]
The first moment of X called the mean or expectation measures the central location of the distribution. Denoting the
mean of X by µX , the nth central moment of X, if it exists, is defined as
mcn = E[(X − µX )n ]
2
The second central moment σX called the variance of X measures the variability of X. The third central moment
measures the symmetry of X with respect to its mean, and the fourth central moment measures the tail behaviour of
X. In statistics, skewness and kurtosis, respectively the normalised third and fourth central moments of X are used to
summarise the extent of asymmetry and tail thickness. The are defined as
mc3 (X − µX )3 mc4 (X − µX )4
S= = E[ 3 ] , K = = E[ ]
(mc2 ) 2
3
σX (mc2 )2 4
σX
Since K = 3 for a normal distribution, the quantity K − 3 is called the excess kurtosis. The moments of a random
variable are related to the derivatives at 0 of its characteristic function.
1 ∂ k ΦX
mk = E[X k ] = (0)
ik ∂z k
Proposition 15 X possesses finite moments of all orders iff z → ΦX (z) is C ∞ at z = 0. Then the moments of X are
related to the derivatives of ΦX by
1 ∂ n ΦX
mn = E[X n ] = n (0)
i ∂z n
If Xi with i = 1, .., n are independent random variables, the characteristic function of Sn = X1 + ... + Xn is the
product of characteristic functions of individual variables Xi
n
Y
ΦSn (z) = ΦXi (z) (A.3.1)
i=1
We see that ΦX (0) = 1 and that the characteristic function ΦX is continuous at z = 0 and ΦX (z) 6= 0 in the
neighborhood of z = 0. It leads to the definition of the cumulant generating function or log characteristic function of
X.
271
Definition A.3.2 There exists a unique continuous function ΨX called the cumulant generating function defined
around zero such that
ΨX (0) = 0 and ΦX (z) = eΨX (z)
The cumulants kn of a probability distribution are a set of quantities providing an alternative to the moments of the
distribution. It is defined via the cumulant-generating function ΨX (z), which is the natural logarithm of the moment
generating function
ΨX (z) = ln ΦX (z)
The cumulants kn are obtained from the power series expansion of the cumulant generating function
∞
X zn
ΨX (z) = kn
n=1
n!
so that the nth cumulant can be obtained by differentiating the above equation n-times and evaluating the result at zero
(n)
kn = ΨX (z) z=0
Definition A.3.3
Cov(X, Y )= E[XY ] − E[X]E[Y ] = E (X − E[X])(Y − E[Y ])
X
E[XY ] = xyfXY (x, y)
x,y
The covariance measures whether, or not, (X − E[X]) and (Y − E[Y ]) have the same sign. Further, the correlation
is scale invariant
ρ(aX + b, cY + d) = ρ(X, Y )
For random variables X, Y, Z and constants a, b, c, d ∈ R then
• Cov(aX + b, cY + d) = acCov(X, Y )
• Cov(X + Y, Z) = Cov(X, Z) + Cov(Y, Z)
• Cov(X, Y ) = Cov(Y, X)
272
Theorem A.3.1 For X and Y random variables, whenever the correlation ρ(X, Y ) exists, it must satisfy
−1 ≤ ρ(X, Y ) ≤ 1
The correlation ρ(X, Y ) is a measure of the strength and direction of the linear relationship between X and Y . Also,
If X and Y are independent, then ρ(X, Y ) = 0 and we must have Cov(X, Y ) = 0 which leads to
E[XY ] = E[X]E[Y ]
Remark A.3.1 Independence of X and Y implies Cov(X, Y ) = 0 but not vice versa. That is, some times ρ(X, Y ) =
0 and X and Y are dependent.
Moreover, if X and Y are independent, we get
Xn n
X n
X
V ar( Xi ) = V ar(Xi ) + Cov(Xi , Xj ) (A.3.2)
i=1 i=1 i6=j
273
Definition A.4.1 We let {Xt ; t = 1, 2, ...} be a sequence of random variables on a given probability space (Ω, F, P)
with E[Xt = 0] and {Ft ; t = 1, 2, ...} a current of σ-algebras on the measurable space (Ω, F), where Ω is the
complete universe of all possible events. Then {Xt } is a sequence of independent random variables with respect to
{Ft } if Xt is measurable with respect to Ft and is independent of Ft−1 for all t = 1, 2, ...
274
We saw that if X1 , X2 , ... is a stationary process, then the passage of time correspond to a shift process X1+k , X2+k , ...,
for some time shift k. Thus, the notion of memory is related to the connections between the process and its shifts.
Since the process is stationary, the shifts do not change the distribution of the process.
Definition A.4.2 A shift transformation T is ergodic if for every measurable function f ∈ L1 (Ω, F, P ) we have
n−1
1X
lim f (T j X) = E[f (X)] a.s.
n→∞ n
j=0
Proposition 16 A stationary stochastic process is not ergodic if and only if it can be represented as a non-trivial
mixture of two different stationary stochastic processes
Y with probability p
X=
Z with probability 1 − p
where 0 < p < 1 and Y and Z are stationary stochastic processes with different finite-dimensional distribution.
In general, we say that a non-ergodic stationary process X has infinite memory. However, the mixing property is not
sufficiently strong to say that a stationary process with this property has short memory. A stronger requirement is
needed, and it is given by the strong mixing conditions:
Definition A.4.3 Let X = (..., X−1 , X0 , X1 , ...) be a stationary process. Define for n ≥ 1
E[|Xt |] < ∞
and
275
N =H M +L
HdM , H ∈ L2loc (M ), where L is a continuous local martingale strongly orthogonal to M .
R
where H M =
Note, in some cases we can drop the continuity assumption on N (see Ansel et al. [1993]). We now provide the
definitions of a Markov process and that of an independent process.
Definition A.4.6 A random process (Xt )t>0 is called a a Markov process if, for each n and every i0 , .., in , then
Xt − Xt−1 = t
where t ≈ IID, standing for independent and identically distributed process.
Definition A.4.8 A geometric Brownian motion is a random walk of natural logarithm of the original process Xt ,
where Lt = ln (Xt ), so that
∆Lt = Lt − Lt−1 = t
where t ≈ IID.
Note, martingale is more general than random walk since semi-martingales allow for dependence in the process. Thus,
random walk implies martingale but martingale does not imply random walk in the process.
We state and prove the Conditional Jensen Inequality as presented by Shreve [2003].
Theorem A.5.1 (The Conditional Jensen Inequality) Let φ : R → R be a convex function and E|φ(X)| < ∞ then
E[φ(X)|F] ≥ φ(E[X|F])
Since φ is convex, we can express it as
Therefore
276
We give the special case of the Optional Sampling Theorem necessary for use in Section (1.1.4.2). The theorem and
proof that follow are given by Williams [1991].
Theorem A.5.2 (Optional Sampling Theorem) Let X be a supermartingale. Let τ be a stopping time such that for
some K ∈ N, τ (ω) ≤ K, ∀ω. Then Xτ ∈ L1 (Ω, Fτ , P) and
E[XK |Fτ ] ≤ Xτ
If X is a submartingale then
E[XK |Fτ ] ≥ Xτ
Let A ∈ Fτ . Then
X
E[XK ]1IA = E[XK ]1IA∩{τ =n}
n≤K
X
≤ E[Xn ]1IA∩{τ =n}
n≤K
= E[Xτ ]1IA
P
The fact that |Xτ | ≤ n≤K |Xn | guarantees the integrability of Xτ . If we consider instead the process −X, so that
−X is a submartingale, then analogously to above, we obtain the result in the second part of the theorem. We give
the special one-dimensional case of the Girsanov Theorem with Brownian Motion, necessary for the probabalistic
approach to the Black-Scholes model in Section (1.2.3). We also give the Levy Characterisation Theorem that is
needed during its proof. We follow the construction given by Zheng [2015].
Theorem A.5.3 (Girsanov’s Theorem) Let (Ω, F, (Ft )t≥0 ), P) be a filtered probability space, where T ∈ (0, ∞)
and assume for simplicity that F = FT . Let Wt be an adapted one-dimensional Brownian Motion and θ(t) an
RT
adapted process such that 0 θ2 (s)ds < ∞ almost surely. Then define
Rt Rt
θ 2 (s)ds)
Zt = e( 0
θ(s)dWs − 21 0
hR i
T dQ
If E 0 θ2 (s)Zt2 ds < ∞ then E[ZT ] = 1. Furthermore, define a measure Q on (Ω, F) by dP = ZT , then under the
measure Q
Z t
W̃t = Wt − θ(s)ds
0
is an Ft Brownian Motion.
Assuming that E[ZT ] = 1 ensures that Q will indeed be a probability measure. Now applying the product rule
277
Theorem A.5.4 (Levy Characterisation Theorem) Let Mt be a continious martingale on (Ω, F, (Ft )t≥0 ), P) and
the quadratic variation of M be [Mt ] = t. Then Mt is an Ft -Brownian Motion.
2t
We suppose that Mt is a continuous martingale with [Mt ] = t and consider the function g(t, x) = eiθx+θ 2 . By
applying Itô’s formula we obtain
1 2 1
dg(t, Mt ) = θ g(t, Mt )dt + iθg(t, Mt )dMt − θ2 g(t, Mt )d[M ]t
2 2
so that g(t, Mt ) is a martingale if [M ]t = t. Hence for t > s
1 2
E[eiθ(Mt −Ms ) |Fs ] = e− 2 θ (t−s)
Let X be any Fs -measurable random variable and ψX its characteristic function. Then by the Tower Property of
conditional expectation, the joint characteristic function of X and Mt − Ms is
So, X and Mt − Ms are independent, meaning (Mt − Ms ) is independent of Fs . Also from above, (Mt − Ms ) ∼
N (0, t − s). Hence, (Mt ) is an (Ft )-Brownian Motion.
1 − e−λx , x ≥ 0
F (x) =
0 otherwise
which is the continuous limit of the waiting time distribution. The probability density function is given by
278
λe−λx for x ≥ 0
f (x) =
0 otherwise
The mean of X is given by
Z ∞
1
E[X] = [1 − F (x)]dx =
0 λ
1
and the variance of X is given by V ar(X) = λ2 so that the standard deviation of X is equal to its mean. The moments
of X for n = 1, 2, .. are given by
n!
E[X n ] =
λn
An exponentially distributed random variable T follows the relation
279
xα−1 (1 − x)β−1
f (x) = cst × xα−1 (1 − x)β−1 = R 1
0
uα−1 (1 − u)β−1 du
Γ(α + β) α−1
= x (1 − x)β−1
Γ(α)Γ(β)
1
= xα−1 (1 − x)β−1
B(α, β)
where Γ(•) is the gamma function, and B(•, •) is a normalisation constant ensuring that the total probability integrates
to 1. Note, this definition includes both ends x = 0 and x = 1, but some authors choose to exclude them and consider
0 < x < 1 instead. Beta densities are symmetric (for α = β = 1), unimodal (α, β > 1), uniantimodal, increasing,
decreasing or constant depending on the values α and β relative to 1, and have many more attractive properties. The
beta density is U -shaped when α, β < 1, it has positive skew when α < β and negative skew when α > q. Since
the beta distribution approaches the Bernoulli distribution in the limit when both shape parameters α and β approach
zero, some authors denote the pair (α, β) by (p, q). The slope of the pdf is given by
0 (α + β − 2)x − (α − 1)
f (x) = f (x)
(x − 1)x
xα−2 (1 − x)β−2
= − (α + β − 2)x − (α − 1)
B(α, β)
and at x = 12 , for α = β, the slope of the pdf is zero. Further, we get the differential equation
0
(x − 1)xf (x) + α − 1 − (α + β − 2)x f (x) = 0
The cumulative distribution function is given by
B(x; α, β)
F (x) = = Ix (α, β)
B(α, β)
where B(•; α, β) is the incomplete beta function and I• (α, β) is the regularised incomplete beta function. The mode
of a beta distributed random variable X with α, β > 1 (corresponding to the peak in the PDF) is given by
α−1
α+β−2
The mean of X is given by
1 1
xα−1 (1 − x)β−1
Z Z
µ = E[X] = xf (x)dx = x dx
0 0 B(α, β)
α 1
= = β
α+β 1+ α
280
β 1
which only depends on the ratio α . For α = β, we get the mean µ = 2, which is at the center of the (symmetric)
distribution. We also get the following limits
For the former limit ratio, the beta distribution becomes a one-point degenerate distribution with a Dirac delta function
spike at x = 1 with probability 1 and zero probability elsewhere else. Similarly, for the latter limit ratio the spike is
at x = 0. Next, we consider the limit cases where one parameter is finite (non-zero) and the other one approaches the
limits
F (x) = 1 − (1 − xa )b
In a more general form, using linear transformation, the normalised variable x is replaced with the unshifted and
unscaled variable z where
z − zmin
x= , zmin ≤ z ≤ zmax
zmax−zmin
In a discrete setting where the density satisfies P (X ∈ dx) ≈ fX (x)dx, then when considering the unshifted variable
1
z we get dx = zmax−z . We can invert the distribution function to obtain the quantile function
min
11
Q(y) = F −1 (y) = 1 − (1 − y) b a , 0 < y < 1
281
We can then trivially generate random variable as for U ∼ U (0, 1), then X ∼ f if
1 1
X = (1 − U b ) a
The Kumaraswamy distribution has the same basic shape properties as the beta distribution (see Jones [2009])
• unimodal: a > 1, b > 1.
• uniantimodal: a < 1, b < 1.
• increasing: a > 1, b ≤ 1.
• decreasing: a ≤ 1, b > 1.
• constant: a = b = 1.
and the Kumaraswamy density also matches that of the beta density at the boundaries of their support
• f (x) ∼ xa−1 as x → 0.
• f (x) ∼ (1 − x)b−1 as x → 1
The raw moments of the Kumaraswamy distribution are given by
bΓ(1 + na )Γ(b) n
mn = = bB(1 + , b)
Γ(1 + b + na ) a
where B(•, •) is the Beta function. Similarly to the beta distribution, they exist for all n > −a. The variance,
skewness, and excess kurtosis can be computed from these raw moments. For instance, the variance is given by
σ 2 = m2 − m21
2 1 2
V ar(X) = bB(1 + , b) − bB(1 + , b)
a a
Assuming that Xa,b is a Kumaraswamy distributed random variable, then it is the a-th root of a suitably defined Beta
distributed random variable (see Jones [2009]). Let Y1,b denote a Beta r.v. with parameters α = 1 and β = b, then
1
Xa,b = Y1,b
a
with equality in distribution
Z x Z xa 1
P (Xa,b ≤ x) = abua−1 (1 − ua )b−1 du = b(1 − u)b−1 du = P (Y1,b ≤ xa ) = P (Y1,b
a
≤ x)
0 0
1
γ
Considering a generalised distribution with Beta distributed r.v. Yα,β with γ > 0, the raw moments are given by
Γ(α + β)Γ(α + nγ )
mn =
Γ(α)Γ(α + β + nγ )
where the original moments are recovered by setting α = 1,β = b and γ = a. However, in general the CDF does not
have a closed form solution. Note,
• if X ∼ K(1, 1) then X ∼ U (0, 1).
• if X ∼ B(1, b) then X ∼ K(1, b).
• if X ∼ B(a, 1) then X ∼ K(a, 1).
• if X ∼ K(a, b) then X ∼ GB1(a, 1, 1, b).
where GB1(a, 1, 1, b) is the generalised beta distribution of the first kind.
282
where we recover the classical beta distribution for a = 1 (1, p, q), and the Kumaraswamy distribution for p = 1
(a, 1, q). It is the distribution of the a1 power of a B(p, q) random variable or of the p-th order statistic of a sample of
size p+q−1 from the power function distribution B(a, 1). Jones [2004] introduced the general class of beta-generated
distibutions characterised by their density function
where we recover the beta-generated distribution for a = 1 and the Kumaraswamy distribution for p = 1. Further, the
GBG with parent F (x) is a standard beta-generated distribution with parent F a (x).
283
Expanding the exponential and collecting terms according to the order of the derivatives we obtain the Gram-Charlier
A series. Focussing on the first two correction terms to the normal distribution, we get
1 (x−µ)2 k x−µ k4 x−µ
f (x) ≈ √ e− 2σ2 1 + 3 3 He3 ( )+ 4
He4 ( )
σ 2π 3!σ σ 4!σ σ
where He3 (x) = x3 − 3x and He4 (x) = x4 − 6x2 + 3 are Hermite polynomials. Note, this expression is not
guaranteed to be positive, so that it is not a valid probability distribution. There are many cases of interest where the
x2
Gram-Charlier A series diverges. It converges only if the density f (x) falls at a faster rate than e− 4 at infinity (see
Cramer [1957]). Hence, when it does not converge the series is not a true asymptotic expansion because it is not
possible to estimate the error of the expansion.
k1Fn − γ1 = 0
k2Fn − γ2 = 0
kr λr
krFn − γr = r = r ,r≥3
σ n 2 −1
r n 2 −1
Following the same approach as the Gram-Charlier A series except that terms are collected according to powers of n,
we get
∞
X Pj (it) − t2
Ψn (t) = 1 + j e 2
j=1 n
2
where Pj (x) is a polynomial of degree 3j. After applying the inverse Fourier transform, the distribution function is
given by
∞
X Pj (−D)
Fn (x) = N (x) + j N (x)
j=1 n2
Letting N (j) (x) be the jth derivative of N (•) at point x, we can recover the first few terms of the expansion. Fur-
ther, since the derivatives of the density of the normal distribution are related to the normal density by φ(n) (x) is
(−1)n Hen (x)φ(x), then we obtain an alternative representation in terms of the density function. However, Edge-
worth expansions are not guaranteed to be a proper probability distribution since the integral of the density needs not
284
to integrate to one, and the probabilities can be negative. Further, they can be inaccurate, especially in the tails, be-
cause they are obtained under a Taylor series around the mean, and they guarantee (asymptotically) an absolute error
but not a relative one.
Definition A.6.1 X is called stable, if for any positive number a and b, there exists a positive number c and real
number d, such that
Proposition 17 For any stable random variable X, there exists α ∈ (0, 2], such that numbers in Equation (A.6.3)
satisfies
aα + bα = cα
√
In the Gaussian case, X ∼ N (µ, σ 2 ) and we get c = a2 + b2 and d = (a + b − c)µ with α = 2.
Definition A.6.2 X is called stable, if for any integer n ≥ 2, there exists positive numbers cn , and real numbers dn ,
such that
n
X
Xi = cn X + dn (A.6.4)
i=1
This is called re-normalisation group transformation, and {Tn } is a semi-group and satisfies Tn Tm = Tmn .
Proposition 18 An i.i.d. strictly α-stable random variable sequence is a fixed point for Tn at δ = α1 . For dependent
stable sequences, it is still a fixed point for Tn , but the corresponding δ is a function of α and the dependence.
285
Definition A.6.3 X is stable if it has a domain attraction, that is, there exists i.i.d. random variables Y1 , Y2 , ...,
dn > 0, an ∈ R, such that
n
1 X
Yi + an = X (A.6.5)
dn i=1
If X is Gaussian and Yi are i.i.d. with finite variance, then Equation (A.6.5) is just the Central Limit Theorem (CLT).
When dn = n1 α, then Y is said to belong to the normal domain of attraction X. In general, we have dn = n1 αh(n),
where h(x) is a slowly varying function at infinity 1 We are now going to define stable distributions in terms of
characteric functions.
Definition A.6.4 X is stable if there exists 0 < α ≤ 2, c ≥ 0, −1 ≤ β ≤ 1, and real number µ,such that the
characteric function for X is
−cα |θ|α 1−iβ(signθ) tan πα +iµθ
iθX e 2 if α 6= 1
E[e ] = 2
e−c|θ| 1+iβ π (signθ) ln θ +iµθ if α = 1
where α is the stability index, and c, β, µ are unique, except when α = 2, β is arbitrary. We denote that X ∼
S(α, β, c, µ).
Many properties can be derived from the characteric functions.
Proposition 21 If 0 < α < 2, and X ∼ S(α, β, c, 0), then −X ∼ S(α, −β, c, 0).
which means that β is a skewness parameter.
Definition A.6.5 S(α, β, c, µ) is called skewed to the right (left) if β > 0 (β < 0), and it is called totally skewed to
the right (left) if β = 1 (β = −1).
Note, the skewness here does not refer to the support of the distribution, but to the parameters P and Q in the Levy
measure.
286
Proposition 26 If X has distribution S(α, β, c, 0) with α < 2, then there exists two i.i.d. random variables Y1 and
Y2 , with common distribution S(α, 1, c, 0), such that
1+β 1 1
( 2 ) α Y1 + ( 1−β2 ) Y2 if α 6= 1
α
X=
( 1+β 1−β 1+β 1+β 1−β 1−β
2 )Y1 + ( 2 )Y2 + c π ln 2 − π ln 2 if α = 1
it means that those stable subordinators can be used as building blocks of more general stable random variables.
Proposition 29 Let X ∼ S(α, β, c, µ) with 0 < α ≤ 2. Then the location parameter µ equals to the mean.
If X is symmetric α-stable (SαS), then X ∼ S(α, 0, c, 0). Its characteristic function is
α
|θ|α
E[eiθX ] = e−c
When c = 1, X is called standard, and when α = 2, a standard SαS distribution is N (0, 2).
0 0 0
Proposition 30 Let X ∼ S(α , 0, c, 0) with 0 < α ≤ 2. If 0 < α < α , if we choose a random variable A ∼
0
α
S( αα0 , 1, (cos 2α
πα
0 ) α , 0), and assume X and A are independent, then
1
Z = A α0 X ∼ S(α, 0, c, 0)
0
α
In particular, if α = 2, for a zero mean Gaussian random variable X, and if A is a positive 2 -stable random variable
1
0
independent of X, then Z = A α X ∼ SαS. That is, every SαS random variable is conditionally Gaussian.
We can also express stable distributions in terms of random series. We let N (t) be the number of arrivals within
time [0, t], the process {Nt }t≥0 is a homogeneous Poisson process with rate λ, if the inter-arrival times τi+1 − τi ,
i ≥ 1 are independent exponentially distributed with mean λ1 .
287
Proposition 31 Let {τi } denote the arrival times of a Poisson process with rate 1, and {Ri } are i.i.d. random
variables, independent of thesequence {τi }. If the series
∞
1
X
τ − α Ri
i=1
288
Equity miscellaneous
0 0 0 0
CBS (t, x, K , T ; r = 0) = xRe(t, T )N (d1 (T − t, xR(t, T ), K )) − K N (d2 (T − t, xR(t, T ), K ))
with
0 1 xR(t, T ) 1 √
d2 (T − t, xR(t, T ), K ) = √ log − σ T −t
σ T −t K0 2
0
= d2 (T − t, xC(t, T ), K )
d1 (T − t, xC(t, T ), K) = d3 (T − t, xC(t, T ), K) + ξ
d2 (T − t, xC(t, T ), K) = d3 (T − t, xC(t, T ), K) − ξ
289
Factorising the call price in Equation (1.2.6) with respect to xRe(t, T ) and dividing both side of the equation with
1 − η, we get
1 CBS (t, x, K, T ) 1
= N (d3 (T − t, xC(t, T ), K) + ξ) − ηN (d3 (T − t, xC(t, T ), K) − ξ)
1−η xRe(t, T ) 1−η
or in a more concise form
PBS (t, x, K, T ) = −xe−q(T −t) N (−d1 (T − t, xe(r−q)(T −t) , K)) + Ke−r(T −t) N (−d2 (T − t, xe(r−q)(T −t) , K))
= Ke−r(T −t) N (d1 (T − t, K, xe(r−q)(T −t) )) − xe−q(T −t) N (d2 (T − t, K, xe(r−q)(T −t) ))
with delta
since
∂2 ∂
Γ(t, St ) =
2
CBS (t, x, K, T ) = Re(t, T ) N (d1 (T − t, x, K))
∂x ∂x
Using the chain rule, for x = St we get the gamma
∂ dd1 Re(t, T )
Γ(t, St ) = Re(t, T ) N (d1 (T − t, x, K)) = n(d1 (T − t, x, K)) √
∂d1 dx xσ T − t
and we see that Γ(t, St ) > 0. The vega in the Black-Scholes model is
√
V ega = xRe(t, T ) T − tn(d1 (T − t, x, K)) (B.1.4)
00 0
∂
where n(x) = ∂x N (x) and N (x) = n (x) = −xn(x), with the density given by
290
1 x2
n(x) = √ e− 2
2π
Further, we have
lim V ega = 0
(T −t)→0
lim V ega = 0
(T −t)→∞
Thus, the vega is positive or equal to zero. We can express the vega in terms of the gamma as follow
∂ √ 1 1
V ega = V olga = xRe(t, T ) T − t d1 d2 n(d1 ) = d1 d2 V ega (B.1.8)
∂σ σ σ
Again, differentiating the Volga with respect to volatility, we get
∂ 1
V olga = −d1 d2 − d22 − d21 + d21 d22 2 V ega
∂σ σ
d
Since dK di = − Kσ√1T −t for i = 1, 2, we differentiate the Vega with respect to the strike K, getting
∂ d
V ega = √1 V ega
∂K Kσ T − t
which we differentiate one more time with respect to the strike, getting
∂2 d1 d2 1 (d1 d2 − 1)
V ega = 2 2 V ega − 2 2 V ega = 2 2 V ega
∂K 2 K σ (T − t) K σ (T − t) K σ (T − t)
To conclude, we need to differentiate the Vega with respect to maturity T , getting
∂ 1 1 (r(T ) − q(T ))
V ega = −q(T )V ega + V ega + d1 d2 − √ V ega
∂T 2(T − t) 2(T − t) σ T −t
We now differentiate the call price with respect to the strike
∂
CBS (t, x, K, T ) = −P (t, T )N (d2 (T − t, x, K))
∂K
291
Differentiating the call price twice with respect to the strike we get
∂2 P (t, T )
CBS (t, x, K, T ) = √ n(d2 (T − t, x, k))
∂K 2 Kσ T − t
Using Equation (B.1.5), it can be rewritten as
∂2 xRe(t, T ) 1
2
CBS (t, x, K, T ) = 2 √ n(d1 (T − t, x, k)) = 2 V ega(K, T ; σ) (B.1.9)
∂K K σ T −t K σ(T − t)
Setting x = St , we differentiate the call price with respect to maturity T getting
∂ σ
CBS (t, St , K, T ) = V ega + rT KP (t, T )N (d2 ) − qT xRe(t, T )N (d1 )
∂T 2(T − t)
We define XBS (t, St , K, T ) as
σ
XBS (t, St , K, T ) = V ega + rT KP (t, T )N (d2 )
2(T − t)
which is always positive, and rewrite the above partial derivative as
∂
CBS (t, St , K, T ) = XBS (t, St , K, T ) − qT xRe(t, T )N (d1 )
∂T
Similarly, the partial derivative of the put price with respect to maturity T is
∂ σ
PBS (t, St , K, T ) = − V ega + rT KP (t, T )N (d1 ) − qT xRe(t, T )N (d2 )
∂T 2(T − t)
∂CBS (t,St ,K,T ) dτ
Since dτ
dt = −1 where τ = T −t, the partial derivative of the price with respect to time t is given by ∂τ dt .
From the relation
∂ σ
CBS (t, St , K, T ) = − V ega − rt x∆(t, St ) + rt CBS (t, St , K, T ) + qt xRe(t, T )N (d1 )
∂t 2(T − t)
From the Equation (B.1.6) relating the vega with the gamma, we can rewrite the theta as
∂ 1
CBS (t, St , K, T ) = − x2 σ 2 Γ(t, St ) − rt x∆(t, St ) + rt CBS (t, St , K, T ) + qt xRe(t, T )N (d1 ) (B.1.10)
∂t 2
Note, we can write the theta as
∂ 1
CBS (t, St , K, T ) = − x2 σ 2 Γ(t, St ) + qt xRe(t, T )N (d1 ) − rt KP (t, T )N (d2 )
∂t 2
and study the ratio
292
√ √
qt Re(t, T )N ( 21 σ T − t) − rt Re(t, T )N (− 21 σ T − t)
R= 1 2
2 xσ Γ(t, St )
Further assuming that qt = rt , then Re(t, T ) = P (t, T ) and the ratio simplifies to
√ √ √
qt Re(t, T ) N ( 12 σ T − t) − N (− 21 σ T − t)
qt Re(t, T )0.4σ T − t qt (T − t)
R= Re(t,T )
= Re(t,T )
= 1 −0.5d21
1 1
2 Ke
2
2 xσ n(d1 ) xσ T −t 2 xσn(d1 )
√ √
T −t
and the numerator is smaller than the denominator. However, when N (d1 ) and N (d2 ) are both close to 0 or close to 1
the gamma tends to zero. The question being to know which of the numerator or the denominator goes to zero faster.
We can then assume that qt xRe(t, T )N (d1 ) − rt KP (t, T )N (d2 ) is small compared to 12 x2 σ 2 Γ(t, St ) so that the theta
can be approximated as
∂ 1
CBS (t, St , K, T ) ≈ − x2 σ 2 Γ(t, St )
∂t 2
• The escrowed model which assumes that the asset price minus the present value of all dividends to be paid until
the maturity of the option follows a geometric Brownian motion.
• The forward model which assumes that the asset price plus the forward value of all dividends (from past dividend
dates to today) follows a geometric Brownian motion.
• The piecewise lognormal model which assumes that the stock price shows a jump downwards at dividend dates
and follows a geometric Brownian motion in between those dates.
Several approaches use the idea of an escrowed dividend process where the stock price process is decomposed into two
parts, a risky part and an escrowed dividend part. The idea being to make a simple adjustment to use the BS-formula
by replacing the stock price S0 by the stock price minus the present value of the dividend S0 −P (0, td )d, where d is the
cash dividend at time td . The main advantage of the escrowed model and the forward model is that we can use the BS-
formula with an adjusted value of the current stock price or strike, respectively. Moreover, for American options it is
easy to construct binomial trees which recombines. However, the escrowed approach undervalue call options and lead
to arbitrage opportunities (for details see Bos et al. [2003], Haug et al. [2003]). One solution is to consider a modified
escrowed model where we assume that the asset price minus the present value of all dividends to be paid in the future
follows a geometric Brownian motion. However, the option prices in that setting will depend on the dividends paid
after the options have expired. Modifying both the escrowed model and the forward model, Bos et al. [2002] defined
a mixture model where one part of the dividends is incorporated in a modified asset price, and the other part in a
modified strike price. From a theoretical point of view, the piecewise lognormal model should be preferred even if it
is computationaly slower. Considering a piecewise lognormal model, Wilmott et al. [1993] proposed a method based
on interpolation steps within a partial differential equation (PDE). Using conditional expectation, Haug et al. [2003])
replaced a multiple integration by a succession of single integrations over Black-Scholes like approximating functions.
293
Following the same approach as Wilmott et al. [1993], but using a stock-price dependent dividend, Vellekoop et al.
[2006] constructed a recombing tree. In any case, when modelling discrete dividends we must face a trade-off between
accuracy and time. We beriefly describe some popular models for discrete dividends where a closed-form solution can
be derived for European options. For notation purpose, we consider the definition of the discrete dividends given in
Section (5.1.1.2).
Remark B.2.1 It can be used to infer a distribution but one should not diffuse S for exotic pricing but only Z because
the local volatility is shifted downwards as maturity passes an ex-dividend dates.
In that model, the stock price process (St )t∈[0,T ] is given by the transformation
St = Zt + DP V (t; t, T )
P P (t,ti )
where, for T = N ∆t, the displaced coefficient is DP V (t; t, T ) = t<ti ≤T Re(t,t i)
dti . At time t0 = 0, the trajectory
of the process Zt starts at S0 to which we substract the present value of all the expected future dividend payments up
to maturity. The process ends its trajectory at maturity where it is equal the stock price, that is, ZT = ST . Note, we
can not pay a constant dividend d at arbitrarily low stock prices as we can get negative stock prices leading to arbitrage
issues. This said, we get the present value of all the future dividend payments up to maturity T described in Section
(5.1.1.2), differentiate that function with respect to time and get its dynamics as
0
b P V (t; t, T ) = µt DP V (t; t, T )dt − Dt dt
dD
It implies that at maturity we have ZT = ST and we can write the price of a call option as
294
x − DP V (t; t, T ) DP V (t; t, T )
σS (x, t) = σZ = σZ (1 − )
x x
In case where σZ is flat then the corresponding stock volatility σS is no-longer flat. Since DP V (t; t, T ) ≥ 0 and
usually DP V (t; t, T ) ≤ St then σS < σLN so that the average volatility of the stock S will be lower than the volatility
of a process log-normally distributed between dividend dates. Moreover, the entire local volatility surface is shifted
downwards as maturity passes an ex-dividend date. As explained by Bos et al. [2003], this behaviour is especially
undesirable for path-dependent options. Note, the volatility of the stock converge to that of the risky process when
time to maturity is reaching zero, lim(T −t)→0 σS = σZ . Obviously when this model is calibrated to market prices one
retrieve the volatility σZ of the risky process.
Re(t, T )
F (t, T ) = Zt = (St − DP V (t; t, T ))C(t, T ) = St C(t, T ) − D(t, T )
P (t, T )
Assuming that the forward price is log-normally distributed with dynamics
dF (t, T )
= σF dWt
F (t, T )
one can then apply Black-Scholes formula to get the price of a call option
C(t, St , K, T ) = P (t, T )F (t, T )N (d1 (t, F (t, T ), K)) − KP (t, T )N (d2 (t, F (t, T ), K))
St = C(t0 , t)Yt − Dt
dYt = Yt σY (t, Y )dWY (t)
Yt0 = St0
with solution
Rt
− 21 2
(s,Y )ds+ tt σY (s,Y )dWY (s)
R
σY
Yt = Yt0 EY (t0 , t) and EY (t0 , t) = e t0 0
In that setting the price of a European call option seen at time t0 with strike K and maturity T is
0
CS (t0 ; K, T ) = P (t0 , T )Et0 [(C(t0 , T )YT − K )+ ]
0
where K = K + DT . Note, for all time t < T the stock price at maturity is
ST = St C(t, T )EY (t, T ) − DT + C(t, T )Dt EY (t, T ) = C(t, T )Y (t, T ) − DT + C(t, T )Dt EY (t, T )
295
Note, since the spot price follows the transformation in Equation (5.1.1), we can rewrite the price of the call option as
00 00 00
CS (t; K, T ) = P (t, T )C(t, T )CY (t; K , T ) = Re(t, T )Et [(YT − K )+ ] = C(t, Yt , K , T ) (B.2.12)
00 0
K P (t,T )(K+D(t,T ))
where K = C(t,T ) = Re(t,T ) . Hence, to price a call option with strike K in the S-space domain, we need
0 00
K
the strike k = in the Y -space domain. Reversing the process, we get K = C(t, T )K − D(t, T ) and the call
C(t,T )
option price becomes
1
CY (t; k, T ) = CS (t; C(t, T )k − D(t, T ), T )
Re(t, T )
We can also rewrite the price of the call option as
b + ] = Et [(ZT − K)
CS (t; K, T ) = Et [(Re(t, T )YT − K) b +]
where K
b = P (t, T )(K + D(t, T )) and Zt = Re(t0 , t)Yt .
y = y(0) + α
where y(0) = CBS (t, x, K, T ; σ) is the Black-Scholes price around-the-money forward. Given the definition of the
implied volatility in Section (1.4.1.2), we let y = C(t, x, K, T ) be the call price for strike K and maturity T , and we
let g(y) = f −1 (y) be the inverse function.
296
Assumption B.3.1 We assume that y(0) dominates α, that is, α is small with respect to y.
As a result from Assumption (B.3.1), we can approximate the function g(y) with a Taylor series around y(0), getting
0 1 00 1
g(y) = g(y(0)) + (y − y(0))g (y(0)) + (y − y(0))2 g (y(0)) + ... + (y − y(0))n g (n) (y(0)) + Rn
2 n!
where Rn is the remainder term. The series becomes
0 1 00 1
g(y) = g(y(0)) + αg (y(0)) + α2 g (y(0)) + ... + αn g (n) (y(0)) + Rn
2 n!
Note, this approximation may not satisfies some market conditions for deep in-or-out of the money options. Defining
the implied volatility for the price y(0) as
0 1
(f −1 ) (y) =
f 0 (f −1 (y))
00
00 f (f −1 (y))
(f −1 ) (y) = −
[f 0 (f −1 (y))]3
000 00 0
f (f −1 (y)) − 3[(f −1 ) (y)]2 [f (f −1 (y))]5
000
(f −1 ) (y) = −
[f 0 (f −1 (y))]4
0000 −1 000 00 0 00 0
0000 f (f (y)) − 10(f −1 ) (y)(f −1 ) (y)[f (f −1 (y))]6 + 15[(f −1 ) (y)]3 [f (f −1 (y))]7
(f −1 ) (y) = −
[f 0 (f −1 (y))]5
0 d −1 1
g (y(0)) =f (y)|y=y(0) =
dy V ega(σ0 )
00 d2 −1 d1 d2 V ega(σ0 ) 1 d1 d2 1
g (y(0)) = 2
f (y)|y=y(0) = − 3
=−
dy σ0 [V ega(σ0 )] σ0 [V ega(σ0 )]2
2 2
d d
−d1 d2 − d22 − d21 + d21 d22 σ12 V ega(σ0 ) − 3 σ1 22 V ega(σ0 )
000
g (y(0)) = − 0 0
[V ega(σ0 )]4
d1 d2 + d22 + d21 + 2d21 d22 σ12
0
=
[V ega(σ0 )]3
6d1 d2 + 3d22 + 3d21 − 3d1 d32 − 3d31 d2 − 3d21 d22 + d31 d32 σ13 V ega(σ0 )
0000
g (y(0)) = −
[V ega(σ0 )]5
d31 d32
10 d21 d22 + d1 d32 + d31 d2 + 2d31 d32 σ13 V ega(σ0 )
σ3
V ega(σ0 )
− 5
0
+ 15 0
[V ega(σ0 )] [V ega(σ0 )]5
6d1 d2 + 3d22 + 3d21 + 7d1 d32 + 7d31 d2 + 7d21 d22 + 6d31 d32 σ13
= − 0
[V ega(σ0 )]4
297
2 2 1
2 2
1 1 2 d1 (σ0 )d2 (σ0 ) 1 1 3 d1 d2 + d2 + d1 + 2d1 d2 σ02
g(y) = g(y(0)) + α − α + α
V ega(σ0 ) 2 σ0 [V ega(σ0 )]2 6 [V ega(σ0 )]3
3 3 1
2 2 3 3 2 2
1 4 6d1 d2 + 3d2 + 3d1 + 7d1 d2 + 7d1 d2 + 7d1 d2 + 6d1 d2 σ03
− α + ...+
24 [V ega(σ0 )]4
√ 0 x
log K +(r−q)T √
where V ega(σ0 ) = xRe(0, T ) T N (db1 ) with db1 = √
σ0 T
+ 12 σ0 T . Replacing the terms in the g(y)
expansion with the appropriate Black-Scholes formula, we get the approximated implied volatility
As all the derivatives of inverse functions depend on f −1 (y), the Taylor series around y(0) depend crutially on the
computation of the ATM forward volatility σ0 .
B.3.1.2 An example
We are going to use the technique described in Appendix (B.3.1.1) to infer the implied volatility from the call option
price in Equation (3.1.4). As the first term of the parametric model generates a smile, we are first going to infer its
volatility. Given the price yb = (1 + µT )CBS (t0 , St0 , K, T ; σ) we get the decomposition
where K = K(1 + µi (T )) and Σi (K, PnT ) is a function of strike and maturity. In addition, the weights ai > 0 for
i = 1, .., n are positive and such that i=1 ai = 1. That is, the function CBS (t, x, K, T ; Σi (K, T )) is a modification
of the Black-Scholes formula capable of generating a skew and curvature. For instance, considering the modified
Black-Scholes price in Equation (B.3.13), we can rewrite the market price as
298
n
X
C(t, x, K, T ) = ai CBS (t, x, K, T ; Σi (K, T )) + αi (K, T )
i=1
yi = yi (0) + αi , i = 1, ..., n
where yi (0) = CBS (t, x, K, T ; σi ) is a price around-the-money forward, and such that αi accounts respectively for
the skew and the curvature. Given the definition of the implied volatility in Section (1.4.1.2), we let y = C(t, x, K, T )
be the call price and g(y) = f −1 (y) be the inverse function. Hence, in our framework we can write
n
X n
X
y(0) = ai yi (0) , α = ai αi
i=1 i=1
y = y(0) + α
Note, Assumption (B.3.1) holds.
σ b(K, T )) − y(0) = 0
b : CBS (K, T ; σ
When the spot price S0 is close to the at-the-money forward strike, we can use the property of linearity of the Black-
Scholes formula with respect to the volatility (see Equation (1.2.12)) to get the approximated price
n
X
CBS (t0 , t; σ) ≈ ai CBS (t0 , t; σi )
i=1
Pn
where σ = i=1 ai σi . That is, the implied volatility σ b is no-longer a function of the strike, but only a function of
time. For notational purpose we let y = CBS (t0 , t; σ) and rewrite the model price as
y(0) = y + β
where β is the error term
n
X
β = y(0) − y = ai CBS (t, x, K, T ; σi ) − CBS (t0 , t; σ)
i=1
Assumption B.3.2 We assume that y = CBS (t0 , t; σ) dominates β, that is, β is small with respect to y(0).
So, given the volatility expansion in Equation (B.3.14), the approximated implied volatility for the price y(0) satisfies
299
n
1 X
b(K, T ) ≈
σ σ+ ai CBS (t, x, K, T ; σi ) − CBS (t0 , t; σ)
vega(σ) i=1
n
1 d1 (σ)d2 (σ) 1 X 2
− ai CBS (t, x, K, T ; σi ) − CBS (t0 , t; σ)
2 σ [vega(σ)]2 i=1
n
1 d1 (σ)d2 (σ) + d22 (σ) + d21 (σ) + 2d21 (σ)d22 (σ) σ12 X
3
+ 3
ai CBS (t, x, K, T ; σi ) − CBS (t0 , t; σ) + ...+
6 [vega(σ)] i=1
Note, if the strike is at-the-money forward K = F (t, T ), which corresponds to η = 1, then the implied volatility
simplifies to
σ
b(K, T ) K=F (t,T )
≈σ
Knowing the volatility σb(K, T ) for the price y(0), and given the correction term α(K, T ), we can then use Equation
(B.3.14) to infer the implied volatility of the price y as
n n
1 X 1 d1 (b σ )d2 (bσ) 1 X 2
Σ(K, T ) ≈ σ
b(K, T ) + ai αi − 2
ai αi (B.3.16)
vega(b σ ) i=1 2 σ
b [vega(b σ )] i=1
n
σ ) σb12 X
1 d1 (b σ ) + d22 (b
σ )d2 (b σ ) + d21 (b
σ ) + 2d21 (b
σ )d22 (b 3
+ ai αi + ...+
6 [vega(b σ )]3 i=1
Given Equation (3.1.5), in the special case where the strike is ATM-forward, we can approximate the implied volatility
with
n
1 X
Σ(K, T ) K=F (T )
≈σ+ ai αi (B.3.17)
vega(σ) i=1
1 2
S1 = F1 eσ1 W1 − 2 σ1 T1
1 2
S2 = F2 eσ2 W2 − 2 σ2 T2
where Si = S(Ti ). We only observe in the market the cumulative variance σ12 T1 up to T1 and σ22 T2 up to T2 . We
therefore need to make an assumption on the shape of the instantaneous variance ν(t) considering the market constraint
RT
on the cumulative variance, that is 0 2 ν(s)ds = σ22 T2 . We assume piecewise constant instantaneous variance ν(t)
and that the cumulative variance decomposes as
300
R T2
we could not have chosen the instantaneous variance to be constant all the way to T2 , 0
νds = σ22 T2 so that we get
σ22 T2 = σ22 T1 + σ22 (T2 − T1 ) as variance decomposition.
S2
In the case of piecewise constant instantaneous variance, the value S1 satisfy the equation
S2 F2 σ1,2 (W2 −W1 )− 1 σ1,2
2
(T2 −T1 )
= e 2
S1 F1
Multiply by S1 , we get
S2 F2 1 2
S2 = S1 = S1 eσ1,2 (W2 −W1 )− 2 σ1,2 (T2 −T1 )
S1 F1
F2 1 2 1 2
= F1 eσ1 W1 − 2 σ1 T1 eσ1,2 (W2 −W1 )− 2 σ1,2 (T2 −T1 )
F1
1 2 1 2
Setting α = F2 e− 2 σ1 T1 − 2 σ1,2 (T2 −T1 ) , the stock S2 simplifies to
S1 = βeσ1 W1
The correlation between two variables x and y is
cov(x, y)
ρ(x, y) = p
var(x)var(y)
As we are only interested in the Brownians of the underlying stock, we assume independent increments between W1
and (W2 − W1 ). Hence, the variance and covariance of the logarithm of S1 and S2 are given by
301
R T1 R T1
ν(s)dW (s)− 12 ν 2 (s)ds
S1 = F1 e 0 0
R T2 R T2
ν(s)dW (s)− 12 ν 2 (s)ds
S2 = F2 e 0 0
Z T1 Z T2 Z T1 ∧T2
ν 2 (s)ds
cov ν(s)dW (s), ν(s)dW (s) =
0 0 0
Z T1 Z T1
var( ν(s)dW (s)) = ν 2 (s)ds
0 0
Z T2 Z T2
var( ν(s)dW (s)) = ν 2 (s)ds
0 0
B.4.2.2 Case 2
The underlying stock price at the two dates T1 and T2 satisfies
R T1 R T1
ν1 (s)dW (s)− 12 ν12 (s)ds
S1 = F1 e 0 0
R T2 R T2
ν2 (s)dW (s)− 21 ν22 (s)ds
S2 = F2 e 0 0
Z T1 Z T2
Z T1 ∧T2
cov ν1 (s)dW (s), ν2 (s)dW (s) = ν1 (s)ν2 (s)ds
0 0 0
Z T1 Z T1
var( ν1 (s)dW (s)) = ν12 (s)ds
0 0
Z T2 Z T2
var( ν2 (s)dW (s)) = ν22 (s)ds
0 0
302
Remark B.4.1 The matrix A is invertible. The symmetric, positive definite properties of the correlation matrix makes
it special to factorise and the Cholesky decomposition can be applied.
In particular, if we choose W1 , W2 and W3 such that αd1 = αd2 = αf 1 = 0, then 1
q
0 1 − ρ2f d ρf d
A= r
0 0 1
1−ρ2f d −ρ2sd −ρ2f s +2ρf s ρsd ρf d ρf s −ρf d ρsd
1−ρ2
√ 2 ρsd
fd 1−ρf d
www.math.wsu.edu/faculty/genz/homepage
1
P3
j=1 αsj αdj = ρsd
303
where one can get the normal distribution probabilities accurate to 1e − 15 proposed by Schonfelder [1978]. Getting
double precision accuracy on the cumulative normal approximation, we can refine to full machine precision the Moro
approximation of the inverse normal cumulative distribution function. The Gaussian function being easily differen-
tiable, one can use the Newton’s second order method or the Halley’s method (third order), see Acklam [2004].
• MRG32ka, it has two components of order three with period length ≈ 2191
• MRG32k5a, it has two components of order five with period length ≈ 2319
• MRG63k3a, it is a 64-bit integer arithmetic with two components of order three with period length ≈ 2377
Alternatively, one can use a Linear Feedback Shift (LFSR) or Tausworthe random number generators which are based
on linear recurrences modulo two with primitive characteristic polynomials. For details, see L’Ecuyer [1991] and
Panneton et al. [2006]. The implementation is available from the author’s website
http://www.iro.umontreal.ca/~lecuyer
L’Ecuyer [1996b] proposed ME-CF combined LFSR generators with length L = 32 and L = 64, whose com-
ponents have recurrences with primitive trinomials of the form Pj (z) = z kj − z qj − 1 with 0 < 2qj < kj , and
with step size sj satisfying 0 < sj ≤ kj − qj < kj ≤ L and gcd(sj , 2kj − 1). Since a large number of generators
had good properties, L’Ecuyer [1999b] performed extensive computer searches and introduced specific instances of
such generators. For L = 32, the generators have period lengths (231 − 1)(229 − 1)(228 − 1)(225 − 1) ≈ 2113 and
characteristic polynomials of degree 113. The procedure LFSR113 has a period ρ ≈ 2113 and the procedure LFSR258
has a period ρ ≈ 2258 .
Using the powers-of-2 decomposition method, L’Ecuyer et al. [2000b] proposed CMRGs that are faster for an
equivalent statistical quality. Considering a = ±2q ± 2r , the product of x by each power of 2 can be implemented by
a left shift of the binary representation of x, and the product ax is computed by adding /or subtracting. Combining
MRG with J = 2 components of order k = 3 with parameters defined such that each component has only two nonzero
coefficients, one of the form aij = 2q and the other one of the form aij = 2q + 1, they obtained the MRG31k3p. It
has two distinct cycles of length ρ = m12m2 ≈ 2185 and provided very good results in terms of speed compared to the
other MRGs having similar periods.
304
0 f (b) − f (a)
∃c ∈]a, b[ , f (c) =
b−a
Theorem B.5.3 Given f : I → R an application differentiable on I. Then the following statements are equivalent
1. f is convex.
0
2. f is increasing.
3. the function f is above of its tangents.
Theorem B.5.4 Given f (x) is a function twice continuously differentiable on an interval I. The function is
00
• convex if f (x) > 0 for all x ∈ I.
00
• concave if f (x) < 0 for all x ∈ I.
The first derivative informs us of the slope of the tangent of a function, and second derivative shows us how it is
0
curved. Given the function f (x) and x = x∗ a static point of the function. We obtain a static point when f (x) = 0.
Then, f (x∗ ) is
00
• a local minimum if the function is convex at x∗ , that is, f (x∗ ) > 0.
00
• a local maximum if the function is concave at x∗ , that is, f (x∗ ) < 0.
305
[1974] Abramowitz M., Stegun I.A., Handbook of mathematical functions, with formulas, graphs and mathematical
tables. Dover.
[2004] Acklam P.J., An algorithm for computing the inverse normal cumulative distribution function.
http://home.online.no/ pjacklam, University of Oslo.
[2012] Ahdida A., Alfonsi A., A mean-reverting SDE on correlation matrices. Cornell University Library.
[2005] Ahmad R., Wilmott P., Which free lunch would you like today, sir?: Delta hedging, volatility arbitrage and
optimal portfolios. Wilmott Magazine, 81, pp 64–79.
[1998] Ait-Sahalia Y., Lo A.W., Nonparametric estimation of state-price densities implicit in financial asset prices.
Journal of Finance, 53, pp 499–547.
[2001] Alexander C., Market models - a guide to financial data analysis. John Wiley and Sons, New York.
[2003] Alexander C., Normal mixture diffusion with uncertain volatility: Modelling short- and long-term smile
effects. ISMA Centre, School of Business, University of Reading, UK.
[2004] Alexander C., Nogueira L.M., Hedging with stochastic local volatility. ISMA, Centre Discussion Paper in
Finance, University of Reading 2004-11.
[2004b] Alexander C., Nogueira L.M., Bivariate normal mixture spread option valuation. Quantitative Finance, 4, (6),
pp 637–648.
[2010] Alexander C., Rubinov A., Kalepky M., Leontsinis S., Hedging with stochastic local volatility. ICMA,
Centre Discussion Paper in Finance, University of Reading.
[2004] Alos E., A generalization of Hull and White formula and applications to option pricing approximation. Work-
ing Paper, Universitat Pompeu Fabra, February 4.
[2007] Alos E., Ewald CO., Malliavin differentiability of the Heston volatility and applications to option pricing.
Working Paper, Universitat Pompeu Fabra, University of St.Andrews, May 14.
[1998] Andersen L., Andreasen J., Volatility skews and extensions of the libor market model. Working Paper, August.
[2002] Anh V.V., Heyde C.C., Leonenko N.N., Dynamic models of long memory processes driven by Levy noise.
Journal of Applied Probability, 39, pp 730–747.
[1993] Ansel J-P., Stricker C., Decomposition de Kunita-Watanabe. Seminaire de probabilites, 27, pp 30–32, Springer-
Verlag.
[2013] Arismendi J.C., Multivariate truncated moments. Journal of Multivariate Analysis, 117, pp 41–75.
306
[2014] Arismendi J.C., A multi-asset option approximation for general stochastic processes. Working Paper, ICMA
Centre, University of Reading.
[2014] Arismendi J.C., Prokopczuk M., An analytic approximation of the implied risk-neutral density of American
multi-asset options. Working Paper, ICMA Centre, University of Reading.
[2002] Avellaneda M., Empirical aspects of dispersion trading in US equity markets. Petit Dejeuner de la Finance.
[2002b] Avellaneda M., Boyer-Olson D., Busca J., Friz P., Reconstructing volatility. Risk, 39, pp 87–91.
[1996] Baillie R.T., Long memory processes and fractional integration in econometrics. Journal of Econometrics, 73,
pp 5–59.
[1997] Bakshi G., Cao C., Chen Z., Empirical performance of alternative option pricing models. Journal of Finance,
52, (5), pp 2003–2049.
[2003a] Bakshi G., Kapadia N., Volatility risk premium embedded in individual equity options: Some new insights.
Journal of Derivatives, pp 45–54.
[2003b] Bakshi G., Kapadia N., Madan D., Stock return characteristics, skew laws, and the differential pricing of
individual equity options. Review of Financial Studies, 16, pp 101–143.
[1991] Barabasi A-L., Vicsek T., Multifractality of self-affine fractals. Physical Review A, 44, (4), pp 2730–2733.
[1987] Barone-Adesi G., Whaley R.E., Efficient analytic approximation of American option values. Journal of Fi-
nance, 42, pp 301–320.
[2009] Barrieu P., El Karoui N., Indifference Pricing: Theory and Applications, Chapter 3: Pricing, Hedging, and
Designing Derivatives with Risk Measures. Princeton University Press, ed. Carmona R., Cinlar E., pp 77–146.
[2011] Bartkiewicz K., Jakubowski A., Mikosch T., Wintenberger O., Stable limits for sums of dependent infinite
variance random variables. Probability Theory and Related Fields, 150, (3), pp 337–372.
[2012] Barunik J., Understanding the source of multifractality in financial markets. Physica A, 391, (17), pp 4234–
4251.
[2000] Bates D., Post-87 crash fears in the S & P 500 futures option market. Journal of Econometrics, 94, pp 181–238.
[2001] Baude S., GRM smile model, Rapport de stage de fin d’etude de SUPELEC.
[2002] Baude S., Roubinet C-H., GRM smile model for the pricing of regular and exotic options on equities, Working
Paper, GRM, Credit Lyonnais, October.
307
[2008] Bekiros S.D., Dimitris A.G., The extreme-value dependence of Asia-Pacific equity markets. Journal of Multi-
national Financial Management, 18, pp 197–208.
[2000] Bellamy N., Jeanblanc M., Incompleteness of markets driven by a mixed diffusion. Finance and Stochastics, 4,
Number 2.
[2001] Beneder R., Vorst T., Options on dividends paying stocks. International Conference on Mathematical Finance,
Shanghai.
[2001] Benhamou E., Delta equivalent asset flows. Swaps Strategy, Goldman Sachs International, London.
[1984] Bensoussan A., On the theory of option pricing. Acta Applicandae Mathematicae, 2, pp 139–158.
[2002] Berestycki H., Busca J., Florent I., Asymptotics and calibration of local volatility models. Quant. Finance, 2,
pp 61–69.
[2004] Bergenthum J., Ruschendorf L., Comparison of option prices in semimartingale models. University of
Freiburg, Eckerstr.
[1996] Bergman Y.Z., Grundy B.D., Wiener Z., General properties of option prices. Journal of Finance, 51, pp
1573–1610.
[2006] Bermudez A., Buehler H., Ferraris A., Jordinson C., Lamnouar A., Overhaus M., Equity hybrid derivatives.
Wiley.
[1713] Bernoulli J., Ars conjectandi, opus posthumum. Accedit Tractatus de seriebus infinitis, et epistola gallice
scripta de ludo pilae reticularis, Basel: Thurneysen Brothers.
[1983] Bhattacharya R., Gupta V., Waymire E., The Hurst effect under trends. Journal of Applied Probability, 20, 3, pp
649–662.
[2003] Bingham N., Kiesel R., Risk-neutral valuation, pricing and hedging of financial derivatives. Springer Finance
Textbook, Second Edition.
[1993] Bjerksund P., Stensland G., Closed-form approximation of American options. Scandinavian Journal of Manage-
ment, 9, pp 87–99.
[2006] Bjerksund P., Stensland G., Closed form spread option valuation. NHH Dept. of Finance & Management
Science Discussion Paper No. 2006/20.
[2001] Blacher G., A new approach for designing and calibrating stochastic volatility models for optimal delta-vega
hedging of exotic options. Conference presentation at Global Derivatives and Risk Management, Juan-les-Pins.
[1973] Black F., Scholes M., The pricing of options and corporate liabilities. Journal of Political Economy, 81, 3, pp
637–659.
[1975] Black F., Fact and fantasy in the use of options. Financial Analysts Journal, pp 36–72.
[2002] Bloch D., Miralles P., Statistical dynamics of the smile. Technical Report, DrKW.
[2007] Bloch D., Fast calibration of the Affine and quadratic models. Working Paper, Universite Paris 6 Pierre et
Marie Curie.
[2008] Bloch D., Expanding forward starting options. Working Paper, Universite Paris 6 Pierre et Marie Curie.
[2010a] Bloch D., A note on calibration of Markov processes. Wilmott Journal, 2, (2), pp 61–96.
308
[2010b] Bloch D., A practical guide to implied and local volatility. Working Paper, Mizuho Securities.
[2011] Bloch D., Coello Coello C.A., Smiling at evolution. Applied Soft Computing, 11, (8), pp. 5724–5734.
[2012a] Bloch D., From implied to local volatility surface. Working Paper, University of Paris 6 Pierre et Marie
Curie.
[2014] Bloch D., A practical guide to quantitative portfolio trading. Working Paper, University of Paris 6 Pierre et
Marie Curie.
[2012a] Bloch D., From implied to local volatility surface. Working Paper, University of Paris 6 Pierre et Marie
Curie.
[2004] Bollen N.P., Whaley R.E., Does net buying pressure affect the shape of implied volatility functions? Journal
of Finance, 59, pp 711–753.
[2003] Bondarenko O., Estimation of risk-neutral densities using positive convolution approximation. Journal of
Econometrics, 116, pp 85–112.
[2005] Borak S., Hardle W., Weron R., Stable distributions. SFB 649 Discussion Paper 2005-008, Humboldt-
Universitat zu Berlin, Germany.
[2002] Bos R., Vandermark S., Finessing fixed dividends. Risk, 15, (9), pp 157–158.
[2003] Bos R., Gairat A., Shepeleva A., Dealing with discrete dividends. Risk, 16, (1), pp 109–112.
[2006] Bossu S., A new approach for modelling and pricing correlation swaps in equity derivatives. Global Deriva-
tives and Risk Management.
[2010] Bouzoubaa M., Osseiran A., Exotic options and hybrids: A guide to structuring, pricing and trading. Wiley
Finance, John Wiley & Sons.
[1988] Boyle P.P., A lattice framework for option pricing with two state variables. Journal of Financial and Quantitative
Analysis, 23, pp 1–12.
[1989] Boyle P.P., Evnine J., Gibbs S., Numerical evaluation of multivariate contingent claims. Review of Financial
Studies, 2, pp 241–250.
[1978] Breeden D, Litzenberger R., Prices of state contingent claims implicit in option prices. Journal of Business, 51,
pp 621–652.
[1987] Brenner M., Courtadon G., Subrahmanyan M.G., The valuation of stock index options. Working Paper, New
York University.
[1988] Brenner M., Subrahmanyan M.G., A simple formula to compute the implied standard deviation. Financial
Analysts Journal, 44, No. 5, pp 80–83.
309
[1996] Broadie M., Detemple J., American option valuation: New bounds, approximations, and a comparison of
existing methods. Review of Financial Studies, 9, pp 1211–1250.
[2008] Buehler H., Volatility and dividends: Volatility modelling with cash dividends and simple credit risk. Working
Paper, Institut fur Mathematik, MA 7-4, 10623 Berlin, Germany.
[2004] Cajueiro D.O., Tabak B.M., The Hurst exponent over time: Testing the assertion that emerging markets are
becoming more efficient. Physica A, 336, pp 521–537.
[2007] Cajueiro D.O., Tabak B.M., Long-range dependence and multifractality in the term structure of LIBOR
interest rates. Physica A, 373, pp 603–614.
[2001] Calvet L., Fisher A., Forecasting multifractal volatility. Journal of Econometrics, 105, pp 27–58.
[2002] Calvet L., Fisher A., Multifractality in asset returns: Theory and evidence. The Review of Economics and
Statistics, 84, (3), pp 381–406.
[2004] Calvet L., Fisher A., Regime-switching and the estimation of multifractal processes. Journal of Financial
Econometrics, 2, pp 44–83.
[2004] Carbone A., Castelli G., Stanley H., Time-dependent Hurst exponents in financial time series. Physica A, 344,
pp 267–271.
[2003] Carmona R., Durrleman V., Pricing and hedging spread options. SIAM Review, 45, pp 627–685.
[2009] Carmona R., Cinlar E., Indifference Pricing: Theory and Applications. Princeton University Press.
[2011] Carmona R., Sun Y., Implied and local correlations from spread options. Princeton University Press.
[1998a] Carr P., Madan D., Option valuation using the Fast Fourier Transform. Morgan Stanley, University of
Maryland.
[1998b] Carr P., Madan D., Towards a theory of volatility trading. in R. A. Jarrow ed. volatility : new estimation
techniques for pricing derivatives, chap. 29, pp 417–427.
[2005] Carr P., Verma A., Potential problems in market models of implied volatility. Working Paper.
[2005b] Carr P., Wu L., A note on sufficient conditions for no arbitrage. Finance Research Letters, 2, pp 125–130.
[2014] Chance D.M., Hanson T.A., Li W., Muthuswamy J., A bias in the volatility smile. Working Paper.
[2004] Cherubini U., Luciano E., Vecchiato W., Copula methods in finance. The Wiley Finance Series, Wiley.
[1997] Chriss N., Black-Scholes and beyond: Option pricing models. Irwin Professional Publishing, Chicago.
[1973] Clark P.K., A subordinated stochastic process model with finite variance for speculative prices. Econometrica,
41, (1), pp 135–155.
[2006] Clegg R.G., A practical guide to measuring the Hurst parameter. International Journal of Simulation: Systems,
Science and Technology, 7, (2), pp 3–14.
[1999] Clemen R., Reilly T., Correlations and copulas for decision and risk analysis. Management Science, 45, pp
208–224.
[1999] Coles S., Hefferman J., Tawn J.A., Dependence measure for extreme value analyses. Extremes, 2, pp 339–365.
[2002] Cont R., da Fonseca J., Dynamics of implied volatility surfaces. Quantitative Finance, 2, pp 45–60.
310
[2003] Cont R., Tankov P., Financial modelling with jump processes. Chapman & Hall, CRC Press.
[2005] Cont R., Model uncertainty and its impact on the pricing of derivative instruments. CMAP, Ecole Polytech-
nique, F-91128 Palaiseau, France.
[2008] Cont R., Deguest R., Extracting implied correlation matrices from index option prices: A statistical approach.
finance-innovation.org.
[1996] Corrado C., Su T., Skewness and kurtosis in S&P 500 index returns implied by option prices. Journal of
Financial Research, 19, (2), pp 175–192.
[1996b] Corrado C.J., Miller Jr. T.W., A note on a simple, accurate formula to compute implied standard deviations.
Journal of Banking and Finance, 20, pp 595–603.
[1996c] Corrado C., Su T., S&P 500 index option tests of Jarrow and Rudd’s approximate option valuation formula.
Journal of Futures Markets, 16, (6), pp 611–629.
[1997] Corrado C.J., Su T., Implied volatility skews and stock index skewness and kurtosis implied by S&P 500
index option prices. Journal of Derivatives, 4, pp 8–19.
[2007] Corrado C., The hidden martingale restriction in Gram-Charlier option prices. Journal of Futures Markets, 27,
(6), pp 517–534.
[2003] Costa R.L., Vasconcelos G.L., Long-range correlations and nonstationarity in the Brazilian stock market.
Physica A, 329, pp 231–248.
[1976] Cox J.C., Ross S., The valuation of options for alternative stochastic processes. Journal of Financial Economics,
3 (January-March), pp 145–166.
[1979] Cox J.C., Ross S.A., Rubinstein M., Option pricing: A simplified approach. Journal of Financial Economics, 7,
(3), pp 229–263.
[2005] Cox A., Hobson D., Local martingales, bubbles and option prices. Finance and Stochastics, 9, (4), pp 477–492.
[1957] Cramer H., Mathematical methods of statistics. Princeton University Press, Princeton.
[2004] Crepey S., Delta-hedging vega risk? Quantitative Finance, 4, pp 559–579.
[2011] Cui C., Frank D., Equity implied volatility surface. Quantitative Research and Development, Equities Team,
Bloomberg.
[1988] Curtis C.E., Carriker G.L., Estimating implied volatility directly from "nearest-to-the-money" commodity
option premiums. Working Paper 081588, Clemson University.
[2008] Czarnecki L., Grech D., Pamula G., Comparison study of global and local approches describing critical
phenomena on the Polish stock exchange market. Physica A: Statistical Mechanics and its Applications, 387, (29), pp
6601–6811.
[2001] Dacorogna M.M., Gencay R., Muller U.A., Olsen R.B., Pictet O.V., An introduction to high frequency
finance. Academic Press, San Diego, CA.
[2006] Daglish T., Hull J., Suo W., Volatility surfaces: Theory, rules of thumb and empirical evidence. Working
Paper, August.
[1995] Davis R.A., Hsing T., Point process and partial sum convergence for weakly dependent random variables
with infinite variance. Ann. Prob., 23, pp 879–917.
311
[2007] Davis M.H.A., Hobson D.G., The range of traded option prices. Mathematical Finance, 17, (1), pp 1–14.
[2002] Deelstra G., Liinev J., Vanmaele M., Option pricing: A simplified approach. Working Paper, Ghent Univer-
sity.
[2014] Delanoe P., Local correlation with local vol and stochastic vol. Working Paper, HSBC Equity Derivatives.
[2002] Delbaen F., Shirakawa H., An interest rate model with upper and lower bounds. Asia-Pacific Financial Markets,
9, pp 191–209.
[2004] Delbaen F., Schachermayer W., What is a free lunch? Notices of the AMS, 51, (5), pp 526–528.
[2008] Deng Q., Volatility dispersion trading. Working Paper, University of Illinois at Urbana-Champaign.
[1994] Derman E., Kani I., The volatility smile and its implied tree. Goldman Sachs Quantitative Strategies Reports.
[1999] Derman E., Regimes of volatility. Risk, 7, (2).
[2008] Derman E., Lecture Notes 1–12.
[1993] Ding Z., Granger C.W.J., Engle R.F., A long memory property of stock market returns and a new model.
Journal of the Empirical Finance, 1, pp 83–106.
[2005] Driessen J., Maenhout P., Vilkov G., Option-implied correlations and the price of correlation risk. University
of Amsterdam and INSEAD. Amsterdam and Fontainbleau.
[1996] Duffie D., Kan R., A yield factor model of interest rates. Mathematical Finance, 6, (4), pp 379–406.
[1997] Duffie D., Pan J., An overview of Value at Risk. Journal of Derivatives, pp 7-49.
[2000] Duffie D., Pan J., Singleton K., Transform analysis and asset pricing for affine jump-diffusions. Econometrica,
68, (6), pp 1343–1376.
[2001] Dufresne D., The integrated square-root process. Research Paper Number 90, University of Montreal.
[1994] Dupire B., Pricing with a smile. Risk, 7, pp 18–20.
[1996] Dupire B., A unified theory of volatility. Working Paper, January.
[2003] Dupire B., New Advances in Volatility Modelling and Trading. Risk Europe 2003, Paris.
[2003] Durrleman V., A note on initial volatility surface. Working Paper.
[2006] Ekstrom E., Tysk J., Convexity preserving jump-diffusion models for option pricing. Journal of Mathematical
Analysis and Applications, 330, pp 715–728.
[1989] El Karoui N., Rochet JC., A pricing formula for options on coupon-bonds. Cahier 8925, October.
[1998] El Karoui N., Jeanblanc M., Shreve S., Robustness of the BS formula. Mathematical Finance, 8, pp 93-126.
[1998] Embrechts P., Resnick S., Samorodnitsky G., Living on the edge. Risk Magazine, 11, (1), pp 96-100.
[1999] Embrechts P., Resnick S., Samorodnitsky G., Extreme value theory as a risk management tool. North American
Actuarial Journal, 26, pp 30-41.
[1982] Engle R.F., Autoregressive conditional heteroskedasticity with estimates of the variance of U.K. inflation.
Econometrica, 50, pp 987–1008.
[1965a] Fama E.F., The behavior of stock-market prices. Journal of Business, 38, (1), pp 34–105.
312
[1970] Fama E.F., Efficient capital markets: A review of theory and empirical work. Journal of Finance, 25, (2), pp
383–417.
[2005] Fengler M.R., Semiparametric modeling of implied volatility, Springer Finance, Springer-Verlag Berlin Hei-
delberg.
[2009] Fengler M.R., Herwartz H., Menn C., Werner C., Explaining the index skew via stochastic correlation
models. Technical report, EEA-ESEM.
[2009] Fengler M.R., Arbitrage-free smoothing of the implied volatility surface. Quantitative Finance, 9, (4), pp. 417–
428.
[2005] Ferson S., Nelsen R., Hajagos J., Berleant D., Zhang J., Tucker W.T., Ginzburg L., Oberkampf W.L.,
Myths about correlations and dependencies and their implications for risk analysis. Working Paper.
[1985] Figlewski S., Arbitrage-based pricing of stock index options. Working Paper, New York University.
[1989] Figlewski S., What does an option pricing model tell us about option prices? Financial Analysts Journal, 45, pp
12–15.
[2002] Flamouris D., Giamouridis D., Estimating implied pdfs from American options on futures: A new semipara-
metric approach. Journal of Futures Markets, 22, (1), pp. 1–30.
[1990] Follmer H., Schweizer M., Hedging of contingent claims under incomplete information. In M.H.A. Davis
and R.J. Elliott, editors, Applied Stochastic Analysis, pp 389–414, Gordon and Breach, London.
[2004] Follmer H., Schied A., Stochastic finance: An introduction in discrete time. second revised and extended ed.,
Walter de Gruyter & Co., Berlin.
[2003] Fouque J.P., Papanicolaou G., Sircar R., Solna K., Multiscale stochastic volatility asymptotics. SIAM J.
Multiscale Modeling and Simulation, 2, (1), pp 22–42.
[2005] Garel B., d’Estampes L., Tjostheim D., Revealing some unexpected dependence properties of linear combi-
nations of stable random variables using symmetric covariations. Communications in Statistics, Theory and Methods,
33, (4), pp 769–786.
[2009] Garleanu N., Pedersen L.H., Poteshman A.M., Demand-based option pricing. The Review of Financial Studies,
22, (10), pp 4259–4299.
[1999] Gatheral J., The volatility skew: Arbitrage constraints and asymptotic behaviour. Merrill Lynch.
[2004] Gatheral J., A parsimonious arbitrage-free implied volatility parameterization with application to the valuation
of volatility derivatives. Presentation at Global Derivatives.
[2006] Gatheral J., The volatility surface: A practitioner’s guide, John Wiley & Sons, Hoboken, New Jersey.
[2012] Gatheral J., Jacquier A., Arbitrage-free SVI volatility surfaces. Working Paper, Baruch College, CUNY, and
Imperial College, London.
[1809] Gauss C.F., Theoria motvs corporvm coelestivm in sectionibvs conicis Solem ambientivm.
313
[1989] Geman H., The importance of the forward neutral probability in a stochastic approach of interest rates. Work-
ing paper, 7, Essec.
[1995] Geman H., El Karoui N., Rochet J., Changes of numeraire, change of probability measure and option pricing.
Journal of Applied Probability, 32, pp 443–458.
[2004] Genz A., Numerical computation of rectangular bivariate and trivariate normal and t probabilities. Statistics
and Computing, 14, pp 151–160.
[2012] Gerhold S., Can there be an explicit formula for implied volatility? Working Paper, Id=2179924.
[1979] Geske R., A mote on an analytical formula for unprotected American call options on stocks with known
dividends. Journal of Financial Economics, 7, pp 375–380.
[1943] Gnedenko B.V., Sur la distribution limite du terme maximum d’une serie aleatoire. Annals of Mathematics, 44,
pp 423–453.
[1954] Gnedenko B.V., Kolmogorov A.N., Limit distributions for sums of independent random variables. Addison-
Wesley, Cambridge.
[2004] Granger C., Hyng N., Occasional structural breaks and long memory with an application to the SP500
absolute stock returns. Journal of Empirical Finance, 11, pp 399–421.
[2005] Grech D., Mazur Z., Statistical properties of old and new techniques in detrended analysis of time series. Acta
Physica Polonica B, 36, (8), pp 2403–2413.
[1992] Grimmett G.R., Stirzaker D.R., Probability and random processes. Oxford Science Publications, Second
Edition.
[2010] Gu G-F., Zhou W-X., Detrending moving average algorithm for multifractals. Physical Review E, 82, 011136,
pp 1–12.
[2012] Guo G., Jacquier A., Martini C. and Neufcourt L., Generalised arbitrage-free SVI volatility surfaces.
Working Paper.
[2011] Gurrieri S., A class of term structures for SVI implied volatility. Working Papaer, Risk Management Depart-
ment, Mizuho Securities, Tokyo.
[2002] Gushchin A.A., Mordecki E., Bounds on option prices for semimartingale market models. Proc. Steklov Inst.
Math., 237, pp 73–113.
[1986] Gyongy I., Mimicking the one-dimensional marginal distributions of processes having an Ito differential.
Probability Theory and Related Fields, 71, pp 501–516.
[1999] Hagan P.S., Woodward D.E., Equivalent Black volatilities. App. Math. Finance, 6, pp 147–157.
[2002] Hagan P.S., Kumar D., Lesniewski A.S., Woodward D.E., Managing smile risk. Wilmott Magazine, pp 84–108.
[1979] Harrison J.M., Kreps D., Martingale and arbitrage in multiperiods securities markets. Journal of Economic
Theory, 20, pp 381–408.
[1968] Hart J., Computer approximations. Wiley, Algorithm 5666 for the error function.
[2003] Haug E.G., Haug J., Lewis A., Back to basics: A new approach to the discrete dividend problem. Wilmott
Magazine, pp 37–47.
[2009] Haugh M., Black-Scholes and the volatility surface. Financial Engineering: Continuous-Time Models, pp 1–17.
314
[1993] Heston S., A closed-form solution for options with stochastic volatility with applications to bond and currency
options. Review of Financial Studies, 6, pp 327–343.
[1975] Hill B.M., A simple general approach to inference about the tail of a distribution. Annals of Statistics, 3, pp
1163–1174.
[2004] Ho D-S., Lee C-K., Wang C-C., Chuang M., Scaling characteristics in the Taiwan stock market. Physica A,
332, pp 448–460.
[1999] Hoeting J.A., Madigan D., Raftery A.E., Volinsky C.T., Bayesian model averaging: A tutorial. Statistical
Science, 14, (4), pp 382–417.
[1987] Hull J., White A., The pricing of options on assets with stochastic volatilities. Journal of Finance, 42, (2), pp
281–300.
[1993] Hull J., White A., Efficient procedures for valuing european and american pathdependent options. The Journal
of Derivatives, 1, (1), pp 21–31.
[2012] Hull J.C., Options, futures and other derivatives. Prentice & Hall, Pearson Education, Eight Edition.
[1951] Hurst H.E., Long-term storage capacity of reservoirs. Trans. Amer. Soc. Civil Engineers, 116, pp 770–799.
[1990] Hutchinson T.P., Lai C.D., Continuous bivariate distributions, emphasizing applications. Rumsby Scientific
Publishing, Adelaide 5000, Australia.
[2011] Hutchinson D., Relative value concepts in volatility. CBOE Risk Management Conference, Dana Point, CA.
[2007] Isengildina-Massa O., Curtis Jr. C.E., Bridges Jr. W.C., Nian M., Accuracy of implied volatility approxima-
tions using "nearest-to-the-money" option premiums. Southern Agricultural Economics Association Meetings,
Mobile, AL, February.
[1998] Ingersoll J.E., Approximating american options and other financial contracts using barrier derivatives. Journal
of Computational Finance, 2 (Fall), pp 85–112.
[2000] Ingersoll J.E., Digital contracts: Simple tools for pricing complex derivatives. The Journal of Business, 73, (1),
pp 67–88.
[2004] Jacod J., Protter P., Probability essentials. Springer Second Edition, Universitext.
[2007] Jacquier A., Slaoui S., Variance dispersion and correlation swaps. School of Economics, Mathematics and
Statistics, Birkbeck University of London.
[1993] Jakubowski A., Minimal conditions in p-stable limit theorems. Stoch. Proc. Appl., 44, pp 291–327.
[1997] Jakubowski A., Minimal conditions in p-stable limit theorems II. Stoch. Proc. Appl., 68, pp 1–20.
[1991] Jansen D.W., de Vries C.G., On the frequency of large stock market returns: Putting booms and busts into
perspective. Review of Economics and Statistics, 73, (1), pp 18–24.
315
[1982] Jarrow R.A., Rudd A., Approximate option valuation for arbitrary stochastic processes. Journal of Financial
Economics, 10, (3), pp 347–369.
[1983] Jarrow R.A., Rudd A., Option pricing. Homewood, Illinois, Richard Irwin, pp 183–188.
[1999] Jex M., Henderson R., Wang D., Pricing exotics under the smile. Risk, November, pp 72–75.
[2012] Jizba P., Korbel J., Methods and techniques for multifractal spectrum estimation in financial time series.
FNSPE, Czech Technical University, Prague.
[2001] Jondeau E., Rockinger M., Gram-Charlier densities. Journal of Economic Dynamics and Control, 25, pp 1457–
1483.
[2002] Jones C.S., Wang T., The term structure of equity option implied volatility. Working Paper
[2004] Jones M.C., Families of distributions arising from distributions of order statistics. Test, 13, pp 143.
[2009] Jones M.C., Kumaraswamy’s distribution: A beta-type distribution with some tractability advantages. Statisti-
cal Methodology, 6, (1), pp 70–81.
[1998] Ju N., Pricing American option by approximating its early exercise boundary as a multipiece exponential
function. Review of Financial Studies, 11, (3), pp 627–646.
[1999] Ju N., Zhong R., An approximate formula for pricing American options. Journal of Derivatives, 7, (2), pp
31–40.
[2000] Kaas R., Dhaene J., Goovaerts M.J., Upper and lower bounds for sums of random variables. Insurance:
Mathematics and Economics, 27, pp 151–168.
[2004] Kahalé N., An arbitrage-free interpolation of volatilities. Risk, 17, (5), pp. 102–106.
[2002] Kantelhardt J., Zschiegner S., Koscielny-Bunde E., Bunde A., Havlin S., Stanley H.E., Multifractal
detrended fluctuation analysis of nonstationary time series. Physica A, 316, (1-4), pp 87–114.
[1988] Karatzas I., On the pricing of American options. Appl. Math. Optim, 17, pp 37–60.
[2004] Kim S-J., Choi J-S., Multifractal measures for the yen-dollar exchange rate. Journal of the Korean Physical
Society, 44, (3), pp 643–646.
[1995] Kirk E., Correlations in the energy markets. Managing energy price risk. Risk Publications and Enron, London,
pp 71–78.
[1980] Koutrouvelis I.A., Regression-type estimation of the parameters of stable laws. Journal of the American Statistical
Association, 75, pp 918–928.
[2013] Kristoufek L., Vosvrda M., Measuring capital market efficiency: Global and local correlations structure.
Physica A, 392, (1), pp 184–193.
[2003] Kruse S., On the pricing of forward starting options under stochastic volatility. Institut Techno und
Wirtschaftsmathematik.
[1980] Kumaraswamy P., A generalized probability density function for double-bounded random processes. Journal
of Hydrology, 46, (1-2), pp 79–88.
[1967] Kunita H., Watanabe S., On square integrable martingales. Nagoya Math. Journal, 30, pp 209–245.
[1997] Kwiatkowski J., Current issues in Value at Risk for portfolio of derivatives. Global Risk Management, BWZ.
316
[2009] Ladokhin S., Volatility modeling in financial markets. Master Thesis, VU University Amsterdam.
[2009] Langnau A., Introduction into local correlation modelling. ArXiv Working Paper 090 : 3441v3.
[1976] Latane H., Rendleman R., Standard deviations of stock price ratios implied in option prices. The Journal of
Finance, 31, pp 369–381.
[1991] L’Ecuyer P., Tables of maximality-equidistributed combined LFSR generators. Working Paper, DIRO, Uni-
versite de Montreal.
[1996b] L’Ecuyer P., Maximally equidistributed combined Tausworthe generators. Mathematics of Computation, 65,
(213), pp 203–213.
[1998] L’Ecuyer P., Good parameters and implementations for combined multiple recursive random number genera-
tors. Working Paper, DIRO, Universite de Montreal, May 4.
[1999b] L’Ecuyer P., Tables of maximally-equidistributed combined LFSR generators. Mathematics of Computation,
68, (225), pp 261–269.
[2000b] L’Ecuyer P., Touzin R., Fast combined multiple recursive generators with multipliers of the form a =
±2q ± 2r . in J.A. Joines, R.R. Barton, K. Kang, P.A. Fishwick, eds, Proceedings of the 2000 Winter Simulation
Conference, pp 683–689.
[2007] L’Ecuyer P., Pseudorandom number generators. Working Paper, DIRO, Universite de Montreal, August 28.
[2001] Lee R.W., Implied and local volatilities under stochastic volatility. International Journal of Theoretical and Applied
Finance, 4, (1), pp 1178–1192.
[2004] Lee R.W., The moment formula for implied volatility at extreme strikes. Mathematical Finance, 14, (3), pp
469–480.
[2005] Lee R.W., Implied volatility: Statistics, dynamics, and probabilistic interpretation. Recent advances in applied
probability, Springer, New York, pp 241–268.
[2000] Li D., On default correlation: A copula function approach. Journal of Fixed Income, 9, (4), pp 43–54.
[2009] Liang S., Tahara Y., A formula to compute implied volatility with error estimate. Interdisciplinary Iinformation
Sciences, 15, No. 2, pp 267–272.
[2010] Lisauskas J., Dispersion trading in German option market. Master Thesis, Tilburg University.
[1991] Lo A.W., Long-term memory in stock market prices. Econometrica, 59, (5), pp 1279–1313.
[2005] London J., Modeling derivatives in C++. John Wiley & Sons, Hoboken, New Jersey.
317
[1996] Longin F., The asymptotic distribution of extreme stock market returns. Journal of Business, 69, pp 383–408.
[1994] Loretan M., Phillips P., Testing the covariance stationarity of heavy-tailed time series. Journal of Empirical
Finance, 1, pp 211–248.
[2005] Lozovaia T., Hizhniakova H., How to extend modern portfolio theory to make money from trading equity
options. EGAR Technology.
[1996] Lux T., The stable Paretian hypothesis and the frequency of large returns: An examination of major German
stocks. Applied Economics Letters, 6, pp 463–475.
[2012] Lye C-T., Hooy C-W., Multifractality and efficiency: Evidence from Malaysian sectoral indices. Int. Journal
of Economics and Management, 6, (2), pp 278–294.
[2009a] Ma J., Pricing foreign equity options with stochastic correlation and volatility. Annals of Economics and Fi-
nance, 10, (2), pp 303–327.
[2009b] Ma J., A stochastic correlation model with mean reversion for pricing multi-asset options. Asia-pacific Finan-
cial Markets, 66, pp 97–109.
[1993] Maheswaran S., Sims C., Empirical implications of arbitrage-free asset markets. in P.C.B. Phillips, ed.,
Models, Methods and Applications of Econometrics, Cambridge, Basil Blackwell, pp 301–316.
[1997] Malz A., Estimating the probability distribution of the future exchange rate from option prices. Journal of
Derivatives, Winter, pp 18–36.
[1982] Manaster S., Koehler G., The calculation of implied variances from the Black-Scholes-Merton model: A
note. The Journal of Finance, 37, pp 227–230.
[1963] Mandelbrot B.B., The variation of certain speculative prices. Journal of Business, 36, (4), pp 394–419.
[1965] Mandelbrot B.B., Une classe de processus stochastiques homothetiques a soi: Application a loi climatologique
de H.E. Hurst. Comptes Rendus Academic Sciences Paris, 240, pp 3274–3277.
[1968] Mandelbrot B.B., van Ness J., Fractional Brownian motions, fractional noises and applications. SIAM Review,
10, (4), pp 422–437.
[1968b] Mandelbrot B.B., Wallis J., Noah, Joseph and operational hydrology. Water Resources Research, 4, pp 909–
918.
[1969a] Mandelbrot B.B., Wallis J., Computer experiments with fractional Gaussian noises: Part 1, averages and
variances. Water Resources Research 5.
[1969b] Mandelbrot B.B., Wallis J., Computer experiments with fractional Gaussian noises: Part 2, rescaled ranges
and spectra. Water Resources Research 5.
[1974] Mandelbrot B.B., When can price be arbitraged efficiently? A limit to the validity of the random walk and
martingale models. Re. Econom. Statist., 53, pp 225–236.
[1975] Mandelbrot B.B., Les objets fractals: forme, hasard et dimension. Paris, Flammarion.
[1979] Mandelbrot B.B., Taqqu M., Robust R/S analysis of long-run serial correlation. in Proceedings of the 42nd
Session of the International Statistical Institute, , pp 69–104, Manila, Bulletin of the I.S.I..
[1997] Mandelbrot B.B., Fisher A., Calvet L., A multifractal model of asset returns. Cowles Foundation Discussion
Paper No. 1164.
318
[2000] Mantegna R.N., Stanley H.E., An introduction to econophysics: Correlation and complexity in finance.
Cambridge University Press, Cambridge.
[1978] Margrabe W., The value of an option to exchange one asset for another. Journal of Finance, 33, pp 177–186.
[1952] Markowitz H.M., Portfolio selection. Journal of Finance, 7, (1), pp 77–91.
[1999] Marris D., Financial option pricing and skewed volatility. MPhil Thesis, University of Cambridge.
[2008] Marshall C.M., Dispersion trading: Empirical evidence from US options markets. Global Finance Journal, (20),
pp 289–301.
[2003] Matia K., Ashkenazy Y., Stanley H.E., Multifractal properties of price fluctuations of stocks and commodi-
ties. Europhysics Letters, 61, (3), pp 422–428.
[2004] Matos J.A.O., Gama S.M.A., Ruskin H., Duarte J., An econophysics approach to the Portuguese stock
index-psi-20. Physica A, 342, (3-4), pp 665–676.
[2008] Matos J.A.O., Gama S.M.A., Ruskin H.J., Al Sharkasi A., Crane M., Time and scale Hurst exponent
analysis for financial markets. Physica A, 387, pp 3910–3915.
[1996] McCulloch J.H., Financial applications of stable distributions. in G.S. Maddala, C.R. Rao, eds., Handbook of
Statistics, 14, Elsevier, pp 393–425.
[1997] McCulloch J.H., Measuring tail thickness to estimate the stable index α: A critique. Journal of Business &
Economic Statistics, 15, (1), pp 74–81.
[1984] McDonald J.B., Some generalized functions for the size distribution of income. Econometrica, 52, pp 647–663.
[1997] Melick W., Thomas C., Recovering an asset’s implied pdf from option prices: An application to crude oil
during the Gulf war. Journal of Financial and Quantitative Analysis, 32, (1), pp 91–115.
[1973] Merton R.C., Theory of rational option pricing. Bell J. of Econ. and Management Sci., 4, (1), pp 141–183.
[1976] Merton R.C., Option pricing when underlying stock returns are discontinuous. Journal of Financial Economics,
3, pp 125-144.
[2003] Mikosch T., Starica C., Long-range dependence effects and ARCH modeling. in Theory and Applications of
Long-range Dependence, Birkhauser Boston, Boston, pp 439–459.
[1998] Milevsky M., Posner S., A closed-form approximation for valuing basket options. The Journal of Derivatives,
5, (4), pp 54–61.
[1999] Mittnik S., Doganoglu T., Chenyao D., Computing the probability density function of the stable Paretian
distribution. Mathematical and Computer Modelling, 29, pp 235–240.
[1964] Moran P., On the range of cumulative sums. Annals of the Institute of Statistical Mathematics, 16, pp 109-112.
[2003] Moskowitz T., An analysis of covariance risk and pricing anomalies. Review of Financial Studies, 16, pp 417-457.
[2006] Moyano L.G., de Souza J., Duarte Queiros S.M., Multi-fractal structure of traded volume in financial
markets. Physica A, 371, pp 118–121.
[2005] Muck M., On the similarity between displaced diffusion and constant elasticity of variance market models of
the term structure. WHU - Otto Beisheim Graduate School of Management.
[2010] Nelsen R.B., An introduction to Copulas. Springer.
319
[2003] Nielsen J.A., Sandmann K., Pricing bounds on Asian options. Journal of Financial and Quantitative Analysis, 38,
(2), pp 449–473.
[2013] Niere H.M., A multifractality measure of stock market efficiency in Asean region. European Journal of Business
and Management, 5, (22), pp 13–19.
[2001] Nolan J.P., Maximum likelihood estimation and diagnostics for stable distributions. in O.E. Barndorff-Nielsen,
T. Mikosch, S. Resnick, eds., Levy Processes, Brikhauser, Boston.
[2014] Nolan J.P., Stable distributions: Models for heavy-tailed data. Math/Stat Department, American University.
[2005] Norouzzadeh P., Jafari G.R., Application of multifractal measures to Tehran price index. Physica A, 356, pp
609–627.
[2006] Norouzzadeh P., Rahmani B., A multifractal detrended fluctuation description of Iranian rial-US dollar ex-
change rate. Physica A, 367, pp 328–336.
[2012] Oh G., A multifractal analysis of Asian foreign exchange markets. European Physical Journal B, pp 85–214.
[1998] Oksendal V, Stochastic differential equations. Springer Fifth Edition.
[2006] Ouyang Z., Stable process. Working Notes.
[2002] Overhaus M., Ferraris A., Knudsen T., Milward R., Nguyen-Ngoc L., Schindlmayr G., Equity derivatives.
Theory and Application John Wiley & Sons, New York.
[1995] Page M., Feng E., Measuring the risk of a portfolio of options on several exchange rates. Working Paper,
Susquehanna Investment Group, Philadelphia.
[2006] Panneton F., L’Ecuyer P., Matsumoto M., Improved long-period generators based on linear recurrences
modulo 2. ACM Transactions on Mathematical Software, 32, (1), pp 1–16.
[2000] Pasquini M., Serva M., Clustering of volatility as a multiscale phenomenon. European Physical Journal B, 16,
(1), pp 195–201.
[1994] Peng C.K., Buldyrev S.V., Havlin S., Simons M., Stanley H.E., Goldberger A.L., Mosaic organization of
DNA nucleotides. Physical Review E, 49, (2), pp 1685–1689.
[1994] Peters E.E., Fractal market analysis: Applying chaos theory to investment and economics. John Wiley &
Sons.
[2005] Piterbarg V., A multi-currency model with FX volatility skew. Working paper, February.
[2003] Poon S., Rockinger M., Tawn J.A., Modelling extreme-value dependence in international stock markets.
Statistica Sinica, 13, pp 929–953.
[2004] Poon S., Rockinger M., Tawn J.A., Extreme value dependence in financial markets: Diagnostic, models and
financial implications. Review of Financial Studies, 17, pp 581–610.
[1992] Press W.H., Teukolsky S.A., Vetterling W.T., Flannery B.P., Numerical recipes. 2nd ed. Cambridge, Cam-
bridge University Press.
[2002] Rebonato R., Assigning future smile surfaces: Conditions for uniqueness and abscence of arbitrage. Working
Paper
[2004] Rebonato R., Cardoso M.T., Unconstrained fitting of implied volatility surfaces using a mixture of normals,
Working Paper, QUARC and Oxford University, July 6.
320
[2000] Resnick S., Van Den Berg E., Sample correlation behavior for the heavy tailed general bilinear process.
Comm. Statist. Stochastic Models, 16, pp 233–258.
[1984] Rodgers J.L., Nicewander W.A., Toothaker L., Linearly independent, orthogonal, and uncorrelated variables.
The American Statistician, 38, (2), pp 133–134.
[1995] Rogers L.C.G., Shi Z., The value of an Asian option. Journal of Applied Probability, 32, pp 1077–1088.
[2010] Rogers L.C.G., Tehranchi M.R., Can the implied volatility surface move by parallel shifts? Finance and
Stochastics, 14, (2), pp 235–248.
[2010] Roper M., Arbitrage free implied volatility surfaces. Working Papaer, School of Mathematics and Statistics,
The University of Sydney, Australia.
[1998] Rosenberg J.V., Pricing multivariate contingent claims using estimated risk-neutral density functions. Journal
of International Money and Finance, 17, pp 229–247.
[2003] Rosenberg J.V., Nonparametric pricing of multivariate contingent claims. Federal Reserve Bank of New York Staff
Reports, 162.
[2001] Routledge B.R., Zin S., Model uncertainty and liquidity. NBER Working Paper 8683.
[1983] Rubinstein M., Displaced diffusion option pricing. Journal of Finance, 38, (1), pp 213–217.
[1994] Rubinstein M., Implied binomial trees. Journal of Finance, 49, (3), pp 771–818.
[1998] Rubinstein M., Edgeworth binomial trees. Journal of Derivatives, 5, (3), pp 20–27.
[1994] Samorodnitsky G., Taqqu M.S., Stable non-Gaussian random processes. Stochastic Modeling, Chapman &
Hall, New York-London.
[1960] Samuelson P.A., Efficient paths of capital accumulation in terms of the calculus of variations. in K.J. Arrow,
S. Karlin, P. Suppes Mathematical Models in the Social Sciences, 1959 Proceedings of the first Stanford Symposium,
Stanford University Press, pp 77–88.
[1965] Samuelson P.A., Proof that properly anticipated prices fluctuate randomly. Industrial Management Review, 6, pp
41–50.
[1965a] Samuelson P.A., Rational theory of warrant pricing. Indus. Management Rev., 6, pp 13–31. Reprinted in Cootner
(1967), pp 506–532.
[1969] Samuelson P.A., Merton R.C., A complete model of warrant pricing that maximizes utility. Industrial Manage-
ment Review, pp 17–46.
[2010] Schlogl E., Option pricing where the underlying assets follow a Gram/Charlier density of arbitrary order.
Working Paper, School of Finance and Economics, University of Technology, Sydney.
321
[1999] Schmitt F., Schertzer D., Lovejoy S., Multifractal analysis of foreign exchange data. Applied Stochastic Models
and Data Analysis, 15, pp 29–53.
[1978] Schonfelder J.L., Chebyshev expansions for the error and related functions. Mathematics of Computation, 32,
No. 144, pp 1232–1240.
[1989] Schroder M., Computing the constant elasticity of variance option pricing formula. Journal of Finance, 44, pp
211–219.
[1970] Sharpe W.F., Portfolio theory and capital markets. New York: McGraw-Hill.
[2003] Shreve S., Stochastic calculus for finance I. The binomial asset pricing model. Springer Finance.
[2004] Shreve S.E., Stochastic calculus for finance II, Continuous-time models. Springer Finance, Textbook, NY,
USA.
[1955] Simon H.A., On a Class of Skew Distribution Functions. Biometrika, 42, (3/4), pp 425–440.
[2000] Simon S., Goovaerts M.J., Dhaene J., An easy computable upper bound for the price of an arithmetic Asian
option. Insurance: Mathematics and Economics, 26, (2-3), pp 175–184.
[2003] Simpson J., Currency options and volatility. Goldman Sachs Working Paper, Chapter 11, Siegel M.J. at Stern
NYU.
[1959] Sklar A., Fonctions de repartition a n dimensions et leurs marges. Publications de l’Institut de Statistiques de
l’Universite de Paris 8, pp 229–231.
[1961] Sprenkle C., Warrant prices as indications of expectations. Yale Econ. Essays 1, pp 179–232. Reprinted in
Cootner (1967), pp 412–474.
[2008] Staum J., Incomplete Markets. Handbooks in OR & MS, Vol. 15: Financial Engineering, ed. J. R. Birge and
V. Linetsky, Elsevier, pp 511–563.
[1991] Stein E.M., Stein J.C., Stock price distribution with stochastic volatility: An analytic approach. The Review of
Financial Studies, 4, (4), pp 727–752.
[1989] Sterge A.J., On the distribution of financial futures price changes. Financial Analysts Journal, 45, (3), pp 75–78.
[1969] Stoll H.R., The relationship between put and call option prices. Journal of Finance, 24, pp 802–824.
[2014] Stosic D., Stosic D., Stosic T., Stanley H.E., Multifractal analysis of managed and independent float
exchange rates. Department of Physics, Boston University.
[1999] Struzik Z.R., Local effective Holder exponent estimation on the wavelet transform maxima tree. in Fractals:
Theory and Applications in Engineering, eds., M. Dekking, J. Levy Vehel, E. Lutton, C. Tricot, Springer Verlag, pp
93–112.
[2002] Struzik Z.R., Siebes A., Wavelet transform based multifractal formalism in outlier detection and localisation
for financial time series. Physica A, 309, pp 388–402.
[2007] Svoboda-Greenwood S., The displaced diffusion as an approximation of the CEV. Working paper, April.
[2000] Tankov P., GRM smile model. Rapport de stage de fin d’etude de l’X.
[2010] Tankov P., Touzi N., Calcul stochastique en finance. Ecole Polytechnique Paris, Departement de Mathema-
tiques Appliquees.
322
[1995] Taqqu M.S., Teverovsky V., Willinger W., Estimators for long-range dependence: An empirical study.
Fractals, 3, (4), pp 785–798.
[2009] Tehranchi M.R., Asymptotics of implied volatility far from maturity. Journal of Applied Probability, 46, (3), pp
629–650.
[1967] Thorp E.O., Kassouf S.T., Beat the market. New York: Random House.
[2010] Tian Y.S., Extracting risk-neutral density and its moments from American option prices. Schulich School of
Business, York University, Toronto, Canada.
[1991] Trigeorgis L., A log-transformed binomial numerical analysis method for valuing complex multi-option in-
vestments. Journal of Financial and Quantitative Analysis, 26, pp 309–326.
[2006] Turiel A., Perez-Vicente C.J., Grazzini J., Numerical methods for the estimation of multifractal singularity
spectra on sampled data: A comparative study. Journal of Computational Physics, 216, pp 362–390.
[1999] Uchaikin V.V., Zolotarev V.M., Chance and stability: Stable distributions and their applications. Modern
Probability and Statistics, De Gruyter, The Netherlands.
[1997] Vandewalle N., Ausloos M., Coherent and random sequences in financial fluctuations. Physica A, 246, (3), pp
454–459.
[1998] Vandewalle N., Ausloos M., Crossing of two mobile averages: A method for measuring the robustness
exponent. Phys. Rev., 58, pp 177–188.
[1998b] Vandewalle N., Ausloos M., Multi-affine analysis of typical currency exchange rates. European Physical
Journal B., 4, (2), pp 257–261.
[1998c] Vandewalle N., Ausloos M., Boveroux PH., Detrended fluctuation analysis of the foreign exchange market.
Working Paper.
[2006] Vellekoop M.H., Nieuwenhuis J.W., Efficient pricing of derivatives on assets with discrete dividends. Applied
Mathematical Finance, 13, (3), pp 265–284.
[2004] Wang Y., Yin H., Qi L., No-arbitrage interpolation of the option price function and its reformation. Journal
of Optimization Theory and Applications, 120, (3), pp 627–649.
[2009] Wang Y., Liu L., Gu R., Analysis of efficiency for Shenzhen stock market based on multifractal detrended
fluctuation analysis. International Review of Financial Analysis, 18, pp 271–276.
[2011] Wang Y., Wei Y., Wu C., Analysis of the efficiency and multifractality of gold markets based on multifractal
detrended fluctuation analysis. Physica A: Statistical Mechanics and its Applications, 390, pp 817–827.
[2011b] Wang Y., Wu C., Pan Z., Multifractal detrending moving average analysis on the US Dollar exchange rates.
Physica A, 390, pp 3512–3523.
[1981] Whaley R.E., On the valuation of American call options on stocks with known dividends. Journal of Financial
Economics, 9, pp 207–211.
[2004] G. West, Better approximations to cumulative normal functions. Working Paper, University of the Witwater-
srand.
[1991] Williams D., Statistical laboratory. DPMMS, Cambridge University, Probability with Martingales, Cambridge
University Press.
323
[1993] Wilmott P., Option pricing: Mathematical models and computation. Oxford Financial Press, Oxford.
[2009] Xu S.J., Jin X.J., Predicting drastic drop in Chinese stock market with local Hurst exponent. International
Conference on Management Science and Engineering, pp 1309–1315.
[2009] Yuan Y., Zhuang X-T., Jin X., Measuring multifractality of stock price fluctuation using multifractal de-
trended fluctuation analysis. Physica A, 388, (11), pp 2189–2197.
[2009] Zeliade Systems, Quasi-explicit calibration of Gatheral’s SVI model. Zeliade White Papaer, Zeliade Systems,
Paris, France.
[2015] Zheng H., Avanced methods in derivatives pricing. MSc Mathematics and Finance, Imperial College London.
[2007] Zhu J., An extended libor market model with nested stochastic volatility dynamics. Working Paper, January
5.
[2007] Zunino L., Tabak B.M., Perez D.G., Garavaglia M., Rosso O.A., Inefficiency in Latin-American market
indices. The European Physical Journal B, 60, (1), pp 111–121.
[2008] Zunino L., Tabak B.M., Figlioa A., Perez D.G., Garavaglia M., Rosso O.A., A multifractal approach for
stock market inefficiency. Physica A, 387, pp 6558–6566.
324