00 Required Background
00 Required Background
Tamas Vadasz
Contents
1 Modelling financial assets 2
1.1 Asset returns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Compounding returns . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Returns as random variables . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4 Compounding variances . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.5 Covariance and correlation . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.6 The variance-covariance matrix . . . . . . . . . . . . . . . . . . . . . . . 8
1.7 (Linear) combination of random variables . . . . . . . . . . . . . . . . . 8
2 Econometrics 10
2.1 Sample moments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.2 Linear regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.3 Time series vs. cross-sectional regression . . . . . . . . . . . . . . . . . . 11
2.4 Hypothesis testing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.5 Multivariate regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3 Analysis 14
3.1 Quick reminder: derivatives . . . . . . . . . . . . . . . . . . . . . . . . . 14
3.2 Maximization problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1
1 Modelling financial assets
This section reviews the most important terms and concepts of modelling financial assets,
and the mathematical/statistical background for portfolio management.
Pt+1
1 + rt = ⇔ Pt+1 = Pt (1 + rt )
Pt
Pt+1
it = ln[ ] = ln Pt+1 − ln Pt ⇔ Pt+1 = Pt eit
Pt
The two returns i and r capture the same price evolution with the relationship i = ln[R].
🤔 You gain 10%, then lose 10% on your portfolio? How much do you
have at the end (i) if these numbers represent log-returns, (ii) if these
numbers represent simple net returns?
2
• Usual advice: use log-returns when you work with time-series models, use simple
returns when you work with cross-sectional data or portfolios of assets.
💭 In practice the difference is usually small, especially for small returns / short horizons. For
this course think always of ‘simple returns’, unless otherwise noted. In practical applications the
difference is important.
• Log-returns can take any value. Simple returns are between [−1, ∞].
❗ Notice that simple returns are logically inconsistent with the “usual” assertion that “returns
are normally distributed”. (Why? Normal distribution has unbounded support.) Formal
asset pricing theory therefore often assumes that log-returns are normally distributed.
ra = (1 + 0.001)252 − 1 = 28.6%
2. With log-returns:
i02 = i01 + i12
When the subsequent returns are the same over multiple periods it follows that
Pt = P0 eit
Usually we work with annualized returns. For example, if for two periods we have
√
r02 = 0.21, then the annualized two-year return is r2 = 1 + 0.21 − 1 = 0.1. The same
goes with log-returns, if i02 = 0.2 then the annualized log-return is i2 = 0.2
2
= 0.1. A
monthly simple return of 1% corresponds to an annualized rate of return of (1.0112 − 1),
while 1% monthly log-return corresponds to 12 × 1 = 12% annualized log-return. This
shows why it’s so much easier to work with log-returns in a time-series context.
1
Here let rtk denote the return between period t and k. When I write rt (just one subscript), I think
of the per period (i.e., annualized) return over a period of length t.
3
🤔 (The power of compounding): your great-grandfather has invested $100
in a portfolio of US stocks in 1930 which you inherit today. How much money
do you have today? (Make a guess before opening the link!)
a $1000 (i.e. 10 times the original investment)
b $10.000 (i.e. 100 times the original investment)
c $100.000 (i.e. 1000 times the original investment)
d $1.000.000 (i.e. 10 000 times the original investment)
Answer: see this graph. Also, read this classic story about exponential growth.
❗ We do not know the realization, but we do know the rules which generate those realizations,
that is, the return generating process. Sometimes it is useful to distinguish the random variable
r̃ from the actual realization ri .
i ∼ N (µ, σ 2 ) ⇔ ei ∼ LN (µ, σ 2 )
As we often say, investors care about return and risk. We proxy these with the
characteristics (first two moments) of the underlying return distribution. Namely,
T
1X
µ̂ = rt
T t=1
T
1 X
2
σ̂ = (rt − µ)2
T − 1 t=1
2
The great thing about normal distribution: it is completely characterized by two parameters, its mean
µ and its variance σ 2 (or standard deviation σ). In reality, returns are not normal, but nevertheless, this
is often a useful approximation.
4
What do we mean by investors care? This is the subject of the utility theory
(1st lecture). Under certain restrictive assumptions, one can derive a convenient, very
simple mean-variance utility function, which is often used in education as it leads to nice
tractable solutions.3
1
U (µ, σ; A) = µ − Aσ 2
2
A is a parameter which captures the attitudes towards risk.
• Can we really say that investors only care about return and risk? What about
social responsibility? What about ethics? What about behavioural biases?
Theory works only under very specific assumptions about human behaviour.
• Can we precisely estimate expected value and variance from historical data?
The underlying technical assumption (usually implicit) is called statistical stationarity.
A time series is stationary if its statistical properties (mean, variance, etc.) do not change
over time. This is generally not true! (Imagine how much COVID-19 has influenced the
statistical properties of the return distributions of stocks or the volatility of prices).
See the pictures below. Figure (a) shows that the S&P500 index moves fairly ran-
domly. However, looking at the daily realizations (plot b) reveals what is called ‘volatility
clustering’ - volatile days tend to follow volatile days, and calm days tend to follow calm
days. This is one example of non-stationarity. Part (c) illustrate the empirical distribu-
tion of returns. It looks random, but when the true random distribution (with the same
expected value and volatility) is superimposed, one can see the famous “fat tails” phe-
nomenon: extreme values are much more likely than what would follow from normality.
3
We will discuss in class what assumptions are needed for such mean-variance preferences to be
accurate.
5
S&P closing prices [2004−01−02/2018−08−28]
140 S&P returns 2004−01−02 / 2018−08−28
Last 99.739998
0.10 0.10
100
0.05 0.05
80
0.00 0.00
60
−0.05 −0.05
40 −0.10 −0.10
−0.15 −0.15
Jan 02 2004 Jul 03 2006 Jan 02 2009 Jul 01 2011 Jan 02 2014 Jul 01 2016 Jan 02 2004 Jan 03 2006 Jan 02 2008 Jan 04 2010 Jan 03 2012 Jan 02 2014 Jan 04 2016 Jan 02 2018
(a) S&P 500 price realizations. (b) S&P 500 return realizations.
S&P return histogram S&P return histogram
25
25
20
20
15
15
Density
Density
10
10
5
5
0
0
−0.15 −0.10 −0.05 0.00 0.05 0.10 0.15 0.20 −0.15 −0.10 −0.05 0.00 0.05 0.10 0.15 0.20
ret ret
Volatility is more complicated. We want to calculate the standard deviation of the sum of
random variables. There is a well-known statistical property of the normal distribution:
if X ∼ N (µX , σX2
) and Y ∼ N (µY , σY2 ) are independent, then
2
X + Y ∼ N (µX + µY ; σX + σY2 )
🤔 Our daily return series exhibit a volatility of 2%. What is the annual
volatility (N = 252) of this time series?
p
σ= N × 0.022 = 31.7%
❗ This is known as the square-root rule. The underlying assumption is that subsequent
return realizations are independent from each other. Such serial independence is a very
strong assumption (↔ efficient market hypothesis).
4
I use log-returns in this section to keep the math simple. This is not crucial.
6
1.5 Covariance and correlation
One rarely invests in just one asset5 . With more than one assets in your portfolio, you
care about the co-movement of those assets. This is the main subject of portfolio theory.
(The first expression is a definition for random variables. The second is estimation from a sample.)
– ρ = 1, perfectly correlated
– ρ = 0, uncorrelated (6= independent!!!)
– ρ = −1, perfectly negatively correlated
❗Pitfalls
• Can be very misleading if the two series’ have a common underlying trend.
5
I hope to convince you by the end of this course to never do that.
6
The rooster’s crow is highly correlated with sunrise, yet roosters likely do not cause the sun to rise.
7
In case of multivariate normal distribution it is actually true that ρ = 0 ⇒ independent).
7
1.6 The variance-covariance matrix
A generalization of covariance to N random variables is the variance-covariance ma-
trix (or just covariance matrix for brevity). It contains the covariance between every
pair of the random variables. Typically is denoted by Σ (large Sigma) .
and we want to know the properties of Z = αX + βY . From the properties of the normal
distribution, we can derive that
µz = αµx + βµy
σz2 = α2 σx2 + β 2 σy2 + 2αβσxy
Understanding matrix-algebra is not required for the exam, although sometimes I’m going to use it.
9
Financial assets are modelled as random variables. A portfolio is a linear combination.
10
Subadditivity of risk is probably the most important element of investment management. Without
this sub-additive property, there would be no Investment Management as a profession, no Finance as a
masters degree program, and you would likely be studying for a philosophy or an art degree.
8
the weights given in vector α := (α1 ...αN )0 , then
µz = α 0 µ =
X
α i µi
i
σz2 = α0 Σα =
XX
αi αj σij
i j
9
2 Econometrics
📖 An excellent book for econometrics with finance applications: Brooks, C. (2019).
Introductory econometrics for finance. Cambridge university press. (Figures below taken
from this book.)
• Bias: is the expected difference between the estimated value and the true parameter
to be estimated. For unbiased estimators this is zero (even in small samples).
An estimator can be consistent but biased, or can be unbiased but not consistent (rare).
Estimating mean and variance.
• The consistent and unbiased estimators for mean and variance are:
T
1X
µ̂ = rt (1)
T t=1
T
1 X
σ̂ 2 = (rt − µ)2 (2)
T − 1 t=1
• These estimators are also random variables. This means they are estimated with
uncertainty, the point estimate may or may not be the true value.
If I estimate the performance of my fund manager as a past average, I obtain a realization
of the sample estimate as a random variable. Is this skill or luck?
yi = α + βxi + εi
E[y|xi ] = α + βxi
10
or unconditionally:
E[y] = α + βE[xi ]
• The Gauss-Markov theorem: OLS is BLUE under the classical assumptions (best
linear unbiased estimator).
• Classical assumptions: (1) the relationship is linear; (2) observations are randomly
sampled; (3) zero conditional mean (E[e|x] = 0); (4) No multicollinearity; (5)
homoscedasticity, no autocorrelation; (6) [optional] the error term is normally dis-
tributed.
yt = α + βxt + εt
Cross-sectional regression:
yi = α + βxi + εi
• Example: regressing expected stock returns for a universe of stocks (one number for
each stock) on possible explanatory variables (firm characteristics, riskiness, etc.).
ε̂2i
P
s2 =
T −2
11
• Estimators of the standard errors (of the coefficient):
qP
x2t
s s
SE[α] = p P and SE[β] = pP
T (xt − x)2 (xt − x)2
α̂ − α β̂ − β
∼ tT2 and ∼ tT2
SE(α̂) SE(β̂)
β̂ − β
−tcrit ≤ ≤ +tcrit
SE(β̂)
12
Example: can investment funds beat the market? Jensen (1968)
Excess return of the portfolio regressed on excess return of the market and a constant.
• The R2 -measure.
13
3 Analysis
3.1 Quick reminder: derivatives
• f 0 (x) > ⇔ f is increasing function.
• Solution:
14