Lec1 17
Lec1 17
Lecture 1: Introduction
Two topics are highly related, but FTS has added uncertainty,
because it must deal with the ever-changing business & economic
environment and the fact that volatility is not directly observed.
1
• to analyze high-dimensional asset returns, including co-movement
2
0.10
0.05
0.00
ln−rtn
−0.05
−0.10
−0.15
−0.20
date
3
25
20
15
density
10
5
0
ln−rtn
Figure 2: Empirical density function of daily log returns of Apple stock: 2007 to 2016
4
CDS of JPM: 3−yr spread
0.020
0.015
spread3y
0.010
0.005
0.000
year
Figure 3: Time plot of daily 3-year CDS spreads of JPM: from July 20, 2004 to September
19, 2014.
5
VIXCLS [2004−01−02/2014−03−07]
Last 14.11 80
70
60
50
40
30
20
10
6
EPS of Coca Cola: 1983−2009
1.0
0.8
0.6
y
0.4
0.2
0.0
Time
7
Dollars per Euro
1.6
1.4
eu
1.2
1.0
0.8
8
ln−rtn: US−EU
0.04
0.02
rtn
0.0
−0.02
9
useu: ln−rtn
400
300
200
100
0
10
15
10
rate
5
0
11
1.5
1.00.5
spread
0.0 −0.5
−1.0
12
Norwegian Fire Insurance Data: 1972−1992
•
400000
200000 300000
claim size
• •
• •
•
100000
• • •
• • •
• • • •
• • •
• • • •• • •
• •• •• •• • • • • •••
• • • •• ••• •• •••
•• •• •• •• •••• •• ••• ••
••• ••
•• ••
•• ••• ••• ••
••
•• •••• ••• •• •• •••• ••• ••• ••• •••
••
•• •••
••
•• ••
•• • • • • • •
0
75 80 85 90
year
Figure 11: Claim sizes of the Norwegian fire insurance from 1972 to 1992, measured in 1000
Krone and exceeded 500.
13
CAT trade data on January 04, 2010.
date hour minute second price size
20100104 9 30 0 57.65 3910
20100104 9 30 0 57.7 400
20100104 9 30 0 57.68 100
20100104 9 30 0 57.69 300
20100104 9 30 1 57.65 462
20100104 9 30 1 57.65 100
20100104 9 30 1 57.65 100
20100104 9 30 1 57.65 100
20100104 9 30 1 57.7 100
20100104 9 30 1 57.7 100
20100104 9 30 1 57.72 500
20100104 9 30 1 57.72 100
20100104 9 30 2 57.73 100
20100104 9 30 3 57.73 300
20100104 9 30 3 57.72 100
20100104 9 30 4 57.72 300
20100104 9 30 5 57.57 100
20100104 9 30 5 57.57 500
20100104 9 30 5 57.56 300
......
20100104 9 30 35 57.77 100
20100104 9 30 36 57.77 100
20100104 9 30 42 57.54 83600
20100104 9 30 42 57.57 100
.....
20100104 9 30 42 57.55 100
20100104 9 30 42 57.55 2400
20100104 9 30 42 57.56 100
20100104 9 30 42 57.55 100
20100104 9 30 42 57.55 100
20100104 9 30 42 57.55 100
20100104 9 30 42 57.54 170
20100104 9 30 42 57.54 200
Asset Returns
Let Pt be the price of an asset at time t, and assume no dividend.
One-period simple return: Gross return
Pt
1 + Rt = or Pt = Pt−1(1 + Rt)
Pt−1
Simple return:
Pt Pt − Pt−1
Rt = −1= .
Pt−1 Pt−1
Multiperiod simple return: Gross return
Pt Pt Pt−1 Pt−k+1
1 + Rt(k) = = × × ··· ×
Pt−k Pt−1 Pt−2 Pt−k
= (1 + Rt)(1 + Rt−1) · · · (1 + Rt−k+1).
Pt
The k-period simple net return is Rt(k) = Pt−k − 1.
j=0
An approximation:
1 k−1
Annualized[Rt(k)] ≈ Rt−j .
X
k j=0
Continuously compounding: Illustration of the power of compound-
ing (int. rate 10% per annum)
Type #(payment) Int. Net
Annual 1 0.1 $1.10000
Semi-Annual 2 0.05 $1.10250
Quarterly 4 0.025 $1.10381
Monthly 12 0.0083 $1.10471
0.1
Weekly 52 52 $1.10506
0.1
Daily 365 365 $1.10516
Continuously ∞ $1.10517
A = C exp[r × n]
where r is the interest rate per annum, C is the initial capital, n is
the number of years, and exp is the exponential function.
16
Present value:
C = A exp[−r × n]
Continuously compounded (or log) return
Pt
rt = ln(1 + Rt) = ln = pt − pt−1,
Pt−1
where pt = ln(Pt).
Multiperiod log return:
rt(k) = ln[1 + Rt(k)]
= ln[(1 + Rt)(1 + Rt−1) · · · (1 + Rt−k+1)]
= ln(1 + Rt) + ln(1 + Rt−1) + · · · + ln(1 + Rt−k+1)
= rt + rt−1 + · · · + rt−k+1.
i=1
19
• sample mean:
1 T
µ̂x = xt ,
X
T t=1
• sample variance:
1 X T
σ̂x2 = (xt − µ̂x)2,
T − 1 t=1
• sample skewness:
1 T
3
Ŝ(x) = (x − µ̂ ) ,
X
t x
(T − 1)σ̂x3 t=1
• sample kurtosis:
1 T
4
K̂(x) = (x − µ̂ ) .
X
t x
(T − 1)σ̂x4 t=1
Under normality assumption,
6 24
Ŝ(x) ∼ N (0, ), K̂(x) − 3 ∼ N (0, ).
T T
Some simple tests for normality (for large T ).
1. Test for symmetry:
Ŝ(x)
S∗ = r ∼ N (0, 1)
6/T
if normality holds.
Decision rule: Reject Ho of a symmetric distribution if |S ∗| >
Zα/2 or p-value is less than α.
2. Test for tail thickness:
K̂(x) − 3
K∗ = r ∼ N (0, 1)
24/T
20
if normality holds.
Decision rule: Reject Ho of normal tails if |K ∗| > Zα/2 or
p-value is less than α.
3. A joint test (Jarque-Bera test):
JB = (K ∗)2 + (S ∗)2 ∼ χ22
if normality holds, where χ22 denotes a chi-squared distribution
with 2 degrees of freedom.
Decision rule: Reject Ho of normality if JB > χ22(α) or p-
value is less than α.
Empirical properties of returns
Data sources: Use packages, e.g. quantmod
• Course web:
http://faculty.chicagobooth.edu/ruey.tsay/teaching/bs41202/sp2017/
• CRSP: Center for Research in Security Prices (Wharton WRDS)
https://wrds-web.wharton.upenn.edu/wrds/
• Various web sites, e.g. Federal Reserve Bank at St. Louis
https://research.stlouisfed.org/fred2/
• Data sets of the textbook:
http://faculty.chicagobooth.edu/ruey.tsay/teaching/fts3/
Empirical dist of asset returns tends to be skewed to the left with
heavy tails and has a higher peak than normal dist. See Table 1.2 of
the text.
0.06
0.05
0.05
0.04
0.04
density
density
0.03
0.03
0.02
0.02
0.01
0.01
0.0
0.0
Figure 12: Comparison of empirical IBM return densities (solid) with Normal densities
(dashed)
22
R demonstration: Use monthly IBM stock returns from 1967 to
2008.
**** Task: (a) Set the working directory
(b) Load the library ‘‘fBasics’’.
(c) Compute summary (or descriptive) statistics
(d) Perform test for mean return being zero.
(e) Perform normality test using the Jaque-Bera method.
(f) Perform skewness and kurtosis tests.
> da=read.table("m-ibm-6815.txt",header=T)
> head(da)
PERMNO date PRC ASKHI BIDLO RET vwretd ewretd sprtrn
1 12490 19680131 594.50 623.0 588.75 -0.051834 -0.036330 0.023902 -0.043848
2 12490 19680229 580.00 599.5 571.00 -0.022204 -0.033624 -0.056118 -0.031223
3 12490 19680329 612.50 612.5 562.00 0.056034 0.005116 -0.011218 0.009400
4 12490 19680430 677.50 677.5 630.00 0.106122 0.094148 0.143031 0.081929
5 12490 19680531 357.00 696.0 329.50 0.055793 0.027041 0.091309 0.011169
6 12490 19680628 353.75 375.0 346.50 -0.009104 0.011527 0.016225 0.009120
> dim(da)
[1] 576 9
> ibm=da$RET % Simple IBM return
> lnIBM <- log(ibm+1) % compute log return
> ts.plot(ibm,main="Monthly IBM simple returns: 1968-2015") % Time plot
> mean(ibm)
[1] 0.008255663
> var(ibm)
[1] 0.004909968
> skewness(ibm)
[1] 0.2687105
attr(,"method")
[1] "moment"
> kurtosis(ibm)
[1] 2.058484
attr(,"method")
[1] "excess"
> basicStats(ibm)
ibm
nobs 576.000000
NAs 0.000000
Minimum -0.261905
Maximum 0.353799
1. Quartile -0.034392
23
3. Quartile 0.048252
Mean 0.008256
Median 0.005600
Sum 4.755262
SE Mean 0.002920
LCL Mean 0.002521
UCL Mean 0.013990
Variance 0.004910
Stdev 0.070071
Skewness 0.268710
Kurtosis 2.058484
> basicStats(lnIBM) % log return
lnIBM
nobs 576.000000
NAs 0.000000
Minimum -0.303683
Maximum 0.302915
1. Quartile -0.034997
3. Quartile 0.047124
Mean 0.005813
Median 0.005585
Sum 3.348008
SE Mean 0.002898
LCL Mean 0.000120
UCL Mean 0.011505
Variance 0.004839
Stdev 0.069560
Skewness -0.137286
Kurtosis 1.910438
> t.test(lnIBM) %% Test mean=0 vs mean .not. zero
data: lnIBM
t = 2.0055, df = 575, p-value = 0.04538
alternative hypothesis: true mean is not equal to 0
95 percent confidence interval:
0.0001199015 0.0115051252
sample estimates:
mean of x
0.005812513
> normalTest(lnIBM,method=’jb’)
Title: Jarque - Bera Normalality Test
Test Results:
24
STATISTIC:
X-squared: 90.988
P VALUE:
Asymptotic p Value: < 2.2e-16
σy2
µy
E(X) = ln , V (X) = ln 1 + 2 .
v
2 µy
u
u σy
t
1+ µ2y
Processes considered
• return series (e.g., ch. 1, 2, 5)
• volatility processes (e.g., ch. 3, 4, 10, 12)
• continuous-time processes (ch. 6)
• extreme events (ch. 7)
• multivariate series (ch. 8, 9, 10)
26
For two consecutive returns r1 and r2, we have
f (r2, r1) = f (r2|r1)f (r1).
For three returns r1, r2 and r3, by repeated application,
f (r3, r2, r1) = f (r3|r2, r1)f (r2, r1)
= f (r3|r2, r1)f (r2|r1)f (r1).
In general, we have
f (rT , rT −1, · · · , r2, r1)
= f (rT |rT −1, · · · , r1)f (rT −1, · · · , r1)
= f (rT |rT −1, · · · , r1)f (rT −1|rt−2, · · · , r1)f (rT −2, · · · , r1)
= ...
T
2πσt 2σt2
t=2
2πσ 2σ 2
t=2 t t
This is the conditional likelihood function of the returns under nor-
mality.
Other dists, e.g. Student-t, can be used to handle heavy tails.
Model specification
27
• µt: discussed in Chapter 2
• σt2: Chapters 3 and 4.
28
• Correlation coefficient focuses no linear dependence and is not
robust to outliers.
• The actual range of the correlation coefficient can be much smaller
than [−1, 1].
R Demonstration
> head(da)
PERMNO date PRC ASKHI BIDLO RET vwretd ewretd sprtrn
1 12490 19680131 594.50 623.0 588.75 -0.051834 -0.036330 0.023902 -0.043848
2 12490 19680229 580.00 599.5 571.00 -0.022204 -0.033624 -0.056118 -0.031223
3 12490 19680329 612.50 612.5 562.00 0.056034 0.005116 -0.011218 0.009400
4 12490 19680430 677.50 677.5 630.00 0.106122 0.094148 0.143031 0.081929
5 12490 19680531 357.00 696.0 329.50 0.055793 0.027041 0.091309 0.011169
6 12490 19680628 353.75 375.0 346.50 -0.009104 0.011527 0.016225 0.009120
> ibm <- da$RET
> sp <- da$sprtrn
> plot(sp,ibm)
> cor(sp,ibm)
[1] 0.5785249
> cor(sp,ibm,method="kendall")
[1] 0.4172056
> cor(sp,ibm,method="spearman")
[1] 0.58267
> cor(rank(ibm),rank(sp))
[1] 0.58267
Takeaway
1. Understand the summary statistics of asset returns
2. Understand various definitions of returns & their relationships
29
3. Learn basic characteristics of FTS
4. Learn the basic R functions. (See Rcommands-lec1.txt on the
course web.)
30
[1] 16.3
> plot(tdx,int[,1],xlab=’year’,ylab=’rate’,type=’l’,ylim=c(0,16.5))
> lines(tdx,int[,2],lty=2) <== Plot the 6m-TB rate on the same frame.
> plot(tdx,int[,2]-int[,1],xlab=’year’,ylab=’spread’,type=’l’)
> abline(h=c(0)) <== Draw a horizontal like to ‘‘zero’’.
> hist(r,nclass=50)
> title(main=’useu: ln-rtn’)
31
Linear Time Series (TS) Models
Basic concepts
• Stationarity:
– Strict: distributions are time-invariant
– Weak: first 2 moments are time-invariant
What does weak stationarity mean in practice?
Past: time plot of {rt} varies around a fixed level within a
finite range!
Future: the first 2 moments of future rt are the same as those of
the data so that meaningful inferences can be made.
• Mean (or expectation) of returns:
µ = E(rt)
• Sample mean and sample variance are used to estimate the mean
and variance of returns.
1 XT 1 X T
r̄ = rt & Var(rt) = (rt − r̄)2
T t=1 T − 1 t=1
32
• Test Ho : µ = 0 vs Ha : µ 6= 0. Compute
r̄ r̄
t= =r
std(r̄) Var(rt)/T
Compare t ratio with N (0, 1) dist.
Decision rule: Reject Ho of zero mean if |t| > Zα/2 or p-value
is less than α.
• Lag-k autocovariance:
γk = Cov(rt, rt−k ) = E[(rt − µ)(rt−k − µ)].
`=1 T − `
34
• Rt: Q(5) = 27.8 and Q(10) = 36.0
• rt: Q(5) = 26.9 and Q(10) = 32.7
All highly significant. Implication: there exist significant serial corre-
lations in the value-weighted index returns. (Nonsynchronous trading
might explain the existence of the serial correlations, among other
reasons.) Similar result is also found in equal-weighted index returns.
35
X-squared = 7.2759, df = 10, p-value = 0.6992
i=0
White noise: iid sequence (with finite variance), which is the build-
ing block of linear TS models.
White noise is not predictable, but has zero mean and finite variance.
38
In this course, we use statistical methods to find models that fit
the data well for making inference, e.g. prediction. On the other
hand, there exists economic theory that leads to time-series models
for economic variables. For instance, consider the real business-cycle
theory in macroeconomics. Under some simplifying assumptions, one
can show that ln(Yt), where Yt is the output (GDP), follows an AR(2)
model. See Advanced Macroeconomics by David Romer (2006, 3rd,
pp. 190).
In this course, we shall discuss some reasons for the observed se-
rial dependence in index returns. See, for example, Chapter 5 on
nonsynchronous trading.
39