BA3202-L2 Distribution
BA3202-L2 Distribution
Actuarial Statistics
Lecture 2:
- Loss distributions
Instructor:
Wenjun Zhu, PhD, FSA, CERA
Assistant Professor
Email: [email protected]
Office: S3-B1B-71
Tel: (65)6592-1859
BA3202
L2
Objectives
Actuarial Statistics
1. Describe the properties of the statistical distributions which
are suitable for modelling individual and aggregate losses.
2. Derive moments and moment generating functions (where
defined) of loss distributions including the gamma,
BA3202
L2
Insurance overview
Actuarial Statistics
• Insurance losses:
• Frequency of loss: how often does a loss occur over the insured
period
• Severity of loss: how much does each loss cost?
• Expected total loss=expected frequency * expected average severity
Actuarial Statistics
• We make the assumptions that historical losses (claims) come
from a familiar distribution.
• However, the exact distribution can never be known with
absolute certainty.
Actuarial Statistics
• Several methods to estimate distribution parameters:
• Method of Moments (MM)
• Method of Percentiles (MP)
• Maximum Likelihood Estimation (MLE)
BA3202
L2
Method of Moments
Actuarial Statistics
• {𝑥" , 𝑥! , … , 𝑥# } is a random sample with size 𝑛, from a random
variable 𝑋
• The 𝑗$% moment of this sample is defined as
#
1 (
BA3202
L2
Method of Percentiles
Actuarial Statistics
• Works similarly to MM
• Procedure In MM when there are 2 parameters to be estimated, using the
first 2 moments:
1. Derive the 1st and 2nd moments of the distribution where the 2 parameter
values will be unknown;
Actuarial Statistics
• 𝑓 𝑥 𝜃 : the conditional pdf of random variable 𝑋 with
parameter set 𝜃, where 𝜃 is a set of unknown variable
• 𝐿(𝜃): the likelihood function
• 𝐿 𝜃 = ∏%"#$ 𝑃(𝑋" = 𝑥" |𝜃), when 𝑋 is discrete
• 𝐿 𝜃 = ∏%"#$ 𝑓 𝑥" 𝜃 , when 𝑋 is continuous
Actuarial Statistics
• The Lognormal distribution
• The Exponential distribution
• The Gamma distribution
• The Pareto distribution
BA3202
L2
The normal distribution
Actuarial Statistics
• If a random variable 𝑋 follows normal distribution:
X~𝑁 𝜇, 𝜎 ! with parameters 𝜇 and 𝜎
!"#$
$ )
• PDF: 𝑓 𝑥 = 𝑒 $%$
& '(
*)+
• CDF: 𝐹 𝑥 = Φ , where Φ(⋅) is CDF of standard normal.
&
10
BA3202
L2
The normal distribution
Actuarial Statistics
• Simulation example with R code:
• 𝜇 = 2, 𝜎 = 1: library(ggplot2)
x_norm = rnorm(1000, mean = 2, sd = 1)
ggplot(data.frame(x_norm), aes(x_norm)) +
geom_histogram(aes(y=..density..),bins =
50,color = 'grey') +
stat_function(fun=function(x)dnorm(x,
mean = 2, sd = 1), color="magenta", size=1)
BA3202
L2
Example:
Actuarial Statistics
• Calculate the MM, MP, and ML estimator of a normal
distribution with parameters 𝜇, 𝜎
• MM estimator:
$
• 𝜇6 = 𝑥, 𝜎6 ' = ∑ 𝑥" − 𝑥 ' .
BA3202
L2
The lognormal distribution
Actuarial Statistics
• If a random variable 𝑋 satisfies ln 𝑋 ~𝑁(𝜇, 𝜎 ! ), then
𝑋~𝐿𝑁(𝜇, 𝜎 ! ), i.e. lognormally distributed with parameters 𝜇
and 𝜎
!"# $ %& '
! &
• PDF: 𝑓 𝑥 = 𝑒 '('
"# $%
'
• Mean: 𝑒 '(# /$
13
BA3202
L2
The lognormal distribution
Actuarial Statistics
• Simulation example with
• 𝜇 = 2, 𝜎 = 1:
R code:
library(ggplot2)
x_logn = rlnorm(1000, meanlog = 2, sdlog = 1)
14
BA3202
L2
The exponential distribution
Actuarial Statistics
• If 𝑋~𝐸𝑥𝑝(𝜆)
• PDF of 𝑋: 𝑓 𝑥 = 𝜆𝑒 )4* , 𝑥 > 0.
• CDF of 𝑋: 𝐹 𝑥 = 1 − 𝑒 )4* , 𝑥 > 0.
$
• Mean:
$) $ +"
• MGF: 𝑀 𝑡 = 𝐸 𝑒 = 1− ,𝑡 < 𝜆
*
15
BA3202
L2
The exponential distribution
Actuarial Statistics
• Simulation example with
• 𝜆 = 0.5:
R code:
library(ggplot2)
x_exp = rexp(1000, rate = 1/2)
16
BA3202
L2
The Gamma distribution
Actuarial Statistics
• If 𝑋~Γ(𝛼, 𝜆)
!)
• PDF of 𝑋: 𝑓 𝑥 = 𝑥 #$% 𝑒 $!& , 𝑥 > 0
" #
# #
• 𝐸 𝑋 = , 𝑉𝑎𝑟 𝑋 =
! !'
• Estimation of parameters:
Actuarial Statistics
• Simulation example with
• 𝛼 = 4, 𝜆 = 2: R code:
library(ggplot2)
x_gam = rgamma(1000, shape = 4, rate = 2)
ggplot(data.frame(x_gam), aes(x_gam)) +
18
BA3202
L2
Example: Invariance Property of MLE
Actuarial Statistics
• Invariance Property of MLE:
• If 𝜃E is the MLE of 𝜃, then for any function 𝑔 𝜃 , the MLE of 𝑔 𝜃
is 𝑔 𝜃E .
• Given a sample 𝑥 = (𝑥$ , 𝑥' , … , 𝑥% ), derive the MLE of population
mean given the following two distributional assumptions:
BA3202
L2
The Pareto distribution
Actuarial Statistics
• If 𝑋~𝑃𝑎(𝛼, 𝜆)
4 6
• CDF of 𝑋: 𝐹 𝑥 = 1 −
47*
64/
• PDF of 𝑋: 𝑓 𝑥 = ,𝑥 >0
47* /0)
BA3202
L2
The Pareto distribution
Actuarial Statistics
• Simulation example with
4
• 𝛼 = 3, 𝜆 = 4 (𝜇 = = 2):
6)$
R code:
ggplot(data.frame(x_pare), aes(x_pare)) +
geom_histogram(aes(y=..density..),bins = 60,color = 'grey') +
stat_function(fun=function(x)dpareto(x, shape = 3, scale = 4),
color="magenta", size=1)
21
BA3202
L2
Variations on Pareto
Actuarial Statistics
• The Burr (or transformed Pareto) distribution
4/
• CDF: 𝐹 𝑥 = 1 −
47*1 /
/)
• Compare with Pareto CDF: 𝐹 𝑥 = 1 − /(" )
• The additional parameter 𝛾 gives extra flexibility
BA3202
L2
The Weibull distribution
Actuarial Statistics
• If 𝑋~𝑊(𝑐, 𝛾)
+
• CDF of 𝑋: 𝐹 𝑥 = 1 − 𝑒 $*&
!
• Weibull tail: Pr 𝑋 > 𝑥 = 𝑒 345
• Exponential tail: Pr 𝑋 > 𝑥 = 𝑒 365
6 8
BA3202
L2
The Weibull distribution
Actuarial Statistics
• Simulation example with 𝜇 = 2
)
( : $71 ' $
• 𝛾 = 2, 𝑐 = 𝜇= ) = 2 , where Γ = 𝜋:
$< > '
=1
R code:
26
BA3202
L2
Goodness-of-fit Test
Actuarial Statistics
• The 𝜒 ) test is defined for the hypothesis:
• 𝐻9 : The data follow a specified distribution
• 𝐻- : The data do not follow the specified distribution
• Test statistic:
/ )
𝑂' − 𝐸'
𝜒) = < ,
𝐸'
Actuarial Statistics
• A game involves rolling 3 identical dices. The game is repeated
100 times, with the following observed counts:
Number of Sixes (N𝟔 ) Number of Rolls
0 48
1 35
28
BA3202
L2
Example: Goodness-of-fit Test
Actuarial Statistics
"
• Under null hypothesis, the game follows 𝐵𝑖𝑛(3, ):
,
@ A - @ B
• 𝑃 𝑁? = 0 = ?
= 0.579; 𝑃 𝑁? = 1 = 3 ⋅ ?
⋅ ?
= 0.347
- B @ - A
• 𝑃 𝑁? = 2 = 3 ⋅ ⋅ = 0.069; 𝑃 𝑁? = 3 = = 0.005
? ? ?
• Test statistic:
! !
𝜒 ! = 23.367 > 𝜒"+<,>+?+" = 𝜒@.AB,C+@+" = 7.815
• Therefore, the null hypothesis is rejected. 29
BA3202
L2
Mixture distributions
Actuarial Statistics
• So far, we have assumed that losses are realizations of a
random variable from ONE distribution
• Actual losses may be more complex
• Claims can be generated from :
• More than one distribution: Insurance company may have losses
30
BA3202
L2
Mixture distributions
Actuarial Statistics
• Claims can be generated from :
• Other non-regular loss curves (e.g., piecewise linear/Splines)
• Example: Munich Re underwriting cycle & credit cycle:
BA3202
L2
Example 1: Pareto
Actuarial Statistics
Example:
Assume the insurance loss X~𝐸𝑥𝑝(Λ), where the parameter Λ is also a random variable
with Λ~Γ(𝛼, 𝛿).
• The gamma distribution for Λ is called the mixing distribution
• The resulting distribution of 𝑋 is called the mixture distribution
D$
• The PDF of Λ: 𝑓C 𝜆 = E 8 𝜆83-𝑒 3D6 , 𝜆 > 0
Actuarial Statistics
Example:
Assume the insurance loss X~Γ(𝑘, Λ), where the parameter Λ is also a random variable
with Λ~Γ(𝛼, 𝛿).
• The gamma distribution for Λ is called the mixing distribution
• The resulting distribution of 𝑋 is called the mixture distribution
D$
• The PDF of Λ: 𝑓C 𝜆 = E 8 𝜆83-𝑒 3D6 , 𝜆 > 0