CM15 Extreme Value Distributions
CM15 Extreme Value Distributions
𝐹𝑌 𝑛 (𝑦) = [ ]𝑛
dFYn ( y)
= n
n −1
fYn ( y) = f X ( y)
dy
1
Seoul National University Instructor: Junho Song
Dept. of Civil and Environmental Engineering [email protected]
1
f X ( x) = , x 1
x2
When someone constructs a sample of size n, derive the CDF and PDF of the largest in the
sample, i.e., Yn = max ( X1 , X 2 ,..., X n ) .
2
Seoul National University Instructor: Junho Song
Dept. of Civil and Environmental Engineering [email protected]
2. Asymptotic Distributions
An asymptotic distribution can be derived for a large-size sample, i.e., n → , using Cramer’s
method (1946). For the extreme value distribution in Example 1, the exact (i.e., derived) and
asymptotic distributions by Cramer’s method are compared as follows:
3
Seoul National University Instructor: Junho Song
Dept. of Civil and Environmental Engineering [email protected]
The asymptotic distributions of the extremes tend to converge on certain limiting forms
(Gumbel 1958):
Type III: The exponential form with upper/lower bound, exp − A(n) / ( − y) k
The type is determined by the ( ) behavior of the original probability density function.
Let us check whether the distribution of the maximum value among 𝑛 = 10, 100 and 200
standard normal random variables converge to the Gumbel distribution:
install.packages('evd')
library('evd')
NormalMaxGen = function(n,N){
# n: number of sample points in each set
# N: number of sets
Sample = rnorm(n*N,0,1)
Sample_matrix = matrix(Sample,n,N)
SampleMax = apply(Sample_matrix,2,max) # Get maximum of each column
return(SampleMax)
}
4
Seoul National University Instructor: Junho Song
Dept. of Civil and Environmental Engineering [email protected]
u = function(n){
# location parameter u_n of Gumbel (Example 4.18 in A&T)
sqrt(2*log(n))-(log(log(n))+log(4*pi))/2/sqrt(2*log(n))
}
b = function(n){
# scale parameter b_n = 1/alpha_n (See Example 4.18 in A&T)
1/sqrt(2*log(n))
}
xv = seq(-1,5,length=100)
plot(ecdf(X10),main="N=10,000",xlim=c(-1,5),col='red')
lines(xv,pgumbel(xv,u(10),b(10)),col='red',lty='dotted')
lines(ecdf(X100),col='black')
lines(xv,pgumbel(xv,u(100),b(100)),col='black',lty='dotted')
lines(ecdf(X200),col='blue')
lines(xv,pgumbel(xv,u(200),b(200)),col='blue',lty='dotted')
5
Seoul National University Instructor: Junho Song
Dept. of Civil and Environmental Engineering [email protected]
A general extreme value distribution model that can represent Type I (Gumbel), Type II
(Fréchet) and Type III (Weibull) distributions.
−1/𝜉
𝑦−𝑢
𝐹𝑌 (𝑦) = exp [− (1 + 𝜉 ( )) ]
𝛽
where 𝑢, β(> 0), and 𝜉 are the location, scale and shape parameters. As ξ → 0, GEV becomes
Gumbel. For 𝜉 > 0, GEV is Fréchet distribution. When 𝜉 < 0, GEV is Weibull distribution. The
“evd” package of R provides dgev(), pgev(), qgev() and rgev() functions. One can estimate the
parameters from a data set using fgev() in the package as well.
6
Seoul National University Instructor: Junho Song
Dept. of Civil and Environmental Engineering [email protected]
Assuming the yearly maximum tension force follows a GEV distribution, estimate the
parameters of GEV and plot the corresponding PDF.
0.02
0.01
0.00
xv