QTA 3 - Common Univariate Random Variables
QTA 3 - Common Univariate Random Variables
QTA 3 -
COMMON UNIVARIATE
RANDOM VARIABLES
Source - Global Association of Risk Professionals (GARP)
Micky Midha
BE, CFA®, FRM, LLB www.midhafin.com
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
2 of 65
Bernoulli distribution,
Poisson distribution,
Normal distribution,
Lognormal distribution,
Chi-squared distribution,
F distribution
INTRODUCTION www.midhafin.com
• There are three common discrete distributions:
The Bernoulli is a general purpose distribution that is typically used to model binary events.
The binomial distribution describes the sum of 𝑛 independent Bernoulli random variables.
The Poisson distribution is commonly used to model hazard rates, which count the number
of events that occur in a fixed unit of time (e.g., the number of corporations defaulting in
the next quarter).
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
4 of 65
INTRODUCTION www.midhafin.com
• The most basic is a uniform distribution, which serves as a foundation for all random variables.
• The most widely used distribution is the normal, which is used for tasks such as modeling
financial returns and implementing statistical tests.
• Many other frequently used distributions are closely related to the normal. These include
Student’s 𝑡,
Chi-square 𝜒 , and
𝐹,
• Mixture distributions are built using two or more distinct component distributions. Mixtures
can be used to build distributions that match important features of financial data. For example,
mixing two normal random variables with different variances produces a random variable
that has a larger kurtosis than either of the mixture components.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
5 of 65
• The Bernoulli distribution depends on a single parameter, 𝑝, which is the probability that a
success is observed.
• Suppose that 𝑌 is a Bernoulli random variable with parameter 𝑝. This can be expressed as
𝑌~𝐵𝑒𝑟𝑛𝑜𝑢𝑙𝑙𝑖 𝑝
• The mean of 𝑌 is
𝐸𝑌 𝑝 1 1 𝑝 0 𝑝,
• The Variance of 𝑌 is
𝑉𝑌 𝐸𝑌 𝐸𝑌 𝑝 1 1 𝑝 0 𝑝 𝑝 1 𝑝 𝑝𝑞
where
𝑞 1 𝑝 is the failure probability.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
6 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
7 of 65
𝑌 𝑋
and expressed as
𝑌 𝐵 𝑛, 𝑝
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
8 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
9 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
10 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
11 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
12 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
13 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
14 of 65
• Poisson random variables are always non-negative and integer-valued. The Poisson distribution
has a single parameter, which is called the hazard rate and expressed as 𝜆, that signifies the
average number of events per interval.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
15 of 65
• A useful feature of the Poisson (one that is uncommon among distributions) is that it is
infinitely divisible.
If 𝑋 ∼ 𝑃𝑜𝑖𝑠𝑠𝑜𝑛 𝜆 , and 𝑋 ∼ 𝑃𝑜𝑖𝑠𝑠𝑜𝑛 𝜆 are independent, and
𝑌 𝑋 𝑋 , then 𝑌 ∼ 𝑃𝑜𝑖𝑠𝑠𝑜𝑛 𝜆 𝜆 .
This feature makes the Poisson well-suited to work with timeseries data, because summing the
number of events in a sampling interval (e.g., a week, month, or quarter) does not change the
distribution.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
16 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
17 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
18 of 65
• The PDF of a uniform random variable does not depend on 𝑦, because all values are equally
likely.
• The CDF returns the cumulative probability of observing a value less than or equal to the
argument. The CDF is 0 to the left of 𝑎, which is the smallest value that could be produced,
linearly increases from 𝑎 𝑡𝑜 𝑏, and then is 1 above 𝑏.
• When 𝑎 0 and 𝑏 1, the distribution is called the standard uniform. Any uniform random
variable can be constructed from a standard uniform 𝑈 using the relationship:
𝑈 𝑎 𝑏 𝑎 𝑈,
where 𝑎 and 𝑏 are the bounds of 𝑈 .
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
19 of 65
1 𝑦 𝑎
𝑓 𝑦 𝑖𝑓 𝑎 𝑦 𝑏 𝐹 𝑦 𝑖𝑓 𝑎 𝑦 𝑏
𝑏 𝑎 𝑏 𝑎
0 𝑖𝑓 𝑦 𝑏 1 𝑖𝑓 𝑦 𝑏
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
20 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
21 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
22 of 65
• The mean and variance of a standard uniform random variable are ⁄ and ⁄ , respectively.
• The probability that 𝑌 falls into an interval with lower bound 𝑙 and upper bound 𝑢 i.e.,
𝑚𝑖𝑛 𝑢, 𝑏 𝑚𝑎𝑥 𝑙, 𝑎
𝑃 𝑙 𝑌 𝑢
𝑏 𝑎
which simplifies to
𝑢 𝑙
𝑏 𝑎
when both 𝑙 𝑎 𝑎𝑛𝑑 𝑢 𝑏
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
23 of 65
𝑚𝑖𝑛 𝑢, 𝑏 𝑚𝑎𝑥 𝑙, 𝑎
𝑃 𝑙 𝑌 𝑢
𝑏 𝑎
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
24 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
25 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
26 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
27 of 65
• Kurtosis=3. This kurtosis is often used as a benchmark when assessing whether another
distribution is heavy/fat-tailed.
• The normal can generate any value in ( ∞, ∞), although it is unlikely to observe values more
than 3𝜎 away from the mean. In fact, values more than 3𝜎 away from the mean are expected in
only one in 370 realizations of a normal random variable.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
28 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
29 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
30 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
31 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
32 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
33 of 65
• The return on a stock is 12% per annum and the standard deviation of annual returns is 15%.
Assuming that the returns follow a normal distribution, find the 90% confidence interval for the
stock return over this year.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
34 of 65
• It is also common to use 𝜙 𝑧 to denote the standard normal PDF and Φ 𝑧 to denote the
standard normal CDF.
PDF is
1
𝜙 e
2𝜋
• The z-value is the number of standard deviations a given observation is away from the mean.
Standardization is converting an observed value 𝑦 to its z value by using the formula:
𝑦 𝜇
z
𝜎
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
35 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
36 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
37 of 65
• The values in the z-table are the probabilities of observing a z-value that is less than a given value
(or 𝑃 𝑍 𝑧 ). It can also be used to find the probability that a z-value is observed above (by
using 𝑃 𝑍 𝑧 1 𝑃 𝑍 𝑧 ), or between values on the standard normal distribution, and by
extension, any normal distribution.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
38 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
39 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
40 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
41 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
42 of 65
• The Poisson can also be approximated by a normal random variable. When 𝜆 is large, then a
Poisson 𝜆 can be well approximated by a Normal 𝜆, 𝜆 . This approximation is commonly
applied when 𝜆 1000.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
43 of 65
𝑌 𝑒
where 𝑋 ∼ 𝑁 𝜇, 𝜎 .
• An important property of the log-normal distribution is that 𝑌 can never be negative, whereas
𝑋 can be negative because it is normally distributed. This log-normal property can be desirable
when constructing certain models. For example, if stock prices are assumed to be normally
distributed, there is a positive (although perhaps tiny) probability that the stock price becomes
negative. This is impossible under a log-normal model.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
44 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
45 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
46 of 65
• The mean of 𝑌 is
𝐸𝑌 𝑒
• The variance of 𝑌 is
𝑉𝑌 𝑒 1 𝑒
• It peaks at 𝑒
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
47 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
48 of 65
• A 𝜒 random variable is defined as the sum of the squares of 𝜈 (Greek nu) independent
standard normal random variables
𝑌 𝑍
• Note that a 𝜒 distribution has 𝜈 degrees of freedom, a concept that arises when dealing with
models that have 𝑘 parameters using 𝑛 data points. Degrees of freedom measure the amount of
data that is available to test model parameters, because estimating model parameters requires a
minimum number of observations (e.g., 𝑘). In many models, the degree of freedom used in
testing is 𝑛 𝑘.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
49 of 65
• The variance of 𝑌 is
𝑉𝑌 2𝜈
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
50 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
51 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
52 of 65
• The variance of 𝑌 is
𝜈
𝑉𝑌
𝜈 2
The variance is only finite if 𝜈 2
• The kurtosis of 𝑌 is
𝜈 2
𝑘𝑢𝑟𝑡𝑜𝑠𝑖𝑠 𝑌 3
𝜈 4
The kurtosis is defined for 𝜈 4 and is always larger than 3
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
53 of 65
This distribution is known as a standardized Student’s 𝑡, because it has mean 0 and variance 1
for any value of 𝜈.
• The above distribution can be rescaled to have any variance and re-centered to have any mean if
𝜈 2. For example, 𝑋 𝜇 𝜎𝑆 is known as a generalized Student’s 𝑡 and has mean 𝜇,
variance 𝜎 , and degrees of freedom parameter 𝜈. It is parameterized with three parameters
reflecting the mean, variance, and degrees of freedom, and is denoted as
𝐺𝑒𝑛. 𝑡 𝜇, 𝜎
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
54 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
55 of 65
F DISTRIBUTION www.midhafin.com
• The 𝐹 is another distribution that is commonly encountered when testing hypotheses about
model parameters. The 𝐹 has two parameters, 𝜈 and 𝜈 , respectively known as the numerator
and denominator degrees of freedom.
• An 𝐹 distribution is defined as the ratio of two independent 𝜒 random variables where each has
been divided by its degree of freedom
𝑊
𝜈
𝑌 𝑊
𝜈
where 𝑊 ∼ 𝜒 , and 𝑊 ∼ 𝜒 , and
𝑊 and 𝑊 are independent.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
56 of 65
F DISTRIBUTION www.midhafin.com
• If 𝑌 ∼ 𝐹 , , then the mean of 𝑌 is
𝜈
𝐸𝑌
𝜈 2
which is only finite when 𝜈 is larger than 2.
• The variance of 𝑌 is
2𝜈 𝜈 𝜈 2
𝑉𝑌
𝜈 𝜈 2 𝜈 4
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
57 of 65
F DISTRIBUTION www.midhafin.com
• When using an 𝐹 in hypothesis testing, 𝜈 is usually determined by the hypothesis being tested
and is typically small (e.g., 1, 2, 3, …), while 𝜈 is related to the sample size (and so is relatively
large).
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
58 of 65
𝐸𝑌 𝛽
• The variance of 𝑌 is
𝑉𝑌 𝛽
• Exponential variables are also memoryless, meaning that their distributions are independent
of their histories.
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
59 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
60 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
61 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
62 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
63 of 65
• A two-component mixture first draws a value from a Bernoulli random variable. Then,
depending on the value (0 or 1), draws from one of two component distributions. This
structure makes it simple to compute the CDF of the mixture when the components are
normal random variables
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
64 of 65
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED
65 of 65
END OF LECTURE
THANK YOU
Micky Midha
BE, CFA®, FRM, LLB www.midhafin.com
____________________________________________________________________________________________________________
www.midhafin.com
© 2019 MidhaFin
REPRODUCTION OF ANY PART OF THIS DOCUMENT IS STRICTLY PROHIBITED