Week 5 Cheat Sheet
Week 5 Cheat Sheet
Charlie Nu©elman
Here, I provide the mathemacal equaons and some of the important Excel funcons required to
perform various calculaons in Week 5 of the course. The headings represent the screencasts in which
you will find those calculaons and concepts. Not all screencasts are referenced below – just the ones
that have complex mathemacal formulas or Excel formulas that are tricky to use.
NOTE: For Week 5, I also have provided you with a cheat-sheet specifically for Excel formulas (“Excel
Funcons for Connuous Distribuons”). So, be sure to check that out, too!
Like for discrete density funcons, we can calculate probabilies by taking differences in cumulave
probability density funcons:
Õ
𝑃[𝑎 ≤ 𝑥 ≤ 𝑏] = ± 𝑓(𝑥) ∙ 𝑑𝑥 = 𝐹(𝑏) − 𝐹(𝑎)
Ô
We typically don’t need to perform the integraon (the term between the two equal signs) since we can
either look up cumulave density funcons or obtain them from Excel (or other compung tool).
Some useful properes of the normal distribuon are that about 68% of the distribuon lies within ±𝜎
(one standard deviaon) of the mean, about 95% of the distribuon lies within ±2𝜎 (two standard
deviaons) of the mean, and 99.7% of the distribuon lies within ±3𝜎 (three standard deviaons) of the
mean.
Cumulave standard normal distribuon tables can be helpful in calculang probabilies associated with
standard normal distribuons. The NORM.S.DIST and NORM.S.INV funcons are useful Excel funcons
for dealing with the standard normal distribuon.
Exponenal Distribuon
Another important connuous distribuon is the exponenal distribuon. The exponenal distribuon
is commonly associated with Poisson processes. It provides the probability that there will be a certain
interval (me, length, area, volume) between two events of a Poisson process. It is common to use the
cumulave exponenal distribuon to calculate probabilies. The cumulave exponenal distribuon is
given by:
𝐹(𝑥) = 1 − 𝑒 ?
Here, 𝜆 is the long-term average rate (events per interval). The EXPON.DIST funcon in Excel is useful for
probability calculaons related to the exponenal distribuon.
Probability PIots
A probability plot is used to determine whether a set of sample data likely came from a normally
distributed populaon. Typically, in order to show that the data were derived from a normally
distributed populaon, the P-value of the AD stasc should be greater than 0.05. If the P-value of the
AD stasc is less than 0.05, then it can be concluded that the data do not come from a normally
distributed populaon.
Many sophiscated soLware tools will provide probability plots and P-values of the AD stasc. Excel,
unfortunately, won’t do this, but in the course, I provide a file “Probability plot.xlsx” that will do this for
you. Please use this file to explore normality of the data – this will be much more important in Part 2 of
the course.