Distrib: Probability Distribution Analysis
Distrib: Probability Distribution Analysis
Introduction
The DISTRIB main dialog window contains all the features necessary to analyze a set of
univariate data.
DISTRIB Page1
There are 5 major parts to the main dialog window:
Univariate data are directly entered into the Data Entry Spreadsheet (upper). The
white portion of the spreadsheet allows for the entry ofthis data. Data should be
entered with one entry per row.
Once all data are entered, use the mouse to click in the Prediction column of the
upper Data Entry Spreadsheet or in the Select Distribution column, select the
distribution type you wish to “fit” the data.
* DISTRIB Example 1 *
Using the maximum annual flowrate data as shown, predict the 25 year return period flow
using the Log Pearson Type III distribution.
DISTRIB Page
The data should be entered as shown:
Next click the Log Pearson Type III button in the Select Distribution frame.
From the lower Prediction Spreadsheet we can see that the 25 year return period
prediction is 165.46 cms.
* End of Example 1 *
The DISTRIB main dialog window allows for a number of options. To see a
close-up of the plot, click on the actual distribution plot in the lower left
corner of the main dialog window - an enlarged plot of the data will appear.
7 Actual Data
f Distribution
_
u ->_
You may now Print or Copy (to clipboard) the plot. You may also change the
grid displayed by clicking on the Grid button. Each consecutive click will add
horizontal lines, vertical lines, or retumn the plot to the above shown empty grid.
Select the Done button to return to the main DISTRIB dialog window.
The Prediction Spreadsheet allows for the entry of different Probabilities
and calculation of predictions based on different return periods. The return
period can be calculated from probability using;
RP=—-
l-p
where,
RP = Retum
Period
p = probability
DISTRIB allows for optional plotting position formulas. Clicking on the word
Weibull in the upper corner of the Data Entry Spreadsheet will allow you to scroll
through plotting position formula;
Weibull m/(n+1)
California m/n
Foster (2m-1)/2n
Exceedence (m-1)/n
I. DISTRIB Options
A. Binomial Calculator
The Binomial Calculator is available from the Calculators menu. The Binomial
Calculator, as shown below, allows for calculation ofthe probability of any discrete
value using the binomial distribution.
* DISTRIB Example 2 *
If the probability of any a certain event is 0.2, what is the probability of that event
occurring exactly 3 times in 10 tries?
H
Pr(x) =— P 1- D"*
X!'(A— )!
u 0.10737 [
0.4
0.3 *
x
Prnbabilay |]2 5e
0.1 =
x
0.0 ! !— —
0 2 4 6 8 10
Numba ol
Tues
x
0.0 ! ! — —
0 2 4 6 8 10
Numba ol
Tues
From the Binomial Calculator it can be seen that the probability 1s 0.20133 or
20.133 %.
DISTRIB Page 7
* DISTRIB Example 3 *
n
Pr(x) = pa-py*
X! (N — x)!
0.30
0.25
0.20
Pobobilur » x
0.15
9.10
0.05 x x
0.00%
X
Ea
0
Nueby el Tuas
The probability ofless than 3 heads, is the sum of 0, 1, and 2 heads in thetoss.
Pr(0) = 0.00781
Pr(1) 0.05469
=
Pr(2) = 0.16406
PR (x <3) = 0.22656 or the probability of having less than 3 headsin a toss of 7 coins
is
22.656 %.
A
Ve
Pr(x) =
x!
Probability = 9.02235221577418E-02
0.30
0.25
0.20
Pl ay
0.15
0.10
0.05
0.00
0 3 A 5
Fireule Tuga
DISTRIB Page 9
C. Annualization Factor
The Anmnualization Factor is available from the Edit menu. The Annualization Factor
allows for the use of partial duration series data in analysis using DISTRIB.
The Annualization Factor is the ratio of number of data points used in the
analysis to the span ofyears the data covers. This factor is used in the prediction
to put the return period predictions on an annual basis.
D. Statistics
The Statistics menu option allows you to activate calculation or plotting of the
statistics without having to click on any of the Results frames (Prediction
columns, Standard Deviation columns, or the Return Period column).
The Fit Distribution selection will calculate all of the Results frame data. The Plot
Distribution will activate the enlarged plot ofthe distribution without having to click
on the small plot in the lower left-hand corner ofthe main DISTRIB dialog window.
DISTRIB Page
10
I. Types of Distributions
DISTRIB uses the common distribution types which are used in the field of
water resources. These distribution are represented by probability density
functions, and are shown below:
A. Normal Distribution
p (x)= )= —
— exp| - 1 (2=2)
— — — |
" o 2 o
where,
- o 27 2 o
where,
In the Three Parameter Log Normal distribution the parameter y of the log normal
distribution is calculated as:
y = In (x - a)
where,
D. Pearson Distribution
P () = p…,[1+g)%e%
where,
where,
F. Gumbel Distribution
a— Xy
P(D=" 1( e
F
6-7 B-7
where,
A number of methods have been used to handle 0.0 values. DISTRIB does not
perform any of these conversions for you and may crash if you attempt to
fit O data to a log distribution. Toalleviate this problem you may:
DULDWN
In most real world cases a type of distribution will not be specified. In this situation the
best distribution for the data must be selected. If is recommended that:
The “region ofinterest” is the portion of the fit which is going to be predicted. When
making predictions of extreme events, the left side ofthe plot (high return periods) should
fit well to the data. It is not important that the low end numbers reflect a
good fit in this case.