Basic Probability and Statistical Distribution
Basic Probability and Statistical Distribution
Fx(4) =r(m) = 6 + 6
m=l
Notice, however, that if the argument to the c.d.f. is not an element of the
sample space S,
2.1.3 Means, variances, and expected values
The various probability functions for discrete and continuous random variables
introduced in
Section 2. 1.2 provide general characterizations of a variable's probability structure. In
cases, however, it is useful to construct summary measures of the random
many
variable that
sulate its various features. These can be derived from the underlying p.m.f. or p.d.f.encap
The
general form of such a summary measure is called an expected value, and it is based on a
mathematical construct known as an expectation operator, E[·]. In its most general usage, the
expected value ofa function of a random variable, g(X), is defined as
for a p.d.f. fxr). That is, expectation involves summation for discrete random variables and
integration for continuous random variables. In effect, the expected value of a function g(X)
is aweighted average of gX). with weights taken as the probability mass or density of the
random variable X.
2.1.4 Median, quartiles, and quantiles
Another uscful summarization for a random variable Xrelates the values it achieves to the
C.d.I.. Fx(). at those values. Consider. for cxample. the c.d.f. at its middle value. 50%.
One might ask what value of x in the sample space S satisfies Fx(x) = 1/2? The point that
does so is called the median of X. More formally, the median is the quantity ,ES where
above which
P\X < O|> and P[X >O,|>.That is. O, is the point below which and
notation 2, willbe
at least SO% of the probability mass or probabilitydensity rests. (The
explained below.)
discrete distributions,
For most continuous distributions, the median is unique. For many
for a dis
however, its defining equations may not be unambiguously satisfied. For example,
m|=;2 and
crete random variable, Xmay have two adjacent values m, < m, such that P[X <
called the median of
P[X > m,]= In this case, any value of Xbetween m, and m, could be
=;(m + m).
X. If this occurs in practice, Q, is set equal to the midpoint of theinterval. @,
six-sided die, and
Example 2.1.5 Six-sided die roll (Example 2.1.4, continued). Roll a fair,
the sample space is
let X be the number of 'pips' seen on that roll. As seen in Example 2.1.4,
S= {1.2, ... .6} and the p.m.f. is fr(m) =; for any me S. I or X= 2 or X =3]
To find the population median, recognize that P[X < 3]=P[X = PIX >
=P[X = 1]+ P[X =2]+ P[X = 3] = 0.5, because the events are disjoint. Similarly.
including)
4] =P[X =4or X = 5 or X =6] = 0.5. Thus any value of , between (but not
to let ,
m =3and m = 4 would satisfy the definition of a median for X. Simplest here is
ofS. As with the
be the midpoint: , = (3 +4) =3.5. Notice that this is not an element
population mean, the population median need not be an element of the sample space.
the mean
As it characterizes the 'center' of a distribution, O, is used as an alternative to
E[X] to measure X's central tendency. In fact, the two values can be equal - as in Examples
2.1.4 and 2.I.5 although this is not guaranteed. When a random variable exhibits a large
skew, the median will be less influenced than the population mean by the extreme values in
the skewed tail of the distribution. Thus it can be particularly useful for measuring central
tendency with skewed distributions.
One can extend the concept of a median that is, the 50% point of a distribution - to any
and 75%
desired probability point along the range of Fy(x). Two obvious values are the 25%
points. These are known as the first (or lower) andthird (or upper) quartiles of the distribution
andare denoted as Q, and 3, respectively. (The second or middle quartile is just the median,
Q. which explains its notation.) Formally, the first (lower) quartile is defined as the point
Q, eS such that P[X < l 0.25 and P[X > ]>0.75.Similarly. the third (upper) quar
tile is defined as the point @3 such that P[X < Q,l>0.75 and P[X >Ql>0.25.
variable into equal-probability
Quartiles act to separate the distribution of a random desired separalion, so that,
Tourths (hence, their name). This concept can be applied to any
intotenths, andpercentiles
lor example, quintiles separate Sintofifths, deciles separate S quantle of a
into hundredths. Fully generalized to any desired probability point. the pth
distribution is the point q, that satisies P[X <al> pand P[X > q,]> I-P. for O<p<l.
IH the c.d.f. of X, Fya), is continuous and strictly increasing such that it has an inverse
function F). the quantiles can be defined by inverting Fy(): 4, = '(P).
BIVARIATE DISTRIBUTIONS
f(z)
-1 0.25
1 0.75
Let z and y be variables that assume values in the the sets {1,T2,...,I,} and
{y1-y2... . .ym}, respectively. Then the function f(r;, yi), which gives the relative
frequencies of the occurrence of the pairs (T;, y;) is called a bivariate frequency
function. It must be true that
The values of f(ri, yi) are within the body of the table.
The marginal frequency function of r gives the relative frequencies of the values
of ; regardless of the values of y; with which thev are associated: and it is defined
by
f(ri)= f(ri.yj): i=1,... ,n.
It follows that