0% found this document useful (0 votes)

24 views18 pages

G Randomvariables

This document introduces random variables and distribution functions. It defines a random variable as a function from a probability space to the real numbers. The distribution function of a random variable X gives the probability that X is less than or equal to any value x. It can be used to find the probability that X lies within an interval by subtracting the distribution function values at the endpoints. An example calculates the cumulative distribution function for the sum of two rolled dice.

Uploaded by

Jason Tagapan Gulla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views18 pages

G Randomvariables

Uploaded by

Jason Tagapan Gulla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Topic 7

Random Variables and Distribution

Functions

7.1 Introduction
From the universe of possible information, we ask
statistics probability
a question. To address this question, we might col-
lect quantitative data and organize it, for example,
using the empirical cumulative distribution func- universe of sample space - ⌦
tion. With this information, we are able to com- information and probability - P
pute sample means, standard deviations, medians + +
and so on. ask a question and define a random
Similarly, even a fairly simple probability
model can have an enormous number of outcomes.
collect data variable X
For example, flip a coin 332 times. Then the num- + +
ber of outcomes is more than a google (10100 ) – organize into the organize into the
a number at least 100 quintillion times the num- empirical cumulative cumulative
ber of elementary particles in the known universe. distribution function distribution function
We may not be interested in an analysis that con- + +
siders separately every possible outcome but rather
some simpler concept like the number of heads or
compute sample compute distributional
the longest run of tails. To focus our attention on means and variances means and variances
the issues of interest, we take a given outcome and
compute a number. This function is called a ran-
Table I: Corresponding notions between statistics and probability. Examining
dom variable. probabilities models and random variables will lead to strategies for the collection
Definition 7.1. A random variable is a real val- of data and inference from these data.
ued function from the probability space.
X : ⌦ ! R.
Generally speaking, we shall use capital letters near the end of the alphabet, e.g., X, Y, Z for random variables.
The range S of a random variable is sometimes called the state space.

Exercise 7.2. Roll a die twice and consider the sample space ⌦ = {(i, j); i, j = 1, 2, 3, 4, 5, 6} and give some random
variables on ⌦.
Exercise 7.3. Flip a coin 10 times and consider the sample space ⌦, the set of 10-tuples of heads and tails, and give
some random variables on ⌦.

101
Introduction to the Science of Statistics Random Variables and Distribution Functions

We often create new random variables via composition of functions:

! 7! X(!) 7! f (X(!))

Thus, if X is a random variable, then so are

p
X 2, exp ↵X, X 2 + 1, tan2 X, bXc

and so on. The last of these, rounding down X to the nearest integer, is called the floor function.
Exercise 7.4. How would we use the floor function to round down a number x to n decimal places.

7.2 Distribution Functions

Having defined a random variable of interest, X, the question typically becomes, “What are the chances that X lands
in some subset of values B?” For example,

B = {odd numbers}, B = {greater than 1}, or B = {between 2 and 7}.

We write
{! 2 ⌦; X(!) 2 B} (7.1)
to indicate those outcomes ! which have X(!), the value of the random variable, in the subset A. We shall often
abbreviate (7.1) to the shorter statement {X 2 B}. Thus, for the example above, we may write the events

{X is an odd number}, {X is greater than 1} = {X > 1}, {X is between 2 and 7} = {2 < X < 7}

to correspond to the three choices above for the subset B.

Many of the properties of random variables are not concerned with the specific random variable X given above,
but rather depends on the way X distributes its values. This leads to a definition in the context of random variables
that we saw previously with quantitive data..
Definition 7.5. A (cumulative) distribution function of a random variable X is defined by

FX (x) = P {! 2 ⌦; X(!)  x}.

Recall that with quantitative observations, we called the analogous notion the empirical cumulative distribution
function. Using the abbreviated notation above, we shall typically write the less explicit expression

FX (x) = P {X  x}

for the distribution function.

Exercise 7.6. Establish the following identities that relate a random variable the complement of an event and the
union and intersection of events
1. {X 2 B}c = {X 2 B c }
2. For sets B1 , B2 , . . .,
[ [ \ \
{X 2 Bi } = {X 2 B} and {X 2 Bi } = {X 2 B}.
i i i i

3. If B1 , . . . Bn form a partition of the sample space S, then Ci = {X 2 Bi }, i = 1, . . . , n form a partition of the

probability space ⌦.

102
Introduction to the Science of Statistics Random Variables and Distribution Functions

Exercise 7.7. For a random variable X and subset B of the sample space S, define

PX (B) = P {X 2 B}.

Show that PX is a probability.

For the complement of {X  x}, we have the survival function

F̄X (x) = P {X > x} = 1 P {X  x} = 1 FX (x).

Choose a < b, then the event {X  a} ⇢ {X  b}. Their set theoretic difference

{X  b} \ {X  a} = {a < X  b}.

In words, the event that X is less than or equal to b but not less than or equal to a is the event that X is greater than a
and less than or equal to b. Consequently, by the difference rule for probabilities,

P {a < X  b} = P ({X  b} \ {X  a}) = P {X  b} P {X  a} = FX (b) FX (a). (7.2)

Thus, we can compute the probability that a random variable takes values in an interval by subtracting the distri-
bution function evaluated at the endpoints of the intervals. Care is needed on the issue of the inclusion or exclusion of
the endpoints of the interval.

Example 7.8. To give the cumulative distribution function for X, the sum of the values for two rolls of a die, we start
with the table

x 2 3 4 5 6 7 8 9 10 11 12
P {X = x} 1/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36

and create the graph.

6 r r
r
1

r
3/4 r
r

r
1/2

r
r
1/4

r
r -
1 2 3 4 5 6 7 8 9 10 11 12

Figure 7.1: Graph of FX , the cumulative distribution function for the sum of the values for two rolls of a die.

103
Introduction to the Science of Statistics Random Variables and Distribution Functions

If we look at the graph of this cumulative distribution function, we see that it is constant in between the possible
values for X and that the jump size at x is equal to P {X = x}. In this example, P {X = 5} = 4/36, the size of the
jump at x = 5. In addition,
X
FX (5) FX (2) = P {2 < X  5} = P {X = 3} + P {X = 4} + P {X = 5} = P {X = x}
2<x5
2 3 4 9
= + + = .
36 36 36 36
We shall call a random variable discrete if it has a finite or countably infinite state space. Thus, we have in general
that:
X
P {a < X  b} = P {X = x}.
a<xb

Exercise 7.9. Let X be the number of heads on three independent flips of a biased coin that turns ups heads with
probability p. Give the cumulative distribution function FX for X.
Exercise 7.10. Let X be the number of spades in a collection of three cards. Give the cumulative distribution function
for X. Use R to plot this function.
Exercise 7.11. Find the cumulative distribution function of Y = X 3 in terms of FX , the distribution function for X.

7.3 Properties of the Distribution Function

A distribution function FX has the property that it starts at 0, ends at 1 and does not decrease with increasing values
of x.. This is the content of the next exercise.
Exercise 7.12. 1. limx! 1 FX (x) = 0. 1

0.8

2. limx!1 FX (x) = 1. 0.6

0.4
3. FX is nondecreasing. 0.2

The cumulative distribution function FX of a discrete random variable 0

x
X is constant except for jumps. At the jump, FX is right continuous, !0.2

!0.4

lim FX (x) = FX (x0 ). !0.6

x!x0 + !0.8

The next exercise ask that this be shown more generally.

!1 !0.8 !0.6 !0.4 !0.2 0 0.2 0.4 0.6 0.8 1

Exercise 7.13. Prove the statement concerning the right continuity of the
distribution function from the continuity property of a probability.
Definition 7.14. A continuous random variable has a cumulative distribu-
1.0

tion function FX that is differentiable.

0.8

So, distribution functions for continuous random variables increase

0.6
probability

smoothly. To show how this can occur, we will develop an example of a

continuous random variable.
0.4

Example 7.15. Consider a dartboard having unit radius. Assume that the
0.2

dart lands randomly uniformly on the dartboard.

0.0

Let X be the distance from the center. For x 2 [0, 1],

0.0 0.5 1.0

area inside circle of radius x ⇡x2 Figure 7.2: (top) Dartboard.

x (bottom) Cumula-
FX (x) = P {X  x} = = = x2 . tive distribution function for the dartboard ran-
area of circle ⇡12 dom variable.

104
Introduction to the Science of Statistics Random Variables and Distribution Functions

Thus, we have the distribution function

8
<0 if x  0,
FX (x) = x2 if 0 < x  1,
:
1 if x > 1.

The first line states that X cannot be negative. The third states that X is at most 1, and the middle lines describes
how X distributes is values between 0 and 1. For example,
✓ ◆
1 1
FX =
2 4
indicates that with probability 1/4, the dart will land within 1/2 unit of the center of the dartboard.
Exercise 7.16. Find the probability that the dart lands between 1/3 unit and 2/3 unit from the center.
Exercise 7.17. Let the reward Y for through the dart be the inverse 1/X of the distance from the center. Find the
cumulative distribution function for Y .
Exercise 7.18. An exponential random variable X has cumulative distribution function
⇢
0 if x  0,
FX (x) = P {X  x} = (7.3)
1 exp( x) if x > 0.

for some > 0. Show that FX has the properties of a distribution function.
Its value at x can be computed in R using the command pexp(x,0.1) for = 1/10 and drawn using
> curve(pexp(x,0.1),0,80)
1.0
0.8
0.6
pexp(x, 0.1)

0.4
0.2
0.0

0 20 40 60 80

x
Figure 7.3: Cumulative distribution function for an exponential random variable with = 1/10.

Exercise 7.19. The time until the next bus arrives is an exponential random variable with = 1/10 minutes. A person
waits for a bus at the bus stop until the bus arrives, giving up when the wait reaches 20 minutes. Give the cumulative
distribution function for T , the time that the person remains at the bus station and sketch a graph.
Even though the cumulative distribution function is defined for every random variable, we will often use other
characterizations, namely, the mass function for discrete random variable and the density function for continuous
random variables. Indeed, we typically will introduce a random variable via one of these two functions. In the next
two sections we introduce these two concepts and develop some of their properties.

105
Introduction to the Science of Statistics Random Variables and Distribution Functions

7.4 Mass Functions

Definition 7.20. The (probability) mass function of a discrete random variable X is

fX (x) = P {X = x}.

The mass function has two basic properties:

• fX (x) 0 for all x in the state space.
P
• x fX (x) = 1.

The first property is based on the fact that probabilities are non-negative. The second follows from the observation
that the collection Cx = {!; X(!) = x} for all x 2 S, the state space for X, forms a partition of the probability
space ⌦. In Example 7.8, we saw the mass function for the random variable X that is the sum of the values on two
independent rolls of a fair dice.
Example 7.21. Let’s make tosses of a biased coin whose outcomes are independent. We shall continue tossing until
we obtain a toss of heads. Let X denote the random variable that gives the number of tails before the first head and p
denote the probability of heads in any given toss. Then

fX (0) = P {X = 0} = P {H} = p
fX (1) = P {X = 1} = P {T H} = (1 p)p
fX (2) = P {X = 2} = P {T T H} = (1 p)2 p
.. .. ..
. . .
fX (x) = P {X = x} = P {T · · · T H} = (1 p)x p

So, the probability mass function fX (x) = (1 p)x p. Because the terms in this mass function form a geometric
sequence, X is called a geometric random variable. Recall that a geometric sequence c, cr, cr2 , . . . , crn has sum

c(1 rn+1 )
sn = c + cr + cr2 + · · · + crn =
1 r
for r 6= 1. If |r| < 1, then limn!1 rn = 0 and thus sn has a limit as n ! 1. In this case, the infinite sum is the limit
c
c + cr + cr2 + · · · + crn + · · · = lim sn = .
n!1 1 r
Exercise 7.22. Establish the formula above for sn .
The mass function above forms a geometric sequence with the ratio r = 1 p. Consequently, for positive integers
a and b,
b
X
P {a < X  b} = (1 p)x p = (1 p)a+1 p + · · · + (1 p)b p
x=a+1

(1 p)a+1 p (1 p)b+1 p
= = (1 p)a+1 (1 p)b+1
1 (1 p)
We can take a = 0 to find the distribution function for a geometric random variable.

FX (b) = P {X  b} = 1 (1 p)b+1 .

Exercise 7.23. Give a second way to find the distribution function above by explaining why P {X > b} = (1 p)b+1 .

106
Introduction to the Science of Statistics Random Variables and Distribution Functions

The mass function and the cumulative distribution function for the geometric random variable with parameter
p = 1/3 can be found in R by writing
> x<-c(0:10)
> f<-dgeom(x,1/3)
> F<-pgeom(x,1/3)
The initial d indicates density and p indicates the probability from the distribution function.
> data.frame(x,f,F)
x f F
1 0 0.333333333 0.3333333
2 1 0.222222222 0.5555556
3 2 0.148148148 0.7037037
4 3 0.098765432 0.8024691
5 4 0.065843621 0.8683128
6 5 0.043895748 0.9122085
7 6 0.029263832 0.9414723
8 7 0.019509221 0.9609816
9 8 0.013006147 0.9739877
10 9 0.008670765 0.9826585
11 10 0.005780510 0.9884390
Note that the difference in values in the distribution function FX (x) FX (x 1), giving the height of the jump
in FX at x, is equal to the value of the mass function. For example,

FX (3) FX (2) = 0.7037037 0.5555556 = 0.148148148 = fX (2).

Exercise 7.24. Check that the jumps in the cumulative distribution function for the geometric random variable above
is equal to the values of the mass function.
Exercise 7.25. For the geometric random variable above, find P {X  3}, P {2 < X  5}. P {X > 4}.
We can simulate 100 geometric random variables with parameter p = 1/3 using the R command rgeom(100,1/3).
(See Figure 7.4.)
Histogram of x Histogram of x
50

5000
40

4000
30
Frequency

Frequency

3000
20

2000
10

1000
0

0 2 4 6 8 10 12 0 5 10 15 20

x x

Figure 7.4: Histogram of 100 and 10,000 simulated geometric random variables with p = 1/3. Note that the histogram looks much more like a
geometric series for 10,000 simulations. We shall see later how this relates to the law of large numbers.

107
Introduction to the Science of Statistics Random Variables and Distribution Functions

7.5 Density Functions

Definition 7.26. For X a random variable whose distribution function FX has a derivative. The function fX satisfying
Z x
FX (x) = fX (t) dt
1

is called the probability density function and X is called a continuous random variable.

By the fundamental theorem of calculus, the density function is the derivative of the distribution function.

FX (x + x) FX (x) 0
fX (x) = lim = FX (x).
x!0 x
In other words,
FX (x + x) FX (x) ⇡ fX (x) x.
We can compute probabilities by evaluating definite integrals
Z b
P {a < X  b} = FX (b) FX (a) = fX (t) dt.
a

The density function has two basic properties that mirror

the properties of the mass function:

• fX (x) 0 for all x in the state space.

R1
• 1 X
f (x) dx = 1.

Return to the dart board example, letting X be the dis-

tance from the center of a dartboard having unit radius.
Then,

P {x < X  x + x} = FX (x + x) FX (x)
⇡ fX (x) x = 2x
Figure 7.5: The probability P {a < X  b} is the area under the
density function, above the x axis between y = a and y = b.
and X has density
8
<0 if x < 0,
fX (x) = 2x if 0  x  1,
:
0 if x > 1.

Exercise 7.27. Let fX be the density for a random variable X and pick a number x0 . Explain why P {X = x0 } = 0.

Example 7.28. For the exponential distribution function (7.3), we have the density function
⇢
0 if x  0,
fX (x) =
e x if x > 0.

Example 7.29. Density functions do not need to be bounded, for example, if we take
8
<0 if x  0,
fX (x) = pc
if 0 < x < 1,
: x
0 if 1  x.

108
Introduction to the Science of Statistics Random Variables and Distribution Functions

Then, to find the value of the constant c, we compute the integral

Z 1 p 1
c
1= p dt = 2c t = 2c.
0 t 0

So c = 1/2.
For 0  a < b  1,
Z b p p
1 b p
P {a < X  b} = p dt = t = b a.
a 2 t a

Exercise 7.30. Give the cumulative distribution function for the random variable in the previous example.

Exercise 7.31. Let X be a continuous random variable with density fX , then the random variable Y = aX + b has
density ✓ ◆
1 y b
fY (y) = fX
|a| a
(Hint: Begin with the definition of the cumulative distribution function FY for Y . Consider the cases a > 0 and a < 0
separately.)

7.6 Joint Distributions

Because we will collect data on several observations, we must, as well, consider more than one random variable at a
time in order to model our experimental procedures. Consequently, we will expand on the concepts above to the case
of multiple random variables and their joint distribution. For the case of two random variables, X1 and X2 , this means
looking at the probability of events,
P {X1 2 B1 , X2 2 B2 }.
For discrete random variables, take B1 = {x1 } and B2 = {x2 } and define the joint probability mass function

fX1 ,X2 (x1 , x2 ) = P {X1 = x1 , X2 = x2 }.

For continuous random variables, we consider B1 = (x1 , x1 + x1 ] and B2 = (x2 , x2 + x2 ] and ask that for
some function fX1 ,X2 , the joint probability density function to satisfy

P {x1 < X1  x1 + x 1 , x2 < X2  x 2 + x2 } ⇡ fX1 ,X2 (x1 , x2 ) x1 x2 .

Example 7.32. Generalize the notion of mass and density functions to more than two random variables.

7.6.1 Independent Random Variables

Many of our experimental protocols will be designed so that observations are independent. More precisely, we will
say that two random variables X1 and X2 are independent if any two events associated to them are independent, i.e.,

P {X1 2 B1 , X2 2 B2 } = P {X1 2 B1 }P {X2 2 B2 }.

In words, the probability that the two events {X1 2 B1 } and {X2 2 B2 } happen simultaneously is equal to the
product of the probabilities that each of them happen individually.
For independent discrete random variables, we have that

fX1 ,X2 (x1 , x2 ) = P {X1 = x1 , X2 = x2 } = P {X1 = x1 }P {X2 = x2 } = fX1 (x1 )fX2 (x2 ).

In this case, we say that the joint probability mass function is the product of the marginal mass functions.

109
Introduction to the Science of Statistics Random Variables and Distribution Functions

For continuous random variables,

fX1 ,X2 (x1 , x2 ) x1 x2 ⇡ P {x1 < X1  x1 + x 1 , x2 < X 2  x 2 + x2 }

= P {x1 < X1  x1 + x1 }P {x2 < X2  x2 + x2 } ⇡ fX1 (x1 ) x1 fX2 (x2 ) x2
= fX1 (x1 )fX2 (x2 ) x1 x2 .

Thus, for independent continuous random variables, the joint probability density function

fX1 ,X2 (x1 , x2 ) = fX1 (x1 )fX2 (x2 )

is the product of the marginal density functions.

Exercise 7.33. Generalize the notion of independent mass and density functions to more than two random variables.
Soon, we will be looking at n independent observations x1 , x2 , . . . , xn arising from an unknown density or mass
function f . Thus, the joint density is
f (x1 )f (x2 ) · · · f (xn ).
Generally speaking, the density function f will depend on the choice of a parameter value ✓. (For example, the
unknown parameter in the density function for an exponential random variable that describes the waiting time for a
bus.) Given the data arising from the n observations, the likelihood function arises by considering this joint density
not as a function of x1 , . . . , xn , but rather as a function of the parameter ✓. We shall learn how the study of the
likelihood plays a major role in parameter estimation and in the testing of hypotheses.

7.7 Simulating Random Variables

One goal for these notes is to provide the tools needed to design inferential procedures based on sound principles
of statistical science. Thus, one of the very important uses of statistical software is the ability to generate pseudo-
data to simulate the actual data. This provides the opportunity to test and refine methods of analysis in advance of
the need to use these methods on genuine data. This requires that we explore the properties of the data through
simulation. For many of the frequently used families of random variables, R provides commands for their simulation.
We shall examine these families and their properties in Topic 9, Examples of Mass Functions and Densities. For other
circumstances, we will need to have methods for simulating sequence of independent random variables that possess a
common distribution. We first consider the case of discrete random variables.

7.7.1 Discrete Random Variables and the sample Command

The sample command is used to create simple and stratified random samples. Thus, if we enter a sequence x,
sample(x,40) chooses 40 entries from x in such a way that all choices of size 40 have the same probability.
This uses the default R command of sampling without replacement. We can use this command to simulate
discrete random variables. To do this, we need to give the state space in a vector x and a mass function f. The call for
replace=TRUE indicates that we are sampling with replacement. Then to give a sample of n independent random
variables having common mass function f, we use sample(x,n,replace=TRUE,prob=f).
Example 7.34. Let X be described by the mass function
x 1 2 3 4
fX (x) 0.1 0.2 0.3 0.4
Then to simulate 50 independent observations from this mass function:
> x<-c(1,2,3,4)
> f<-c(0.1,0.2,0.3,0.4)
> sum(f)

110
Introduction to the Science of Statistics Random Variables and Distribution Functions

[1] 1
> data<-sample(x,50,replace=TRUE,prob=f)
> data
[1] 1 4 4 4 4 4 3 3 4 3 3 2 3 3 3 4 4 3 3 2 4 1 3 3 4 2 3 3 3 1 2 4 3 2 3 4 4 4 4 2 4 1
[43] 2 3 4 4 1 4 3 4

Notice that 1 is the least represented value and 4 is the most represented. If the command prob=f is omitted, then
sample will choose uniformly from the values in the vector x. Let’s check our simulation against the mass function
that generated the data. (Notice the double equal sign, ==.) First, recount the observations that take on each possible
value for x. We can make a table.

> table(data)
data
1 2 3 4
5 7 18 20

or use the counts to determine the simulated proportions.

> counts<-rep(0,max(x)-min(x)+1)
> for (i in min(x):max(x)){counts[i]<-length(data[data==i])}
> simprob<-counts/(sum(counts))
> data.frame(x,f,simprob)
x f simprob
1 1 0.1 0.10
2 2 0.2 0.14
3 3 0.3 0.36
4 4 0.4 0.40

Exercise 7.35. Simulate the sums on each of 20 rolls of a pair of dice. Repeat this for 1000 rolls and compare the
simulation with the appropriate mass function.

0.9

0.8

0.7

0.6

0.5

0.4

0.3

0.2

0.1

0
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8
x

Figure 7.6: Illustrating the Probability Transform. First simulate uniform random variables u1 , u2 , . . . , un on the interval [0, 1]. About 10%
of the random numbers should be in the interval [0.3, 0.4]. This corresponds to the 10% of the simulations on the interval [0.28, 0.38] for a random
variable with distribution function FX shown. Similarly, about 10% of the random numbers should be in the interval [0.7, 0.8] which corresponds
to the 10% of the simulations on the interval [0.96, 1.51] for a random variable with distribution function FX , These values on the x-axis can be
obtained from taking the inverse function of FX , i.e., xi = FX 1 (ui ).

111
Introduction to the Science of Statistics Random Variables and Distribution Functions

7.7.2 Continuous Random Variables and the Probability Transform

If X a continuous random variable with a density fX that is positive everywhere in its domain, then the distribution
function FX (x) = P {X  x} is strictly increasing. In this case FX has a inverse function FX 1 , known as the
quantile function.

Exercise 7.36. FX (x)  u if and only if x  FX 1 (u).

The probability transform follows from an analysis of the random variable

U = FX (X)

Note that FX has range from 0 to 1. It cannot take values below 0 or above 1. Thus, U takes on values between 0 and
1. Thus,
FU (u) = 0 for u < 0 and FU (u) = 1 for u 1.
For values of u between 0 and 1, note that

P {FX (X)  u} = P {X  FX 1 (u)} = FX (FX 1 (u)) = u.

Thus, the distribution function for the random variable U ,

8
<0 u < 0,
FU (u) = u 0  u < 1,
:
1 1  u.

If we can simulate U , we can simulate a random variable with distribution FX via the quantile function

X = FX 1 (U ). (7.4)

Take a derivative to see that the density

8
<0 u < 0,
fU (u) = 1 0  u < 1,
:
0 1  u.

Because the random variable U has a constant density over the

1.0

interval of its possible values, it is called uniform on the interval

[0, 1]. It is simulated in R using the runif command. The iden-
tity (7.4) is called the probability transform. This transform is
0.8

illustrated in Figure 7.6. We can see how the probability transform

works in the following example.
0.6
probability

Example 7.37. For the dart board, for x between 0 and 1, the
0.4

distribution function
p
u = FX (x) = x2 and thus the quantile function x = FX 1 (u) = u.
0.2

We can simulate independent observations of the distance from

0.0

the center X1 , X2 , . . . , Xn of the dart board by simulating inde-

pendent uniform random variables U1 , U2 , . . . Un and taking the 0.0 0.2 0.4 0.6 0.8 1.0

quantile function x
p Figure 7.7: The distribution function (red) and the empirical
Xi = U i . cumulative distribution function (black) based on 100 simula-
tions of the dart board distribution. R commands given below.

112
Introduction to the Science of Statistics Random Variables and Distribution Functions

> u<-runif(100)
> x<-sqrt(u)
> xd<-seq(0,1,0.01)
> plot(sort(x),1:length(x)/length(x),
type="s",xlim=c(0,1),ylim=c(0,1), xlab="x",ylab="probability")
> par(new=TRUE)
> plot(xd,xdˆ2,type="l",xlim=c(0,1),ylim=c(0,1),xlab="",ylab="",col="red")

Exercise 7.38. If U is uniform on [0, 1], then so is V = 1 U.

Sometimes, it is easier to simulate X using FX 1 (V ).
Example 7.39. For an exponential random variable, set
1
u = FX (x) = 1 exp( x), and thus x = ln(1 u)

Consequently, we can simulate independent exponential random variables X1 , X2 , . . . , Xn by simulating independent

uniform random variables V1 , V2 , . . . Vn and taking the transform
1
Xi = ln Vi .

R accomplishes this directly through the rexp command.

7.8 Answers to Selected Exercises

7.2. The sum, the maximum, the minimum, the difference, the value on the first die, the product.
7.3. The roll with the first H, the number of T , the longest run of H, the number of T s after the first H.

7.4. b10n xc/10n

7.6. A common way to show that two events A1 and A2 are equal is to pick an element ! 2 A1 and show that it is in
A2 . This proves A1 ⇢ A2 . Then pick an element ! 2 A2 and show that it is in A1 , proving that A2 ⇢ A1 . Taken
together, we have that the events are equal, A1 = A2 . Sometimes the logic needed in showing A1 ⇢ A2 consist not
solely of implications, but rather of equivalent statements. (We can indicate this with the symbol ().) In this case
we can combine the two parts of the argument. For this exercise, as the lines below show, this is a successful strategy.
We follow an arbitrary outcome ! 2 ⌦.
1. ! 2 {X 2 B}c () ! 2 / {X 2 B} () X(!) 2 / B () X(!) 2 B c () ! 2 {X 2 B c }. Thus,
c c
{X 2 B} = {X 2 B }.
S S
2. ! 2 i {X S 2 Bi } () !S2 {X 2 Bi } for someSi () X(!) 2 Bi for some i () X(!) 2 i Bi ()
! 2 {X 2 i B}. Thus, i {X 2 Bi } = {X 2 i B}. The identity with intersection is similar with for all
instead of for some.
3. We must show that the union of the Ci is equal to the state space S and that each pair are mutually exclusive.
For this
S
(a) Because Bi are a partition of ⌦, i Bi = ⌦, and
[ [ [
Ci = {X 2 Bi } = {X 2 Bi } = {X 2 ⌦} = S,
i i i

the state space.

113
Introduction to the Science of Statistics Random Variables and Distribution Functions

(b) For i 6= j, Bi \ Bj = ;, and

Ci \ Cj = {X 2 Bi } \ {X 2 Bj } = {X 2 Bi \ Bj } = {X 2 ;} = ;.

7.7. Let’s check the three axioms. Each verification is based on the corresponding axiom for the probability P .
1. For any subset B, PX (B) = P {X 2 B} 0.
2. For the sample space S, PX (S) = P {X 2 S} = P (⌦) = 1.
3. For mutually exclusive subsets Bi , i = 1, 2, · · · , we have by the exercise above the mutually exclusive events
{X 2 Bi }, i = 1, 2, · · · . Thus,
1
! ( 1
) 1
! 1 1
[ [ [ X X
PX Bi = P X 2 Bi = P {X 2 Bi } = P {X 2 Bi } = PX (Bi ).
i=1 i=1 i=1 i=1 i=1

7.9. For three tosses of a biased coin, we have

x 0 1 2 3
P {X = x} (1 p)3 3p(1 p)2 3p2 (1 p) p3
Thus, the cumulative distribution function,
8
>
> 0 for x < 0,
>
>
< (1 p)3 for 0  x < 1,
FX (x) = (1 p)3 + 3p(1 p)2 = (1 p)2 (1 + 2p) for 1  x < 2,
>
>
>
> (1 p)2 (1 + 2p) + 3p2 (1 p) = 1 p3 for 2  x < 3,
:
1 for 3  x

7.10. From the example in the section Basics of Probability, we know that
x 0 1 2 3
P {X = x} 0.41353 0.43588 0.13765 0.01294
To plot the distribution function, we use,
> hearts<-c(0:3)
> f<-choose(13,hearts)*choose(39,3-hearts)/choose(52,3)
> (F<-cumsum(f))
[1] 0.4135294 0.8494118 0.9870588 1.0000000
> plot(hearts,F,ylim=c(0,1),type="s")
Thus, the cumulative distribution function,
8
> 0 for x < 0,
1.0

>
>
>
< 0.41353 for 0  x < 1,
0.8

FX (x) = 0.84941 for 1  x < 2,

>
>
> 0.98706 for 2  x < 3,
0.6

>
:
1 for 3  x
F

0.4

7.11. The cumulative distribution function for Y ,

0.2

FY (y) = P {Y  y} = P {X 3  y}
0.0

p p
= P {X  3 y} = FX ( 3 y). 0.0 0.5 1.0 1.5 2.0 2.5 3.0

hearts

7.12. To verify the three properties for the distri-

bution function:

114
Introduction to the Science of Statistics Random Variables and Distribution Functions

1. Let xn ! 1 be a decreasing sequence. Then x1 > x2 > · · ·

{X  x1 } {X  x2 } ···
Thus,
P {X  x1 } P {X  x2 } ···
For each outcome !, eventually, for some n, X(!) > xn , and ! 2
/ {X  xn } and consequently no outcome !
is in all of the events {X  xn } and
\1
{X  xn } = ;.
n=1
Now, use the second continuity property of probabilities.
2. Let xn ! 1 be an increasing sequence. Then x1 < x2 < · · ·
{X  x1 } ⇢ {X  x2 } ⇢ · · · .
Thus,
P {X  x1 }  P {X  x2 }  · · · .
For each outcome !, eventually, for some n, X(!)  xn , and
1
[
{X  xn } = ⌦.
n=1

Now, use the first continuity property of probabilities.

3. Let x1 < x2 , then {X  x1 } ⇢ {X  x2 } and by the monotonicity rule for probabilities
P {X  x1 }  P {X  x2 }, or written in terms of the distribution function, FX (x1 )  FX (x2 )

7.13. Let xn ! x0 be a strictly decreasing sequence. Then x1 > x2 > · · ·

1
\
{X  x1 } {X  x2 } ··· , {X  xn } = {X  x0 }.
n=1

(Check this last equality.) Then P {X  x1 } P {X  x2 } · · · . Now, use the second continuity property of
probabilities to obtain limn!1 FX (xn ) = limn!1 P {X  xn } = P {X  x0 } = FX (x0 ). Because this holds for
every strictly decreasing sequencing sequence with limit x0 , we have that
lim FX (x) = FX (x0 ).
x!x0 +

7.15. Using the identity in (7.2), we have

⇢ ✓ ◆ ✓ ◆
1 2 2 1 4 1 3 1
P <X = Fx Fx = = = .
3 3 3 3 9 9 9 3
Check Exercise 7.22 to see that the answer does not depend on whether or not the endpoints of the interval are included.

7.17. Using the relation Y = 1/X, we find that the distribution function for Y ,
1
FY (y) = P {Y  y} = P {1/X  y} = P {X 1/y} = 1 P {X < 1/y} = 1 .
y2
Thus uses the fact that P {X = 1/y} = 0.
7.18. We use the fact that the exponential function is increasing, and that limu!1 exp( u) = 0. Using the numbering
of the properties above

115
Introduction to the Science of Statistics Random Variables and Distribution Functions

1. Because FX (x) = 0 for all x < 0, limx! 1 FX (x) = 0.

2. limx!1 exp( x) = 0. Thus, limx!1 FX (x) = limx!1 1 exp( x) = 1.
3. For x < 0, FX is constant, FX (0) = 0. For x 0, note that exp( x) is decreasing. Thus, FX (x) =
1 exp( x) is increasing. Consequenlty, the distribution function FX is non-decreasing.

7.19. The distribution function has the graph shown in Figure 7.8.
1.0
0.8
0.6
F

0.4
0.2
0.0

0 20 40 60 80

Figure 7.8: Cumulative distribution function for an exponential random variable with = 1/10 and a jump at x = 20.

The formula 8
<0 if x < 0,
FT (x) = P {X  x} = 1 exp( x/10) if 0  x < 20,
:
1 if 20  x.

7.22. For r 6= 1, write the expressions for sn and rsn and subtract. Notice that most of the terms cancel.

sn = c+ cr +cr2 + · · · + crn
rsn = cr +cr2 + · · · + crn +crn+1
(1 r)sn = c crn+1 = c(1 rn+1 )

Now divide by 1 r to obtain the formula.

7.23 The event {X > b} is the same as having the first b + 1 coin tosses turn up tails. Thus, the outcome is b + 1
independent events each with probability 1 p. Consequently, P {X > b} = (1 p)b+1 .
7.25. P {X  3} = FX (3) = .8024691, P {2 < X  5} = FX (5) FX (2) = 0.8683128 0.5555556 = 0.3127572,
and P {X > 4} 1 FX (4) = 1 0.8024691 = 0.1975309.
7.27. Let fX be the density. Then
Z x0 + x
0  P {X = x0 }  P {x0 x<X x+ x} = fX (x) dx.
x0 x

Now the integral goes to 0 as x ! 0. So, we must have P {X = x0 } = 0.

116
Introduction to the Science of Statistics Random Variables and Distribution Functions

7.28. Because the density is non-negative on the interval [0, 1], FX (x) = 0 if x < 0 and FX (x) = 1 if x 1. For x
between 0 and 1, Z x
1 p x p
p dt = t = x.
0 2 t 0

Thus, 8
< 0p if x  0,
FX (x) = x if 0 < x < 1,
:
1 if 1  x.

7.31. The random variable Y has distribution function

FY (y) = P {Y  y} = P {aX + b  y} = P {aX  y b}.

For a > 0 ⇢ ✓ ◆
y b y b
FY (y) = P X = FX .
a a
Now take a derivative and use the chain rule to find the density
✓ ◆ ✓ ◆
y b 1 1 y b
fY (y) = FY0 (y) = fX = fX .
a a |a| a

For a < 0 ⇢ ✓ ◆
y b y b
FY (y) = P X =1 FX .
a a
Now the derivative ✓ ◆ ✓ ◆
y b 1 1 y b
fY (y) = FY0 (y) = fX = fX .
a a |a| a

7.32. The joint density (mass function) for X1 , X2 , . . . , Xn

fX1 ,X2 ,...,Xn (x1 , x2 , . . . , xn ) = fX1 (x1 )fX2 (x2 ) · · · fXn (xn )

is the product of the marginal densities (mass functions).

7.35. Here is the R code.
> x<-c(2:12)
> x
[1] 2 3 4 5 6 7 8 9 10 11 12
> f<-c(1,2,3,4,5,6,5,4,3,2,1)/36
> sum(f)
[1] 1
> (twodice<-sample(x,20,replace=TRUE,prob=f))
[1] 9 7 3 9 3 6 9 5 5 5 5 10 10 12 9 8 6 8 11 8
> twodice<-sample(x,1000,replace=TRUE,prob=f)
> counts<-rep(0,max(x)-min(x)+1)
> for (i in min(x):max(x)){counts[i]<-length(twodice[twodice==i])}
> freq=counts/(sum(counts))
> data.frame(x,f,freq[min(x):max(x)])
x f freq.min.x..max.x..
1 2 0.02777778 0.031
2 3 0.05555556 0.054

117
Introduction to the Science of Statistics Random Variables and Distribution Functions

1.0
0.8
0.6
F

0.4
0.2
0.0

2 4 6 8 10 12

x
Figure 7.9: Sum on two fair dice. The empirical cumulative distribution function from the simulation (in black) and the cumulative distribution
function (in red) are shown for Exercise 7.31.

3 4 0.08333333 0.065
4 5 0.11111111 0.096
5 6 0.13888889 0.120
6 7 0.16666667 0.167
7 8 0.13888889 0.157
8 9 0.11111111 0.121
9 10 0.08333333 0.098
10 11 0.05555556 0.058
11 12 0.02777778 0.033
We also have a plot to compare the empirical cumulative distribution function from the simulation with the cumu-
lative distribution function.
> plot(sort(twodice),1:length(twodice)/length(twodice),type="s",xlim=c(2,12),
ylim=c(0,1),xlab="",ylab="")
> par(new=TRUE)
> plot(x,F,type="s",xlim=c(2,12),ylim=c(0,1),col="red")

7.39. FX is increasing and continuous, so the set {x; FX (x)  u} is the interval ( 1, FX 1 (u)]. In addition, x is in
this inverval precisely when x  FX 1 (u).
7.40. Let’s find FV . If v < 0, then
0  P {V  v}  P {V  0} = P {1 U  0} = P {1  U } = 0
because U is never greater than 1. Thus, FV (v) = 0 Similarly, if v 1,
1 P {V  v} P {V  1} = P {1 U  1} = P {0  U } = 1
because U is always greater than 0. Thus, FV (v) = 1. For 0  v < 1,
FV (v) = P {V  v} = P {1 U  v} = P {1 v  U} = 1 P {U < 1 v} = 1 (1 v) = v.
This matches the distribution function of a uniform random variable on [0, 1].

118

Glossary of IB Mathematics
No ratings yet
Glossary of IB Mathematics
40 pages
Marsh & McLennan Guidelines
No ratings yet
Marsh & McLennan Guidelines
1 page
Toaz - Info Introduction To The Theory of Statistics Mood Am Graybill Fa Boes DPDF PR
No ratings yet
Toaz - Info Introduction To The Theory of Statistics Mood Am Graybill Fa Boes DPDF PR
34 pages
02 Random Variables
No ratings yet
02 Random Variables
51 pages
Unit 1 - Digital Communication - WWW - Rgpvnotes.in
No ratings yet
Unit 1 - Digital Communication - WWW - Rgpvnotes.in
11 pages
Notes DC
No ratings yet
Notes DC
109 pages
Probability Theory - D
No ratings yet
Probability Theory - D
80 pages
CHAPTER TWO (2) S
No ratings yet
CHAPTER TWO (2) S
69 pages
RV Intro
No ratings yet
RV Intro
5 pages
PRP Module 2
No ratings yet
PRP Module 2
113 pages
02-Random Variables2-1
No ratings yet
02-Random Variables2-1
38 pages
PRP - Unit 2
No ratings yet
PRP - Unit 2
41 pages
02 Random Variables SEIDTCHR
No ratings yet
02 Random Variables SEIDTCHR
44 pages
Probability Distribution and Density Functions
No ratings yet
Probability Distribution and Density Functions
24 pages
UNIT II Probability Theory
No ratings yet
UNIT II Probability Theory
77 pages
Econ-2042 - Unit 2-HO
No ratings yet
Econ-2042 - Unit 2-HO
12 pages
02-Random Variables2
No ratings yet
02-Random Variables2
47 pages
Math556 02 RVProbDist
No ratings yet
Math556 02 RVProbDist
6 pages
02-Random Variables
No ratings yet
02-Random Variables
77 pages
Addis Ababa Science & Technology University Department of Electrical & Computer Engineering
No ratings yet
Addis Ababa Science & Technology University Department of Electrical & Computer Engineering
63 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
MEFall2023 7
No ratings yet
MEFall2023 7
46 pages
Random Variables FinalNotes
No ratings yet
Random Variables FinalNotes
57 pages
Chapter 2
No ratings yet
Chapter 2
34 pages
02-Random Variables
No ratings yet
02-Random Variables
44 pages
Random Variables
No ratings yet
Random Variables
11 pages
ST2133 CH 3
No ratings yet
ST2133 CH 3
80 pages
UNIT II Probability Theory
No ratings yet
UNIT II Probability Theory
84 pages
8 Random Variable
No ratings yet
8 Random Variable
7 pages
Chapter 4
80% (5)
Chapter 4
21 pages
02-Random Variables
No ratings yet
02-Random Variables
62 pages
P - S 3 Random Variables
No ratings yet
P - S 3 Random Variables
18 pages
CH 7 - Random Variables Discrete and Continuous
No ratings yet
CH 7 - Random Variables Discrete and Continuous
7 pages
Random Variables
No ratings yet
Random Variables
44 pages
Uunit7 SSB
No ratings yet
Uunit7 SSB
14 pages
Math 5846 Chapter 2
No ratings yet
Math 5846 Chapter 2
102 pages
Chap2 Discrete Distributions
No ratings yet
Chap2 Discrete Distributions
22 pages
02-Random Variables
No ratings yet
02-Random Variables
62 pages
02-Random Variables
No ratings yet
02-Random Variables
49 pages
Best Ones
No ratings yet
Best Ones
32 pages
Distribution Theory: Unit III
No ratings yet
Distribution Theory: Unit III
70 pages
0.1. Probability Review
No ratings yet
0.1. Probability Review
6 pages
Fundamentals of Probability
No ratings yet
Fundamentals of Probability
58 pages
Random Variable Lesson 1
No ratings yet
Random Variable Lesson 1
9 pages
PROBABILITY 03 Rv-Dist-Moments 5 8
No ratings yet
PROBABILITY 03 Rv-Dist-Moments 5 8
21 pages
Basic Statistics For Lms
0% (1)
Basic Statistics For Lms
23 pages
Mca4020 SLM Unit 02
No ratings yet
Mca4020 SLM Unit 02
27 pages
Student Notes 2.1
No ratings yet
Student Notes 2.1
5 pages
Chapitre 01 Random Variables
No ratings yet
Chapitre 01 Random Variables
32 pages
Module 2
No ratings yet
Module 2
36 pages
3.lecture 3-Random Variables
No ratings yet
3.lecture 3-Random Variables
22 pages
Random Variables
No ratings yet
Random Variables
23 pages
Handout 2 V 6
No ratings yet
Handout 2 V 6
8 pages
02-Random Variables
No ratings yet
02-Random Variables
38 pages
Discrete Random Variable
No ratings yet
Discrete Random Variable
41 pages
Probability Distribution: Question Booklet
No ratings yet
Probability Distribution: Question Booklet
8 pages
Mathematics PDF
No ratings yet
Mathematics PDF
280 pages
ch2 Probability Random Variables and Random Signal Principles
No ratings yet
ch2 Probability Random Variables and Random Signal Principles
27 pages
Socio Economic Factors Affecting Business and Industry
100% (1)
Socio Economic Factors Affecting Business and Industry
24 pages
G12 Q3 LAS Week4 3is
No ratings yet
G12 Q3 LAS Week4 3is
17 pages
G12 Q3 LAS Week7 Media-and-Info-Literacy-1
No ratings yet
G12 Q3 LAS Week7 Media-and-Info-Literacy-1
9 pages
Math 1
No ratings yet
Math 1
12 pages
G12 Q3 LAS Week7-8 Applied-Economics
No ratings yet
G12 Q3 LAS Week7-8 Applied-Economics
14 pages
TOCHER, K. D., The Art of Simulation
No ratings yet
TOCHER, K. D., The Art of Simulation
192 pages
MTH618 WK 3 Lect 1: Random Variables. Probability Distributions
No ratings yet
MTH618 WK 3 Lect 1: Random Variables. Probability Distributions
17 pages
Task Performance Quantitative
No ratings yet
Task Performance Quantitative
7 pages
Fitdistrplus R Package Fitting Distributions
No ratings yet
Fitdistrplus R Package Fitting Distributions
22 pages
P6-Random Variables and Distributions
No ratings yet
P6-Random Variables and Distributions
11 pages
B Sc-Mathematics
No ratings yet
B Sc-Mathematics
15 pages
Booklet 1 CS1
No ratings yet
Booklet 1 CS1
16 pages
On Families of Generalized Pareto Distributions: Properties and Applications
No ratings yet
On Families of Generalized Pareto Distributions: Properties and Applications
20 pages
Continuous Random Variables: Problem 3.1
No ratings yet
Continuous Random Variables: Problem 3.1
12 pages
Unit-I 4. Cumulative Distribution Function (CDF)
No ratings yet
Unit-I 4. Cumulative Distribution Function (CDF)
10 pages
Carleton University: Sysc 4600 Digital Communications Fall 2016
No ratings yet
Carleton University: Sysc 4600 Digital Communications Fall 2016
2 pages
G Families of Probability Distributions Theory and Practices Mir Masoom Ali Download
No ratings yet
G Families of Probability Distributions Theory and Practices Mir Masoom Ali Download
77 pages
Experimental Comparison of Bluetooth and Wifi Signal Propagation For Indoor Localisation
No ratings yet
Experimental Comparison of Bluetooth and Wifi Signal Propagation For Indoor Localisation
12 pages
Unit 4 - Continuous Random Variables
No ratings yet
Unit 4 - Continuous Random Variables
35 pages
P1.T2. Quantitative Analysis Bionic Turtle FRM Practice Questions Chapter 13: Simulation and Bootstrapping
No ratings yet
P1.T2. Quantitative Analysis Bionic Turtle FRM Practice Questions Chapter 13: Simulation and Bootstrapping
24 pages
En Tanagra Calcul P Value
No ratings yet
En Tanagra Calcul P Value
21 pages
Midterm Exam Study Guide ST314-3
No ratings yet
Midterm Exam Study Guide ST314-3
4 pages
Chapter 6 - Random Variables and Probability Distributions
No ratings yet
Chapter 6 - Random Variables and Probability Distributions
101 pages
Lu8 Random Variables (Students Version)
No ratings yet
Lu8 Random Variables (Students Version)
53 pages
11 - Statistical Distributions of Traffic Characteristics
No ratings yet
11 - Statistical Distributions of Traffic Characteristics
37 pages
Assignment II & III 208
No ratings yet
Assignment II & III 208
9 pages
Chapter 4 - Continuous Random Variables and Probability Distribution
No ratings yet
Chapter 4 - Continuous Random Variables and Probability Distribution
34 pages
Chapter 3 Discrete Random Variable PDF
No ratings yet
Chapter 3 Discrete Random Variable PDF
23 pages
Probability Mass Function & Density Function
No ratings yet
Probability Mass Function & Density Function
34 pages
Random Variate Generation
No ratings yet
Random Variate Generation
18 pages
Sheet4ProbPower PDF
No ratings yet
Sheet4ProbPower PDF
2 pages
15 Monte Carlo Simulation Method
No ratings yet
15 Monte Carlo Simulation Method
11 pages
Wa0050.
No ratings yet
Wa0050.
4 pages