Probability Distributions
Probability Distributions
If the set of all possible values is at most a finite or a countably infinite number of
possible values
Produce values that are nonnegative whole numbers
Discrete distributions, constructed from discrete random variables
CONTINUOUS RANDOM VARIABLES
PROBABILITY DISTRIBUTION FOR THE NUMBER OF AUTOMOBILES SOLD DURING A DAY AT DICARLO
MOTORS
DISCRETE UNIFORM PROBABILITY DISTRIBUTION
WHY DO WE NEED PROBABILITY DISTRIBUTIONS?
The standard deviation, σ, is defined as the positive square root of the variance
The standard deviation is measured in the same units as the random variable and
therefore is often preferred in describing the variability of a random variable
MEAN, VARIANCE, AND STANDARD DEVIATION
BINOMIAL PROBABILITY DISTRIBUTION
Consider the experiment of tossing a coin five times and on each toss observing whether
the coin lands with a head or a tail on its upward face
In a binomial distribution experiment, any one trial can have only two possible, mutually
exclusive outcomes (right-handed/left-handed, defective/good, male/female, etc.).
MARTIN CLOTHING STORE PROBLEM
Let us consider the purchase decisions of the next three customers who enter the Martin
Clothing Store. On the basis of past experience, the store manager estimates the probability
that any one customer will make a purchase is .30. What is the probability that two of the next
three customers will make a purchase?
BINOMIAL PROBABILITY FUNCTION
SOLVE
Consider the purchase decisions of the next ten customers who enter the Martin
Clothing Store. On the basis of past experience, the store manager estimates the
probability that any one customer will make a purchase is .30. What is the
probability that four of the next ten customers will make a purchase?
USING TABLES OF BINOMIAL PROBABILITIES
BINOMDIST has four
arguments:
• x (the number of
successes)
• n(the number of trials)
• p (the probability of
success)
• Cumulative (True/False)
EXPECTED VALUE AND VARIANCE FOR THE BINOMIAL DISTRIBUTION
Suppose that for the next month the Martin Clothing Store forecasts 1000 customers will
enter the store. What is the expected number of customers who will make a purchase?
What is the variance and standard deviation for the number of customers who will make
a purchase?
PRACTICE PROBLEM
What is the first big change that American drivers made due to higher gas prices? According to an Access
America survey, 30% said that it was cutting recreational driving. However, 27% said that it was
consolidating or reducing errands. If these figures are true for all American drivers, and if 20 such drivers
are randomly sampled and asked what is the first big change they made due to higher gas prices,
a. What is the probability that exactly 8 said that it was consolidating or reducing errands?
b. What is the probability that none of them said that it was cutting recreational driving?
c. What is the probability that more than 7 said that it was cutting recreational driving?
POISSON PROBABILITY DISTRIBUTION
Useful in estimating the number of occurrences (random) over a specified interval of time
or space
Often used to model random arrivals in waiting line situations
the number of arrivals at a car wash in one hour
the number of repairs needed in 10 miles of highway
the number of leaks in 100 miles of pipeline
the number of serious accidents at a chemical plant
COMPARE BINOMIAL WITH POISSON DISTRIBUTION
Suppose the average number of car arrivals at the drive-up teller window of a bank during a
15-minute period on weekday mornings is 10.
What is the probability of exactly five arrivals in 15 minutes?
EXAMPLE:
Suppose the probability of a bank making a mistake in processing a deposit is .0003. If 10,000
deposits (n) are audited, what is the probability that more than 6 mistakes were made in
processing deposits?
HYPERGEOMETRIC PROBABILITY DISTRIBUTION
Trials are not independent; and the probability of success changes from trial to trial.
Used when sampling is done without replacement, information about population
makeup must be known in order to redetermine the probability of a success in each
successive trial as the probability changes
It is discrete distribution
Each outcome consists of either a success or a failure
Sampling is done without replacement
The population, N, is finite and known
The number of successes in the population, A, is known
HYPERGEOMETRIC PROBABILITY DISTRIBUTION
HYPERGEOMETRIC PROBABILITY DISTRIBUTION FORMULAE
BINOMIAL DISTRIBUTION AS AN APPROXIMATION OF HYPERGEOMETRIC
DISTRIBUTION
Even though the binomial distribution theoretically applies only when sampling is done
with replacement and p stays constant, recall that, if the population is large enough in
comparison with the sample size, the impact of sampling without replacement on p is
minimal. Thus the binomial distribution can be used in some situations when sampling
is done without replacement. Because of the tables available, using the binomial
distribution instead of the hypergeometric distribution whenever possible is preferable.
As a rule of thumb, if the sample size is less than 5% of the population, use
of the binomial distribution rather than the hypergeometric distribution is
acceptable
The when sampling
hypergeometric is done
distribution without
yields replacement.
the exact probability, and the binomial
distribution yields a good approximation of the probability in these situations
EXAMPLE
Suppose 18 major computer companies operate in the United States and that 12 are located
in California’s Silicon Valley. If three computer companies are selected randomly from the
entire list, what is the probability that one or more of the selected companies are located in
the Silicon Valley?
What is the probability of randomly obtaining a score between 300
and 600 on the GMAT exam?
CONTINUOUS DISTRIBUTIONS
CONTINUOUS DISTRIBUTIONS
Constructed from continuous random variables in which values are taken on for
every point over a given interval
Usually generated from experiments in which things are “measured” as opposed
to “counted.”
Probabilities of outcomes occurring between particular points are determined by
calculating the area under the curve between those points
Area under the graph of f(x), probability density function, gives the
probability that the continuous random variable x assumes a value in that
interval.
The entire area under the whole curve is equal to 1.
DISCRETE VS CONTINUOUS
Bell Shaped
Unimodal
f(X)
Location is characterized by the mean, μ
Spread is characterized by the standard deviation, σ
Symmetrical about its mean
σ
The mechanism by which all normal distributions can be converted into a single distribution: the z
distribution
A z score is the number of standard deviations that a value, x, is above or below the mean
This formula allows conversion of the distance of any x value from its mean into standard deviation units.
A standard z score table can be used to find probabilities for any normal curve problem that has been
converted to z scores.
The z distribution is a normal distribution with a mean of 0 and a standard deviation of 1.
Any value of x at the mean of a normal curve is zero standard deviations from the mean. Any value of x
that is one standard deviation above the mean has a z value of 1.
STANDARDIZED NORMAL DISTRIBUTION
z
Distribution
SOLVING NORMAL CURVE PROBLEMS
The Graduate Management Aptitude Test (GMAT), produced by the Educational Testing
Service in Princeton, New Jersey, is widely used by graduate schools of business in the United
States as an entrance requirement. Assuming that the scores are normally distributed,
probabilities of achieving scores over various ranges of the GMAT can be determined. In a
recent year, the mean GMAT score was 494 and the standard deviation was about 100. What
is the probability that a randomly selected score from this administration of the GMAT is
between 600 and the mean?
GMAT score of 600 is 1.06 standard deviations more than the mean
What is the probability of obtaining a score greater than 700 on a GMAT test
that has a mean of 494 and a standard deviation of 100? Assume GMAT
scores are normally distributed.
For the same GMAT examination, what is the probability of randomly
drawing a score that is 550 or less?
What is the probability of randomly obtaining a score between 300
and 600 on the GMAT exam?
What is the probability of getting a score between 350 and 450 on the
same GMAT exam?
EXPONENTIAL DISTRIBUTION
It describes a probability distribution of the times between random occurrences
It is a family of distributions
It is skewed to the right
The x values range from zero to infinity
Its apex is always at x = 0
The curve steadily decreases as x gets larger
The mean and standard deviation are equal
SCHIPS LOADING DOCK PROBLEM
Suppose that x represents the loading time for a truck at the Schips loading dock and follows
a exponential distribution. The mean, or average, loading time is 15 minutes.