CHAPTER 5-Probability Distribution
CHAPTER 5-Probability Distribution
PROBABILITY DISTRIBUTIONS
CHAPTER OBJECTIVES
To introduce the basic concepts of discrete probability
distributions.
To present applications of the binomial distribution.
To present applications of the Poisson distribution.
To introduce the basic concepts of continuous probability
distributions.
To present applications of the standard normal distribution.
To develop confidence interval estimates for the mean, proportion
and variance
Introduction
Random variable: variable whose value depends on the outcome of a
random experiment.
A random variable is denoted by a capital letter (i.e., X).
A value assigned to a random variable is denoted by a small letter (i.e., x).
Types of random variables:
Discrete random variables: These variables usually occur in counting experiments.
Continuous random variables: These variables usually occur in experiments where
measurements are taken
Introduction
Probability distribution: list of all the possible outcomes of a random
variable and their associated probabilities of occurrence.
Two types of probability distributions: discrete and continuous.
Discrete probability distribution: assumes that the outcomes of a
random variable under study can take on only specific (usually integer)
values.
Examples of discrete probability distribution: Binomial distribution and
Poisson distribution.
Example of continuous probability distribution is the normal distribution.
Introduction
There are two conditions that a probability distribution has to satisfy:
The probability assigned to each value of the random variable occurs in the
range 0 to 1, that is, for each .
The sum of the probabilities assigned to all possible values of the random
variable is equal to 1, that is,
The probability that a random variable will assume a specified value is denoted
by
The probability that a random variable takes on a value less than or equal to a
specified value of is denoted by .
The probability that a random variable takes on a value greater than a specified
value of is denoted by .
Discrete random variable
Mean of a discrete random variable
The mean is denoted by and is also known as the expected value of a
random variable.
The expected value of a random variable may also be denoted by .
or
Standard deviation of a discrete random variable.
The standard deviation gives a measures of how much a probability
distribution is dispersed or spread around the mean of a discrete random
variable.
Discrete random variable
The standard deviation is the square root of the variance. The formula for the variance is given
by:
0 1 2
1/4 2/4 1/4
Example
Exercise
A bank stock is currently selling at R10 a share. An investor plans to buy
shares and to hold the stock for one year. Let X denote the price of the
stock after one year. The probability distribution is shown below here:
10 11 12 13 14
0,35 0,25 0,2 0,15 0,05
X ~ Bin (n, p )
The Binomial Mathematical Formula
Let X represent the number of successes observed in n trials of a
Binomial experiment. Then
P ( X x) p ( x) C xn p x (1 p ) n x ,
a. 4
0 1 2 3 4 5
0,1681 0,3602 0,3087 0,1323 0,0284 0,0024
Example
Let X be a discrete random variable that possesses a binomial
distribution. Find the mean and standard deviation of the probability
distribution for which and .
Poisson Distribution
The Poisson distribution is a discrete probability distribution for the
counts of events that occur randomly in a given interval of time (or
space).
The Poisson distribution is given by:
Where.
Poisson experiment
A Poisson experiment is an experiment that has the following properties
The experiment consists of counting the number of times an event of
interest (termed a success) occurs in a specified interval of time, distance,
or volume (or any other unit of measurement).
The probability of a success in an interval is the same for all intervals of
equal size.
The number of successes in an interval is independent of the number of
success that occurs in other intervals.
The probability of a success is proportional to the size of the interval.
The intervals do not overlap.
Mean and standard deviation of
the Poisson distribution
Example
Assume that the number of patients that arrive at a physiotherapy
practice per hour follows a Poisson distribution. By reviewing past record,
it was determined that the mean is two patients per hour.
a. What is the probability that in a given hour exactly three patients will
arrive?
b. What is the probability that in a given two-hour period exactly eight
patients will arrive?
c. What is the probability that in a given half-hour period exactly two
patients will arrive?
Solution
X= the number of patients that arrive at a physiotherapy practice per hour.
a.
b.
c.
Example
The management of a supermarket receives, on average, two requests
per month to order items that the supermarket does not usually hold in
stock. Find the mean and the standard deviation of the probability
distribution.
Solution
X=the number of requests per months to order items that the supermarket
does not usually hold in stock.
Continuous probability
distribution
The random variable X is said to be continuous if it can take on any
value in a certain interval.
In other words, if X is a continuous random variable in the interval (a; b) then X
can have unlimited values within this interval.
Further it can be shown that for any continuous random variable the following
is valid:
The Normal distribution
The most common continuous distribution is the so called normal
distribution. The normal distribution is determined by two parameters,
namely the population average, µ, and the population variance,. It is
written as:
Mean=
The Normal distribution
For the normal distribution the following is valid:
the normal curve is bell shaped and symmetrical about the population
average, ;
the total area under this curve is exactly one;
the probabilities are directly proportional to the areas under the curve;
if the probability must be calculated, the area under the curve between a
and b must be calculated.
The Standard Normal
distribution
If, for a normal distribution, the population average, , and the population
variance, , then the distribution is called the standard normal distribution
and is denoted by:
Z
0,9 0,1 0,05 1,645
0,95 0,05 0,025 1,96
0,98 0,02 0,01 2,33
0,99 0,01 0,005 2,575
Example
The Management of a large national chain of motels decided to estimate
the mean cost per room of repairing damages caused by its customers. A
random sample of 150 vacated rooms were inspected by the
management and this indicated a mean repair cost of R84,30 and a
standard deviation of R37,20. Construct a 95% confidence interval for the
mean repair cost for the population of 2 000 motel rooms.
Solution
The mean number of days of absence for all clerical workers at the corporation last year was
between 8 days and 11 days at a 95% level of confidence.
Exercise
An auditor of a certain company found that the billing errors that occurred
on a random sample of invoices were:
R32; -R45; R66; R2; -R8; -R51; R12 and R18.
Positive errors indicate that the customer was billed too much and
negative errors indicate that the customer was billed too little. Find the
99% confidence interval for the mean billing error. Assume that the
population of billing errors is normally distributed.
Confidence Intervals
Confidence interval for population proportion
Each element in the population can be classified either as a success or
failure
Sample proportion
Proportion always between 0 and 1
For large samples the sampling distribution of the sample proportion is
approximately normal.
Confidence Intervals
The formula for constructing a confidence interval for is given by:
Example
A sales manager needs to determine the proportion of defective radio
returns that is made on a monthly basis. In December 65 new radios were
sold and in January 13 were returned for rework.
Estimate with 95% confidence the population proportion of returns for
December.
Solution
95% confident the mean monthly returns will be will be between 10,3%
and 29,7%
Exercise
According to a recent report by the Census Bureau, 26% of the single
male households own stocks, bonds and mutual funds. Although Census
Bureau estimates are based on very large sample, for convenience,
assume that this result is based on a random sample of 2 000 single male
households. Find a 95% confidence interval for the proportion of all single
male households that own stocks, bonds and mutual funds.
Confidence Intervals
Confidence interval for the population variance
Population variance very often important
Quality control
Sample variance is based on a sample of size n
Distribution of resulted from repeated sampling is a (chi-square)
distribution
Confidence interval for the population variance
90% confident the variation in temperature will be will be between 0,315 and
0,757 degrees
Exercise
A precision meter is guaranteed to be accurate within 2%. A sample of
five meter readings on the same object yielded the following
measurements:
350; 348; 348; 352 and 351.
Construct a 99% confidence interval for assuming the population of
measurements is normally distributed.
Determining sample sizes for
estimates
Sample size for estimating means
Confidence level
Accepted sampling error -
Need to know , else use
Example
The maintenance of charge accounts may become too costly if the
average account purchase falls below a certain level. A department store
manager would like to estimate the average amount of account purchases
per month by its customers who have charge accounts to within R2,50
with a probability of approximately 0,95.
How many accounts should be selected from the store’s records if the
standard deviation of monthly account balances is known to be R16,50?
Solution
and
==
Therefore, at least 168 accounts should be selected.
Determining sample sizes for
estimates
Sample size for estimating proportions
Confidence level
Accepted sampling error -
Need to know , else use
Example
A cable television company wants to estimate the proportion of its
customers who would purchase a cable television program guide. The
company would like to have 95% confidence that its estimate is correct to
within of the true population.
Past experience in other area indicates that 30% of the customers will
purchase the program guide. What sample size is required?
Solution
=
A sample size of at least 323 customer is required.