Chapter 6 Stats
Chapter 6 Stats
❑ Assume a student plans to study for an upcoming exam and they usually study
for anywhere between 2 to 5 hours per day, depending on their schedule.
❑ Questions?
• What is the random variable here?
• Is this continuous random variable? Why?
• What is the probability of the student studying exactly 4 hours?
• What is notation for finding the probability of the student studying between 1.5
and 3 hours?
Continuous Probability Distributions
• Probability of a continuous random variable taking a fixed value is always zero, i.e., P (x =
a) = 0, for any real number x
• For continuous random variables, we always talk about the probability of the random
variable falling in a certain interval
• P(a ≤ x ≤ b) ≥ 0
• Continuous random variables take values in the (subset of the) real line
Continuous Probability Distributions
• Cumulative distribution function: Denotes the probability of the random variable (X)
being less than a certain value
❑ Assume a student plans to study for an upcoming exam and they usually study for
anywhere between 2 to 5 hours per day, depending on their schedule. The
time spent studying each day is equally likely to be any value between 2 and 5
hours. This means the study time is uniformly distributed over the interval from 2
to 5 hours
❑ Let’s say you are waiting for a bus, and the bus is equally likely to arrive anytime
between 10 minutes and 20 minutes. This means the waiting time is uniformly
distributed over the interval from 10 to 20 minutes
❑ Suppose the temperature in a city is continuously distributed between 20°C and
30°C throughout the day. The temperature at any given time is a continuous
random variable, and we assume that the probability of the temperature being any
specific value is uniform across this range
Uniform Distribution Function, Expectation and Variance
• Density function
1. Give a mathematical expression for the probability density function of battery life.
2. What is the probability that the battery life for an iPad Mini will be 10 hours or less?
3. What is the probability that the battery life for an iPad Mini will be at least 11 hours?
4. What is the probability that the battery life for an iPad Mini will be between 9.5 and 11.5 hours?
5. In a shipment of 100 iPad Minis, how many should have a battery life of at least 9 hours?
Normal Probability Distribution
• In a normal distribution, outcomes are not equally likely; they cluster around a central
value
• A normal distribution is a continuous probability distribution that is symmetric about the
mean, showing that data near the mean are more frequent in occurrence than data far
from the mean.
Normal Probability Distribution
❑ Three normal distributions with the same standard deviation but three different
means (−10, 0, and 20) are shown below
Normal Distribution: Same Means, Different Standard Deviations
• Reading the z value from the left most column and the top row of the table, we
find that the corresponding z value is 1.28
• An area of approximately .9000 (actually .8997) will be to the left of z = 1.28
Computing Probabilities for Any Normal Probability Distribution
• Probabilities for all normal distributions are computed by using the standard normal
distribution
• When we have a normal distribution with any mean µ and any standard deviation σ, we
answer probability questions about the distribution by first converting to the standard
normal distribution
• Then we can use the standard normal probability table and the appropriate z values to
find the desired probabilities
• Converting any normal random variable to a standard normal random variable:
Exercise
❑ A random variable is normally distributed with a mean of µ = 50 and a standard
deviation of σ = 5.
• What is the probability the random variable will assume a value between 45 and 55?
• What is the probability the random variable will assume a value between 40 and 60?
Exercise
❑ The mean cost of domestic airfares in the United States rose to an all-time high of $385
per ticket (Bureau of Transportation Statistics website, November 2, 2012). Airfares were
based on the total ticket value, which consisted of the price charged by the airlines plus
any additional taxes and fees. Assume domestic airfares are normally distributed with a
standard deviation of $110.
1. What is the probability that a domestic airfare is $550 or more?
2. What is the probability that a domestic airfare is $250 or less?
3. What is the probability that a domestic airfare is between $300 and $500?
Exercise
❑ Automobile repair costs continue to rise with the average cost now at $367 per repair
(U.S. News & World Report website, January 5, 2015). Assume that the cost for an
automobile repair is normally distributed with a standard deviation of $88. Answer the
following questions about the cost of automobile repairs.
1. What is the probability that the cost will be more than $450?
2. What is the probability that the cost will be less than $250?
3. What is the probability that the cost will be between $250 and
$450?
4. If the cost for your car repair is in the lower 5% of automobile repair charges, what is
your cost?
Exercise: Home Work
❑ The port of South Louisiana, located along 54 miles of the Mississippi River between New
Orleans and Baton Rouge, is the largest bulk cargo port in the world. The U.S. Army Corps of
Engineers reports that the port handles a mean of 4.5 million tons of cargo per week (USA
Today, September 25, 2012). Assume that the number of tons of cargo handled per week is
normally distributed with a standard deviation of 0.82 million tons.
1. What is the probability that the port handles less than 5 million tons of cargo per week?
2. What is the probability that the port handles 3 or more million tons of cargo per week?
3. What is the probability that the port handles between 3 million and 4 million tons of
cargo per week?
4. Assume that 85% of the time the port can handle the weekly cargo volume without
extending operating hours. What is the number of tons of cargo per week that will require
the port to extend its operating hours?
Chebyshev’s Theorem
• Chebyshev’s Theorem, also known as Chebyshev's inequality, is a fundamental result in
probability and statistics that provides a way to understand the dispersion of data points in a
distribution, regardless of the shape of the distribution
• The theorem states that for any real-valued random variable with a finite mean (μ) and a
finite standard deviation (σ), the proportion of observations that fall within k standard
deviations of the mean is at least
Chebyshev’s Theorem