Chapter V Continuous Probability Distributions PDF - Dr. Ashraf
Chapter V Continuous Probability Distributions PDF - Dr. Ashraf
Ashraf Hossain
1/10
x
5 10 12 15
Salad Weight (oz.)
Continuous Probability Distributions
A probability distribution in which the random variable X can take on any
value (is continuous). Because there are infinite values that X could assume,
the probability of X taking on any one specific value is zero. ...
μ X
Graph of the Normal Probability Density Function
2 1 x 2
1 ( )
2 2
e 2 dx .95
3 1 x 2
1 ( )
3 2
e 2 dx .997
… we can
easily
Symmetry in the Tails
determine the
Because the Normal curve AUC in tails
is symmetrical and the 95%
total AUC is exactly 1…
Example: 68-95-99.7 Rule
Normal Densities
0.045
0.04
0.035
0.03
N(100,400)
0.025 N(100,100)
f(y)
N(100,900)
0.02 N(75,400)
N(125,400)
0.015
0.01
0.005
0
0 20 40 60 80 100 120 140 160 180 200
y
Standard Normal Distribution: Z-distribution
In short, the Z-distribution is a way of naming the Standard
Normal distribution.
The Z-distribution is a specific instance of the Normal Distribution
that has a mean of ‘0’ and a standard deviation of ‘1’.
The visual way to understand it would be the following image:
The red
curve is
the standard
normal
distribution
1 Z 0 2 1
1 ( ) 1 ( Z )2
p( Z ) e 2 1
e 2
(1) 2 2
Somebody calculated all the integrals for the standard normal and put them in a
table! So we never have to integrate!
Even better, computers now do all the integration.
Determining Normal Probabilities
19
When value do not fall directly on σ landmarks:
1. State the problem; 2. Standardize the value(s) (z score)
3. Sketch, label, and shade the curve; 4. Use Table B
?
Z ~ N(0,1)
Example: a Z variable
of 1.96 has cumulative
probability 0.9750.
Notation: Let zp
represents the z
score with cumulative
probability p,
7: Normal Probability Distributions
e.g., z.975 = 1.96
Example
Suppose SAT scores roughly follows a normal distribution in the
U.S. population of college-bound students (with range restricted
to 200-800), and the average math SAT is 500 ()with a
standard deviation (σ) of 50, then:
68% of students will have scores between 450 and 550
95% will be between 400 and 600
99.7% will be between 350 and 650
BUT…What if you wanted to know the math SAT score
corresponding to the 90th percentile (=90% of students are lower)?
Q 1 x 500 2
1 ( )
P(X≤Q) = .90
200
(50) 2
e 2 50 dx .90
Example
For example: What’s the probability of getting a math SAT score of
575 or less, =500 and =50?
575 500
Z 1.5
50
i.e., A score of 575 is 1.5 standard deviations above the mean
575 1 x 500 2 1.5 1
1 ( ) 1 Z2
P( X 575) (50)
200
2
e 2 50 dx
2
e 2 dz
Yikes!
But to look up Z= 1.5 in standard normal chart (or enter
into SAS) no problem! = .9332
Practice problem
If birth weights in a population are normally distributed with a mean of
109 oz and a standard deviation of 13 oz,
a. What is the chance of obtaining a birth weight of 141 oz or heavier
when sampling birth records at random?
b. What is the chance of obtaining a birth weight of 120 or lighter?
Z=1.51 Area is
93.45%
Z=1.51
Step 1: State Problem
Question: What gestational length is smaller than 97.5% of gestations?
Let X represent gestations length. We know from prior research that
X ~ N(39, 2)
A value that is smaller than .975 of gestations has a cumulative probability
of.025
Step 2 (z percentile)
Less than 97.5% (right tail) =
greater than 2.5% (left tail)
z lookup:
z.025 = −1.96
z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
–1.9 .0287 .0281 .0274 .0268 .0262 .0256 .0250 .0244 .0239 .0233
Probabilities Between Points
a represents a lower boundary
b represents an upper boundary
Pr (a ≤ Z ≤ b) = Pr (Z ≤ b) − Pr(Z ≤ a)
-2 0.5 0.5 -2
Unstandardize and sketch
29
x z p 39 (1.96)( 2) 35
Median = 6,
Mean = 7.1,
Mode = 0
SD = 6.8,
Range = 0 to 24
Median = 3
Mean = 3.4
Mode = 3
SD =
2.5
Range =
0 to 12
Data from a BBA class…
Median = 5
Mean = 5.4
Mode = none
SD = 1.8
Range = 2 to 9
Median = 7:00
Mean = 7:04
Mode = 7:00
SD = :55
Range = 5:30 to 9:00
Normal probability plot
coffee…
Right-Skewed!
(concave up)
Normal probability plot love of
writing…
Neither right-skewed
or left-skewed, but
big gap at 6.
Norm prob. plot Exercise…
Right-Skewed!
(concave up)
Normal prob. Plot: Wake up time
Closest to a
straight line…
Formal tests for normality
Results:
Coffee: Strong evidence of non-
normality (p<.01)
Writing love: Moderate evidence
of non-normality (p=.01)
Exercise: Weak to no evidence of
non-normality (p>.10)
Wakeup time: No evidence of
non-normality (p>.25)
Take example of Wake-up Time … for
+ (% of AUC?)
5:14
8:54
7:04+/- 2*0:55
= 5:14 – 8:54
Re-expression of Non-Normal Random Variables
Many variables are not Normal but can be re-expressed with
a mathematical transformation to be Normal
Example of mathematical transforms used for this purpose:
logarithmic
exponential
square roots
Review logarithmic transformations…
Logarithms are exponents Base 10 log function
of their base
Common log (base 10)
log(100) = 0
log(101) = 1
log(102) = 2
Natural ln (base e)
ln(e0) = 0
ln(e1) = 1
Example: Logarithmic Re-expression
Prostate Specific Take exponents of “95%
Antigen (PSA) is range”
used to screen for e−1.9,1.3 = 0.15 and 3.67
prostate cancer Thus, 2.5% of non-
In non-diseased diseased population have
populations, it is not values greater than 3.67
use 3.67 as screening cutoff
Normally
distributed, but its
logarithm is:
ln(PSA) ~N(−0.3, 0.8)
95% of ln(PSA)
within
= μ ± 2σ
= −0.3 ± (2)(0.8)
= −1.9 to 1.3
Chapter Notes