Lect 4 The Normal distributionXIUGAI
Lect 4 The Normal distributionXIUGAI
医学统计学
Lecture 4 Normal distribution
正态分布
Intensity (Rate)
Ratio
20.38 8.40 17.32 29.64 19.69 21.69 23.90 17.45 19.08 20.52 24.14 23.77
18.36 23.04 24.22 24.13 21.53 11.09 18.89 18.26 23.29 17.67 15.38 18.61
14.27 17.40 22.55 17.55 16.10 17.98 20.13 21.00 14.56 19.89 19.82 17.48
14.89 18.37 19.50 17.08 18.12 26.02 11.34 13.81 10.25 15.94 15.83 18.54
24.52 19.26 26.13 16.99 18.89 18.46 20.87 17.51 13.12 11.75 17.40 21.36
17.14 13.77 12.50 20.40 20.30 19.38 23.11 12.67 23.02 24.36 25.61 19.53
14.77 14.37 24.75 12.73 17.25 19.09 16.79 17.19 19.32 19.59 19.12 15.31
21.75 19.47 15.51 10.86 27.81 21.65 16.32 20.75 22.11 13.17 17.55 19.26
12.65 18.48 19.83 23.12 19.22 19.22 16.72 27.90 11.74 24.66 14.18 16.52
Approach of frequency distribution table
20.00
15.00
10.00
5.00
0.00
μmol/L
Note: the lateral axis is group values, the vertical axis can be
frequency or frequency density.
Frequency density =frequency / group interval
1.2 Approaches of Normal Distribution
Increase the
sample size
and decrease
the group
interval .
12.50
10.00
7.50
5.00
2.50
0
7 9 11 13 15 17 19 21 23 25 27 29
serum iron (μmol/L)
1.3 Properties of normal distribution
Infinite extensibility.
Symmetry.
Two parameters.
The area under the curve.
1.3.1 Infinite extensibility
distribution.
1.3.2 Symmetry
-3 -2 -1 1 2 3
1.3.3 Two Parameters
The normal distribution is characterized by
two parameters: the mean (µ or X ) and the
standard deviation (σor s).
Two Parameters
The mean (µ or X ) is a measure of location or center
and the standard deviation (σor s) is a measure of
scale or spread.
The mean can be any value between infinity.
The standard deviation must be positive.
Each possible value of µ and σ define a specific
normal distribution and collectively all possible
normal distributions define the normal family.
Two Parameters---mean
Position translation
-6 -5 -4 1
-3 -2 -1 20 1 2 3 3 4 5
1 2 3 1 2 3
Position Translation
N ( -1 , 0.52 )、
Two Parameters---σ changing
1
2
3
1 2 3 1 2 3
-3 -2 -1 0 1 2 3
σ changing
0.9
0.8
0.7 σ =0.5
0.6
0.5
0.4 σ =1
0.3 σ =2
0.2
0.1
0
-6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6
1.3.4 The area under the curve
The total area under the curve is 1.
Probability-density function
The probability density function of a normal
distribution:
( x )2
f ( x)
1
e
2 2 X
2 e 2.71828
3.14159
Denoted with: N ( , 2 )
The area under the curve
The area under the curve
6 11.84
Example 4.2
6 10
Example 4.2
You can’t compare the areas directly.
5 and 2 ,Which one is bigger?
7 3
5 15 2 14 5 2
, , so ,
7 21 3 21 7 3
φ(Z2)=φ(-0.67)=0.5029/2=0.2515
Example 4.3
The heights of boys aged 8 are distributed
normally in Hefei city with mean 123.02
cm and SD of 4.79 cm.
What is the proportion of boys higher than
130 cm ?
What is the proportion of boys heights
between 120cm and 128cm .
Calculate the heights interval which covers
60% boys ?
Example 4.3
1)
2)
Example 4.3
1- 0.60=0.40
Turn to page 139.
φ(Zi)= 0.20 ? Zi = 1.28
X - X 123.02
Z 1.28
4.79 60%
X 123.02
1.28
4.79
X 123.02 4.79 1.28
116 .89 X 129.15
3. Application of normal distribution
Medical Reference range (Reference
interval).
Quality control.
The basis of other mathematic distribution.
3.1 Reference range
3.1 Reference range
Report of endocrine lab, the First Affiliated Hospital
of AHMU
Name:Tang xia Number of the sample : 310 Time : 2011-05-09
Sex : female Number of the subject: 2011178211 Type of sample: blood
Age: 39 years Department: FY Doctor: Chen xuhua
Two sided
A normal variance, too higher and too lower is abnormal.
Height, weight, Blood pressure
0.025 0.025
-1.96 1.96
Two sided ? one sided ?
one sided
Too higher is abnormal, Hair mercury, Blood lead,
Too lower is abnormal, IQ, Vital Capacity / lung’s
capacity: the index of respiratory function
0.05
0.05
1.64 -1.64
How to work out a reference range ?
Well define “normal person”
Needs grouping or not? (male-female? Age groups? )
Determine sample size and Random sampling
Measurement (instrument, method, quality control…)
Two sides? One side?
“Most individuals” ? (99%? 95%? 90%?…)
Statistical method?
Normal distribution method
Percentile method
Statistical method ?
If the frequency distribution is close to a normal
distribution: Normal distribution method.
The dataset is not distributed normal: Percentile method.
Normal distribution method Percentile method
One sided One sided
%
Two sided Lower Two sided Lower Upper
Upper limit
limit limit limit
X Z / 2 s
X 1.28s X 1.28s
90 X 1.64s P5~P95 P10 P90
X Z / 2 s X X Z / 2 s denoted with X Z / 2 s
One sided (1-α)range
X X Z s OR X X Z s
Percentile method
If the frequency distribution is not a normal
distribution.
Two side (1- 0.05) range
P2.5 X P97.5
One side (1- 0.05) range
X P5 X P95
Example 4.4
The hemoglobin of 120 healthy females
distributed a normal distribution with mean
117.4g/L and SD is 10.2g/L, calculate the
95% reference range of healthy females’
hemoglobin.
The hemoglobin is a normal index for human beings ,
too higher and too lower are abnormal.
days
3.3 The basis of other mathematic
distribution.
We often transform some other
distributions to normal distribution for
further analysis.
Sometimes we explain some theories with
normal distribution properties.