Lecture 4 - Normal and Nonnormal Dist - HS - 070323en
Lecture 4 - Normal and Nonnormal Dist - HS - 070323en
Distribution
Elias Namosha
n = 30
Median @ 30+1 / 2 = 15.5, i.e., between 15th and 16th position
Value at 15th position = 10
Value at 16th position = 10
So median = 10
Mode= 10
Mode: the most common value
in a children’s age group distribution
mean=median=mode
•
3/7/2023
How do you know a
distribution is Normal?
In order to be considered a normal distribution, a data
set (when graphed) must follow a bell shape symmetrical
curve centered around the mean.
-A standard normal distribution has a mean of 0
and a standard deviation of 1.
It must also adhere to the empirical rule that indicates
the percentage of the data set falls within (plus or
minus) 1, 2, and 3 standard deviations of the mean.
-The empirical rule says in a standard normal
distribution, 68% of the data points will fall within ± one
standard deviation from the mean and 95% will fall
within
3/7/2023
± two standard deviations.
Areas under the normal curve that lie between 1, 2
and 3 standard deviations (SD) on each side of the
mean.
Normal curves are symmetric
Skewed vs normal distributions
Left-skewed Right-skewed
Normal Distribution
▪This pattern occurs so often in biological and
natural world that mathematicians have
studied it and found that if the observed
measurement is the sum of many independent
small random factors, the resulting
measurements will take on values that are
distributed normally as the bell-curve above –
Normal or Gaussian distribution
3/7/2023
70 75 80 85 90
70 75 80 85 90
Are you getting irritated?
3/7/2023
Real life examples
Age distribution
3/7/2023
Real life examples
Babies Birth Weight
▪ The normal birth weight of a
newborn range from 2.5 to 3.5 kg.
▪ The majority of newborns have
normal birthweight whereas only a
few percentage of newborns have a
weight higher or lower than the
normal.
▪ Hence, birth weight also follows the
normal distribution curve.
▪ In general: Boys are usually a little
heavier than girls.
▪ The average birth weight for babies
is around 3.5 kg
3/7/2023
Non-Normal Distribution
▪ When a population follows a normal distribution,
we can describe its location and variability
completely with the two parameters of the mean
and variability (standard deviation)
3/7/2023
Interquartile range
A quartile divides the number of data points into four parts,
or quarters, of more-or-less equal size. The data must be ordered
from smallest to largest to compute quartiles; as such, quartiles are
a form of order statistics. The three main quartiles are as follows:
•The second quartile (Q2) is the median of a data set; thus 50% of
the data lies below this point.
•The third quartile (Q3) is the middle value between the median and
the highest value (maximum) of the data set. It is known as
the upper or 75th empirical quartile, as 75% of the data lies below
this point.
Q1 Q2 Q3
7 14
IQR = 7 (Q3-Q1)
Box and Whiskers plots
Three curves with different skewing
3/7/2023
Boxplots
Max
180
Q3
160
Median Q2
Q1
140
Min
120
Boxplots
180
160
140
120
Male Female
For more categories…
Boxplots
180
160
140
120
NB: Notice among female ages 30-45, there are some values that seat above
and outside the maximum value. These (three dots) are called outliers
3/7/2023
Box and whiskers diagrams tell us a lot about distribution
curves. Note that A, B, and C are all normal curves. Also
note that D and E are similar to B, but skewed right and
left.
THE END
Thank you..