Normal Distribution Review
Normal Distribution Review
1
I. The Normal Distribution
The normal distribution is considered the most prominent probability distribution because it has
several important properties that make it widely applicable in various fields, such as statistics,
finance, and natural sciences.
It is often used as a first approximation to describe real-valued random variables that tend to
cluster around a single mean
The “BELL" shape of the normal distribution makes it a convenient choice for modeling a large
variety of random variables encountered in practice.
2
The area under the curve represents the total probability, which sums to 1
The distribution is symmetric around the mean, meaning that half of the total probability lies to the
left of the mean, and the other half lies to the right
f(X)
P( X μ) 0.5 P(μ X ) 0.5
0.5 0.5
μ X
P( X ) 1.0
3
I. The Normal Distribution
Symmetrical.
4
1. How does σ affect the shape ?
The larger the standard deviation, the less concentrated the bell curve is, the more spread out the curve is.
5
2. The Empirical Rules
Approximately 68% of the data in a bell shaped distribution is within 1 standard deviation of the
mean, or µ ± σ
Approximately 95% of the data in a bell-shaped distribution lies within two standard deviations of
the mean, or µ ± 2σ
Approximately 99.7% of the data in a bell-shaped distribution lies within three standard deviations
of the mean, or µ ± 3σ
95% 99.7
%
μ 2σ μ 3σ
6
2. The Empirical Rules
Suppose that the variable “Stat scores” (noted X) is normally distributed with a mean of 13 and a standard
deviation of 2. Then,
7
3. The Standardized Normal distribution
Any normal distribution (with any mean and standard deviation combination) can be transformed into the
standardized normal distribution (Z).
The standardized normal distribution (Z) has a mean of 0 and a standard deviation of 1.
This process simplifies calculations and comparisons between different normal distributions.
8
3. The Standardized Normal distribution
Translate from X to the standardized normal (the “Z” distribution) by subtracting the mean of X and
dividing by its standard deviation:
X
Z
The Z distribution always has mean = 0 and standard deviation = 1.
Why Standardize?
Comparison across different scales: It allows comparison of variables measured in different units
(e.g., height in cm, weight in kg) by bringing them into the same scale.
Easier analysis: Many statistical techniques require or perform better when data is standardized.
9
3. The Standardized Normal Distribution
Z
0
10
3. The Standardized Normal Distribution
Example
If X is distributed normally with mean of 100 and standard deviation of 50, the Z value for X = 200 is:
X μ $200 $100
Z 2.0
σ $50
This says that X = 200 is two standard deviations (2 increments of 50 units) above the mean of 100.
11
3. The Standardized Normal Distribution
100 200 X
(μ = 100, σ = 50)
0 2.0 Z (μ = 0, σ = 1)
Note that the shape of the distribution is the same, only the scale has changed. We can express the
problem in the original units of X or in standardized units (Z).
12
3. The Standardized Normal Distribution
Z =- 0,9 Z = + 0,9
13
Exercise 1
14
3. The Standardized Normal Distribution
15
4. Evaluating Normality
It is important to evaluate how well the data set is approximated by a normal distribution.
There are several ways to evaluate the normality of a dataset, including graphical methods,
descriptive statistics…
The normal distribution is bell shaped (symmetrical) where the mean is equal to the median.
16
4. Evaluating Normality
a. Graphical Methods
These methods help visually assess how closely the data align with a normal distribution.
Histogram: Plotting a histogram of the data is a simple way to visually inspect its distribution. If the
data is approximately normally distributed, the histogram should show a bell-shaped curve.
Box Plot: A box plot can give an indication of normality by showing the symmetry of the data. In a
normal distribution, the median line should be centered within the box, and the whiskers should be
of roughly equal length on either side.
b. Descriptive Statistics
Do the mean, median and mode have similar values?
17
4. Evaluating Normality
Do approximately 80% of the observations lie within mean ±1.28 standard deviations?
18
5. Review
19
5. Review
Properties
Suppose X ~ N (µx ;σx) and Y ~ N (µY ;σY) ; a and k are constant numbers
20
Exercise 2
A large group of students took a test in Physics and the final grades have a mean of 70 and a standard
deviation of 10. If we can approximate the distribution of these grades by a normal distribution, what
percent of the students.
Q1) scored higher than 80?
Q2) should pass the test (grades≥60)?
Q3) should fail the test (grades<60)?
Q4) determine the symmetrical interval (midpoint is the mean) that contains 95% of the scores.
Q5) determine the symmetrical interval (midpoint is the mean) that contains 80% of the scores.
Q6) if we take 5 students with replacement what is the probability that the sum of the 5 grades is
higher than 350?
21
Exercise 3
You work for food-processing industry. You know that the weight of potatoes packet is a normally
distributed random variable whose mean is 200 grams and whose standard deviation is 10 grams. You
randomly and independently select 10 packets, the total weight of your 10 packets is defined by the
variable TW.
1- Write the equation of the variable TW. Is TW a random variable ? Explain Why?
3- What’s the probability of the total weight to be higher than 2100 grams?
22