Session 41 - Normal Distribution
Session 41 - Normal Distribution
-> Tail
-> Asymptotic in nature
-> Lots of points near the mean and very few far away
The normal distribution is characterized by two parameters: the mean (μ) and the
standard deviation (σ). The mean represents the centre of the distribution, while the
standard deviation represents the spread of the distribution.
Denoted as:
Why is it so important?
https://samp-suman-normal-dist-visualize-app-lkntug.streamlit.app/
Equation in detail:
Equation:
Suppose the heights of adult males in a certain population follow a normal distribution
with a mean of 68 inches and a standard deviation of 3 inches. What is the probability
that a randomly selected adult male from this population is taller than 72 inches?
A z-table tells you the area underneath a normal distribution curve, to the left of the z-
score
https://www.ztable.net/
For a Normal Distribution X~(u,std) what percent of population lie between mean and 1
standard deviation, 2 std and 3 std?
1. Symmetricity
The normal distribution is symmetric about its mean, which means that the probability of
observing a value above the mean is the same as the probability of observing a value below
the mean. The bell-shaped curve of the normal distribution reflects this symmetry.
3. Empirical Rule
The normal distribution has a well-known empirical rule, also called the 68-95-99.7 rule,
which states that approximately 68% of the data falls within one standard deviation of the
mean, about 95% of the data falls within two standard deviations of the mean, and about
99.7% of the data falls within three standard deviations of the mean.
• What is skewness?
A normal distribution is a bell-shaped, symmetrical distribution with a specific
mathematical formula that describes how the data is spread out. Skewness indicates that
the data is not symmetrical, which means it is not normally distributed.
In a symmetrical distribution, the mean, median, and mode are all equal. In contrast, in a
skewed distribution, the mean, median, and mode are not equal, and the distribution
tends to have a longer tail on one side than the other.
Skewness can be positive, negative, or zero. A positive skewness means that the tail of
the distribution is longer on the right side, while a negative skewness means that the tail
is longer on the left side. A zero skewness indicates a perfectly symmetrical distribution.
The greater the skew the greater the distance between mode, median and mode.
• Python Example
• Interpretation
• Outlier detection
• Assumptions on data for ML algorithms -> Linear Regression and GMM
• Hypothesis Testing
• Central Limit Theorem