Assignment 2 State Arman
Assignment 2 State Arman
02
Normal distribution:
The normal distribution is a continuous
probability distribution that is symmetrical around its mean, most of the
observations cluster around the central peak,
In a normal distribution, data is symmetrically distributed with no skew. When
plotted on a graph, the data follows a bell shape, with most values clustering around
a central region and tapering off as they go further away from the center.
Normal distributions are also called Gaussian distributions or bell curves because of
their shape.
The mean determines where the peak of the curve is centered. Increasing
the mean moves the curve right, while decreasing it moves the curve left.
• Around 68% of values are within 1 standard deviation from the mean.
• Around 95% of values are within 2 standard deviations from the
mean.
• Around 99.7% of values are within 3 standard deviations from the
mean.
The empirical rule is a quick way to get an overview of your data and check
for any outliers or extreme values that don’t follow this pattern.
If data from small samples do not closely follow this pattern, then other
distributions like the t-distribution may be more appropriate. Once you
identify the distribution of your variable, you can apply appropriate
statistical tests.
• Law of Large Numbers: As you increase sample size (or the number
of samples), then the sample mean will approach the population
mean.
• With multiple large samples, the sampling distribution of the mean is
normally distributed, even if your original variable is not normally
distributed.
You can use parametric tests for large samples from populations with any
kind of distribution as long as other important assumptions are met. A
sample size of 30 or more is generally considered large.
The formula for the normal probability density function looks fairly
complicated. But to use it, you only need to know the population mean and
standard deviation.
For any value of x, you can plug in the mean and standard deviation into
the formula to find the probability density of the variable taking on that
value of x.
• f(x) = probability
• x = value of the variable
• μ = mean
• σ = standard deviation
• σ2 = variance
_________________________________________________________________