Central Limit Theorem in Statistics
Last Updated :
27 Jul, 2025
One of the most basic principles in statistics, the Central Limit Theorem (CLT) describes how the sample mean distribution changes with increasing sample size.
If the sample is sufficiently large (usually n > 30), then the sample means' distribution will be normally distributed regardless of the underlying population distribution, whether it is normal, skewed, or otherwise.
All types of mean distributions tend to converge to a Normal Distribution as the sample size increases.
This is crucial since, even if population distribution is unknown, statisticians are able to draw inferences about the population based on the sample data. Larger samples are more accurate because CLT also proves that the distribution of the sample mean will have the mean as the population mean, and the standard deviation will reduce with increasing sample size. This theorem forms the basis for many. All types of mean distributions tend to converge to a Normal Distribution as the sample size increases.
The Central Limit Theorem in Statistics states that as the sample size increases and its variance is finite, then the distribution of the sample mean approaches the normal distribution, irrespective of the shape of the population distribution.
Let us assume we have a random variable X. Let σ be its standard deviation, and μ be the mean of the random variable.
- Now, as per the Central Limit Theorem, the sample mean \overline{X} will approximate a normal distribution, which is given as \overline{X} ⁓ N(μ, σ/√n).
- The Z-score of the random variable \overline{X} is given as Z =\dfrac{\overline x - \mu}{\frac{\sigma}{\sqrt n}} . Here \overline x is the mean\overline X .
The image of the formula is attached below.

Central Limit Theorem Proof
Let the independent random variables be X1, X2, X3, . . . . , Xn which are identically distributed and where their mean is zero(μ = 0) and their variance is one(σ2 = 1).
The Z score is given as, Z = \dfrac{\overline X - \mu}{\frac{\sigma}{\sqrt n}}= \frac{\sqrt{n} (\bar{X}_n - \mu)}{\sigma}
where \bar{X}_n = \frac{1}{n} \sum_{i=1}^n X_i. \:
Here, according to Central Limit Theorem, Z approximates to Normal Distribution as the value of n increases.
i.e. Z_n \xrightarrow{d} \mathcal{N}(0,1) \quad \text{as} \quad n \to \infty
Let m(t) be the Moment Generating Function of Xi
⇒ M(0) = 1
⇒ M'(1) = E(Xi) = μ = 0
⇒ M''(0) = E(Xi2) = 1
The Moment Generating Function for Xi/√n is given as E[etXi/√n]
Since, X1 X2, X3 . . . Xn are independent, hence the Moment Generating Function for (X1 + X2 + X3 + . . . + Xn)/√n is given as [M(t/√n)]n
Let us assume as function
f(t) = log M(t)
⇒ f(0) = log M(0) = 0
⇒ f'(0) = M'(0)/M(0) = μ/1 = μ
⇒ f''(0) = (M(0).M"(0) - M'(0)2)/M'(0)2 = 1
Now, using L' Hospital Rule we will find t/√n as t2/2
⇒ [M(t/√n)]2 = [ef(t/√n)]n
⇒ [enf(t/√n)] = e^(t2/2)
Thus the Central Limit Theorem has been proved by getting Moment Generating Function of a Standard Normal Distribution.
Central Limit Theorem Example
Let's say we have a large sample of observations and each sample is randomly produced and independent of other observations. Calculate the average of the observations, thus having a collection of averages of observations. Now as per the Central Limit Theorem, if the sample size is adequately large, then the probability distribution of these sample averages will approximate to a normal distribution.
Assumptions of the Central Limit Theorem
The Central Limit Theorem is valid for the following conditions:
- The drawing of the sample from the population should be random.
- The drawing of the sample should be independent of each other.
- The sample size should not exceed ten percent of the total population when sampling is done without replacement.
- Sample Size should be adequately large.
- CLT only holds for a population with finite variance.
Steps to Solve Problems on Central Limit Theorem
Problems of Central Limit Theorem that involves >, < or between can be solved by the following steps:
- Step 1: First identify the >, < associated with sample size, population size, mean and variance in the problem. Also there can be 'betwee; associated with range of two numbers.
- Step 2: Draw a Graph with Mean as Centre
- Step 3: Find the Z-Score using the formula
- Step 4: Refer to the Z table to find the value of Z obtained in the previous step.
- Step 5: If the problem involves '>' subtract the Z score from 0.5; if the problem involves '<' add 0.5 to the Z score and if the problem involves 'between' then perform only step 3 and 4.
- Step 6: The Z score value is found along \overline X
- Step 7: Convert the decimal value obtained in all three cases to decimal.
Mean of the Sample Mean
According to the Central Limit Theorem:
- If you have a population with a mean μ, the mean of the sample means (also called the expected value of the sample mean) will be equal to the population mean:
E(\bar{X}) = μ
Standard Deviation of the Sample Mean
The standard deviation of the sample mean (often called the standard error) describes how much the sample mean is expected to vary from the true population mean. It is calculated using the population standard deviation σ and the sample size n:
σXˉ = \frac{\sigma}{\sqrt{n}}
\sigma_{\hat{p}} = \sqrt{\frac{p(1 - p)}{n}} (For categorical data, the standard error for proportions is calculated using the true population proportion p)
Central Limit Theorem Applications in Computer Science
- Measuring latency/response times of systems (e.g., web servers, databases).
- The average latency over many requests converges to a normal distribution.
- Enables use of confidence intervals and parametric tests (t-tests) to compare system optimizations.
A/B Testing & Experimentation
- Comparing conversion rates between two website versions.
- User conversions are Bernoulli trials (0/1), so the average conversion rate (proportion) is approximately normal for large samples.
- Validates statistical tests (e.g., Z-tests) to determine if differences are significant. Without CLT, comparing proportions would be less straightforward.
Monte Carlo Simulations
- Estimating complex values (e.g., π, financial risks, graphics rendering) via random sampling.
- The simulation output (e.g., mean of samples) becomes normally distributed around the true value.
- Provides error bounds (e.g., "estimate ± 2 standard errors") and justifies increasing samples to reduce error.
Machine Learning (ML) & Statistics
- Used for model evaluation. Accuracy/F1-scores of ML models over test sets converge to normality, enabling comparison via confidence intervals.
- Stochastic Gradient Descent (SGD): Batch gradients are averages of random samples → approximately normal noise.
- Feature Engineering: Aggregated features (e.g., mean user interactions per day) often become Gaussian-like, simplifying assumptions for models (e.g., linear regression).
Also Check
Central Limit Theorem in Data Science & Machine Learning
Central Limit Theorem Solved Examples
Example 1. The male population's weight data follows a normal distribution. It has a mean of 70 kg and a standard deviation of 15 kg. What would the mean and standard deviation of a sample of 50 guys be if a researcher looked at their records?
Given: μ = 70 kg, σ = 15 kg, n = 50
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence, \mu _{\overline{x}} = μ = 70 kg
Now, \sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}} = 15/√50
⇒ \sigma _{\overline{x}} ≈ 2.1 kg
Example 2. A distribution has a mean of 69 and a standard deviation of 420. Find the mean and standard deviation if a sample of 80 is drawn from the distribution.
Given: μ = 69, σ = 420, n = 80
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence, \mu _{\overline{x}} = μ = 69
Now, \sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}}
⇒ \sigma _{\overline{x}} = 420/√80
⇒ \sigma _{\overline{x}} = 46.95
Example 3. The mean age of people in a colony is 34 years. Suppose the standard deviation is 15 years. The sample of size is 50. Find the mean and standard deviation of the sample.
Given: μ = 34, σ = 15, n = 50
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence, \mu _{\overline{x}} = μ = 34 years
Now, \sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}}
⇒ \sigma _{\overline{x}} = 15/√50
⇒ \sigma _{\overline{x}} = 2.12 years
Example 4. The mean age of cigarette smokers is 35 years. Suppose the standard deviation is 10 years. The sample size is 39. Find the mean and standard deviation of the sample.
Given: μ = 35, σ = 10, n = 39
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence, \mu _{\overline{x}} = μ = 35 years
Now, \sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}} = 10/√39
⇒ \sigma _{\overline{x}} = 1.601 years
Example 5. The mean time taken to read a newspaper is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample of size 70. Find its mean and standard deviation.
Given: μ = 8.2, σ = 1, n = 70
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence, \mu _{\overline{x}} = μ = 8.2 minutes
Now, \sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}} = 1/√70
⇒ \sigma _{\overline{x}} = 0.11 minutes
Example 6. A distribution has a mean of 12 and a standard deviation of 3. Find the mean and standard deviation if a sample of 36 is drawn from the distribution.
Given: μ = 12, σ = 3, n = 36
As per the Central Limit Theorem, the sample mean is equal to the population mean.
Hence, \mu _{\overline{x}} = μ = 12
Now, \sigma _{\overline{x}}=\frac{\sigma }{\sqrt{n}} = 3/√36
⇒ \sigma _{\overline{x}} = 0.5
Example 7. You want to estimate the mean income of a population with a margin of error of $5, assuming the population standard deviation is $50, and you want a 95% confidence level. What sample size do you need?
Given: Z= 1.96 (for 95% confidence level), σ = 50, E = 5
As per the Central Limit Theorem, the formula to calculate the sample size is.
Hence, n = \left( \frac{E}{Z \times \sigma} \right)^2
n = \left( \frac{5}{1.96 \times 50} \right)^2 = \left( \frac{5}{98} \right)^2 = (19.6)^2
n= 384.16 (Round up to the nearest whole number)
n=385
The required sample size is 385.
Example 8. Given that the population proportion p=0.40p = 0.40p=0.40 and the sample size n = 100, calculate the standard error for the sample proportion \hat{p}.
Given: n=100, p=40% or .40.
As per the Central Limit Theorem, the formula to calculate standard error for proportions.
\sigma_{\hat{p}} = \sqrt{\frac{p(1 - p)}{n}} = \sqrt{\frac{100}{0.40(1 - 0.40)}} = \sqrt{\frac{100}{0.24}} = 0.04899
\sigma_{\hat{p}}=0.04899
\sigma_{\hat{p}} \approx 0.04899
Related Articles
Practice Problem Based on Central Limit Theorem
Question 1. Given that the population mean is 50 and the population standard deviation is 10, find the Z-score for a sample mean of 52, when the sample size is 25.
Question 2. If the population has a standard deviation of 15, and you take a sample of 50 from this population, calculate the standard error of the sample mean.
Question 3. A population has a mean of 100 and a standard deviation of 20. You take a sample of 36. Calculate the 95% confidence interval for the sample mean.
Question 4. The average height of adult women in a population is 160 cm with a standard deviation of 10 cm. What is the probability that a random sample of 25 women has a mean height greater than 162 cm?
Answer:-
- 1
- 2.12
- [93.47, 106.53]
- 0.1587
Central Limit Theorem in Statistics | Formula, Derivation, Examples & Proof
Central Limit Theorem in Statistics | Formula, Derivation, Examples & Proof
Central Limit Theorem (CLT) in Machine Learning
Similar Reads
Maths Mathematics, often referred to as "math" for short. It is the study of numbers, quantities, shapes, structures, patterns, and relationships. It is a fundamental subject that explores the logical reasoning and systematic approach to solving problems. Mathematics is used extensively in various fields
5 min read
Basic Arithmetic
What are Numbers?Numbers are symbols we use to count, measure, and describe things. They are everywhere in our daily lives and help us understand and organize the world.Numbers are like tools that help us:Count how many things there are (e.g., 1 apple, 3 pencils).Measure things (e.g., 5 meters, 10 kilograms).Show or
15+ min read
Arithmetic OperationsArithmetic Operations are the basic mathematical operationsâAddition, Subtraction, Multiplication, and Divisionâused for calculations. These operations form the foundation of mathematics and are essential in daily life, such as sharing items, calculating bills, solving time and work problems, and in
9 min read
Fractions - Definition, Types and ExamplesFractions are numerical expressions used to represent parts of a whole or ratios between quantities. They consist of a numerator (the top number), indicating how many parts are considered, and a denominator (the bottom number), showing the total number of equal parts the whole is divided into. For E
7 min read
What are Decimals?Decimals are numbers that use a decimal point to separate the whole number part from the fractional part. This system helps represent values between whole numbers, making it easier to express and measure smaller quantities. Each digit after the decimal point represents a specific place value, like t
10 min read
ExponentsExponents are a way to show that a number (base) is multiplied by itself many times. It's written as a small number (called the exponent) to the top right of the base number.Think of exponents as a shortcut for repeated multiplication:23 means 2 x 2 x 2 = 8 52 means 5 x 5 = 25So instead of writing t
9 min read
PercentageIn mathematics, a percentage is a figure or ratio that signifies a fraction out of 100, i.e., A fraction whose denominator is 100 is called a Percent. In all the fractions where the denominator is 100, we can remove the denominator and put the % sign.For example, the fraction 23/100 can be written a
5 min read
Algebra
Variable in MathsA variable is like a placeholder or a box that can hold different values. In math, it's often represented by a letter, like x or y. The value of a variable can change depending on the situation. For example, if you have the equation y = 2x + 3, the value of y depends on the value of x. So, if you ch
5 min read
Polynomials| Degree | Types | Properties and ExamplesPolynomials are mathematical expressions made up of variables (often represented by letters like x, y, etc.), constants (like numbers), and exponents (which are non-negative integers). These expressions are combined using addition, subtraction, and multiplication operations.A polynomial can have one
9 min read
CoefficientA coefficient is a number that multiplies a variable in a mathematical expression. It tells you how much of that variable you have. For example, in the term 5x, the coefficient is 5 â it means 5 times the variable x.Coefficients can be positive, negative, or zero. Algebraic EquationA coefficient is
8 min read
Algebraic IdentitiesAlgebraic Identities are fundamental equations in algebra where the left-hand side of the equation is always equal to the right-hand side, regardless of the values of the variables involved. These identities play a crucial role in simplifying algebraic computations and are essential for solving vari
14 min read
Properties of Algebraic OperationsAlgebraic operations are mathematical processes that involve the manipulation of numbers, variables, and symbols to produce new results or expressions. The basic algebraic operations are:Addition ( + ): The process of combining two or more numbers to get a sum. For example, 3 + 5 = 8.Subtraction (â)
3 min read
Geometry
Lines and AnglesLines and Angles are the basic terms used in geometry. They provide a base for understanding all the concepts of geometry. We define a line as a 1-D figure that can be extended to infinity in opposite directions, whereas an angle is defined as the opening created by joining two or more lines. An ang
9 min read
Geometric Shapes in MathsGeometric shapes are mathematical figures that represent the forms of objects in the real world. These shapes have defined boundaries, angles, and surfaces, and are fundamental to understanding geometry. Geometric shapes can be categorized into two main types based on their dimensions:2D Shapes (Two
2 min read
Area and Perimeter of Shapes | Formula and ExamplesArea and Perimeter are the two fundamental properties related to 2-dimensional shapes. Defining the size of the shape and the length of its boundary. By learning about the areas of 2D shapes, we can easily determine the surface areas of 3D bodies and the perimeter helps us to calculate the length of
10 min read
Surface Areas and VolumesSurface Area and Volume are two fundamental properties of a three-dimensional (3D) shape that help us understand and measure the space they occupy and their outer surfaces.Knowing how to determine surface area and volumes can be incredibly practical and handy in cases where you want to calculate the
10 min read
Points, Lines and PlanesPoints, Lines, and Planes are basic terms used in Geometry that have a specific meaning and are used to define the basis of geometry. We define a point as a location in 3-D or 2-D space that is represented using coordinates. We define a line as a geometrical figure that is extended in both direction
14 min read
Coordinate Axes and Coordinate Planes in 3D spaceIn a plane, we know that we need two mutually perpendicular lines to locate the position of a point. These lines are called coordinate axes of the plane and the plane is usually called the Cartesian plane. But in real life, we do not have such a plane. In real life, we need some extra information su
6 min read
Trigonometry & Vector Algebra
Trigonometric RatiosThere are three sides of a triangle Hypotenuse, Adjacent, and Opposite. The ratios between these sides based on the angle between them is called Trigonometric Ratio. The six trigonometric ratios are: sine (sin), cosine (cos), tangent (tan), cotangent (cot), cosecant (cosec), and secant (sec).As give
4 min read
Trigonometric Equations | Definition, Examples & How to SolveTrigonometric equations are mathematical expressions that involve trigonometric functions (such as sine, cosine, tangent, etc.) and are set equal to a value. The goal is to find the values of the variable (usually an angle) that satisfy the equation.For example, a simple trigonometric equation might
9 min read
Trigonometric IdentitiesTrigonometric identities play an important role in simplifying expressions and solving equations involving trigonometric functions. These identities, which include relationships between angles and sides of triangles, are widely used in fields like geometry, engineering, and physics. Some important t
10 min read
Trigonometric FunctionsTrigonometric Functions, often simply called trig functions, are mathematical functions that relate the angles of a right triangle to the ratios of the lengths of its sides.Trigonometric functions are the basic functions used in trigonometry and they are used for solving various types of problems in
6 min read
Inverse Trigonometric Functions | Definition, Formula, Types and Examples Inverse trigonometric functions are the inverse functions of basic trigonometric functions. In mathematics, inverse trigonometric functions are also known as arcus functions or anti-trigonometric functions. The inverse trigonometric functions are the inverse functions of basic trigonometric function
11 min read
Inverse Trigonometric IdentitiesInverse trigonometric functions are also known as arcus functions or anti-trigonometric functions. These functions are the inverse functions of basic trigonometric functions, i.e., sine, cosine, tangent, cosecant, secant, and cotangent. It is used to find the angles with any trigonometric ratio. Inv
9 min read
Calculus
Introduction to Differential CalculusDifferential calculus is a branch of calculus that deals with the study of rates of change of functions and the behaviour of these functions in response to infinitesimal changes in their independent variables.Some of the prerequisites for Differential Calculus include:Independent and Dependent Varia
6 min read
Limits in CalculusIn mathematics, a limit is a fundamental concept that describes the behaviour of a function or sequence as its input approaches a particular value. Limits are used in calculus to define derivatives, continuity, and integrals, and they are defined as the approaching value of the function with the inp
12 min read
Continuity of FunctionsContinuity of functions is an important unit of Calculus as it forms the base and it helps us further to prove whether a function is differentiable or not. A continuous function is a function which when drawn on a paper does not have a break. The continuity can also be proved using the concept of li
13 min read
DifferentiationDifferentiation in mathematics refers to the process of finding the derivative of a function, which involves determining the rate of change of a function with respect to its variables.In simple terms, it is a way of finding how things change. Imagine you're driving a car and looking at how your spee
2 min read
Differentiability of a Function | Class 12 MathsContinuity or continuous which means, "a function is continuous at its domain if its graph is a curve without breaks or jumps". A function is continuous at a point in its domain if its graph does not have breaks or jumps in the immediate neighborhood of the point. Continuity at a Point: A function f
11 min read
IntegrationIntegration, in simple terms, is a way to add up small pieces to find the total of something, especially when those pieces are changing or not uniform.Imagine you have a car driving along a road, and its speed changes over time. At some moments, it's going faster; at other moments, it's slower. If y
3 min read
Probability and Statistics
Basic Concepts of ProbabilityProbability is defined as the likelihood of the occurrence of any event. It is expressed as a number between 0 and 1, where 0 is the probability of an impossible event and 1 is the probability of a sure event.Concepts of Probability are used in various real life scenarios : Stock Market : Investors
7 min read
Bayes' TheoremBayes' Theorem is a mathematical formula used to determine the conditional probability of an event based on prior knowledge and new evidence. It adjusts probabilities when new information comes in and helps make better decisions in uncertain situations.Bayes' Theorem helps us update probabilities ba
13 min read
Probability Distribution - Function, Formula, TableA probability distribution is a mathematical function or rule that describes how the probabilities of different outcomes are assigned to the possible values of a random variable. It provides a way of modeling the likelihood of each outcome in a random experiment.While a Frequency Distribution shows
13 min read
Descriptive StatisticStatistics is the foundation of data science. Descriptive statistics are simple tools that help us understand and summarize data. They show the basic features of a dataset, like the average, highest and lowest values and how spread out the numbers are. It's the first step in making sense of informat
5 min read
What is Inferential Statistics?Inferential statistics is an important tool that allows us to make predictions and conclusions about a population based on sample data. Unlike descriptive statistics, which only summarize data, inferential statistics let us test hypotheses, make estimates, and measure the uncertainty about our predi
7 min read
Measures of Central Tendency in StatisticsCentral tendencies in statistics are numerical values that represent the middle or typical value of a dataset. Also known as averages, they provide a summary of the entire data, making it easier to understand the overall pattern or behavior. These values are useful because they capture the essence o
11 min read
Set TheorySet theory is a branch of mathematics that deals with collections of objects, called sets. A set is simply a collection of distinct elements, such as numbers, letters, or even everyday objects, that share a common property or rule.Example of SetsSome examples of sets include:A set of fruits: {apple,
3 min read
Practice