Statistic and Probability
Statistic and Probability
HAZEL BRIGOLE
MERIAM CUICO
MARIAN CUICO
JM MONTEZA
ALMER JAGOLINA
Statistic and Probability
Objectives:
The students that are having a hard time to understand the lessons in
Statistic and Probability will learn it here easily.
It provides understandable examples.
Help the students to understand the lesson and to know how to solve
them.
We ask the Grade 11 students about what are the lessons in Statistic and
Probability they are having a hard time to understand. And here are their
common answers:
Variance and Standard Deviation
Normal Distribution
The Probability Notation Under Normal Curve
Sampling Distribution of Sample Means
Means and Variance of Sampling Distribution
Central Limit Theorem
Variance and Standard Deviation
The standard deviation is derived from variance and tells you, on average, how far each
value lies from the mean. It’s the square root of variance. Both measures reflect variability in a
distribution, but their units differ:
Standard deviation is expressed in the same units as the original values (e.g.,
meters).
Variance is expressed in much larger units (e.g., meters squared).
Since the units of variance are much larger than those of typical value of data set, is
harder to interpret the variance number intuitively. That’s why standard deviation is often
preferred as a main measure of variability.
µ= ΣX · P(X) σ 2
= Σ (X- µ)2P(X)
= 31/10
=2.69 is the variance
= 3.1
√2.69
Note: The no. of items and the probability are already given, all you have to do is to calculate
what is asked; X · P(X), X - µ, (X- µ)2, (X- µ)2 · P(X)
Normal Distribution
Normal distribution is a continuous probability distribution where in values live in a
symmetrical fashion mostly situated around the mean.
The normal distribution is a probability distribution, so the total area under the curve is
always 1 or 100%. The formula for the normal probability density function looks fairly
complicated. But to use it, you only need to know the population mean and standard deviation.
Find here some normal distribution word problems or some applications of the normal
distribution.
Example #1
Suppose the current annual salary of all teachers in the United States have a normal
distribution with a mean of 51000 dollars and a standard deviation of 6000 dollars. Find the
probability that the annual salary of a randomly selected teacher would be between 42000 and
65000.
Solution
The probability that the annual salary of a randomly selected teacher is between 42000
and 65000 is given by the area under the normal curve between x = 42000 and x = 65000.
z = 14000/6000
= 2.33
The required probability is given by the area under the normal curve between z = -1.5 and z =
2.33.
This is obtained by adding the area between z = -1.5 and z = 0 and the area between z = 0 and z
= 2.33.Using the standard normal distribution table, we see that the area between z = -1.5 and z
= 0 is 0.4332 and the area between z = 0 and z = 2.33 is 0.4901. P (42000 < x < 65000) = P (-1.5 <
z < 2.33) = 0.4332 + 0.4901 = 0.9233
This means that about 92.33% of all teachers in the USA earn between 42000 and 65000.
The Probability notation its symbols used to denote the Probability of certain events.
The Table of areas under the normal curve is also known as Z-Table.
P (a < z <b) denotes the probability that the z-score is between a and b
P (z >a) denotes the probability that the z-score is greater than a
P (z < a) the probability that the z-score is less than a
Where a and b are z-score values.
For example:
To denote the area between z=1 and z=2, we used the notation: P (z < z < 2) = 0.1359
Entries in the table give the area under the curve between the mean and z standard deviations
above the mean. For example, for z = 1.46, the area under the curve between the mean (0) and
z is 0.4279
A.
Draw a normal curve.
Locate the z – value.
Draw a line through the z-value.
Shade the required region.
B.
Consult the z-table and find the area that corresponds to Z= 1.46.
Z= 1.46 corresponds to a area of 0.4279
C.
Examine the graph and use probability notation to form an equation showing the appropriate
operation to get the required area.
The graph suggests subtraction. The required area is equal to 0.5 – 0.4279 = 0.0721.
That is, P (z < 1.46) = 0.5 – 0.4279 = 0.0721
D.
Make a statement indicating the required area.
The proportion of the area above z= 1.46 is 0.0721 or 7.21%
In statistics, population is the entire set of items from which you draw data for a statistical
study. It can be a group of individuals, a set of items, etc. It makes up the data pool for a study.
Generally, population refers to the people who live in a particular area at a specific time.
Sample - refers to a smaller, manageable version of a larger group. It is a subset containing the
characteristics of a larger population.
Sampling- is the process of getting a sample.
Parameter- population’s particular characteristic.
Types of Sampling:
In this case each individual is chosen entirely by chance and each member of the population has
an equal chance, or probability, of being selected. One way of obtaining a random sample is to
give each individual in a population a number, and then use a table of random numbers to
decide which individuals to include.
2. Systematic sampling
Individuals are selected at regular intervals from the sampling frame. The intervals are chosen
to ensure an adequate sample size. If you need a sample size n from a population of size x, you
should select every x/nth individual for the sample.
3. Stratified sampling
In this method, the population is first divided into subgroups (or strata) who all share a
similar characteristic. It is used when we might reasonably expect the measurement of interest
to vary between the different subgroups, and we want to ensure representation from all the
subgroups.
4. Clustered sampling
In a clustered sample, subgroups of the population are used as the sampling unit, rather
than individuals. The population is divided into subgroups, known as clusters, which are
randomly selected to be included in the study.
5. Convenience sampling
Convenience sampling is perhaps the easiest method of sampling, because participants are
selected based on availability and willingness to take part.
1. Determine the number of sets of all the possible random samples that can be drawn from the
given population by using the formula, NCn where N is the population size and n is the sample
size.
2. List all the possible samples and compute the mean of each sample.
3. Construct the sampling distribution.
Example:
Population consists of the numbers 2, 4, 9, 10 and 5.let us list all possible sample size of 3
from this population and compute the mean of each sample.
a. Determine the number of sets of all possible random samples that can be drawn from the
given population by using formula NCn. where N is the population size and n is the sample
size.
C = N! /n! (N-n)!
N n C = N! /n! (N-n)!
N n
N= 5 =5! / 3! (5-3)!
n=3 =5·4·3·2·1 / 3·2·1 (2·1)
= 120/1
= 10 *There are 10 possible samples of size 3 that can
be drawn from the given data.
b. List all the possible samples and compute the mean of each sample.
Sample Mean
2,3,9 5.00
2,4,10 5.33
2,4,5 3.67
2,9,10 7.00
2,9,5 5.33
2,10,5 5.67
4,9,10 7.67
4,9,5, 6.00
4,10,5 6.33
9,10,5 8.00
EXAMPLE:
μ= Σ X
N
= 1+2+3+4+5
5
= 3.00
So the mean of the population is 3.00
1 1 – 3 = -2 (-2)2 = 4 N
2 2 – 3 = -1 (-1)2 = 1 = 10/5
3 3–3=0 (0)2 = 0 =2
4 4–3=1 (1)2 = 1
5 5–3=2 (2)2 = 4 So, the variance of the
population is 2.
Σ (X- μ)2 = 10
C =10
5 2
Samples Mean
1,2,3 2.00
1,2,4 2.33
1,2,5 2.67
1,3,4 2.67
1,3,5 3.00
1,4,5 3.33
Step 2,3,4 3.00 5.Construct the sampling distribution of the
sample 2,3,5 3.33 means.
2,4,5 3.67
Sample Mean
3,4,5X̅ Frequency
4.00 Probability P(X̅ )
2.00 1 1/10
2.33 1 1/10
2.67 2 2/10
3.00 2 2/10
3.33 2 2/10
3.67 1 1/10
4.00 1 1/10
Total 10 1.00
Step 6.Compare the mean of the sampling distribution of the sample means (X- μ).
Step 7.Compute the variance (σ2) of the sampling distribution of the sample means.
a) Subtract the population mean (μ) from each sample mean (X̅). Label this as X̅- μ
b) Square the difference. Label this as (X̅- μ)2.
c) Multiply the results by the corresponding probability. Label this as P (X̅) · (X̅- μ)2.
d) Add the results.
=0.334 or 0.33
In probability theory, the central limit theorem establishes that, in many situations, for
identically distributed independent samples, the standardized sample mean tends towards the
standard normal distribution even if the original variables themselves are not normally
distributed.
Given a population with a finite mean μ and a finite non-zero variance σ2, the sampling
distribution of the mean approaches a normal distribution with a mean of μ and a variance of σ2
/ N as N, the sample size, increases.
EXAMPLE:
A. What is the probability that a randomly selected college student will complete the
examination in less than 45 minutes?
N= 47.2
σ= 8
X= 45
If what is asked is less than , then used its symbol “< “, P(X< _ )
and if it is more than, use its symbol “> “,P( X> _ )
So we will used, P(X< 45)
X – N/σ
X – N/σ
= 45 – 47.2 / 9
= -0.24
-0.24
P(X< 45)
= P(X< - 0.24)
Find the z-score
= 0.5000 – 0.0948, where 0.5000 is constant
= 0.4052
The probability that randomly selected college student will complete the examination
in less than 45 minutes is 0.3446 or 34.46 %.