0% found this document useful (0 votes)
35 views15 pages

Statistic and Probability

The document provides information about statistical concepts that students commonly struggle with such as variance, standard deviation, normal distribution, and sampling distributions. It defines these concepts and provides examples to help explain them. Variance and standard deviation measure variability in a distribution. The standard deviation is the square root of variance. A normal distribution is a continuous probability distribution that is symmetrical around the mean. Notation is used to describe probabilities under the normal curve. A sampling distribution of sample means shows how sample means are distributed when multiple samples are taken from a population. Steps for constructing a sampling distribution include determining possible samples, calculating the mean of each, and constructing the distribution.

Uploaded by

Ferly Taburada
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
35 views15 pages

Statistic and Probability

The document provides information about statistical concepts that students commonly struggle with such as variance, standard deviation, normal distribution, and sampling distributions. It defines these concepts and provides examples to help explain them. Variance and standard deviation measure variability in a distribution. The standard deviation is the square root of variance. A normal distribution is a continuous probability distribution that is symmetrical around the mean. Notation is used to describe probabilities under the normal curve. A sampling distribution of sample means shows how sample means are distributed when multiple samples are taken from a population. Steps for constructing a sampling distribution include determining possible samples, calculating the mean of each, and constructing the distribution.

Uploaded by

Ferly Taburada
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 15

FERLY TABURADA

HAZEL BRIGOLE
MERIAM CUICO
MARIAN CUICO
JM MONTEZA
ALMER JAGOLINA
Statistic and Probability
Objectives:
 The students that are having a hard time to understand the lessons in
Statistic and Probability will learn it here easily.
 It provides understandable examples.
 Help the students to understand the lesson and to know how to solve
them.

We ask the Grade 11 students about what are the lessons in Statistic and
Probability they are having a hard time to understand. And here are their
common answers:
 Variance and Standard Deviation
 Normal Distribution
 The Probability Notation Under Normal Curve
 Sampling Distribution of Sample Means
 Means and Variance of Sampling Distribution
 Central Limit Theorem
Variance and Standard Deviation

The standard deviation is derived from variance and tells you, on average, how far each
value lies from the mean. It’s the square root of variance. Both measures reflect variability in a
distribution, but their units differ:

 Standard deviation is expressed in the same units as the original values (e.g.,
meters).
 Variance is expressed in much larger units (e.g., meters squared).

Since the units of variance are much larger than those of typical value of data set, is
harder to interpret the variance number intuitively. That’s why standard deviation is often
preferred as a main measure of variability.

EXAMPLE: Finding the variance and standard deviation:

NO. of Probability X · P(X) X-µ (X- µ)2 (X- µ)2 · P(X)


items P(X)

1 3/10 1 · 3/10 1 – 3.1 = -2.1 (-2.1)2 = 4.41 4.41 · 3/10 =1.323

2 1/10 2 ·1/10 2 – 3.1 = -1.1 (-1.1)2 = 1.21 1.21 · 1/10 = 0.121

3 1/10 3 · 1/10 3 – 3.1= -0.1 (-0.1)2 = 0.01 0.01 · 1/10 = 1/1000

4 2/10 4 · 2/10 4 – 3.1 = 0.9 (0.9)2 = 0.81 0.81 · 2/10 = 0.162

5 3/10 5 · 3/10 5 – 3.1 = 1.9 (1.9)2 = 3.61 3.61 · 3/10 = 1.083

µ= ΣX · P(X) σ 2
= Σ (X- µ)2P(X)
= 31/10
=2.69 is the variance
= 3.1
√2.69

= 1.64, is the standard


variation

Note: The no. of items and the probability are already given, all you have to do is to calculate
what is asked; X · P(X), X - µ, (X- µ)2, (X- µ)2 · P(X)

Normal Distribution
Normal distribution is a continuous probability distribution where in values live in a
symmetrical fashion mostly situated around the mean.

The normal distribution is a probability distribution, so the total area under the curve is
always 1 or 100%. The formula for the normal probability density function looks fairly
complicated. But to use it, you only need to know the population mean and standard deviation.

Find here some normal distribution word problems or some applications of the normal
distribution.

Example #1

Suppose the current annual salary of all teachers in the United States have a normal
distribution with a mean of 51000 dollars and a standard deviation of 6000 dollars. Find the
probability that the annual salary of a randomly selected teacher would be between 42000 and
65000.

Solution

The probability that the annual salary of a randomly selected teacher is between 42000
and 65000 is given by the area under the normal curve between x = 42000 and x = 65000.

For x = 42000, z = 42000 – 51000/6000


z =-9000/6000
 = -1.5

For x = 65000, z = 65000 – 51000/6000

z = 14000/6000
 = 2.33
The required probability is given by the area under the normal curve between z = -1.5 and z =
2.33.

This is obtained by adding the area between z = -1.5 and z = 0 and the area between z = 0 and z
= 2.33.Using the standard normal distribution table, we see that the area between z = -1.5 and z
= 0 is 0.4332 and the area between z = 0 and z = 2.33 is 0.4901. P (42000 < x < 65000) = P (-1.5 <
z < 2.33) = 0.4332 + 0.4901 = 0.9233

This means that about 92.33% of all teachers in the USA earn between 42000 and 65000.

The Probability Notations, Under the Normal Curve


The following notations for a random variable are used in various solutions concerning the
normal curve. Mathematical notations are convenient forms of lengthy expressions.

 The Probability notation its symbols used to denote the Probability of certain events.
 The Table of areas under the normal curve is also known as Z-Table.
P (a < z <b) denotes the probability that the z-score is between a and b
P (z >a) denotes the probability that the z-score is greater than a
P (z < a) the probability that the z-score is less than a
Where a and b are z-score values.

For example:
To denote the area between z=1 and z=2, we used the notation: P (z < z < 2) = 0.1359

Entries in the table give the area under the curve between the mean and z standard deviations
above the mean. For example, for z = 1.46, the area under the curve between the mean (0) and
z is 0.4279

Example: Above z = 1.46

Here are the steps to be followed:

A.
 Draw a normal curve.
 Locate the z – value.
 Draw a line through the z-value.
 Shade the required region.

B.
Consult the z-table and find the area that corresponds to Z= 1.46.
 Z= 1.46 corresponds to a area of 0.4279

C.
Examine the graph and use probability notation to form an equation showing the appropriate
operation to get the required area.

 The graph suggests subtraction. The required area is equal to 0.5 – 0.4279 = 0.0721.
That is, P (z < 1.46) = 0.5 – 0.4279 = 0.0721

D.
Make a statement indicating the required area.
 The proportion of the area above z= 1.46 is 0.0721 or 7.21%

SAMPLING DISTRIBUTION OF SAMPLE MEANS


A sampling distribution of sample means is a frequency distribution using the means
computed from all possible random samples of a specific size taken from a population. The
means of the samples are less than or greater than the mean of the population.

In statistics, population is the entire set of items from which you draw data for a statistical
study. It can be a group of individuals, a set of items, etc. It makes up the data pool for a study.
Generally, population refers to the people who live in a particular area at a specific time.
Sample - refers to a smaller, manageable version of a larger group. It is a subset containing the
characteristics of a larger population.
Sampling- is the process of getting a sample.
Parameter- population’s particular characteristic.

Types of Sampling:

1. Simple random sampling

In this case each individual is chosen entirely by chance and each member of the population has
an equal chance, or probability, of being selected. One way of obtaining a random sample is to
give each individual in a population a number, and then use a table of random numbers to
decide which individuals to include.

2. Systematic sampling

Individuals are selected at regular intervals from the sampling frame. The intervals are chosen
to ensure an adequate sample size. If you need a sample size n from a population of size x, you
should select every x/nth individual for the sample.

3. Stratified sampling
In this method, the population is first divided into subgroups (or strata) who all share a
similar characteristic. It is used when we might reasonably expect the measurement of interest
to vary between the different subgroups, and we want to ensure representation from all the
subgroups.

4. Clustered sampling

In a clustered sample, subgroups of the population are used as the sampling unit, rather
than individuals. The population is divided into subgroups, known as clusters, which are
randomly selected to be included in the study.

5. Convenience sampling

Convenience sampling is perhaps the easiest method of sampling, because participants are
selected based on availability and willingness to take part.

STEPS IN CONSTRUCTING THE SAMPLING DISTRIBUTION OF THE MEANS

1. Determine the number of sets of all the possible random samples that can be drawn from the
given population by using the formula, NCn where N is the population size and n is the sample
size.
2. List all the possible samples and compute the mean of each sample.
3. Construct the sampling distribution.

Example:
Population consists of the numbers 2, 4, 9, 10 and 5.let us list all possible sample size of 3
from this population and compute the mean of each sample.

a. Determine the number of sets of all possible random samples that can be drawn from the
given population by using formula NCn. where N is the population size and n is the sample
size.

C = N! /n! (N-n)!
N n C = N! /n! (N-n)!
N n
N= 5 =5! / 3! (5-3)!
n=3 =5·4·3·2·1 / 3·2·1 (2·1)
= 120/1
= 10 *There are 10 possible samples of size 3 that can
be drawn from the given data.
b. List all the possible samples and compute the mean of each sample.

Sample Mean
2,3,9 5.00
2,4,10 5.33
2,4,5 3.67
2,9,10 7.00
2,9,5 5.33
2,10,5 5.67
4,9,10 7.67
4,9,5, 6.00
4,10,5 6.33
9,10,5 8.00

c. Construct the sampling distribution.

Sample Mean Frequency Sample Frequency Probability


3.67 1 Mean P(x)
3.67 1 1/10= 0.10
5.00 1
5.00 1 1/10= 0.10
5.33 2
5.33 2 2/10= 0.20
5.67 1
5.67 1 1/10= 0.10
6.00 1
6.00 1 1/10= 0.10
6.33 1 6.33 1 1/10= 0.10
7.00 1 7.00 1 1/10= 0.10
7.67 1 7.67 1 1/10= 0.10
8.00 1 8.00 1 1/10= 0.10
Total 10 Total n=10 1.00

Mean and Variance of Sampling Distribution


Recognize that the sampling distribution of the sample means is definitely the probability
distribution of the sample means, which also implies that the samples is that random variable in
this probability distribution. But since the sampling distribution of the means is the probability
distribution of the random variable X, we could perhaps calculate its mean and variance.

EXAMPLE:

Consider a population consisting of 1,2,3,4 and 5. Suppose samples of size 3 are


drawn from this population. Compute the mean and the variance of the sampling distribution
of the sample means.

Here are the Steps to be followed:

Step 1.Compute the mean of the population.

 μ= Σ X
N
= 1+2+3+4+5
5
= 3.00
So the mean of the population is 3.00

Step 2.Compute the variance of the population (σ)

X X-μ (X -μ)2 σ = Σ (X- μ)2


2

1 1 – 3 = -2 (-2)2 = 4 N
2 2 – 3 = -1 (-1)2 = 1 = 10/5
3 3–3=0 (0)2 = 0 =2
4 4–3=1 (1)2 = 1
5 5–3=2 (2)2 = 4 So, the variance of the
population is 2.
Σ (X- μ)2 = 10

Step 3.Determine the number if possible samples of size n=2


 Use the formula NCn.

Here N=5 and n=2.

C =10
5 2

So, there are 10 possible samples of size 2 that can be drawn.

Step 4.List all possible samples and their corresponding mean.

Samples Mean
1,2,3 2.00
1,2,4 2.33
1,2,5 2.67
1,3,4 2.67
1,3,5 3.00
1,4,5 3.33
Step 2,3,4 3.00 5.Construct the sampling distribution of the
sample 2,3,5 3.33 means.
2,4,5 3.67
Sample Mean
3,4,5X̅ Frequency
4.00 Probability P(X̅ )
2.00 1 1/10

2.33 1 1/10

2.67 2 2/10
3.00 2 2/10

3.33 2 2/10
3.67 1 1/10

4.00 1 1/10

Total 10 1.00

Step 6.Compare the mean of the sampling distribution of the sample means (X- μ).

Follow these steps:

a) Multiply the sample mean by the corresponding probability.


b) Add the results.
Sample Mean X̅ Probability P(X̅ ) X̅ · P(X̅ )
2.00 1/10 0.20
2.33 1/10 0.23
2.67 1/5 0.53
3.00 1/5 0.60
3.33 1/5 0.67
3.67 1/10 0.37
4.00 1/10 0.40
Total 1.00 3.00 3.00 is the mean of sampling
distribution.

Step 7.Compute the variance (σ2) of the sampling distribution of the sample means.

Follow these steps:

a) Subtract the population mean (μ) from each sample mean (X̅). Label this as X̅- μ
b) Square the difference. Label this as (X̅- μ)2.
c) Multiply the results by the corresponding probability. Label this as P (X̅) · (X̅- μ)2.
d) Add the results.

X̅ P (X̅) X̅- μ (X̅- μ)2 P (X̅) · (X̅- μ)2


2.00 1/10 -1.00 1.00 0.100
2.33 1/10 -0.67 0.45 0.045
2.67 1/5 -0.33 0.11 0.022
3.00 1/5 0.00 0.00 0.000
3.33 1/5 0.33 0.11 0.022
3.67 1/10 0.67 0.45 0.045
4.00 1/10 1.00 1.00 0.100
Total 1.00 0.334

σ = Σ P (X̅) · (X̅- μ)2


2

=0.334 or 0.33

So, the variance of the sampling distribution is 0.33.

Recognize that the sampling distribution of the


sample means is
definitely the probability distribution of the
sample means, which also
implies that the sample is the random variable
in this probability
distribution. But since the sampling
distribution of the means is the
probability distribution of the random variable
X, we could perhaps
calculate its mean and variance.
Recognize that the sampling distribution of the
sample means is
definitely the probability distribution of the
sample means, which also
implies that the sample is the random variable
in this probability
distribution. But since the sampling
distribution of the means is the
probability distribution of the random variable
X, we could perhaps
calculate its mean and variance.
Central Limit Theorem

In probability theory, the central limit theorem establishes that, in many situations, for
identically distributed independent samples, the standardized sample mean tends towards the
standard normal distribution even if the original variables themselves are not normally
distributed.

The central limit theorem states that:

Given a population with a finite mean μ and a finite non-zero variance σ2, the sampling
distribution of the mean approaches a normal distribution with a mean of μ and a variance of σ2
/ N as N, the sample size, increases.

EXAMPLE:

The arrange time it takes a group of college students to complete a certain


examination is 47.2 minutes. The standard deviation is 9 minutes. Assume that the variable is
normally distributed standard.

A. What is the probability that a randomly selected college student will complete the
examination in less than 45 minutes?

Here are the 5 steps to be followed:

Step 1.Identify the given information.

N= 47.2

σ= 8

X= 45

Step 2.Identify what is asked.

 If what is asked is less than , then used its symbol “< “, P(X< _ )
 and if it is more than, use its symbol “> “,P( X> _ )
So we will used, P(X< 45)

Step 3.Identify the formula to be used

X – N/σ

Step 4.Solve the problem


Solve the problem using the formula,

X – N/σ

= 45 – 47.2 / 9

= -0.24

Draw a normal curve,

-0.24

P(X< 45)
= P(X< - 0.24)
Find the z-score
= 0.5000 – 0.0948, where 0.5000 is constant
= 0.4052

Step 5.State the final answer.

The probability that randomly selected college student will complete the examination
in less than 45 minutes is 0.3446 or 34.46 %.

Recognize that the sampling distribution of the


sample means is
definitely the probability distribution of the
sample means, which also
implies that the sample is the random variable
in this probability
distribution. But since the sampling
distribution of the means is the
probability distribution of the random variable
X, we could perhaps
calculate its mean and variance.

You might also like