Statistics & Probability
Statistics & Probability
●PROBABILITY
bb
★ contains all possible outcomes of the
experiment.
1
STATISTICS & PROBABILITY
bb
●NOTE!!
★ ALWAYS SIMPLIFY THE ANSWER
★ HISTOGRAM SHOULD BE SHADED
★ THE SUM OF ALL PROBABILITIES SHOULD
EQUAL TO 1
★ PROBABILITIES SHOULD BE CONFINED
lc
BETWEEN 0 AND 1 (NEGATIVE IS NOT
ALLOWED)
2
STATISTICS & PROBABILITY
̂ ) (read as X-hat)
set. It is symbolized as (X
★ Types of modes: unimodal, bimodal, trimodal, &
multimodal
bb ●Mode
●Median
★ median is the middle value in a set of
lc
quantities.
★ separates an ordered set of data into two equal
parts. Half of the quantities are located above
the median and the other half is found below it.
3
STATISTICS & PROBABILITY
MEAN OF A DISCRETE RANDOM VARIABLE
4
STATISTICS & PROBABILITY
★ *Many random variables are either normally ■ Inflection point is the point at which a
distributed or, at least approximately normally change in the direction of curve at mean
distributed. EXAMPLE: height, weights and minus standard deviation and mean plus
examinations standard deviation.
●The properties of normal distribution ■ Note that each inflection point of the
★ 1. The graph is a continuous curve and has a normal curve is one standard deviation
domain -∞ < X < ∞. This means that X may away from the mean.
increase or decrease without bound. ★ 7. Every normal curve corresponds to the
★ 2. The graph is asymptotic to the x-axis. The “empirical rule” (also called the 68 - 95 99.7%
value of the variable gets closer and closer but rule):
will never be equal to 0 . ■ About 68.3% of the area under the curve
★ 3. The highest point on the curve occurs at x = falls within 1 standard deviation of the
µ (mean). The mean (µ) indicates the highest mean (half = 34.15)
peak of the curve and is found at the center. ■ About 95.4% of the area under the curve
■ The median and mode of the distribution falls within 2 standard deviations of the
are also found at the center of the graph. mean (half = 47.7)
This indicates that in a normal
bb ■ About 99.7% of the area under the curve
distribution, the mean, median and mode falls within 3 standard deviations of the
are equal . mean. (half = 49.85)
■ Graph THE STANDARD NORMAL DISTRIBUTION
5
STATISTICS & PROBABILITY
★ 3: If you are looking for the area between two z ●Three important things to remember
scores, simply subtract the corresponding ★ First, a probability value corresponds to an
areas to arrive at the answer area under the normal curve.
●The z-score ★ Second, in the Table of Areas Under the
★ The z-score is an essential component in Normal Curve, the numbers in the extreme left
standard normal distribution. This allows us to and across the top are z scores, which are the
describe a given set of data by finding the distances along the horizontal scale. The
z-scores. This leads us to a question of how numbers in the body of the table are areas or
z-scores are identified probabilities.
★ Given a normal random variable X with mean ★ Third, the z –scores to the left of the mean are
(µ) and standard deviation ( 𝜎), each value of x negative values.
of the variable can be transformed into ●Example 1
z-scores using the formula, ★ Having an obtained score of 85 in a recently
concluded unit test in Science, John wanted to
know how he fared in comparison with his
classmates. His teacher told him that he scored
bb at the 90th percentile. What is the
corresponding z –score of the 90th percentile?
●Steps ★ Procedure:
★ 1: Write the formula ■ STEP 1: Draw the appropriate normal
★ 2: Substitute the given values curve
★ 3: Perform the operation
★ 4: Write the corresponding z –score
●NOTE: answer must be in percent form and 2
decimal places. If the answer is 95.5 make it
95.50
6
STATISTICS & PROBABILITY
●Example 2 IDENTIFYING THE DIFFERENT RANDOM SAMPLING
★ A score in 96th percentile. Where is the score
TECHNIQUE
under the normal curve?
●Definition of terms
★ Procedure:
★ Population – the set of all possible values of a
■ STEP 1: Draw the appropriate normal
variable.
curve
★ Sample – It consists of one or more data drawn
■ STEP 2: Split 96% or 0.9600 into 0.5000
from the population.
+ 0.4600
★ Random Sampling – it is a sampling method of
■ STEP 3: Shade 0.5000 of the sketch of choosing representatives from the population
the normal curve in Step 1 wherein every sample has an equal chance of
■ STEP 4: Find the area of 0.4600 in the being selected. Accurate data can be collected
body of z –table. If it cannot be found in using random sampling techniques.
the table, get the area value nearest to it. ★ Probability Sampling – the sampling
■ STEP 5: Identify the corresponding techniques that involve random selection.
z–score of the found area. ★ Non–Probability Sampling – the sampling
★ It is between the values of 0.4599 techniques that do not involve random
bb ★ The corresponding z –score of 0.4599 selection of data.
is 1.75. ●Different types of RANDOM sampling
★ Therefore the z–score that ★ SIMPLE RANDOM SAMPLING – is the most
corresponds to 96th percentile on the basic random sampling wherein each element
normal curve is 1.75. in the population has an equal probability of
●Example 3 being selected.
★ Find the upper 10%of the normal curve.
★ Solution:
■ Express the given percentage as
probability 10% or 0.1000
■ Using the upper side of the mean, find
the remaining area. 0.5000 – 0.1000 =
0.4000
■ Find the area of 0.4000 in the body of
the z –table. The nearest value is 0.3997
★ Therefore the upper 10% is above z = ★ SYSTEMATIC RANDOM SAMPLING – this can
lc
1.28 be done by listing all the elements in the
●Example 4 population and selecting every kth element in
★ The results of a nationwide aptitude test in your population list.
mathematics are normally distributed with 𝝁=
80 and 𝝈=15. What is the percentile rank of a
score of 84?
★ Solution:
■ Convert raw score of 84 to z –score form
𝑥−µ 84 − 80
𝑧 = σ
= 15
= 0. 27
■ Find the area that corresponds to z =
0.27 = 0.1064 ★ STRATIFIED RANDOM SAMPLING – is a
random sampling wherein the population is
■ Get the total area below z = 0.27 0.5000
divided into different strata or divisions. The
+ 0.1064 = 0.6064
number of samples will be proportionately
★ The percentile rank of the score 84 in
picked in each stratum that is why all strata are
a test is 60.64%.
represented in the samples.
7
STATISTICS & PROBABILITY
★ SNOWBALL SAMPLING – defined as a non
probability sampling technique in which the
samples have traits that are rare to find. This is
a sampling technique, in which existing
subjects provide referrals to recruit samples
required for a research study.
ESTIMATION OF PARAMETERS
●Parameter
★ It is a descriptive population measure. It is a
measure of the characteristics of the entire
population (a mass of all the units under
consideration that share common
characteristics) based on all the elements
within that population.
■ ex. All people living in one city, all male
teenagers worldwide, all elements in a
shopping cart, and all students in a
classroom.
8
STATISTICS & PROBABILITY
●Statistic
★ It is a number that describes the sample. It can
be calculated and observed directly.
★ The statistics is a characteristic of a
population or sample group. You will get the
sample statistic when you collect the sample
and calculate the standard deviation and the
mean.
★ ex. Researchers interviewed 70% of Covid 19
survivors.
●Definition of Terms
★ Parameter
■ The measurement or quantity that
describes the population
★ Statistic
■ The measurement or quantity that
bb describes the sample
●Population Mean
●Population Variance
●Sample Variance