0% found this document useful (0 votes)
180 views4 pages

Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation

This document provides information on various statistical concepts including frequency distribution tables, measures of central tendency (mean, median, mode), measures of dispersion (range, variance, standard deviation), measures of relative position (z-scores, percentiles, quartiles), the normal distribution, and linear regression. It includes definitions, formulas, and examples for calculating each of these statistical metrics.

Uploaded by

Jmazing
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
180 views4 pages

Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation

This document provides information on various statistical concepts including frequency distribution tables, measures of central tendency (mean, median, mode), measures of dispersion (range, variance, standard deviation), measures of relative position (z-scores, percentiles, quartiles), the normal distribution, and linear regression. It includes definitions, formulas, and examples for calculating each of these statistical metrics.

Uploaded by

Jmazing
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

FREQUENCY DISTRIBUTION TABLE

A frequency distribution is often used to organize raw data. It is a table that lists observed events and
frequency of occurrence of each observed event.

Illustration: Consider the table which lists the number of laptop computers owned by families in each of
40 homes in a subdivision.
2 0 3 1 2 1 0 4 2 1 1 7 2 0 1 1 0 2 2 1
3 2 2 1 1 4 2 5 2 3 1 2 2 1 2 1 5 0 2 5

Question: How to find the mean of this frequency distribution table?


Recall:
Formula for the weighted mean:
∑ 𝑤𝑥
𝑊𝑒𝑖𝑔ℎ𝑡𝑒𝑑 𝑚𝑒𝑎𝑛 = ∑𝑤
.
We can use the formula of the weighted mean to find the mean of the frequency distribution, the only
change is that the weights, 𝑤1 , 𝑤2, 𝑤3 , … , 𝑤𝑛 will be 𝑓1 , 𝑓2, 𝑓3 , … , 𝑓𝑛 so we have
∑ 𝑓𝑥
𝑚𝑒𝑎𝑛𝑓𝑑 =
∑𝑓
.
Example: Find the mean of the previous distribution table.
Example: Find the mean, the median, and all the modes for the data in the given frequency distribution.

Age of Science fair contestants


Age Frequency
7 3
8 4
9 6
10 15
11 11
12 7
13 1

Measure of Dispersion: Range, Variance, Standard Deviation


Consider the following situation:
Suppose we have two soft-drink dispensing machines that should produce 8 oz. of soft-drink
cup.
Soda dispensed (ounces)
Machine 1 Machine 2
9.52 8.01
6.41 7.99
10.07 7.95
5.85 8.03
8.15 8.02
The range of a set of data values is the difference between the highest data value and the lowest data
value.
𝑅𝑎𝑛𝑔𝑒 = 𝑥ℎ𝑖𝑔ℎ𝑒𝑠𝑡 − 𝑥𝑙𝑜𝑤𝑒𝑠𝑡

Example: Find the range of the numbers of ounces dispensed by Machine 1 and Machine 2.

Another way to measure the dispersion of a data set is called the variance.
Population Variance:
𝒙−𝛍
𝝈𝟐 =
𝑵

N= number of data values


𝛍 = 𝐩𝐨𝐩𝐮𝐥𝐚𝐭𝐢𝐨𝐧 𝐦𝐞𝐚𝐧

Sample Variance:
𝒙 − 𝐱̄
𝒔𝟐 =
𝒏−𝟏
N= number of data values
𝐱̄ = 𝐩𝐨𝐩𝐮𝐥𝐚𝐭𝐢𝐨𝐧 𝐦𝐞𝐚𝐧

Procedure for computing the Variance:


1. Determine the mean of the data set
2. For each number, calculate the deviation (difference) between the number and the mean of the
data set.
3. Calculate the square of each of the deviations and find the sum of these squared deviations.
4. If the data came from a population, divide the sum by n, which is the number of the data values.
If the data came from a sample then divide the sum by n-1.
Example:
1. The following numbers were obtained by sampling a population 2, 4, 7, 12, 15. Find the variance
of the sample.
2. A group of mountaineers went on hiking to Mt. Pulag to study the different species of plants
existing in that area. The ages of the mountaineers are 34, 35, 45, 46, 49, 32. What is the
variance of their ages?

Another way to measure the spread of a data set is the standard deviation.
Standard deviation is a measure of the dispersion of a data set from its mean. A large standard
deviation means that the data values are far from the mean and small standard deviation means that
they are closer to the mean.
Population Standard Deviation: Sample Standard deviation:
𝒙−𝛍 𝒙−𝐱̄
𝝈 = √𝝈𝟐 = √ s= √𝒔𝟐 = √
𝑵 𝒏−𝟏

Example: Find the standard deviation of the two example from the Variance.
Measures of Relative Position:
z-scores:
Standard score- derived value that expresses how far a given raw data is from some reference
data point, usually the mean, in terms of standard deviation unit.
Example: z-score, t-score, stanines

The z-score for a given data value, x, is the number of standard deviations that x is above or below the
mean of the data set.
Illustration: Consider data values 6 and 20 in a data set with 12 to be its mean and 4 to be its standard
deviation. Get the z-scores of 6 and 20.

kth Percentile:
A value x is called the kth percentile of a data set such that k% of the are lesser or smaller than x
and (100-k)% of the data are greater or larger than x.
e.g Jay’s score in a Math exam is 65 with 80th percentile rank.

To find the kth percentile,


𝑘(𝑛 + 1)
𝑃𝑘 = 𝑡ℎ 𝑖𝑡𝑒𝑚
100

n- number of data values


Quartile:
The three numbers 𝑄1 , 𝑄2 , and 𝑄3 that partition a ranked data set into four (approximately)
equal groups are called the quartiles o the data set.
To find the kth quartile,
𝑘(𝑛 + 1)
𝑄𝑘 = 𝑡ℎ 𝑖𝑡𝑒𝑚
4

Steps to find the kth percentile or kth quartile:


1. Arrange the data set from lowest to highest.
2. Count the number of values.
3. Substitute k and n in the formula
4. Find the item in the ordered data set.
Example: Consider a data set: 188, 168, 174, 198, 186, 170, 180, 182, 186, 176. Find the 18 th and 80th
percentile. Also, find 𝑄1 , 𝑄2 , and 𝑄3 .

NORMAL DISTRIBUTION
- a bell-shaped curved called normal curve that is symmetric about a vertical line through the
mean of the data set. The normal curve will cover up 3 standard deviations on the right and 3
standard deviations on the left.
The Normal curve has the following characteristics:
1. The curve is symmetric.
2. The value of the mean, median, and mode are the same.
3. The tails are asymptotic to the horizontal line and they extend to infinity.
4. The area under the curve is 1 or the probability under the curve is 100%.
Empirical rule:
1. Almost 99.7% of the data will fall within 3 standard deviations of the mean.
2. 95% of the data will fall within 2 standard deviations of the mean.
3. 68% of the data will fall within 1 standard deviations of the mean.
The Standard Normal Distribution
- a normal distribution that has a mean of 0 and a standard deviation of 1.

Area under the Standard Normal Curve:


Find the area under the curve at the ff:
a. P(z < -1.32)
b. P(z > -1.32)
c. P(1.20 < z < 2.24)

Regression Equation: 𝒚 = 𝒂 + 𝒃𝒙
- Used to predict the behavior of a variable
- Explains the amount of variations observable in the independent variable x.
Given n number of data pairs (x,y), to find a and b, we have
(∑ 𝑦)(∑ 𝑥 2 ) − (∑ 𝑥)(∑ 𝑥𝑦)
𝑎=
𝑛(∑ 𝑥 2 ) − (∑ 𝑥)2
𝑛(∑ 𝑥𝑦) − (∑ 𝑥)(∑ 𝑦)
𝑏=
𝑛(∑ 𝑥 2 ) − (∑ 𝑥)2

Example: Find the regression equation of the the following ordered pairs: (1,25), (2,30), (3,32), (4,45),
(5,50).

You might also like