Stat Chapter 3
Stat Chapter 3
Objectives
𝑥𝑖 4 3 1 2
𝑤𝑖 4 4 3 2
𝑥 𝑖 𝑤𝑖 16 12 3 4
x w w1 x1 w2 x2 wn xn w x i i
𝟏𝟔+𝟏𝟐+𝟑+𝟒
w1 w2 wn w i
x̄ 1𝑛1 + x̄ 2𝑛2 + ⋯ + x̄ 𝑘 𝑛𝑘
observations taken together is given by
𝑛1 + 𝑛 2 + ⋯
X̄c =
+ 𝑛𝑘
Example: There are two classes, Class A and Class B. Class A has 30 students
with an average score of 70 on a test. Class B has 20 students with an average
score of 80. What is the combined average score for both classes?
Solution:
If a constant c is added to or subtracted from every value in the data set, the
mean increases or decreases by that constant:
If each value in the data set is multiplied by a constant k, the mean is also
multiplied by k: New Mean=k × Old Mean.
Question 1: If the mean of a data set is 50, what will the new mean be if a constant
value of 5 is added to every value in the data set?
Given mean = 50 and constant = 5; New mean = 50 + 5 = 55.
The mid range
The midrange (MR) is defined as the sum of the lowest and highest
values in the data set divided by 2.
MR = Lowest value + Highest value
2
Example: Find the midrange (MR) for the following data:
11, 13, 20, 30, 9, 4, 15
Solution: The lowest value is 4, and the highest value is 30, then
MR = 4 + 30 = 34 = 17
2 2
Note that, this measure (MR) is weak as a measure of central ten-
dency since it is depends only on two values among of all values in
the data set.
Mean for Grouped data
If data are given in the form of continuous frequency distribution,
the
sample𝑘mean can be computed as
= , 𝑥𝑖 𝑓𝑖 - is the product of mid-
σ𝑖=1 𝑥 𝑖 𝑓 𝑖 𝑥 1 𝑓1 +𝑥 2 𝑓2 + …
𝑘
σ𝑖=1+𝑥
�𝑖 𝑘 𝑓 𝑘 𝑓1 +𝑓2 + …+
x̄
𝑓𝑘
= freq. � point &
xi 61 63 65 67 69 71
x1 x2 x3 x4 x5 x6 x7
Median=374
Since there are 7 values in this data set, so the fourth
term a 7+ 1 = 4k in the ranked data is the median.
Therefore the median is
median = ( )th item= = 4th item = 374
Cont…
Median of Even Numbers
Step 1: Arrange the data either in ascending or in descending
order.
Step 2: If the number of observations (say n) are even, then
identify (n/2)th and [(n/2) + 1]th observations.
Step 3: The average of the above two observations (which
are identified in step 2) is the median of the given data.
Cont…
Example: Find the median for the data
set:
8, 12, 7, 17, 14, 45, 10, 13, 17, 13, 9, 11
x Solution:
x1 x x2 x 3
First,
x x x we
4 x x rank
5 x x6
the data in
7 8 9 10 11 12
7
increasing order:
8 9 10 11 12 13 13 14 17 17 45
Frequency 7 17 16 6 4 50
48 = 3(Mean – 12);
16 = Mean – 12;
Mean = 28.
B. Measures of dispersion
• An average can represent a series only as best as a
single figure can, but it certainly cannot reveal
the entire story of any phenomenon under study
• It shows the degree by which numerical data tend
to spread around an average value/mean .
• Averages do not tell anything about the scatterness
of observations within the distribution.
• In order to measure the degree of scatter, the
statistical device called measures of dispersion are
calculated.
03/08/2025 By: Menberu T. 23
Range = highest value – lowest value
It shows the difference b/n the highest value
and the lowest value, hence it is the weakest
measure of dispersion
Variance
First calculate the mean, then deduct the
mean from each value in the group square
the result and divide the result by the
number of values.
The variance is used as a measure of how far a
set of numbers are spread out from each other.
It describes how far the numbers lie from the
mean (expected value).
03/08/2025 By: Menberu T. 24
n 2
( x x)
i 1
i
Var ( x)
n
Standard deviation
The most reliable measurement of the degree
to which the data is spread around the mean
Putting the variance in square root
23, 36, 44, 45, 53, 55, 56, 56, 62, 70, since n is order, therefore
Median = 53 + 55/2 = 54
• where,
• Find the variance and the standard deviation for the following frequency dist
of a sample:
Class Frequency fm
5–9 2
10 – 14 4
15 – 19 7
20 – 24 3
25 – 29 1
30 – 34 3
Total 20
3.5 12.25
• Variance= = 1105/19=58.158
• Standard deviation=7.626
SD
CV
Mean
CV = (0.20/0.50)*100% = 40%
2. Covariance
Covariance between X and Y refers to a measure
of how much two variables change together.
Covariance indicates how two variables are
related.
A positive covariance means the variables are
positively related, while a negative covariance
means the variables are inversely related.
The formula for calculating covariance of sample
data is shown below.
03/08/2025 By: Menberu T. 34
Note: for population (N) and
for sample (n-1)
Often the numbers have no
meaning. Thus we focus on the
sign.
10 18 -4 6 -24 16 36
30 6 16 -6 -96 256 36
8 12 -6 0 0 36 0
16 15 2 3 6 4 9
6 9 -8 -3 24 64 9
Cov (X,Y)= ∑(xi- )(Yi- )/n= -90/5 = - ∑(xi- )2∑
18 = 376 (Yi- )2
= 90
r (x, y) = ∑(xi- )(Yi- )/∑(xi- )2∑ (Yi- )2 = -90/33, 840 =
-90/183 = -0.49
03/08/2025 By: Menberu T. 39
D. Shape of Frequency
Distribution
Skewness
It refers to symmetry or asymmetry of the
distribution.
A distribution is symmetric if its left half is a
mirror image of its right half.
The skewness value can be positive or
negative.
A symmetric distribution with a single peak and
a bell shape is known as a normal
distribution.
The
( x x)
i
fourth standardized
KU i 1 moment is
(n 1) S 4
defined as