0% found this document useful (0 votes)
9 views30 pages

Statistics 2

Uploaded by

zyah.laurence
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views30 pages

Statistics 2

Uploaded by

zyah.laurence
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 30

Fundamentals of Statistics

Dr. Srivaramangai
Measures of • An attempt to find single figure to describe
Central Tendency whole figures

Averages

Mathematical Averages of
• Individual Data Averages Position
• Discrete Series
• Continuous Series
Arithmeti Geometri Harmonic Percentile
Median Mode Deciles Quartiles
c Mean c Mean Mean s

Simple Weighted
Mean Mean
Geometric Mean
• is the average value or mean which signifies the central tendency of
the set of numbers by finding the product of their values
• The Geometric Mean (G.M) of a series containing n observations is
the nth root of the product of the values.
For example, consider the
given data set, 4, 10, 16, 24
Here n= 4
Therefore, the G.M = 4th root
of (4 ×10 ×16 × 24)
= 4th root of 15360
G.M = 11.13
Application of Geometric Mean

• It should be only applied to positive values and often used for the set
of numbers whose values are exponential in nature and whose values
are meant to be multiplied together.
• It is used in stock indexes. Because many of the value line indexes
which is used by financial departments use G.M.
• It is used to calculate the annual return on the portfolio.
• It is used in finance to find the average growth rates which are also
referred to the compounded annual growth rate.
• It is also used in studies like cell division and bacterial growth etc.
Examples
Question 2 : Find the geometric mean of the following data.
Question 3: Find the geometric mean of the following grouped data for the
frequency distribution of weights.
Harmonic Mean
• The Harmonic Mean (HM) is defined as the reciprocal of the average
of the reciprocals of the data values.
• It is based on all the observations, and it is rigidly defined.
• Harmonic mean gives less weightage to the large values and large
weightage to the small values to balance the values correctly.
• In general, the harmonic mean is used when there is a necessity to give
greater weight to the smaller items. It is applied in the case of times
and average rates.
Find the harmonic mean for data 2, 5, 7, and 9.
Solution:
Given data: 2, 5, 7, 9
Step 1: Finding the reciprocal of the values:
½ = 0.5
⅕ = 0.2
1/7 = 0.14
1/9 = 0.11
Step 2: Calculate the average of the reciprocal values obtained from step 1.
Here, the total number of data values is 4.
Average = (0.5 + 0.2 + 0.143 + 0.11)/4
Average = 0.953/4
Step 3: Finally, take the reciprocal of the average value obtained from step 2.
Harmonic Mean = 1/ Average
Harmonic Mean = 4/0.953
Harmonic Mean = 4.19
Hence, the harmonic mean for the data 2, 5, 7, 9 is 4.19.
Calculate the harmonic mean for the following data:

The formula for weighted harmonic mean is


HMw = N / [ (f1/x1) + (f2/x2) + (f3/x3)+
….(fn/xn) ]
HMw = 42 / 7.879
HMw = 5.331
Therefore, the harmonic mean, HMw is 5.331.
Average of Positions
• Defined by its position of which it represents the whole set of samples.
This is an effective measure for handling nominal kinds of data.
• While in the case of mathematical averages the calculated value might
not be in the series of the respective data set but in the case of
positional averages, the calculated average value must be a value that
lies within the set of observed data.
• The median is the middle value in a set of data that has been arranged
from smallest to largest. Half the values are smaller than or equal to
the median, and half the values are larger than or equal to the median.
Median Grouped Data
Mode
• The value of variable which occurs most frequently in data is called Mode. The concept
of mode is often used in determining sizes. As an example, the most common shoe size is
6 or the most common shirt size is 42.
• It is a very appropriate measure of central tendency for nominal data.
• Calculation of Mode - Ungrouped Data:
• In this case mode is obtained by inspection.
• The blood pressure of 9 patients is as follows: 86, 87, 80, 86, 76, 86, 90, 88, 86. Calculate its mode.
• The mode value is 86, as it occurs maximum times (i.e.4 times)
• In certain cases there may not be a mode or there may be more than one mode.
• Example :- a) 40, 44,57,78,84 (no mode)
• b) 3, 4, 5, 5, 4, 2, 1 (modes 4 and 5)
• c) 8, 8, 8, 8, 8 (no mode)
• A series of data, having one mode is called ‘unimodal’ and a series of data having
two modes is called ‘bimodal’. It may also have several modes and be called
‘multimodal’.
Calculation of Mode - Grouped Data:

In case of grouped data, modal class is determined by inspection or by preparing grouping and
analysis tables. Then we apply the following formula.
Examples
The mode of data set A = {14, 15, 16, 17, 15, 18, 15, 19}, for example, is 15 because just one value repeats itself. As a
result, it's a unimodal data set.

Bimodal: Set A = {2,2,2,3,4,4,5,5,5} has a mode of 2 and 5, because both 2 and 5 are repeated three times in the
provided set.

Because all four values in the given set recur twice, the mode of data set A = 100, 80, 80, 95, 95, 100, 90, 90,100,95
is 80, 90, 95 and 100. As a result, it's a multimodal dataset.
➢ If the ratio of the mode to the median is 2 : 3 for a dataset. Find the ratio of mode to mean.
The empirical relationship between mean mode and median is
Mode = 3 Median – 2 Mean.
Let the modal value be 2x, and the median be 3x, then
2x = 3.3x – 2 Mean
⇒ 2 Mean = 9x – 2x
⇒ Mean = 7x/2
∴ Mode : Mean = 2x/(7x/2) = 4/7 = 4 : 7.

From the given frequency distribution table there are two modes, 15 and 25.
Hence, the dataset is bimodal.
➢ For any given data, the mean is 45.5, and the median is 43. Find the modal value.

We know that,
Mode = 3 Median – 2 Mean
∴ Mode = 3 × 43 – 2 × 45.5
= 129 – 91 = 38.
Mode = 38.

➢ Find the mode of the following data distribution:

The modal class is 60-70. Now,


l = 60
fk = 18
fk – 1 = 14
fk + 1 = 10
h = 70 – 60 = 10

= 60 + [(18 – 14)/(2 × 18 – 14 – 10)] × 10


= 60 + [4/12] × 10
= 60 + 10/3
= 63.333 (approx).
Percentiles
• are the values which divide the arranged data into hundred equal parts. There are 99 percentiles i.e.
P1, P2, P3, ……..,P99.
• The 50th percentile divides the series into two equal parts and P50 = D5 = Median.
The scores for some candidates in a test are 40, 45, 49, 53, 61, 65, 71, 79, 85, 91. What will
be the percentile for the score 71? Use the percentile formula.
The scores for some candidates in a test are 40, 45, 49, 53, 61, 65, 71, 79, 85, 91. What will
be the score with a percentile value of 90?

It is obvious that we need to find the score “x” with percentile 90 for the given data set.
Thus, No. of. Scores below the value “x” i.e. n =?
Total no. of. Scores i.e. N = 10
Given percentile value, P = 90
The formula for percentile is given as below:
Deciles
• Deciles are the partition values which divide the arranged data into ten equal parts. There are nine deciles i.e.
D1, D2, D3…….. D9 and 5th decile is same as median or Q2, because it divides the data in two equal parts.

Find the D6 for the following data

11, 25, 20, 15, 24, 28, 19, 21

x(n+1)10
Quartile
• There are three quartiles, i.e. Q1, Q2 and Q3 which divide the total data into four equal parts when
it has been orderly arranged.
• Q1, Q2 and Q3 are termed as first quartile, second quartile and third quartile or lower quartile,
middle quartile and upper quartile, respectively
1.First Quartile(Q1)=((n+1)/4)th Term also known as the lower quartile.
2.The second quartile or the 50th percentile or the Median is given as: Second
Quartile(Q2)=((n+1)/2)th Term
3.The third Quartile of the 75th Percentile (Q3) is given as: Third
Quartile(Q3)=(3(n+1)/4)th Term also known as the upper quartile.
4. The interquartile range is calculated as: Upper Quartile – Lower Quartile.
Find the quartiles of the following data: 4, 6, 7, 8, Find the Quartiles of the following age:-
10, 23, 34. 23, 13, 37, 16, 26, 35, 26, 35
First, we need to arrange the numbers in increasing order.
Therefore, 13, 16, 23, 26, 26, 35, 35, 37
Number of items, n = 8
Here the numbers are arranged in the ascending Lower quartile, Q1 = [(n+1)/4] th item
order and number of items, n = 7 Q1 = 8+1/4 = 9/4 = 2.25th term
Lower quartile, Q1 = [(n+1)/4] th item From the quartile formula we can write;
Q1= 7+1/4 = 2nd item = 6 Q1 = 2nd term + 0.25(3rd term-2nd term)
Median, Q2 = [(n+1)/2]th item Q1= 16+0.25(23-26) = 15.25
Q2= 7+1/2 item = 4th item = 8 Similarly,
Upper Quartile, Q3 = [3(n+1)/4]th item Median, Q2 = [(n+1)/2]th item
Q3 = 3(7+1)/4 item = 6th item = 23 Q2 = 8+1/2 = 9/2 = 4.5
Q2 = 4th term+0.5 (5th term-4th term)
Q2= 26+0.5(26-26) = 26
And,
Upper Quartile, Q3 = [3(n+1)/4]th item
Q3 = 3(8+1)/4 = 6.75th term
Q3 = 6th term + 0.75(7th term-6th term)
Q3 = 35+0.75(35-35) = 35

You might also like