Exercises On Introduction To Ststistics
Exercises On Introduction To Ststistics
SIMPLIFIED
A Quantitative Methods practice book for BBA II Semester
Students of Bangalore University
Compiled by
Lenin Arumanayagam,
Contains Concepts, Formulae, Exercises and Assignments Freelance Faculty
Table of Contents
3 Diagrammatic Representation 5
5 Measures of Variation 17
6 Measures of Skewness 25
8 Index Numbers 35
9 Formulae 39
10 Assignments 43
Characteristics of Statistics
1. Statistics are numerical facts
2. Statistics are aggregate of facts
3. Statistics are affected to a great extent by multiplicity of factors
4. Statistics are either enumerated or estimated with reasonable standard of accuracy
5. Statistics are collected in a systematic manner and for a predetermined purpose
6. Statistics should be capable of being placed in relation to each other
Functions of Statistics
1. Presents facts in simple forms
2. Reduces the complexity of data
3. Facilitates comparison
4. Testing hypothesis
5. Formulation of policies
6. Forecasting and estimating
7. Derives valid inferences
Limitations of Statistics
1. Statistics does not study the qualitative phenomenon
2. Statistics does not study the individual changes
3. Statistics results are true only in general and on an average
4. Statistics can be misused by ignorant and wrongly motivated persons
5. Statistics does not reveal the entire story
6. Statistics is liable to be misused
Scope of Statistics
Statistics and Planning; Statistics and Business; Statistics and Economics; Statistics and Administration;
Statistics and Business Management; Statistics and Research; Statistics and Mathematics; Statistics and
Science. Scope of Statistics in Business: Marketing; Production; Finance; Banking; Investment; Purchase;
Accounting; Control
Data Collection
Data: Facts and figures collected for a specific purpose, processed and used to help decision-making.
Census: The method of collection of data in which every unit of the population is included. This method is
accurate and reliable but expensive, time consuming and involves much labor.
Sample: A sample is a group of units selected from a larger group (the population) for specific investigation.
Primary Data: Data originally collected for the first time directly from the source using surveys are called
primary data. It may be obtained through direct observation, interviews, questionnaires, etc.
Secondary Data: Data already collected by someone other than the user are called secondary data. They may
be obtained from newspapers, agencies, journals, records, reports, etc.
Classification of data is a process of arranging data into sequences and groups or classes according to their
attributes and or characteristics. It refers to the sorting out of a heterogeneous mass data into a number of
homogeneous groups and sub-groups.
Tabulation is defined as the orderly or systematic presentation of numerical data in rows and columns,
designed to facilitate comparison between the figures.
Parts of a Table: Table number, Title, Titles of rows, columns, sub-rows and sub-columns, Totals, Footnotes
and Source.
Types of Classification
Qualitative Classification: a classification in which data are classified according to attributes or qualities.
Generally the qualitative phenomena are not measurable. E.g. Classification based on marital status, gender
etc.
Quantitative Classification: A classification in which data are classified according to quantities that are
measurable such as age, weights, marks, wages, etc.
Two-Way Table: A table which is used to categorize the data based on two or more attributes.
Exercise 2.1
1. Draw a blank table to present the following information regarding the students of a college according to:
2. The total number of accidents in Southern Railway in 1960 was 3,500 and it decreased by 300 in 1961 and
by 700 in 1962. The total number of accidents in meter gauge section showed a progressive increase from
1960 to 1962. It was 245 in 1960, 346 in 1961 and 428 in 1962. In the meter gauge section, the number of
non-compensated cases were 49 in 1960, 77 in 1961, and 108 in 1962. The number of compensated cases
in the broad gauge section were 2,867, 2,687 and 2,152 in those years in order. Tabulate the data.
3. Present the following information in a suitable form supplying the figures not directly given:
In 1975, out of a total of 4,000 workers in a factory, 3,300 were members of a trade union. The number of
women workers employed was 500 out of which 400 did not belong to any union. In 1974, the number of
workers in the union was 3,450 of which 3,200 were men. The number of non-union workers was 760 of
which 330 were women.
4. Following data gives the number of children in 50 families. Construct a suitable frequency table.
4 2 0 2 3 2 2 1 0 2 3 5
1 1 4 2 1 3 4 2 6 1 2 2
2 1 3 4 1 0 2 4 3 0 1 3
6 1 0 1 1 3 4 1 0 1 2 2
2 5 (Answer: 6, 13, 14, 7, 6, 2, 2)
5. Following are the weights of 50 college students in kg. Construct a frequency table.
42 42 46 54 41 37 54 44 38 45 47 50
58 49 51 42 46 37 42 39 54 39 51 58
47 51 43 48 49 48 49 41 41 40 58 49
49 59 57 52 56 38 45 52 46 40 51 41
6. Following are figures of income (x) and percentage expenditure on food (y) in 25 families. Construct a
bivariate (two-way) frequency table.
X 550 623 310 420 600 225 310 640 512 690
Y 12 14 18 16 15 25 26 20 18 12
X 680 300 425 555 325 202 255 492 587 643
Y 13 25 16 51 23 29 27 18 21 19
X 689 523 317 384 400
Y 11 12 18 17 19
Line Diagrams: These diagrams are used when there is a large number of values of variable with variations in
their values within a small range
Simple Bar Diagram: These diagrams are suitable for individual observations and time series. The bars have
the uniform width.
Multiple Bar Diagram: These diagrams are used when two or more phenomena and a number of attributes
are compared with each other. Different shades may be used to identify the various attributes or periods.
Sub-divided (Component) Bar Diagrams: These diagrams are used when two or more components are present
in a single phenomenon
Sub-divided Percentage Bar Diagrams: These are sub-divided diagrams which are used to depict the values of
variable in percentage. All the bars are equal in height representing the value as 100%.
Pie Charts (Circular Diagram): This is a pictorial representation of statistical data with several components in a
circular form. Pie charts consist of a circle sub-divided into several sectors by radius.
Pictograms: It is a representation in which pictures are used to represent the data. Each full diagram
represents a certain quantity.
Histograms: Histogram is a device of graphical representation of a frequency distribution. It is constructed by
erecting a set of rectangles on each interval on the horizontal axis. The height of the rectangle represents the
frequency of the class interval.
Frequency Polygon: A line graph connecting the midpoints of each class in a data set, plotted at the height
corresponding to the frequency of the class. It can also be drawn by joining the midpoints of the top of the
vertical bars of a histogram.
Frequency Curve: A frequency polygon with smoothed curve to eliminate the accidental irregularities in the
data.
Ogive Curve: This is a graphical representation of cumulative frequency distribution of a continuous series.
There are two types of Ogive Curves: 1. More than Ogive and 2. Less-than Ogive
Exercise 3.1
1. Draw a simple bar diagram from the following data relating to the number of small scale industrial units in
various states in the year 2008
No. of SS Units 10 12 15 15 18 25 22
2. Present the following data of results of BBM students in statistics examination of Bangalore University
held in June 2006, 2007 and 2008 using multiple bar diagram:
3. Represent the following data using sub-divided bar diagram and percentage sub-divided bar diagram:
5. Draw a pie diagram to represent the following data of investment pattern in the state budget (in ₹ crore):
Arithmetic Mean
A mathematical representation of the typical value of a series of numbers, computed as the sum of all the
numbers divided by the count of all numbers in the series.
Formulae
Σx Σd
̅ =
Individual Series: X ̅ =A+
;X
n n
Σfx Σfd
̅ =
Discrete Series: X ̅ =A+
;X
N N
Σxw
Weighted Arithmetic Mean: ̅
X =
Σw
n1 x̅1 + n2 x̅2
̅ (1,2) =
Combined Arithmetic Mean: X
n1 + n2
Exercise 4.1
1. Find the AM of 5, 8, 10, 15, 24 and 28 (Answer: 15)
2. The wages of 9 workers are: 150, 80, 120, 60, 75, 125, 95, 115, 130. Find the mean wages. (Answer: 105.5)
3. In the city, 30 members were surveyed as to how many domestic appliances they had purchased and the
replies were as under. Prepare a frequency table and find the mean. (Answer: 2.83)
1, 2, 5, 1, 2, 1, 4, 2, 3, 4, 2, 4, 3, 2, 6, 3, 2, 4, 3, 6, 2, 2, 3, 3, 7, 2, 3, 0, 2, 1
4. Find the mean runs scored by a batsman during his career using direct method and shortcut method
(Answer: 46):
x 10 20 30 40 50 60 70 80 90
f 7 18 15 25 30 20 16 7 2
5. Compute the mean of the following data using direct method and shortcut method (Answer: 13.54):
x 9 10 11 12 13 14 15 16 17 18
f 1 2 3 6 10 11 7 3 2 1
6. Calculate the mean from the following data using direct, shortcut and step-deviation methods (Answer:
36.36):
CI 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 70 – 80
f 5 10 25 30 20 10 5 5
7. Calculate the mean wages from the following data (Answer: 73.44):
Wages 48 – 56 56 – 64 64 – 72 72 – 80 80 – 88 88 – 96 96 – 104
No. of Workers 8 3 11 14 5 7 2
8. Calculate the mean from the following data using direct, shortcut and step-deviation methods (Answer:
49.3):
CI 10 – 19 20 – 29 30 – 39 40 – 49 50 – 59 60 – 69 70 – 79 80 – 89
f 5 9 14 20 25 15 8 4
9. Calculate the mean marks from the following data (Answer : 43.7):
10. Calculate the mean sales from the following data (Answer: 28.73):
Frequency 4 20 35 55 62 67
11. A college wanted to give monthly scholarship to B.Com students securing 50% and above marks in the
following manner:
Percentage of Marks 50 – 55 55 – 60 60 – 65 65 – 70 70 – 75
Scholarship (₹) 25 30 35 40 45
The percentage of marks of 20 students who were eligible for scholarship are given below:
52, 62, 51, 71, 54, 53, 51, 50, 57, 64, 56, 54, 69, 63, 65, 59, 58, 68, 57, 62
Calculate the average monthly scholarship payable to the students. (Answer: 31.5)
12. A limited company wants to pay bonus to the members of its staff as under:
Salary (₹ ‘000) 100 - 120 120 – 140 140 – 160 160 – 180 180 – 200 200 - 220 Above 220
Actual salaries of the members of the staff are as follows, in rupees: 200, 180, 185, 195, 218, 187, 160,
250, 198, 190, 168, 170, 178, 175, 140, 120, 148, 165, 155, 145, 125, 110, 162, 130, 150
What is the total bonus paid? What is the average bonus paid per staff? (Answer: 78.4)
13. From the following data of calculation of AM, find the missing value. Mean value is 126.3 (Answer: 120):
f 5 8 12 22 10 7 6
14. The AM of the following frequency distribution is 67.45 inches. Find the missing frequency. (Answer: 126):
Height (Inches) 60 – 62 63 – 65 66 – 68 69 – 71 72 - 74
F 15 54 ? 81 24
15. The mean of the following data is 67.45. Find the missing frequencies (Answer: 42, 27).
CI 60 – 62 63 – 65 66 – 68 69 – 71 72 - 74 Total
F 5 18 - - 8 100
16. The mean of the following data is 25. Find the missing frequencies (Answer: 10, 10).
x 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 Total
f 5 - 15 - 5 45
17. Find the weighted arithmetic average price of coal purchased by an industry (Answer: 50.36):
Price per ton (₹) 42.50 51.25 50.00 52.00 44.25 54.00
No. of tons 25 30 40 50 10 45
18. The mean weight of 25 male workers in a factory is 63 kg, and the mean weight of 35 female workers in
the same factory is 55 kg. Find the combined average weight of the 60 workers in the factory. (Answer:
58.33)
19. The arithmetic mean of a group of 80 boys is 10 years, and that of second group of 20 boys is 15 years.
Find the arithmetic mean of the two groups taken together. (Answer: 11)
Median
Median is the middle value of the distribution, and therefore it is called the positional average. So, the place of
median in a series is such that, an equal number of items lie on either side of it.
Merits of Median
1. Median is especially useful in case of open ended classes, since it is not necessary that the value of all
items be known.
2. Median is not influenced by extreme values.
3. In a markedly skewed distribution, median is especially useful.
4. The value of median can be determined graphically, whereas the value of mean cannot be graphically
ascertained.
Formulae
n th n th
(n+1) th ( 2 ) term + (2 +1) term
Individual Series: M = [ ] term when n is odd and M = [ ] when n is even.
2 2
(n+1) th
Discrete Series: M = [ ] term
2
N
− c.f.
Continuous Series: M = L + 2
xi
f
Exercise 4.2
1. Find the median: 43, 62, 15, 80, 56, 72, 34, 8, 25 (Answer: 43)
2. The wages of 9 workers are: 150, 80, 120, 60, 75, 125, 95, 115, 130. Find the median. (Answer: 115)
3. Find the median: 36, 5, 19, 26, 6, 28, 56, 18, 63, 4 (Answer: 22.5)
4. Find the median: 105, 89, 93, 142, 112, 136, 82, 97, 128, 135, 110, 104 (Answer: 107.5)
5. In a class of 15 students, 5 failed in a test. The marks of those who passed were, 9, 6, 7, 8, 9, 6, 5, 4, 7 and
8. Calculate the median marks of the 15 students.
x 10 20 30 40 50 60 70 80 90 100
f 10 16 18 13 6 3 8 4 6 8
Wages 5 10 15 20 25 30
Frequency 7 12 37 25 22 11
CI 10 – 19 20 – 29 30 – 39 40 – 49 50 – 59 60 – 69 70 – 79 80 – 89
f 5 9 14 20 25 15 8 4
CI 1 – 10 11 – 20 21 – 30 31 – 40 41 – 50 51 – 60 61 – 70 71 - 80 81 - 90
f 3 7 13 19 14 11 9 9 5
Values above 10 20 30 40 50 60
No. of students 50 40 25 16 10 2
15. In a group of 1000 wage earners, the monthly wages of 4% are below ₹60 and those of 15% are under
₹62.50. 15% earned ₹95 and over, and 5% got ₹100 and over. Find the median wage (Answer: 78.75).
16. 10% of the workers in a factory employing a total of 1000 workers, earn between ₹5 and 9.99, 30%
between ₹10 and ₹14.99, 250 workers between ₹15 and 19.99 and the rest ₹20 and above. What is the
median wage? (Answer: 17)
17. Compute the median after amending the table (Answer: 14):
X f x f
Less than 5 7 20 – 25 20
5 – 15 38 30 and above 1
15 and above 35
Mid values 115 125 135 145 155 165 175 185 195
Frequencies 6 25 48 72 116 60 38 22 3
Frequencies 12 23 40 65 17 3
Wages 0 – 20 20 – 40 40 – 60 60 – 80 80 – 100
No. of Students 5 13 24 39 52 60
22. Marks of 100 students are given below. If median is 33, find the missing frequencies. (Answer: 17, 16)
Marks 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70
No. of Students 12 15 - 20 - 10 10
23. Find the missing frequencies if the value of median is 36.5 and N = 120. (Answer: 30, 11)
Class Interval 20 – 25 25 – 30 30 – 35 35 – 40 40 – 45 45 – 50 50 – 55 55 – 60
Frequencies 8 15 28 - 22 - 4 2
Mode
According to A M Tuttle, “Mode is the value which has the greatest frequency in the neighborhood.” Just as
median, mode too is a positional average. So, the most frequent or the item which is repeated maximum
times in the series is the mode of the series.
Merits of Mode
1. Mode is not affected by extremely large or small items.
2. Mode can be determined in open-ended classes without assuming the class limits.
3. The value of mode can be determined graphically, whereas, the value of mean cannot be ascertained.
Formulae
Individual Series: The variable that occurs most frequently.
Discrete Series: The value which has the greatest frequency in the neighborhood.
∆1
Continuous Series: Z or M0 = L + x i; ∆1 = |f1 – f0| and ∆2 = |f1 – f2|
∆1 + ∆2
Exercise 4.3
1. Find the mode: 3, 5, 7, 5, 9, 7, 5, 7, 6, 3, 9, 5, 6, 6, 3
2. Find the mode: 54, 66, 42, 64, 44, 86, 104, 94, 100, 80, 72, 64, 64, 44, 64, 72, 54, 54, 48, 52, 50
3. Find the mode: 122, 234, 638, 420, 512, 234, 270, 420, 900, 195, 360
x 1 2 3 4 5 6
f 2 8 11 18 9 7
x 8 16 24 32 40 48
f 2 4 20 19 10 5
x 2 4 6 8 10 12 14
f 6 8 16 16 12 6 4
Wages 48 – 56 56 – 64 64 – 72 72 – 80 80 – 88 88 – 96 96 – 104
No. of Workers 8 3 11 14 5 7 2
CI 10 – 19 20 – 29 30 – 39 40 – 49 50 – 59 60 – 69 70 – 79 80 – 89
f 5 9 14 20 25 15 8 4
Attendance below 5 10 15 20 25 30 35 40 45
No. of students 29 224 465 582 634 644 650 653 655
10. Twenty percent of the workers in a firm employing a total of 2000 workers earn less than ₹2.00 per hour,
440 earn from ₹2.00 to ₹2.24 per hour, 24% earn from ₹2.25 to ₹2.49 per hour, 370 earn from ₹2.50 to
₹2.74 per hour, 12% earn from ₹2.75 to ₹2.99 per hour and the rest ₹3.00 or more per hour. Set up a
frequency table and calculate the modal wage. (Answer: 2.3117)
CI 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 70 – 80
f 4 12 24 32 32 16 8 2
f 7 9 10 6 13 10 13 10
15. Locate the mode using Histogram, Frequency polygon and smoothed frequency curve (Answer:50.71):
Class Interval 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70
16. Locate the mode using Histogram, Frequency polygon and smoothed frequency curve (Answer: 24.44):
Class Interval 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60
Frequencies 14 23 35 20 8 5
17. Find the mean, median and mode (Answer: Mean = 151.29, Median = 149.6, Mode = 146.211):
Mid values 115 125 135 145 155 165 175 185 195
90, 78, 86, 51, 96, 104, 51, 78, 50, 72, 49, 77, 90, 74, 69, 70, 68, 69, 104, 80, 79, 54, 79, 73, 58, 91, 78, 67,
50, 84, 76, 110, 53, 74, 40, 60, 42, 82, 41, 76, 84, 76, 42, 65, 60, 77, 61, 75, 115, 81
b. ̅
X = 12, Z = 13, M = ?
2. If the two sets of data are expressed in different units, such as quintals of sugar versus tons of
sugarcane, or if the average size is very different, such as the manager’s salary versus worker’s wages,
relative measures should be used. Relative measures of dispersion are also called a coefficient of
dispersion.
Range
Definition
Range is defined as “The difference between the two extreme items of the distribution” or the difference
between the largest and smallest items of the distribution.
Merits of Range
1. Range is simple to understand
2. It is easy to calculate
3. It gives a quick rather than an accurate picture of variability.
Limitations of Range
Uses of Range
1. It is useful in studying the variations in the prices of shares and stock, gold, jewelry etc.
2. In weather forecasts, range is used to determine the difference between the maximum and minimum
temperature.
3. It is used in industries for statistical quality control.
Formulae
Range: L – S (Where L = Largest variable and S = Smallest variable)
L−S
Coefficient of Range: Individual & Discrete Series Continuous Series
L+S
N
Interquartile Range: IQR = Q3 – Q1 (n + 1) th − c. f.
Q1 [
4
] term L+ 4 xi
Q3 − Q1 f
Quartile Deviation: QD =
2 3N
3(n + 1) th − c. f.
Q3 − Q1 Q3 [ ] term L+ 4 xi
Quartile Deviation: CQD = 4 f
Q3 + Q1
Exercise 5.1
1. Compute the range and coefficient of range of the following series and state which is more dispersed.
2. Find the range and coefficient of range of the following distribution (36, 0.75):
x 6 12 18 24 30 36 42
f 7 18 15 25 30 20 16
3. Compute range and coefficient of range of the following series (Answer: 80, 1):
CI 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 70 – 80
F 5 10 25 30 20 10 5 5
4. From the following data, calculate the Quartile Deviation and its Coefficient (Answer: 19.75, 0.339)
30, 43, 48, 89, 54, 25, 84, 61, 67, 37, 72, 80
5. Calculate the Quartile Deviation and its Coefficient from the following data (Answer: 1.5, 0.0244):
X 58 59 60 61 62 63 64 65 66
F 15 20 32 35 33 22 20 10 8
6. Compute the Quartile Deviation and its Coefficient from the following data (Answer: 5, 0.25):
Wages 5 10 15 20 25 30
Frequency 7 12 37 25 22 11
7. Calculate the Quartile Deviation and its Coefficient from the following data (Answer: 5.208, 0.2643):
8. Calculate the Quartile Deviation and its Coefficient from the following data (Answer: 2.273, 0.2039):
CI 5–7 8 – 10 11 – 13 14 – 16 17 – 19
f 14 24 38 20 4
Mean Deviation
Meaning
It is the average difference between the items in a distribution and the mean of that series.
4. Since deviations are taken from a central value, comparison about the formation of different
distributions can easily be made.
Formulae
MD MD MD MD MD MD
Coefficient of MD 𝑜𝑟 𝑜𝑟 𝑜𝑟
x̅ M x̅ M x̅ M
Exercise 5.2
1. Calculate mean deviation & Coefficient of mean deviation using mean and median (Answer: 0.1193):
2. Calculate mean deviation & Coefficient of mean deviation using mean and median (Answer: 0.38, 0.43):
90, 280, 65, 60, 50, 120, 100, 110, 70, 80, 75
3. Compute the mean deviation and its coefficient using mean and median (Answer: 7.66, 7.6 & 0.38, 0.38):
x 5 10 15 20 25 30 35 40
f 16 32 36 44 28 18 12 14
4. Compute the mean deviation and its coefficient using mean and median (Answer: 1.53, 1.49 & 0.407,
0.372):
No of Home
0 1 2 3 4 5 6 7
Appliances
No. of Families 14 21 25 43 51 40 39 12
5. Compute the mean deviation and its coefficient using mean and median (Answer: 11.33 & 0.252):
Marks 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 70 – 80
No. of Students 4 6 10 20 10 6 4
6. Compute the mean deviation and its coefficient using mean and median (Answer: 7.6, 7.296 & 0.196,
0.194):
Mid Values 22.5 27.5 32.5 37.5 42.5 47.5 52.5 57.5 62.5
Frequency 6 12 17 28 12 10 8 5 2
7. Compute the mean deviation and its coefficient (Answer: 40.417 & 0.425):
No. of Workers 4 10 20 40 50 56 60
Standard Deviation
Standard Deviation is the square root of the means of the squared deviations from the arithmetic mean. SD is
also known as Root Mean Square Deviation for this reason. It is the most widely used measure of variation.
Formulae
Ʃd2 Ʃfd2
Direct Method σ=√ d = x − x̅ σ=√ d = x − x̅ or m − x̅
n N
Exercise 5.3
1. Calculate the standard deviation of the marks of 11 students (Answer: 60.49):
90, 280, 65, 60, 50, 120, 100, 110, 70, 80, 75
2. Calculate the SD and Coefficient of Variation using direct method and shortcut method (Answer: 23.066 &
59.91%):
3. Following are the runs scored by two batsmen X and Y in ten innings. Find who is a better scorer and who
is more consistent (Answer: CV(X) = 84.072%; CV(Y) = 82.707%):
X 100 22 0 36 82 45 7 13 65 14
Y 97 12 40 96 13 8 85 8 56 16
x 10 20 30 40 50 60
f 8 12 20 10 7 3
5. The following table gives the age distribution of boys and girls in a high school. Find which of the two
groups is more variable in age. (Answer: CV(boys) = 7.85%; CV(girls) = 7.34%)
Age 13 14 15 16 17
No. of boys 12 15 15 5 3
No. of girls 13 10 12 2 1
6. The goals scored by teams A and B in a few football matches are as follows. Which team is more
consistent? (Answer: CV(A) = 124.94%; CV(B) = 108.97%)
Goals 0 1 2 3 4
Marks 20 – 29 30 – 39 40 – 49 50 – 59 60 – 69 70 – 79 80 – 89 90 - 99
No. of Students 5 12 15 20 18 10 6 4
8. Compute the coefficient of variation from the following data (Answer: 152.77%):
No. of shops 4 10 22 28 38 56 40 24 18 10
9. Find which class is more consistent in scoring marks, from the following table (Answer: 24.99 & 23.53):
Marks 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70
Class A 7 10 20 18 7
Class B 5 9 21 15 6
10. Following data relates to the wages of workers in factories A and B. which factory wages are more
variable (Answer: CV(A) = 54.14%; CV(B) = 49.89?
Wages up to (₹) 5 10 15 20 25 30
11. The number of employees, average wages per employee, and the variance of wages for two factories is
given below. (Answer: 2.5% and 4.71%)
Factory A Factory B
No. of
50 100
employees
Average wages ₹120 ₹85
Variance 9 16
In which factory is there greater variation in the distribution of wages/employees? Which factory pays
more?
12. Mean and standard deviation of the following continuous series are 31 and 15.9 respectively. The
distribution after taking step deviations is as follows. Determine the class intervals. (Answer: i = 10, CI = 0
– 10, 10 – 20 etc.).
d' -3 -2 -1 0 1 2 3
f 10 15 25 25 10 10 5
Symmetrical Distribution
A distribution is symmetrical if the Mean = Median = Mode
A distribution is positively skewed if Mean > Median > Mode
A distribution is negatively skewed if Mean < Median < Mode
Uses of Skewness
1. Skewness is a measure to study whether a distribution is symmetrical or not.
2. Many models assume normal distribution; i.e., data are symmetric about the mean. The normal
distribution has a skewness of zero. But in reality, data points may not be perfectly symmetric. So, an
understanding of the skewness of the dataset indicates whether deviations from the mean are going
to be positive or negative.
Skewness:
1. It tells us about the direction of the variation or departure from the symmetry
2. It indicates to what extent and in what direction the distribution differs from the symmetry
3. It depends on the dispersion to some extent.
4. It is based on the averages of the first order (Mean, Median and Mode) and second order (SD)
Formulae
X̅ − M0
For unimodal distribution: Karl Pearson’s Coefficient of Skewness, Skp =
σ
̅ − M)
3(X
For bimodal distribution: Karl Pearson’s Coefficient of Skewness, Skp =
σ
Exercise 6.1
1. Calculate Karl Pearson’s and Bowley’s Coefficients of Skewness (Answer: 0.3453 & 0.2):
23, 45, 12, 28, 23, 19, 27, 23, 28, 30
2. Calculate Karl Pearson’s and Bowley’s Coefficients of Skewness (Answer: 0.162 & 0.164):
112, 75, 140, 89, 112, 98, 134, 129, 98, 121, 136
3. Calculate Pearson’s and Bowley’s Coefficients of Skewness (Answer: – 0.2445 & 0):
f 35 40 48 100 125 87 43 22
x 4 8 12 16 20 24 28 32 36
f 18 21 20 9 7 20 22 17 8
6. Calculate Karl Pearson’s and Bowley’s Coefficients of Skewness. (Answer: –0.0518 & –0.0165)
Class Interval 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 70 – 80
Frequency 6 12 22 48 56 32 18 6
7. Calculate Karl Pearson’s and Bowley’s Coefficients of Skewness (Answer: 0.401 & 0.3750):
Marks Above 0 10 20 30 40 50 60 70 80 90
8. Calculate Karl Pearson’s and Bowley’s Coefficients of Skewness (Answer: –0.2078 & –0.058):
Mid Value 21 27 33 39 45 51 57
Frequency 18 22 40 50 38 12 4
9. Compute Karl Pearson’s and Bowley’s Coefficients of Skewness (Answer: –0.310 & –0.2314):
CI 3–7 8 – 12 13 – 17 18 – 22 23 – 27 28 – 32 33 – 37 38 – 42
f 7 9 10 6 13 10 13 10
10. a. In a distribution, Mean = 65, Median = 70, Skp = – 0.6. Find i) Mode and ii) CV. (Answers: 80, 38.46%)
b. Skp = – 0.7, σ = 6, M = 12.8. Find the Mean and CV. (Answers: 11.4, 52.63%)
Probable Error
It is the value that helps determine the reliability of the value of the correlation coefficient in the condition of
random sampling. It helps interpret the correlation coefficient.
Methods of Interpretation
1. If r < 6PE, the value of r may not be significant.
2. If r > 6PE, the value of r is significant or practically certain.
3. Using the limits of population, we get the range within which population correlation lies. ρ = r ± PE
Formulae
Σdx.dy
Using Actual Mean: r =
√Σdx2 x Σdy2
Σdx.Σdy
Σdx.dy −
Using Assumed Mean: r = 2
N
2
√Σdx2 − (Σdx) x √Σdy2 − (Σdy)
N N
1 − r2
Probable Error: P. E = 0.6745 x
√N
Exercise 7.1
1. Compute the coefficient of correlation from the following data: (Ans.: +0.9243)
Internal Marks 25 30 22 12 19 24
External Marks 56 68 40 24 28 60
2. Compute the coefficient of correlation from the following data: (Ans.: +0.6051)
X 6 8 9 14 17 28 24 31 7
Y 10 12 15 15 18 25 22 26 28
3. Compute the coefficient of correlation from the following data: (Ans.: +0.8818)
X 45 55 56 58 60 65 68 70 75 80
Y 56 50 48 60 62 64 65 70 74 82
4. Compute the coefficient of correlation from the following data: (Ans.: – 0.7327)
X 43 44 46 40 44 42 45 42 38 40 42 57
Y 29 31 19 18 19 27 27 29 41 30 26 10
5. Calculate the coefficient of correlation between age and playing habits of students: (Ans.: – 0.9895)
Age 15 16 17 18 19 20
6. The following table gives the distribution of the total population and those blind among them. Calculate the
coefficient of correlation and probable error. (Ans.: 0.898)
Age 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 70 – 80
Blind Persons 55 40 40 40 36 22 18 15
7. Calculate ‘r’ between age and failure of candidates in the results of B.Com students: (Ans.: 0.7745)
Age 13 – 14 14 – 15 15 – 16 16 – 17 17 – 18 18 – 19 19 – 20 20 – 21 21 – 22 22 – 23
Candidates
200 300 100 50 150 400 250 150 25 75
Appeared
b. If r = +0.9668 and PE = 0.01463, find N (Answer: 9) [Hint: Round off answer to the closest whole number]
Exercise 7.2
1. Two ladies ranked seven brands of lipsticks as follows. Find the degree of agreement (Ans.: 0.786):
Lady 1 1 3 2 7 6 4 5
Lady 2 2 1 4 6 7 3 5
2. In a beauty competition, two judges ranked 12 participants as follows. What is the degree of agreement
between them? (Ans.: – 0.4546)
X 3 4 1 5 2 10 6 9 8 7 12 11
Y 6 10 12 3 9 2 5 8 7 4 1 11
3. Compute the rank correlation from the following data (Ans.: 0.8322):
X 60 34 40 50 45 41 22 43 42 66 64 46
Y 75 32 35 40 45 33 12 30 36 72 41 57
4. From the marks scored in accountancy and statistics by 12 students, compute rank correlation (Ans.: 0):
Accountancy 60 15 20 28 12 40 80 20
Statistics 10 40 30 50 30 20 60 30
X 48 33 40 9 16 16 65 24 16 57
Y 13 13 24 6 15 4 20 9 6 19
6. Compute the rank correlation between the length of service and order of merit (Ans.: 0.7937):
Length of Service 5 2 10 8 6 4 12 2 7 5 9 3
Order of Merit 6 12 1 9 8 5 2 10 3 7 4 11
7. Ten competitors in a voice contest are ranked by three judges in the following order. Find which pair of
judges have the nearest approach to common liking in voice (Ans.: -0.212, -0,297, 0.6364; Judges 1 & 3):
Judge 1 1 6 5 10 3 2 4 9 7 8
Judge 2 3 5 8 4 7 10 2 1 6 9
Judge 3 6 4 9 8 1 2 3 10 5 7
Regression
The statistical tool with the help of which we are in a position to estimate or predict the unknown values of
one variable from known values of another variable is called regression.
Correlation vs. Regression
1. Correlation coefficient is a measure of degree of co-variability between two variables, but regression
analysis helps to predict the value of one variable given the value of the other.
2. The cause and effect relation is clearly indicated more through regression analysis than by
correlation, which is more a tool of ascertaining the degree of relationship between the variables.
Formulae
Equation X on Y: (X − ̅
X) = bxy (Y − ̅
Y)
̅) = byx (X − X
Equation Y on X: (Y − Y ̅)
Exercise 7.3
1. Find the Regression Equations (Answer: X = 1.3Y – 4.4 & Y = 0.65X + 4.1):
X 2 4 6 8 10
Y 5 7 9 8 11
2. A panel of judges P & Q graded seven dramatic performances by awarding marks as follows. Obtain the
two Regression Equations: (Answer: X = 0.75Y + 14.5 & Y = 0.75X + 5.75)
Performance 1 2 3 4 5 6 7
Marks by P 46 42 44 40 43 41 45
Marks by Q 40 38 36 35 39 37 41
3. Following Table shows the exports of raw cotton and the imports of manufactured goods into India for
seven years.
Exports 42 44 58 55 89 98 60
Imports 56 49 53 58 67 76 58
Obtain the two Regression Equations and estimate the imports when export in a particular year was ₹ 70
crore. (Answer: 62.03; X = 2.198Y – 67.244 & Y = 0.391X + 34.651)
4. The advertisement expenses and sales data of ABC company are as follows:
Sales (₹ Crore) 10 11 13 15 16 19 14
Find:
Find:
b. The advertisement budget if the company desires to achieve the target sales of ₹ 1,00,000
(Answer: ₹ 19,800)
6. Coefficient of correlation between the ages of brothers and sisters in a community was found to be 0.8.
Average age of the brothers was 25 and that of sisters 22 years. Their variances were 16 and 25
respectively.
Find:
a. The expected age of the brother when sister’s age is 12 years. (Answer: 18.6 years)
b. The expected age of the sister when brother’s age is 33 years. (Answer: 30 years)
7. a. 𝐼𝑓 𝑟 = 0.42, 𝜎𝑦 = 16.8 𝑎𝑛𝑑 𝜎𝑥 = 10.8, 𝑓𝑖𝑛𝑑 𝑏𝑥𝑦 𝑎𝑛𝑑 𝑏𝑦𝑥 (Answers: 0.269 & 0.653)
6. Factor Reversal Test: The index must permit interchanging the prices and quantities without giving
inconsistent results. The two results multiplied together should give a true value ratio. This is given by
Σ p1 q1
the expression: P01 x Q 01 =
Σ p0 q0
1. It is based on geometric mean which is considered to be the best average for constructing index
numbers
2. It takes into account both, current year as well as base year prices and quantities.
3. It satisfies both Time Reversal Test (TRT) and Factor Reversal Test (FRT).
4. It is free from bias.
Formulae
Σp1
Simple Aggregative Method: P01 = x 100
Σp0
Σp1 q0
Weighted Aggregative Method: P01 = x 100
Σp0 q0
p
ΣI Σ( 1 x 100)
p0
Simple Average of Price Relatives Method: P01 = =
n n
Σp1 q0
CPI/CLI: Aggregate Expenditure Method: P01 = x 100
Σp0 q0
ΣIW p1
Family Budget Method: P01 = where I = x 100 and W = p0 q0
ΣW p0
Σp q Σp1 q1
Fisher’s Ideal Index Number: P01 = √ 1 0 x x 100
Σp q 0 0 Σp0 q1
Exercise 8.1
1. Calculate the price index for 2006, 2007 and 2008 using the simple aggregative method on the basis of 1995
(Answers: 124.37, 139.42, 153.20):
2. Calculate the weighted aggregative index number for the following commodities for the year 2001 and 2008 taking
the year 1991 as the base year (Answers: 130.45, 156.97):
3. Calculate the price index numbers for the following data for 2007 and 2008 using simple average of price relative
method (Answers: 147, 196):
Prices – 2001 10 20 5 2 7
Prices – 2007 16 21 6 3 14
Prices – 2008 18 22 7 5 21
4. Calculate the index number for the following data using simple average of price relative method (Answer: 122.92)
Commodity A B C D E F
Prices – 2008 4 6 2 5 8 10
Prices – 2009 5 6 3 7 9 11
5. Calculate the Consumer Price Index or Cost of Living Index Number using Aggregative Expenditure Method and Family
Budget Method (Answer: 150):
Quantity Price
Item
2005 2005 2010
A 5 8 15
B 2 9 12
C 3 16 20
6. Calculate the CPI using Aggregative Expenditure Method and Family Budget Method (Answer: 118.77):
Quantity Price
Item
2008 2008 2009
E 4 kg 2.00 1.50
7. An enquiry into the budgets of middle class families in Bangalore gave the following information:
What changes in the cost of living index of 2008 have taken place as compared 2007? How much dearness allowance
should be given to a worker who was drawing ₹200 as wages in 2007? (Answers: 126.10 & ₹52.20)
Average wage per month in 2004 is ₹750. What should be the average wage per worker in 2009 in that town so that
the standard of living of the workers does not fall below that of 2004? (Answers: 203 & ₹1,522.50)
9. An enquiry into the budget of the middle class families in a city gave the following information. What changes in the
cost of living figures of 2005 as compared to that of 2002 are seen? (Answer: 102.75)
Rent 15% 30 30
Clothing 25% 75 66
Fuel 10% 25 20
Miscellaneous 21% 40 52
10. The data below show the percentage increase in prices of selected food items and the weights attached to each of
them. Calculate the index number for the food group (Answer: 340, 304.6)
Food Item: Rice Wheat Dal Ghee Oil Spices Milk Fish Vegetables Refreshments
Weights 33 11 8 5 5 3 7 9 9 10
Increase in
180 202 115 212 175 517 260 426 332 279
Price %
Using the above food index and information given below, calculate the cost of living index number:
Weight 60 - 8 9 18
11. The cost of living index number on a certain date was 200. From the base period, the percentage increase in prices
were Rent – ₹60, clothing – ₹250, Fuel and lighting – ₹150, Miscellaneous – ₹120. The weights of different groups
were Food – 60, Rent – 16, clothing – 12, fuel and lighting – 8, and miscellaneous – 4. What was the percentage
increase in food group? (Answer: 72.67)
12. A textile worker earns ₹350 per month. The cost of living index for that particular month is known to be 136. Using
the data given below, find the amounts spent by him on house rent and clothing (Answer: 42, 49):
Expenditure 140 ? ? 56 63
13. Compute Fisher’s Ideal Index Number and prove that it satisfies the Time Reversal Test and Factor Reversal Test
(Answer: 134.41):
Year Commodity: A B C D E
Price 10 12 18 20 22
2008
Consumption 49 25 10 5 8
Price 12 15 20 40 45
2009
Consumption 50 20 12 2 5
14. Compute Fisher’s Ideal Index Number for the following five items (Answer: 266.615):
A 16 40 100 120
B 4 12 30 20
C 2 4 40 50
D 4 10 20 16
E 2 10 80 60
15. Construct Fisher’s Ideal Index Number and prove that it satisfies TRT & FRT (Answer: 165.71):
16. Compute Fisher’s Ideal Index Number and prove that it satisfies the TRT and FRT (Answer: 112.10):
Year Item A B C D E
Price 10 12 20 18 28
Base Year
Value 200 108 260 144 280
17. Compute Fisher’s Ideal Index Number and prove that it satisfies the TRT and FRT (Answer: 219.12):
Year Commodity: A B C D
Price 20 40 10 50
2013
Expenditure 400 160 100 250
Price 50 80 20 100
2014
Expenditure 750 400 240 600
Σx Σfx Σfm
Direct Method ̅ =
X ̅ =
X ̅ =
X
n N N
Σd Σfd Σfd
Shortcut Method ̅
X =A+ ; d = x– A ̅ =A+
X ; d = x − A ̅ =A+
X ; d = m − A
n N N
Σxw
̅ =
Weighted Arithmetic Mean: X
Σw
n1 x̅1 + n2 x̅2
̅ (1,2) =
Combined Arithmetic Mean: X
n1 + n2
Median
n th n th
(n+1) th ( 2 ) term + (2 +1) term
Individual Series: M = [ ] term when n is odd and M = [ ] when n is even.
2 2
(n+1) th
Discrete Series: M = [ ] term
2
N
− c.f.
Continuous Series: M = L + 2
xi
f
Mode
Individual Series: The variable that occurs most frequently.
Discrete Series: The value which has the greatest frequency in the neighborhood.
∆1
Continuous Series: Z or M0 = L + x i; ∆1 = |f1 – f0| and ∆2 = |f1 – f2|
∆1 + ∆2
Mean Deviation
Individual Series Discrete Series Continuous Series
MD MD MD MD MD MD
Coefficient of MD 𝑜𝑟 𝑜𝑟 𝑜𝑟
x̅ M x̅ M x̅ M
Quartile Deviation
Interquartile Range: IQR = Q3 – Q1
Q3 − Q1
Quartile Deviation: QD =
2
Q3 − Q1
Quartile Deviation: CQD =
Q3 + Q1
N
(n + 1) th − c. f.
Q1 [
4
] term L+ 4 xi
f
3N
3(n + 1) th − c. f.
Q3 [
4
] term L+ 4 xi
f
Standard Deviation
Individual Series Discrete & Continuous Series
Ʃd2 Ʃfd2
Direct Method σ=√ d = x − x̅ σ=√ d = x − x̅ or m − x̅
n N
Coefficient of Skewness
X̅ − M0
For unimodal distribution: Karl Pearson’s Coefficient of Skewness, Skp =
σ
̅ − M)
3(X
For bimodal distribution: Karl Pearson’s Coefficient of Skewness, Skp =
σ
𝑄3 + 𝑄1 − 2M
Bowley’s Coefficient of Skewness, SB =
𝑄3 − 𝑄1
Σdx.Σdy
Σdx.dy −
Using Assumed Mean: r = 2
N
2
√Σdx2 − (Σdx) x √Σdy2 − (Σdy)
N N
1 − r2
Probable Error: P. E = 0.6745 x
√N
Regression
Equation X on Y: (X − ̅
X) = bxy (Y − ̅
Y)
Equation Y on X: (Y − ̅
Y) = byx (X − ̅
X)
Index Numbers
Σp1
Simple Aggregative Method: P01 = x 100
Σp0
Σp1 q0
Weighted Aggregative Method: P01 = x 100
Σp0 q0
p
ΣI Σ( 1 x 100)
p0
Simple Average of Price Relatives Method: P01 = =
n n
Σp1 q0
CPI/CLI: Aggregate Expenditure Method: P01 = x 100
Σp0 q0
ΣIW p1
Family Budget Method: P01 = where I = x 100 and W = p0 q0
ΣW p0
Σp q Σp1 q1
Fisher’s Ideal Index Number: P01 = √ 1 0 x x 100
Σp q 0 0 Σp0 q1
2. In 2012, the total number of visitors to the Wonder Land, Bangalore, was 25,000. Among them, there
were 8,600 female visitors from India and 6,500 foreign visitors out of which 3,500 were female visitors.
In 2013, the total number of visitors increased by 20% and that of Indian visitors increased by 10%.
Among them, there were 8,000 Indian male visitors and 6,000 foreign female visitors. Tabulate the data.
3. A survey of 370 students from Commerce faculty and 130 students from Science faculty revealed that 180
students were studying for only CA examinations, 140 for only Costing examinations and 80 for both CA
and Costing examinations. The rest opted for Part-time Management courses. Of those studying for
Costing, only 13 were girls and 90 boys belonged to Commerce faculty. Out of 80 studying for both CA and
Costing, 72 were from commerce faculty amongst which 70 were boys. Among those that opted for Part-
time Management courses, 50 boys were from Science faculty, and 30 boys and 10 girls were from
Commerce faculty. In all there were 110 boys in Science faculty. Present the above information in a
tabular form.
4. Prepare a frequency distribution from the following figures relating to bonus paid to workers (₹’000)
67 60 69 70 62 63 69 70 58 56 67 54
55 70 60 60 60 65 70 56 57 58 60 59
61 73 69 67 61 60 59 57
5. The following are the marks of 50 students in Statistics. Construct a suitable frequency table:
28 17 48 57 38 59 28 16 78 46 45 86
21 29 49 61 71 46 49 30 76 37 76 36
37 39 46 27 29 31 21 49 29 8 56 46
5 36 71 42 46 56 16 15 22 35 18 22
46 17
6. 25 values of two variables X and Y are given below. Form a two-way frequency table showing the
relationship between the two:
X 12 24 33 22 44 37 26 36 55 48 27
Y 140 256 360 470 470 380 280 315 420 390 440
X 57 21 51 27 42 43 52 57 44 48 48
Y 390 590 250 550 360 570 290 416 380 392 370
X 42 41 69
Production (tons) 45 40 44 41 49 42 55 50
2. Present the following data on profit before tax and after tax using multiple bar diagram:
3. Represent the cost per scooter using sub-divided bar diagram and percentage sub-divided bar diagram:
Weight 58 60 61 62 63 64 65 66
No. of Persons 4 12 24 32 32 16 8 2
Wages 0 – 20 20 – 40 40 - 60 60 - 80 80 – 100
x 0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70 70 - 80
f 5 8 7 12 28 20 10 10
CI 4–7 8 – 11 12 – 15 16 – 19 20 – 23 24 – 27
Frequency 12 23 40 65 17 3
Age below 5 10 15 20 25 30 35
6. 20% of the workers in a firm, employing a total of 4000 workers, earn less than ₹4 per hour, 880 earn
from ₹4 to ₹4.24 per hour, 24% earn from ₹4.25 to 4.49 per hour, 740 earn from ₹4.50 to ₹4.74 per hour,
12% earn from ₹4.75 to ₹4.99 per hour and rest earn ₹5 or more per hour. Calculate the median.
7. Find the median and mode of the following data using Ogive curves and Histogram respectively:
Mid values 115 125 135 145 155 165 175 185 195
Frequency 6 25 48 72 116 60 38 22 3
8. Find the missing frequencies, if total frequency is 120 and median is 36.5:
CI 20 – 25 25 – 30 30 – 35 35 – 40 40 – 45 45 – 50 50 – 55 55 – 60
F 8 15 28 - 22 - 4 2
Answers
(1) 62.24, 62, 62 (2) 46.51 (3) 45, 46.43, 46.67 (4) 15.03, 15.81, 16.87 (5) 8.33 (6) 4.33
(7) 153.79, 154.4 (8) 30, 11
Weight 58 59 60 61 62 63 64 65 66
No. of Persons 15 20 32 35 33 22 20 10 8
2. Find the Interquartile Range, QD, CDQ, and MD (Using mean and median) from the following data:
Value 90 – 99 80 – 89 70 – 79 60 – 69 50 – 59 40 – 49 30 – 39
Frequency 2 12 22 20 14 4 1
3. From the prices of shares of X and Y given below, state which share prices are more stable
X 55 54 53 53 56 68 52 50 51 49
Y 108 107 105 105 106 107 104 103 104 101
5. The life of two types of tyres in a sample survey is given below. Which one has a higher average? Based on
consistency, which one would you prefer?
Type A 10 18 32 40 22 18
Type B 18 22 40 32 18 10
6. Given below is the distribution of boys and girls of a school. Find which group is more variable.
Age in Years 13 14 15 16 17
No. of Boys 12 15 15 5 3
No. of Girls 13 10 12 2 1
Answers
(1) QD: 3, 1.5, 0.024, 1.74; MD: 1.74, 0.028, 1.713, 0.028 (2) QD: 18.02, 9.01, 0.132; MD: 10.41, 0.153, 10.437,
0.152 (3) CV(X) = 9.37%, CV(Y) = 1.91% (4) 16.781% (5) CV(A) = 33.35% & CV(B) = 37.13%
(6) CV(X) = 7.855%& CV(Y) = 7.341%
Age 12 14 15 18 21 24 26 27 31 33
No. of Persons 8 12 24 20 15 24 18 8 6 4
3. Find the Coefficient of Skewness from the following data using Pearson’s and Bowley’s methods:
Size 7 8 9 10 11 12 13 14
Frequency 2 11 36 64 39 30 22 2
Marks Below 80 70 60 50 40 30 20 10
No. of
150 136 120 80 70 70 50 10
Students
5. From the data given below, find the coefficient of skewness using both the methods:
X 23 – 27 28 – 32 33 – 37 38 – 42 43 – 47 48 – 52 53 – 57 58 – 62 63 – 67 68 – 72
F 2 6 9 14 32 16 12 6 2 1
Marks Above 0 10 20 30 40 50 60 70 80 90
No. of
100 89 73 64 52 49 32 20 12 5
students
7. Pearson’s coefficient of skewness is –0.7 and the value of the median and standard deviation are 12.8 and
6 respectively. Determine the value of mean.
8. In a distribution, mean = 65, median = 70 and Skp = – 0.6. Find i) SD, ii) Mode, iii) CV
Answers
(1) 0.155, - 0.1818 (2) – 0.2597, – 0.0909 (3) 0.3665, 1 (4) – 0.7539, –0.3636 (5) 0.0572, 0.0615
(6) – 0.2276, – 0.1858 (7) 11.4 (8) 25, 80, 38.46
Age 20 – 25 25 – 30 30 – 35 35 – 40 40 – 45 45 – 50 50 – 55 55 – 60 60 – 65
Wages (‘000) 9 10 12 11 16 16 18 17 15
3. From the following data find the coefficient of correlation between average profits and average
advertisement expenditure per shop and interpret the result.
No. of Shops 30 45 14 26 12 16 22 35
Total Profits 60,000 135,000 42,000 52,000 36,000 64,000 66,000 105,000
Advertisement
3,000 45,000 7000 13,000 6,000 4,800 8,800 14,000
Expenses
4. With the following data in 6 cities, calculate the Coefficient of Correlation between the density of
population and death rates.
Cities A B C D E F
Density of
200 500 400 700 600 300
Population
Population (‘000) 30 90 40 42 72 24
Analyst A 15 18 12 22 15 21 15 27 16 24
Analyst B 16 19 17 21 19 26 12 16 18 20
6. Using Rank Correlation find out which pair of judges have a nearly common taste in fashion design.
Judge A 1 3 2 5 8 7 9 4 10 6
Judge B 3 5 4 6 7 9 8 1 2 10
Judge c 5 6 2 3 8 7 10 4 1 9
Answers
(1) – 0.975 (2) 0.855 (3) 0.141 (4) + 0.988 (5) 0.4182 (6) 0.3455, 0.7697, 0.2727
Husband’s age 25 28 30 32 35 36 38 39 42 55
Wife’s age 20 26 29 30 25 18 26 35 35 46
Obtain the two regression equations and determine the most likely age of husband when the wife’s age is
25 years.
X 20 24 26 34 36
Y 10 12 14 18 26
3. Find the regression lines for the following data and estimate the value of X when Y = 38.
X 25 28 35 32 36 37 29 39
Y 43 46 49 41 36 32 31 32
4. The heights (in cm) and weights (in kg) of a random sample of 9 adult males are shown below:
Height 177 163 173 182 171 168 174 176 184
Weight 71 67 77 85 69 62 73 78 80
Estimate the height when the weight is 75 and the weight when the height is 180.
5. A study of wheat prices per kg at Mysore and Bengaluru yields the following data:
Mysore Bengaluru
Estimate:
a. The price of wheat at Mysore when the price is ₹ 23.54 at Bengaluru.
b. The Price of wheat at Bengaluru when the price is ₹ 30.5 at Mysore.
Answers
(1) 32.6956 years (2) 32; 17.739 (3) 32.839 (4) 175.315 cm; 78.75 kg (5) ₹19.23; ₹30.855
2. Calculate the price index numbers for the following data for 2007 and 2008 using simple average of price
relative method:
3. Calculate the Consumer Price Index or Cost of Living Index Number using Aggregative Index Number and
Family Budget Method:
A B C D E
Quantity – 2004 50 100 60 30 40
Prices – 2004 6 2 4 10 8
Prices – 2009 10 2 6 12 12
4. The group indices and the corresponding weights for the working class cost of living index numbers in an
industrial city for 2009 and 2010 are as follows:
Group Index
Group Weight
2009 2010
Food 71 370 380
Clothing 3 423 504
Fuel 9 469 336
House Rent 7 110 116
Miscellaneous 10 279 283
Compute the cost of living index number for 2009 and 2010. If a worker was getting ₹3000 per month in
2009, should he be given any extra allowance in 2010 so that he can maintain his 2009 standard of living?
Justify your answer.
5. The following table gives the cost of living index numbers for different groups with their respective
weights for the year 1992 (base year 1982). Calculate the overall cost of living index numbers.
If Mr. Bose got ₹550 in 1982, determine how much he should receive in 1992 to maintain the same
standard of living as in 1982.
6. The relative importance of the following 8 groups of family expenditure is tabulated below. If the corresponding
increase in prices (in %) for February, 1992 compared to January 1992, are 25, 1, 22, 18, 14, 13, 20 and 11, calculate
the CPI:
348 88 97 65 71 35 70 217
7. Compute Fisher’s Ideal Index Number and show that it satisfies the TRT & FRT:
2004 2008
Commodity
Price (₹) Consumption (kg) Price (₹) Consumption (kg)
A 8 6 12 4
B 10 8 12 8
C 14 4 18 4
D 4 6 2 10
E 10 10 14 8
8. Compute Fisher’s Ideal Index Number and prove that it satisfies the TRT & FRT:
Year A B C D
2000 2 4 1 5
Price
2010 5 8 2 10
2000 40 16 10 25
Value
2010 75 40 24 60
9. A worker earns ₹750 per month. The cost of living index for January, 2009 is known to be 160. Using the data given
below, find the amounts spent by him on food house rent.
Answers
(1) 127.27 & 127.57 (2) 135.86 (3) 139.71 (4) 353.2 & 351.58; No (5) 352; 1936.00 (6) 117.49
(7) 124.01 (8) 218.046 (9) 300, 150