Definition of Statistics
Definition of Statistics
INTRODUCTION
Definition of statistics
"Statistics are the classified facts representing the conditions of the people in a state... specially
those facts which can be stated in number or in tables of numbers or in any tabular or
classified arrangement." - Webster.
"Statistics are numerical statement of facts in any department of enquiry placed in relation to
each other."-Bowely.
"Statistics may be defined as the aggregate of facts affected to a marked .extent by multiplicity
of causes, numerically expressed, enumerated or estimated according to a reasonable standard
of accuracy, collected in a systematic manner, for a predetermined purpose and placed in
relation to each other." - Prof. Horace Secrist.
This definition seems to be the most exhaustive 'of all the definitions. By examining
this definition, we can draw the following essentials of "Statistics":
1|Page
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Thus, from the definition of Prof. Horace Secrist and its discussions above, we may
conclude that:
"All statistics are numerical statements of facts but all numerical statements of facts are not
statistics." The numerical statements of facts, to be designed as statistics, must possess
some of the characteristics given in the definition of Prof. Horace Secrist.
B. Statistics as a Science
The term 'Statistics', as a singular noun, means a body of theories and techniques or
methods employed in analyzing the numerical information. It is a branch of
scientific method and deals with the mathematical process to yield finished product.
It is concerned with the collection, classification, tabulation, presentation and
analysis of data relating to a particular field of study.
2|Page
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
"Statistics may be defined as the science of collection, presentation, analysis and interpretation
of numerical data." F.E. Croxton and D.J. Cowden.
This definition is the best of all the above definitions. It is satisfactory and complete
in giving the correct meaning of the term 'statistics'. Thus, a study and analysis of
various definitions of statistics helps us to draw the following essentials of
"Statistics":
Thus, the science of statistics is widely employed as a tool of analyzing the problems
in the field of natural and social sciences. It provides tools and techniques for
research students; it is not studied for its own sake but for the sake of developing
new sciences. It is a method of research in which statistical method (experiment
method in natural sciences) is adopted in studying the problems concerned with the
social sciences.
3|Page
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
The main objective of statistics is to study the population and the variables therein
for the purpose of reducing the population to the possible extent which helps to
make decisions and solve problems. Following are the various objects of statistics:
4|Page
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Therefore, the quantitative data are processed for the purpose of doing something
and making use of them. They are utilized to examine the problems concerning a
field of enquiry in their true perspective, to find out the causes of changes and to
estimate their probable effects. The statistical methods are employed as a tool for
comparison between past and present events to throw light on the reasons of
changes.
5|Page
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
investigation and interpretation. It discloses the hidden facts and enlarges the field of all
the branches of human knowledge.
Importance of Statistics
Statistics has been termed as the "Science of Kings". Indeed in ancient times, it kept
the Kings informed about the man power and riches of domain. Nowadays, it is
treated as the "Arithmetic of Human Welfare". It is indispensable for a clear
appreciation of any problem affecting the welfare of mankind.
In modern times, planning without statistics cannot be imagined. Statistics is the light
bearer that enlightens the way to life's adventure. It unravels the crowded
complexities of life and thought. Without its support man has to wander aimlessly
through this perplexing universe. It discloses casual connection between related
facts. Such a study of statistics is at the bottom of all sound human endeavors.
Statistics are the eyes of administration. All the businessmen and statesmen can tender
sound advice on a problem to their administrative machinery with the help of
adequate data before them to base their judgments.
Statistics are aids to supervision. The present days of impersonal relationship between
the employer and employees, statistics are becoming tools for the supervision of
work in obtaining efficiency of the employees. The supervisors or officers are
provided with accurate and concise information to supervise the work of their
subordinates.
6|Page
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
The scope of statistics is stretched over all those branches of human knowledge in
which a grasp of the significance of large numbers is looked for. Its methods provide
an important manner of measuring numerical changes in complex groups and
judging collective phenomenon. Its scope is, thus, wide, the limiting factor being its
applicability to studies of quantitative aspects alone
"Sciences without statistics bear no fruits; Statistics without sciences have no roots."
There is hardly any field of human knowledge where statistical methods are not
applicable. Thus the significance of statistics has increased from the "Science of
Kings" to the "Science of Universe".
For proper use of statistics, it is essential to keep in mind the following limitations:
7|Page
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Liable to be Mis-used: Statistical tools are dangerous in the hands of those who
do not know their use and deficiencies. They can be misused to serve one's own
purpose. Statistics is the useful servant but only of great value to those who
realize them properly.
Therefore, the use of statistics by the experts, who are well experienced and skilled in
the analysis and interpretation of statistical data for drawing correct and valid
inferences, very much reduces the chances of disgrace to the mass popularity of this
important science.
DATA COLLECTION
SOURCES OF DATA
The sources of data collection are classified, on the basis of the nature of data, into
two groups i.e. Primary and Secondary.
"Primary Data" are those data which are originally collected for the first time, for a
specific purpose by an investigator. They are collected directly from the people to
whom the enquiry is related. They are original in character. They are primary to the
institution which collects them; they are secondary to all other institutions which
refer such data.
"Secondary Data" are those data which are already collected, processed and used by
someone else for their own purpose. They are either published or unpublished. They
are finished products. They are secondary to all those institutions except the one
which has collected them.
It may be observed that the distinction between primary and secondary data is a
matter of degree or relativity only. The same .set of data may be primary in the
hands of one and secondary in the hands of others. The sources of collecting the
data, thus, are also classified as – Primary Sources and Secondary Sources.
8|Page
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
PRIMARY SOURCES
The data which are originally collected for the first time are called "primary data." The
methods of collecting the primary data are as under:
Disadvantages
i. This method is restrictive in nature and is suited only for intensive studies
and not for extensive enquiry.
ii. It is handicapped due to lack of time, money and manpower since the
informants are to be approached at their residence or office at their
convenience.
iii. It requires skilled and experienced investigator for enquiry.
9|Page
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Indirect Oral Interviews: This method consists of the collection of data through
enumerators appointed for the purpose. A small list of questions pertaining to the
subject matter of the enquiry is prepared. These listed questions are then put to the
informants or witnesses who are closely connected or concerned with the subject
matter. The enumerators collect all the answers from the informants and record
them systematically. This method is adopted by the commissions appointed by the
government.
Advantages
Disadvantages
i. This method suffers from lack of personal touch and supervision by the
investigator as the information is collected through the enumerators from the
informants.
ii. The accuracy of data collected much depends upon the nature and skill of the
informants who may give wrong information.
iii. The informants may be prejudiced (biased) persons.
iv . It is very difficult to find the informants who are more concerned with the
subject matter of the enquiry. Some of them even may not really possess the
knowledge in answering the questions.
Advantages
10 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
This method works out to be very cheap and economical for extensive investigations
particularly when the data are obtained by part-time correspondents or agents.
Moreover, the information can be obtained expeditiously since only rough estimates
are required.
Disadvantages
This method suffers from personal prejudices and whims (sudden change of mind)
of the correspondents in different fields of the enquiry and consequently the data so
obtained will not be very reliable.
This method is not suitable for collecting the accurate and exact information.
A very polite covering note or letter explaining the aims and objects of enquiry is
attached. Respondents are also requested to extend their full co-operation by
furnishing information and returning the questionnaire duly filled in, Respondents
are taken into confidence by ensuring them that the information so supplied by
them will be kept strictly confidential and secrete.
In order to ensure quick and better response, the return postal expenses are usually
borne by the investigating organization by sending the self-addressed stamped
envelope. This method is usually used by the research workers, private individuals,
non-official agencies and government departments.
i. Due skill, efficiency, care and experience are required in framing the
questionnaire.
ii. The questions asked must be clear, short, brief, corroborative, non-offending,
courteous, unambiguous and suitable for answering in the form of "Yes" or
"No".
11 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
iii. They should not be too confidential and hurt the feelings of the respondents.
iv . They must be arranged in a logical order.
v. They must suit the general educational level of the respondents.
Advantages
i. This method is by far the most economical method in terms of time, money
and manpower.
ii. It is used for extensive enquiries covering a very wide area.
iii. The information obtained is original and much more authentic.
Disadvantages
A questionnaire is a list of questions which is sent directly to and filled in by the informants.
The enumerators visit the respondents personally with the list of questions
(schedules), ask them the questions therein and record the answers from the
12 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
informants. This method is generally used by the big business houses, large public
undertakings and research institutions.
Advantages
i. The enumerators can explain in detail the objective and scope of the enquiry
to the informants and impress upon them the need and utility of furnishing
the correct information.
ii. This method is useful in extensive enquiries and it yields reliable and
dependable results.
iii. There will be' a good and quick response from the informants whether they
are literates or illiterates.
iv . The enumerators can effectively check the accuracy of the information by
cross-questioning and putting additional questions.
Disadvantages
From all the above sources that primary data can be collected, there are other
methods such as- Local Reports, Enquiries, Telephone Calls and Fax - which are used in
most of the advanced countries for collecting data.
SECONDARY SOURCES
13 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
The data which have already been collected by someone else are called "secondary
data" which have gone through the statistical process at least once. They may be
either finished or semi-finished and published or unpublished.
Statistical data are collected for the purpose of analysis and interpretation. The data
are collected by techniques such as - Census and Sample. The problem of
application of techniques will arise only in-case of collecting the primary data.
1. Census Method: In a census technique of collecting the data every element of the
population is included in the investigation.
14 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Advantages
i. The results from such method are more accurate and reliable.
ii. This method helps in making available the intensive knowledge.
iii. It is useful (suitable) where units of diverse nature constitute universe or
population or mass.
Disadvantages
i. This technique is more expensive and requires more labour and time.
ii. It is the most difficult and giant task to count all the items in the population
as they are always subject to changes.
2. Sample Method: In a sample technique of collecting the data selection of the
part of the universe is made for the purpose of study.
It is very essential to select the adequate number of items so that each one shall have
the characteristics of the mass to the possible extent. A finite subset of the
population is called "sample". The number of units in the sample is called the
"sample size".
Advantages
Disadvantage
The samples selected may not represent the mass and they may be false and
influenced by the personal bias of the selector.
15 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
As the census method is too cumbersome and costly, in most of the cases the
sampling method is used in statistical enquiries. There are two methods of taking
the samples - Random sampling and Non-random sampling.
Random Sampling: It refers to the chance selection but it does not mean that the
selection should be made in a haphazard manner. A few mechanical devices are used
in a sample selection so that every item selected on random basis has equal
probability of entering the sample. It is a blind chance alone that determines
whether one or the other item mayor may not be selected.
The term "random", is not used in the sense of 'hit or miss' method. There shall be no
room for discrimination to be exercised by the investigator. In fact, sampling is a
scientific way of getting items selected from the mass. The selection of the items
should be free from personal bias or judgment. The various methods of sampling
may include:-
Drawing chips from the bowl, rotating the drum and use of a table of random digits - are used
in collecting the data. The most important method adopted nowadays is the "lottery"
draws that are based on purely random sampling.
There are many more other methods of taking the samples. They are as under:
16 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
In a "Double Sampling", samples are taken into two stages. If the first set of sample is
inclusive, the second set is added to the first one to strengthen the results.
In a "Multiple sampling", a sample procedure is carried out in several stages - the first
stage, the second stage and so on. For example, in the first stage a few districts from
the state are selected. In the second stage, a few divisions are selected from the
sample districts. In the third stage, a few villages are selected from the sample
divisions. In the next stage, a few families are selected from the sample villages. In
this process, the size of the sample goes on reducing at each stage.
In a "Convenience Sampling", a sample is obtained from the readily available lists such
as telephone directory, ration card-holders' list and voters' list.
In a "Sequential sampling", a number of samples are drawn from the universe one after
another depending on the results of previous samples. If the first sample is not
acceptable, the lot from which the first samples are made is rejected. Then the
second lot is taken from which again samples are selected. After each lot of items, we
have to decide whether to accept or reject or to reserve the judgement or continue
the process until a final decision is taken.
Thus, the choice of sampling technique depends upon the nature of the universe,
the sampling unit, the size of sample and the other factors.
PRINCIPLES OF SAMPLING
The following are the theories or principles or laws of the statistical methods:
17 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
random, such items are almost sure, on the whole, to possess the characteristics
of the population.
Law of Statistical Regularity: The law indicates that a moderately large number
of items chosen at random from a large group are almost sure to possess, on an
average, the characteristics of the large group or population. The ''Theory-of
Probability" gave rise to the "Law of Statistical Regularity". The former tells us of
mathematical expectation of happening and not happening of events having
equal chances in a sample selection. The latter tells us that random selection
from the universe is likely to give a representative character of the population.
Law of Inertia of Large Numbers: It is a corollary to the "Law of Statistical
Regularity". It states that, other things being equal, larger the size of the sample, it
is likely that the results will be more accurate.
STATISTICAL ERRORS
After the collection and editing of the data, there still remains the difference
between the true value and the estimated value. Such errors are called statistical
errors which arise due to inadequacy of data, unintentional acts of the investigators,
random sampling and editing of the data.
Statistical errors are of two types sampling errors and non-sampling errors.
1. Sampling Errors: These errors arise due to drawing inference about the
population or universe on the basis of samples or observations, because samples
are seldom perfect miniature (very small in kind) of the population. These
sampling errors may be classified, further, as biased and unbiased (purposeful and
unpurposeful).
"Biased Errors" exist due to biased views of the informants and investigators,
defective measurements, defective sampling and prejudiced treatment of data
18 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
collected. These errors are constant, cumulative, persistent and systematic. They are
not compensating and they move in the same direction. They can be checked by
examining their sources carefully and taking proper precautions at each stage in the
process of investigation.
"Unbiased Errors" exist naturally and suddenly without any bias or prejudice. They
arise due to the principles of statistics - "Theory of Probability", "Law of Statistical
Regularity" and "Law of Inertia of Large Numbers". The excess errors in one
direction are almost balanced by the excess errors in the other direction. Such
errors, therefore, are compensating. They are non-cumulative and compensating.
They lie in both the directions - positive and negative.
Therefore in the sampling enquiry we find both the sampling and non-sampling
errors
Non-sampling Errors: These errors occur in the sampling enquiry and the census
enquiry as well. But in the census enquiry we find only non-sampling errors. Non-
sampling errors include both statistical errors and mistakes. They creep in during the
process of collecting the information.
i. They arise due to the negligence of the investigators in asking the questions or
in recording the answers.
ii. They also arise due to the lack of knowledge in the investigation.
iii. They are also the result of the approximation, faulty definitions and defective
system of collecting the data.
They increase as the number of units included in the census enquiry. They are
greater in a census enquiry than in a sample.
All the statistical errors can be measured absolutely or relatively. So the statistical
errors may be either absolute or Relative.
a. Absolute Error: It is the difference between the true value and the estimated
value. It is the difference between the origin figure and the approximated figure.
To put it in a formula form:
19 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
CLASSIFICATION OF DATA
The data collected in any statistical enquiry are voluminous, huge and in a raw form.
They are unwieldy and uncomprehensible. Some process of condensation is
necessary to make the data, organized so that they could be easily understood.
There are three steps in organizing the raw data - Classification, Tabulation and
Analysis.
20 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
"Classification is the process of arranging data into sequences and groups according to their
common characteristics, or separating them into different but related parts". - Secrist
"Classification is a scheme for breaking a category into a set of parts, called classes, according
to some precisely defined differing characteristics possessed by all the elements of the category".-
Tuttle. A.M.
Thus, the process of classification involves the arrangements of the data into
different classes which are to be determined depending upon the nature, objectives
and scope of the enquiry.
Objectives of Classification
To present the facts in a simple form: Classification makes the unorganized mass of
data simple, brief, logical and understandable. It helps in understanding the
structure and nature of data.
To bring out points of similarity and dissimilarity: Classification brings out clearly the
points of similarity and dissimilarity existing among the data. Different attributes
of data are placed in different classes.
To facilitate comparison: Classification helps in making comparisons between the
two series which enables one to draw inferences. Without classification it is very
difficult to have a comparative study and take decisions
To bring out relationship: Classification helps to establish relationship between the
two series. With the help of such' a relationship, one can conclude about the
cause and effect relationship.
To present a mental picture: Classification makes one to have a mental picture of
the mass data. It gives a clear picture of the complex data in a condensed form.
To prepare the basis for tabulation: Classification makes the process of tabulation
easy. Without classification it is very difficult to tabulate the data.
21 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Functions of Classification
Bases of Classification
The basis or the criteria with respect to which the data are classified primarily
depend on the objectives and the purpose of the enquiry. Generally, the data are
classified as under:
22 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
4. "Arbitrary classification" refers to those attributes which are not clearly defined
and in which no demarcation can be made. For example, tall persons and short
persons literates and illiterates, young and old and so on. Such attributes require
specific definitions. They are called "notional attributes".
5. Quantitative Classification: Under this method data are classified according to
quantitative measurements like age, height, marks, weight, prices, production,
income, wages, expenditure, sales and profits. The quantitative phenomenon
under study is known as 'variable', hence this classification is also sometimes
called "classification by variables"
Types of variables
1. Discrete Variable: is a variable which cannot take all the possible values within a
given specified range. It takes only integers (whole numbers) characterized by
jumps and gaps between one value and the next. The values, generally, change
with particular gap constantly. Thus a variable, which shows definite breaks
between one value and the other succeeding it, is called a "Discrete Variable."
2. Continuous Variable: The variable which takes all the possible values, integer as
well as fractional, is called a continuous variable. It is capable of passing from any
given value (integer and fractional) to the next value by infinitely small
gradations.
Therefore, values in a discrete variable are obtained just by counting and the values
in a continuous variable are obtained through specified measurements.
Following are the two examples of series based on discrete and continuous variables.
23 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Seriation
Seriation is concerned with the logical listing or arrangement of the data into a
particular sequence. The data are listed orderly in different categories. The things or
attributes are measured, counted or weighted and then they are placed one after
another in the process of seriation. Thus, 'seriation' means a logical or systematic
arrangement of data with reference to time or area or condition.
"Individual Series" is one in which data are shown as they are observed. The
values of variable are small in size or limited in number. The observations are just
recorded and presented as they are collected. Generally, they are in raw form and
unorganized.
"Condition Series" is one in which data are arranged with reference to the
physical condition such as age, height and weight with respect to their occurrence
(frequency) at a given time. A frequency distribution is to be formed to reduce
the size of the data collected.
24 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
The values of variable are smaller in size as compared to the number of observations.
Because, each value is occurring in the series repeatedly. There is a definite gap
between the variables which makes the series discontinuous in values. There are no
fractional values at all. Each value takes a definite frequency against it according to
the repetitions of the same in the series.
2. Grouped Series: When the values of variable in the data are individually distinct
and integers, we can have the discrete form of frequency distribution. It is
possible only when the size of the values of the variable is smaller as compared to
the number of observations. If the values' of variable and the number of
observations are both large in size, we can have the grouping of the values for the
purpose of condensing the data.
Grouped series are classified into different classes (class-intervals) with two limits for
each class-interval i.e. lower limit and upper limit.
FREQUENCY DISTRIBUTION.
"Frequency" is a number of times each value of variable occurs in the series. It refers
to the number of repetitions of a particular value of variable. It is shown, with the
25 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
help of counting, against the particular value or class. So it is the rate of occurrence
of a particular event or thing or value.
On the basis of statistical series) we can construct the three types of frequency
distributions as under:
"Tally bars" or "Tally marks" are small vertical bars scored parallel to each other. They
are put opposite to the particular value or class to facilitate the counting of the
frequencies.
Grouped Frequency Distribution: under this, the entire range of the values of variable is
to be condensed by dividing it into a suitable groups or classes. Each class having
two limits i.e. lower limit and upper limit.
Note: The inclusive method is not in general usage. That is why the inclusive method
is converted into the exclusive method for all the purposes. There is a gap of 1
between the upper limit of the preceding class and the lower limit of the succeeding
class. This gap is adjusted by a correction factor, computed as under
𝑠𝑢𝑐𝑐𝑒𝑑𝑖𝑛𝑔 𝑙𝑜𝑤𝑒𝑟 𝑙𝑖𝑚𝑖𝑡 − 𝑝𝑟𝑒𝑐𝑒𝑑𝑖𝑛𝑔 𝑐𝑙𝑎𝑠𝑠 𝑢𝑝𝑝𝑒𝑟 𝑙𝑖𝑚𝑖𝑡
𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑖𝑜𝑛 𝑓𝑎𝑐𝑡𝑜𝑟 =
2
26 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
The magnitude of the class interval is 2, before the adjustment, but it is 3 after the
adjustment. However, the mid-value in both the cases remains the same.
Continuous Frequency Distribution: under this, the entire range of values of variable is
condensed by dividing it into suitable groups or classes. Each class having two limits
i.e. lower limit and upper limit. In this distribution, there will not be a gap of 1
between the upper limit of the preceding class and the lower limit of the succeeding
class. Because both the limits are same.
27 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Class interval: It is a size of each group of the data beginning with the lower limit and
ending with the upper limit. Thus, it is a group constituted by the two limits. E.g. of
inclusive method of class intervals are 10-19, 20-29, 30-39 and so on. For exclusive
method of class intervals are 10-20, 20-30, 30-40 and so on.
n = 1 + 3.222 log N
n = number of class intervals
N = number of observations
Accordingly, on the basis of the above Sturges Rule, we can determine the
magnitude of the class interval as under:
𝑅𝑎𝑛𝑔𝑒 𝐿−𝑆
𝑖= =
𝑁𝑜. 𝑜𝑓 𝑐𝑙𝑎𝑠𝑠𝑒𝑠 1 + 3.222 𝑙𝑜𝑔 𝑁
Where;
L = Largest Value,
S = Smallest Value.
When we apply the Sturges Formula, the number of classes are determined within
the range of 4 and 20 in general. It is a fairly reasonable range within which we get
the number of class intervals. However, the number of classes should be a whole
number (integer) preferably 5 or multiples of 5 such as 10, 15, 20 and so on. Such
integers are readily perceptible to the mind and quite convenient for numerical
computations in the analysis of data. Similarly, the magnitude of the class intervals
should also be taken preferably in 5 or least common multiples of 5 such as 0-5 or
15-25 or 20-40 and so on.
28 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Empty Class Interval: It is the class interval which is having no frequency. It means
there are no observations against this class interval.
Class Limits: They are the two ends by which the class interval is constituted. Each
class interval is having two limits - lower limit and upper limit. The lowest value is
the lower limit and the highest value the upper limit.
Class Boundaries: These are the new corrected class limits in a grouped series which
are formed by converting inclusive method into the exclusive method.
1
𝑢𝑝𝑝𝑒𝑟 𝐶𝐵 = 𝑢𝑝𝑝𝑒𝑟 𝑐𝑙𝑎𝑠𝑠 𝑙𝑖𝑚𝑖𝑡 + 𝑑
2
1
𝐿𝑜𝑤𝑒𝑟 𝐶𝐵 = 𝑙𝑜𝑤𝑒𝑟 𝑐𝑙𝑎𝑠𝑠 𝑙𝑖𝑚𝑖𝑡 − 𝑑
2
Where; d= the difference between the lower class limit of the succeeding class and
the upper class limit of the preceding class.
Magnitude and Width of the Class: The "magnitude" is the difference between the lower
limit and the upper limit of the class interval, whereas the "width" is the difference
between the lower boundary and the upper boundary of the class interval.
Example of inclusive method of class interval and its conversion into the exclusive method as
under:
29 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Example 1
84, 91, 58, 72, 44, 87, 76, 43, 83, 40, 73, 86, 77, 75, 73, 71, 54, 46, 55, 43,33, 76,
94, 65, 74, 50, 65, 80, 57, 73, 36, 33, 91, 53, 63, 59, 47, 29, 37, 11, 82, 40, 27, 84,
53, 19, 35, 72, 44, 19, 51, 67, 58, 76, 38, 16, 37, 74, 46, 50, 18, 59, 27, 92, 13, 45,
61, 86, 39, 78, 23, 12, 71, 62, 22, 41, 38, 27, 66, 51, 29, 63, 47, 39, 19, 22, 35, 39,
80, 37.
Exclusive method
Inclusive method
Example 2:
57, 59, 60, 61, 60, 63, 67, 70, 74, 80, 50, 59, 60, 63, 62, 61, 61, 65, 70, 76, 80, 89,
54, 59, 63, 60, 64, 65, 65, 86, 75, 80, 90, 92, 56, 60, 60, 64, 66, 66, 69, 68, 78, 84,
86, 96, 99, 60.
Example 3:
Group the following data in a frequency polygon by taking a class interval 5 – 10
100, 150, 149, 105, 109, 101, 160, 200, 105, 205, 210, 111, 114, 131, 121, 123,
133, 129, 149, 148, 152, 157, 159, 167, 168, 161, 171, 173, 181, 189, 182, 191,
30 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
192, 199, 197, 201, 209, 202, 207, 195, 185, 165, 155, 145, 125, 175, 115, 109,
105, 201, 176, 177, 168, 169, 159, 157, 100.
Example 1: Data: 23, 58, 62, 62, 63, 65, 67, 71, 71, 72, 80, 82, 82, 82.
143, 160, 154, 159, 172, 165, 162, 171, 146, 165, 176, 145, 182, 175, 186, 160,
158, 167, 172.
Required:
Types of graphs
Graphs are generally classified into two categories on the basis of the characteristics
of data - Graphs of Time series and Graphs of Frequency Distribution.
Graphs of Time Series (Historical)
Time series or historical series stand for the numerical record of the changes in a
variable during a given period of time. Time units are placed on the OX-axis and the
values of variable are measured on the OY-axis. There are three types of time series
graphs as under;
One- Variable Graph.
Two - Variable Graph.
31 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Example: the following is the production of rice (in '000' tonnes) in Mbale in
different years.
Two- variable graph: in this graph, there are factors on the Y-axis and the time is
measured on X – axis.
Example: the following data relates to foreign trade of Uganda during the seven years.
32 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Three Variable Graph: In three variable graph, there are three factors on the Y-axis
and the time is measured on X-axis. Following is the data relating to Income and
Expenditure of the Sports Club villa for five years.
33 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Histogram
Frequency Polygon
Frequency Curve
Ogive Curves
The height of rectangle represents the frequency of the respective class interval. The
height should be measured on Y-axis from the point of the ‘upper-class limit’ of the
34 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
concerned class interval. The area of all the rectangles joined together represents the
total frequencies.
When the class intervals are not equal, the density of the frequency has to be
calculated. Frequency density refers to the concentration of frequency in a unit of
value of the total size of the class interval. In unequal class intervals, the frequencies
of each class are adjusted to the width of the interval with the lowest size as follows:
Examples
58 × 5
= = 14
10
35 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Example 2
The following data relates to marks obtained by 2nd year BECON students in
statistics for economics exam.
12, 30, 29, 22, 35, 39, 40, 42, 40, 16, 30, 18, 12, 5, 15, 3, 4, 9, 45, 50, 0, 5, 6, 8, 15,
25, 22, 3, 4, 9, 12, 5, 9, 3, 4, 30, 32, 29, 1, 8
Required;
Solution:
36 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Example
Weekly wages (in shs) 20-25 25-30 30-35 35-40 40-45 45-50
No. of workers 6 18 25 14 10 8
37 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Example
From the following information, determine the mode graphically and verify the
results.
Weekly wages (shs) 10-15 15-20 20-25 25-30 30-40 40-60 60-80
No. of workers 7 19 27 15 12 12 8
Solution:
Mode, Z
𝑑1
𝑚𝑜𝑑𝑒, 𝑧 = 𝑙𝑚 + ×𝑐
𝑑1 + 𝑑2
38 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
27 − 19
20 + × 5 = 22
(27 − 19) + (27 − 15)
Construct a Histogram from the following and then locate the mode
Steps:
The area of the frequency polygon is equal to the area of the histogram, as the area
left outside is geometrically equal to the area included in it.
Profits per shop (in 0-50 50-100 100-150 150-200 200-250 250-300
39 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
‘000’ shs)
No. of shops 12 18 27 20 17 6
Required;
Draw a histogram and show the frequency polygon for the following data:
8 × 20
=4
20
40 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Frequency curve: This can be drawn on the same lines as a frequency polygon but the
mid-points at the height of class interval rectangles will be joined smoothly and by
rounding off the top through the trend principle. It means that the difference
between frequency polygon and frequency curve is of joining the points.
Ogive curves: These curves refer to a continuous form of the cumulative frequency
curves-less than cumulative frequency curve and more than cumulative frequency
curve. This method of drawing the curves is the best among other types as it serves
41 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
many purposes. Ogive curves are based on the cumulative frequencies. That is why
they are also called "Cumulative Frequency Curves".
Types of Ogives
Less than Ogive: It consists in plotting the 'less than' frequencies against the upper
limit of the class interval or boundaries. The points so obtained are joined by a
smoothed curve. It is an increasing curve sloping upward from left to right of the
graph and it is in the shape of an elongated-(S).
More than Ogive: It consists in plotting the 'more than' frequencies against the lower
limit of the class interval or boundaries. The points so obtained are joined by a
smoothed curve. It is a decreasing curve sloping downward from left to right of the
graph and it is in the shape of an elongated upside down - (s).
Example 1; draw an ogive and from it read the median and quartiles
To find the partitioned values we have to convert the data into a less than frequency
distribution
Marks 20 30 40 50 60 70 80
No. of students 21 40 100 142 166 184 201
42 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Example 2
Draw two ogives from the following data and locate the median value.
From the ogive curves we have to convert the data into two cumulative frequencies.
43 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Example 3
Draw a ‘Less Than Ogive’ from the following and locate the median
Draw two ogives from the following data and locate the median
Draw a less than ogive and find the value of quartiles for the following;
44 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
It is very difficult to understand mass of data. In order to get a concise and complete
picture of large data, it is essential to obtain a figure, which should represent the
whole data. By the help of such a figure, the data are compared and understood
easily. A figure which represents the whole data is known as an average or a measure
of central tendency. An average removes all the unnecessary details of the data and
gives a concise picture of the huge data under investigation.
45 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Although an average is useful in studying the complex data and is very widely used
in almost all the spheres of human activity, it is not without limitations that restrict
its scope and applicability.
The extreme values, if any, will affect the averageable figure disproportionately.
The composition of the data cannot be viewed with the help of the average.
The average does not represent always the characteristics of individual items.
The average gives only a representative figure of the mass but fails to depict the
entire picture of the data.
In spite of the limitations, the statistical averages still are useful measures which play
an important role in analyzing the mass data as there is no alternative left for
statistical analysis.
Broadly speaking, there are five types of statistical averages which are commonly
used in practice. They are;
Mathematical averages:
Positional averages:
Median
Mode
This is the quotient of the sum of the values divided by their number. It’s obtained
by dividing the sum of the values of the items in a variable by their number. It is
expressed as follows:
46 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
∑𝑥
Individual observations/series: 𝑚𝑒𝑎𝑛, 𝑋̅ =
𝑛
∑ 𝑓𝑥
Discrete/continuous series: 𝑚𝑒𝑎𝑛, 𝑋̅ = ∑𝑓
N is No. of observations.
The mean can be calculated by the help of two methods, i.e. Direct method &
Short-cut method but much emphasis in this handout is placed on direct method.
49, 62, 11, 74, 55, 80, 56, 64, 34, 42, 56, 67, 10, 68, & 46
Solution:
∑𝑥
𝑚𝑒𝑎𝑛, 𝑋̅ =
𝑛
46 + 62 + 11 + 74 + 55 + 80 + 56 + 64 + 34 + 42 + 56 + 67 + 10 + 68 + 46
𝑋̅ =
15
781
= = 52.1 𝑚𝑎𝑟𝑘𝑠
15
Discrete series:
In these series, the value of each of the item is multiplied by the corresponding
frequency and the total of products is divided by the total number of frequencies.
47 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Worked Examples:
Marks 15 25 35 45 55 65 75 85 95
No. of students 1 2 3 4 5 7 12 16 10
Solution:
15 1 15
25 2 50
35 3 105
45 4 180
55 5 275
65 7 455
75 12 900
85 16 1,360
95 10 950
∑𝑓 = 60 ∑𝑓𝑥 = 4,290
∑ 𝑓𝑥
∴ 𝑋̅ =
∑𝑓
4,290
=
60
𝑋̅ = 72 𝑚𝑎𝑟𝑘𝑠.
Example II
In the city, 30 members were surveyed as to how many domestic appliances they
have purchased, the replies were as follows:
1, 2, 5, 1, 5, 2, 1, 4, 2, 3, 4, 2, 4, 3, 2, 6, 3, 2, 4, 3, 6, 2, 2, 3, 3, 7, 2, 3, 0, 2
0 / 1 0
48 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
1 /// 3 3
2 //// //// 10 20
3 //// // 7 21
4 //// 4 16
5 // 2 10
6 // 2 12
7 / 1 7
∑𝑓 = 30 ∑𝑓𝑥 = 89
∑ 𝑓𝑥
∴ 𝑋̅ =
∑𝑓
89
=
30
𝑋̅ = 2.9667 𝑚𝑒𝑚𝑏𝑒𝑟𝑠.
Continuous series:
The method of calculating the mean from continuous series is exactly the same as
that of discrete series with the exception that in continuous series, mid-points of
various class intervals are determined against the class intervals. These mid-point
values are then multiplied by the corresponding frequencies.
Example:
4,890
=
250
𝑋̅ = 19.56 ≈ 20
GEOMETRIC MEAN
This is the ‘n’th root of the product of the ‘n’ quantities of a series. In the field of
business & management, various problems often arise relating to “Average
percentage” rate of change over a period of time. In such cases, geometric mean is
suitably applied. It involves obtaining log values of the items and the average of these
values determined. It is obtained as follows:
Individual series:
∑𝑙𝑜𝑔𝑥
𝐺. 𝑀 = 𝐴𝑛𝑡𝑖 − log
𝑛
Discrete/continuous series:
∑𝑓𝑙𝑜𝑔𝑥
𝐺. 𝑀 = 𝐴𝑛𝑡𝑖 − log
∑𝑓
f is frequencies
Individual series:
50 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Solution:
𝒙 𝒍𝒐𝒈𝒙
75 1.8751
125 2.0969
350 2.5441
790 2.8976
1250 3.0969
1740 3.2405
2000 3.3010
∑𝑙𝑜𝑔𝑥 = 19.0521
∑𝑙𝑜𝑔𝑥
𝐺. 𝑀 = 𝐴𝑛𝑡𝑖 − log
𝑛
19.0521
𝐺. 𝑀 = 𝐴𝑛𝑡𝑖 − log
7
𝐴𝑛𝑡𝑖 − log 𝑜𝑓 2.7217
∴ 𝑮. 𝒎. = 𝟓𝟐𝟕
Discrete Series:
Yield in maize mill (in tonnes) of 6 farms in Soroti District is given below:
Solution:
51 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
∑𝑓𝑙𝑜𝑔𝑥
𝐺. 𝑀 = 𝐴𝑛𝑡𝑖 − log
∑𝑓
166.498
= 𝐴𝑛𝑡𝑖 − log
81
= 𝐴𝑛𝑡𝑖 − log 𝑜𝑓 2.0555
𝐺. 𝑀 = 114 𝑇𝑜𝑛𝑛𝑒𝑠
Continuous series:
89.8937
= 𝐴𝑛𝑡𝑖 − log
65
= 𝐴𝑛𝑡𝑖 − log 𝑜𝑓 1.38298
𝐺. 𝑀 = 24.2
52 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
HARMONIC Mean
This is the total number of items of a variable divided by the sum of the reciprocals
of the values of the variable. It’s a specialized average which solves problems
involving variables expressed in ‘time rates’ that vary according to time-km per hour,
hour per km, units per day, price per unit & so forth.
Note: harmonic mean, is suitable only when time factor is variable and the act being
performed remains constant. Therefore, harmonic mean provides the same results as
that of the weighted arithmetic mean when the act being performed remains the
same otherwise not. It is denoted as;
Individual series:
𝑛
𝐻. 𝑚 =
∑𝑅𝑥
𝑓= frequencies
Worked examples:
Solution:
𝒙 𝟏
𝑹𝒙 =
𝒙
40 0.025
80 0.0125
53 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
90 0.0111
120 0.0080
150 0.0070
180 0.0060
200 0.0050
∑𝑅𝑥 = 0.0746
𝑛
𝐻. 𝑚 =
∑𝑅𝑥
7
= = 94
0.0746
Discrete series:
𝒙 20 25 30 35 40 45 50
𝒇 3 8 12 15 10 7 5
Solution:
𝒙 𝟏 𝒇 𝟏
𝑹𝒙 = 𝒇
𝒙 𝒙
20 0.050 3 0.150
25 0.040 8 0.320
30 0.030 12 0.360
35 0.029 15 0.435
40 0.025 10 0.250
45 0.022 7 0.154
50 0.020 5 0.100
∑𝑓 = 60 1
∑𝒇 = 𝟏. 𝟕𝟔𝟗
𝑥
∑𝑓
𝐻. 𝑚 =
∑𝑓𝑅𝑥
60
𝐻. 𝑚 = = 34
1.769
Continuous series:
54 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Size Mid-points 𝟏 𝒇 𝟏
𝑹𝒙 = 𝒇
𝒙 𝒙 𝒙
0-4 2 0.50 6 3.00
4-8 6 0.16 15 2.40
8-12 10 0.10 22 2.20
12-16 14 0.07 28 1.96
16-20 18 0.06 45 2.70
20-24 22 0.05 25 1.25
24-28 26 0.04 13 0.52
28-32 30 0.03 6 0.18
∑𝑓 = 160 1
∑𝒇 = 𝟏𝟒. 𝟐𝟏
𝑥
∑𝑓
𝐻. 𝑚 =
∑𝑓𝑅𝑥
160
= = 11.3
14.21
POSITIONAL AVERAGES
MEDIAN:
This is the value of that item in a series which divides the array into two equal parts,
one consisting of all the values less than it and the other consisting of all the values
more than it. This means when values of a variable are arranged in an array form in
ascending or descending order of magnitude, the value of the middle item of the
array is the median. It is expressed as follows:
Individual series:
𝑛+1
𝑀𝑒 = 𝑆𝑖𝑧𝑒 𝑜𝑓 ( ) 𝑡ℎ 𝑖𝑡𝑒𝑚
2
55 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Note: when computing the median under individual series, first arrange the data in
ascending order.
Example:
In a company, there are 7 workers whose ages are 20, 25, 29, 19, 35, 40, 55 years.
Find the median age.
Solution:
∴ 𝑚𝑒𝑑𝑖𝑎𝑛 = 19 𝑦𝑒𝑎𝑟𝑠
Example:
56 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Solution:
S/No. Values
1 51
2 65
3 40
4 44
5 46
6 55
7 48
8 62
1 𝑛 𝑛+2
𝑀𝑒 = (𝑆𝑖𝑧𝑒 𝑜𝑓 𝑡ℎ 𝑖𝑡𝑒𝑚 + 𝑡ℎ 𝑖𝑡𝑒𝑚)
2 2 2
1 8 8+2
𝑀𝑒 = (𝑆𝑖𝑧𝑒 𝑜𝑓 𝑡ℎ 𝑖𝑡𝑒𝑚 + 𝑡ℎ 𝑖𝑡𝑒𝑚)
2 2 2
1
𝑀𝑒 = (𝑆𝑖𝑧𝑒 𝑜𝑓 4𝑡ℎ 𝑖𝑡𝑒𝑚 + 5 𝑡ℎ 𝑖𝑡𝑒𝑚)
2
1
𝑀𝑒 = (44 + 46)
2
90
𝑀𝑒 = = 45
2
Discrete series:
In discrete series, the items are first arranged in ascending or descending order of
magnitude and their corresponding frequencies written against them. The
frequencies are cumulated and then the value of the middle item located with
reference to the cumulative frequency which covers the size first. The formula
∑𝑓+1
however, becomes; 𝑆𝑖𝑧𝑒 𝑜𝑓 ( 2
) 𝑡ℎ 𝑖𝑡𝑒𝑚
Example:
x 31 32 33 34 35 36 37 38 39 40
Freq. 5 18 21 26 29 22 15 12 10 4
Solution:
x f Cumulative freq.
31 5 5
32 18 23
33 21 44
34 26 70
35 29 99
36 22 121
37 15 136
38 12 148
39 10 158
40 4 162
∑𝑓 = 𝟏𝟔𝟐
∑𝑓 + 1
𝑀𝑒 = 𝑆𝑖𝑧𝑒 𝑜𝑓 ( ) 𝑡ℎ 𝑖𝑡𝑒𝑚
2
162 + 1
= 𝑆𝑖𝑧𝑒 𝑜𝑓 ( ) 𝑡ℎ 𝑖𝑡𝑒𝑚
2
163
=( ) 𝑡ℎ 𝑖𝑡𝑒𝑚 = 81.5𝑡ℎ 𝑖𝑡𝑒𝑚
2
Thus, 81.5th item lies in 99th cumulative frequency and so, the median = 35
Continuous series:
Under this series, since class intervals are already in the form of an array, size of the
𝑛
median class is determined using; (2 ) 𝑡ℎ 𝑖𝑡𝑒𝑚 with reference to the cumulative
frequency which covers the size first. After locating the median class, the median
value is then determined using interpolation formula below:
𝑛
(2 ) 𝑡ℎ 𝑖𝑡𝑒𝑚 − 𝐶𝑓𝑏
𝑀𝑒 = 𝐿𝑚 + ( )×𝐶
𝑓𝑚
58 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Note: While calculating the median, always make sure that the data under
observation is exclusive data. If it’s inclusive data type, convert it into exclusive first
to obtain class boundaries since class limits are obtained from class boundaries.
Worked examples:
Solution:
17.5 − 13
𝑀𝑒 = 40 + ( )×5
10
59 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
14.5
= 40 + ( )×5
10
14.5 × 5
= 40 + ( )
10
Example II
The data is inclusive, so there is need to convert into exclusive class interval.
80 − 75
𝑀𝑒 = 15.5 + ( )×4
65
5
= 15.5 + ( ) × 4
65
60 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
5×4
= 15.5 + ( )
65
QUARTILES
This divide the entire distribution into four equal parts i.e. 1st Quartile, 2nd Quartile
& 3rd Quartile.
1st Quartile (Lower quartile): this is a value which there are one-fourth of the
items and above which there are three-fourth of the items.
2nd Quartile (Median): this divides the distribution into two halves
3rd Quartile (upper quartile): is a value below which there are three-fourth of the
items and above which there are one-fourth of the items.
Determination of quartiles
Measure SERIES
Individual &discrete Continuous
𝑸𝟏 𝑛+1 𝑛
𝑡ℎ 𝑖𝑡𝑒𝑚 𝑡ℎ 𝑖𝑡𝑒𝑚
4 4
2(𝑛 + 1) 𝑛+1 2𝑛 𝑛
𝑸𝟐 𝑡ℎ 𝑜𝑟 𝑡ℎ 𝑖𝑡𝑒𝑚 𝑡ℎ 𝑜𝑟 𝑡ℎ 𝑖𝑡𝑒𝑚
4 2 4 2
3(𝑛 + 1) 3𝑛
𝑸𝟑 𝑡ℎ 𝑖𝑡𝑒𝑚 𝑡ℎ 𝑖𝑡𝑒𝑚
4 4
Note: always arrange data either in ascending or descending order while calculating
quartiles under individual series.
Example:
23, 48, 34, 68, 15, 36, 24, 54, 65, 75, 92, 10, 70, 61, 20, 47, 83, 19, 77
61 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Solution:
Example:
Shoe sizes 4.0 4.5 5.0 5.5 6.0 6.5 7.0 7.5 8.0
Frequencies 20 36 44 50 80 30 30 16 14
Solution:
62 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
4.0 20 20 𝑛+1
𝑄1 = 𝑡ℎ 𝑖𝑡𝑒𝑚
4
4.5 36 56
5.0 𝑸𝟏 44 100 320 + 1
= 𝑡ℎ 𝑖𝑡𝑒𝑚 = 80.25𝑡ℎ 𝑖𝑡𝑒𝑚
5.5 50 150 4
80.25𝑡ℎ 𝑖𝑡𝑒𝑚 𝑙𝑖𝑒𝑠 𝑖𝑛 100 𝑐. 𝑓; 𝑄1 = 5.0
6.0 𝑸𝟐 80 230
6.5 𝑸𝟑 30 260 𝑄2 =
𝑛+1
𝑡ℎ 𝑖𝑡𝑒𝑚
2
7.0 30 290 320 + 1
= 𝑡ℎ 𝑖𝑡𝑒𝑚 = 160.50𝑡ℎ 𝑖𝑡𝑒𝑚
7.5 16 306 2
14 320 160.50𝑡ℎ 𝑖𝑡𝑒𝑚 𝑙𝑖𝑒𝑠 𝑖𝑛 230 𝑐. 𝑓; 𝑄2 = 6.0
8.0
3(𝑛 + 1)
𝑄3 = 𝑡ℎ 𝑖𝑡𝑒𝑚
4
3(320 + 1)
4
240.75𝑡ℎ 𝑖𝑡𝑒𝑚 𝑙𝑖𝑒𝑠 𝑖𝑛 260 𝑐. 𝑓; ∴ 𝑄3 = 6.5
Computation of Quartiles from Continuous series:
Example:
𝑪𝒍𝒂𝒔𝒔 𝒊𝒏𝒕𝒆𝒓𝒗𝒂𝒍𝒔 𝒇 𝑪. 𝒇
0-10 5 5
10-20 8 13
20-30 7 20
30-40 12 32
40-50 28 60
50-60 20 80
60-70 10 90
70-80 10 100
∑𝑓
= 100
𝑛
𝑄1 = 𝑡ℎ 𝑖𝑡𝑒𝑚
4
63 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
100
= 𝑡ℎ 𝑖𝑡𝑒𝑚 = 25𝑡ℎ 𝑖𝑡𝑒𝑚 = 32 𝑐. 𝑓
4
𝑛
( )𝑡ℎ 𝑖𝑡𝑒𝑚−𝐶𝑓𝑏
4
∴ 𝑄1 = 𝐿𝑚 + ( 𝑓𝑚
)×𝐶
25 − 20
= 30 + ( ) × 10
12
5 × 10
= 30 + ( ) = 34.2
12
𝑛 100
𝑄2 = 𝑠𝑖𝑧𝑒 𝑜𝑓 𝑡ℎ 𝑖𝑡𝑒𝑚 = = 50𝑡ℎ 𝑖𝑡𝑒𝑚 = 60 𝑐. 𝑓
2 2
𝑛
(2 ) 𝑡ℎ 𝑖𝑡𝑒𝑚 − 𝐶𝑓𝑏
= 𝐿𝑚 + ( )×𝐶
𝑓𝑚
50 − 32
∴ 𝑄2 = 40 + ( ) × 10
28
18 × 10
= 40 + ( )
28
= 40 + 6.43 = 46.43
3𝑛 3 × 100 300
𝑄3 = 𝑡ℎ 𝑖𝑡𝑒𝑚 = = = 75𝑡ℎ 𝑖𝑡𝑒𝑚 = 80 𝑐. 𝑓
4 4 4
3𝑛
( 4 ) 𝑡ℎ 𝑖𝑡𝑒𝑚 − 𝐶𝑓𝑏
= 𝐿𝑚 + ( )×𝐶
𝑓𝑚
75 − 60
= 50 + ( ) × 10
20
15 × 10
= 50 + ( )
20
= 50 + 7.5 = 57.5
THE MODE
This is the value which occurs with the maximum frequency. It is the most common
value which receives the highest frequency. Thus, it corresponds to the value of the
variable which occurs most frequently. E.g. manufacturers produce in greater
64 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
quantity those items which are always purchased frequently by customers; this shows
the application of a mode in business. A mode however, is denoted by a symbol ‘Z’.
Individual series:
54, 66, 42, 64, 44, 86, 104, 94, 100, 80, 72, 64, 64, 44, 64, 72, 54, 48, 52, 50. The
mode is 64 as it occurs most frequently.
Continuous series:
Under this, the modal class is located with the help of highest frequency. After
locating the modal class, the following interpolation formula is used to determine
the modal value.
𝑑1
𝑍 = 𝑙𝑚 + ( )×𝑐
𝑑1 + 𝑑2
Example I:
Value 1 2 3 4 5 6 7
Freq. 7 11 31 17 16 5 2
Solution:
Example II
65 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Solution:
2,700
= 25 + ( )×5
2,700 + 2,000
2,700 × 5
25 + ( ) = 25 + 2.9 = 27.9𝑚𝑎𝑟𝑘𝑠
2,700 + 2,000
Exercises:
1. From the following data, estimate the mode and quartiles (1,2 & 3).
66 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
RANGE
This is the difference between the smallest and highest value of the series. It gives an
extremely simple indicator of the variability of a set of observations. Note that,
where data are grouped into a frequency distribution, the range is equal to the
difference between the upper boundary of the highest class and lower boundary of
the lowest class. Thus, range is given by;
𝑅𝑎𝑛𝑔𝑒 = 𝑙𝑎𝑟𝑔𝑒𝑠𝑡 𝑣𝑎𝑙𝑢𝑒 − 𝑠𝑚𝑎𝑙𝑙𝑒𝑠𝑡 𝑣𝑎𝑙𝑢𝑒
Therefore, a distribution with the smallest range is less dispersed and vice-versa.
Examples:
Individual series:
67 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Solution:
A B C
= 14 − 10 = 44 − 40 = 104 − 100
=4 =4 =4
Thus, Series C, is less dispersed & more uniform while; Series (A), s less uniform &
more dispersed.
Discrete series:
Example: from the following distribution, find the range and coefficient of range
X 6 12 18 24 30 36 42
Freq. 20 130 80 60 210 1500 600
Solution:
𝑅𝑎𝑛𝑔𝑒 = 𝐿 − 𝑆 = 42 − 6 = 36
𝐿−𝑆 42 − 6 36
𝐶𝑜𝑅 = == == = 0.8
𝐿+𝑆 42 + 6 48
Continuous series:
Example;
68 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
This measures the difference between the values of quartile one and quartile three.
Q.D is a measure of partition rather than a measure of dispersion. Thus, the smaller
the value of Q.D., the minimum is the dispersion of middle half of the distribution
around the median & vice versa. It is symbolically denoted as:
𝑄3 − 𝑄1
𝑄. 𝐷 =
2
𝑄3 − 𝑄1
𝑐𝑜𝑒𝑓𝑓𝑖𝑐𝑖𝑒𝑛𝑡 𝑜𝑓 𝑄. 𝐷 =
𝑄3 + 𝑄1
Individual series:
From the following marks of 12 students, compute the quartile deviation and its
coefficient.
Solution:
S/No. 1 2 3 4 5 6 7
Marks 12 15 20 28 30 40 50
𝑛+1
𝑄1 = 𝑡ℎ 𝑖𝑡𝑒𝑚
4
7+1
𝑄1 = 𝑡ℎ 𝑖𝑡𝑒𝑚 = 2𝑛𝑑 𝑖𝑡𝑒𝑚 = 15
4
3(𝑛 + 1)
𝑄3 = 𝑡ℎ 𝑖𝑡𝑒𝑚
4
3(7 + 1)
𝑄3 = 𝑡ℎ 𝑖𝑡𝑒𝑚 = 6𝑡ℎ 𝑖𝑡𝑒𝑚 = 40
4
𝑄3 − 𝑄1 40 − 15
∴ 𝑄. 𝐷 = = = 12.5
2 2
𝑄3 − 𝑄1 40 − 15 25
𝑐𝑜𝑒𝑓𝑓𝑖𝑐𝑖𝑒𝑛𝑡 𝑜𝑓 𝑄. 𝐷 = = = = 0.45
𝑄3 + 𝑄1 40 + 15 55
69 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Discrete series:
Compute quartile deviation and its coefficient from the following data:
x 58 59 60 61 62 63 64 65 66
Freq. 15 20 32 35 33 22 20 10 8
Solution:
x Freq. C.f
58 15 15
59 20 35
60 32 67
61 35 102
62 33 135
63 22 157
64 20 177
65 10 187
66 8 195
∑𝑓 = 195
𝑛+1
𝑄1 = 𝑡ℎ 𝑖𝑡𝑒𝑚
4
195 + 1
𝑄1 = 𝑡ℎ 𝑖𝑡𝑒𝑚 = 49𝑡ℎ 𝑖𝑡𝑒𝑚 = 67 𝑐. 𝑓 = 60
4
3(𝑛 + 1)
𝑄3 = 𝑡ℎ 𝑖𝑡𝑒𝑚
4
3(195 + 1)
𝑄3 = 𝑡ℎ 𝑖𝑡𝑒𝑚 = 147𝑡ℎ 𝑖𝑡𝑒𝑚 = 157 𝑐. 𝑓 = 63
4
𝑄3 − 𝑄1 63 − 60
∴ 𝑄. 𝐷 = = = 1.5
2 2
𝑄3 − 𝑄1 63 − 60 3
𝑐𝑜𝑒𝑓𝑓𝑖𝑐𝑖𝑒𝑛𝑡 𝑜𝑓 𝑄. 𝐷 = = = = 0.02
𝑄3 + 𝑄1 63 + 60 123
Continuous series:
From the following data, compute quartile deviation and its coefficient
70 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Wages 4-8 8-12 12-16 16-20 20-24 24-28 28-32 32-36 36-40
Workers 6 10 18 30 15 12 10 6 2
Solution:
27.25 − 16
= 12 + ×4
18
45
= 12 + = 12 + 2.5 = 15
18
3𝑛 3(109)
𝑄3 = 𝑡ℎ 𝑖𝑡𝑒𝑚 = = 81.8𝑡ℎ 𝑖𝑡𝑒𝑚 = 91 𝑐. 𝑓 = 𝑐𝑙𝑎𝑠𝑠 (24 − 28)
4 4
3𝑛
− 𝐶𝑓𝑏
𝑄1 = 𝐿𝑚 + ( 4 )×𝑐
𝑓𝑚
81.8 − 79
= 12 + ×4
12
11.2
= 24 + = 24 + 0.93 = 25
12
𝑄3 − 𝑄1 25 − 15
∴ 𝑄. 𝐷 = = =5
2 2
71 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
𝑄3 − 𝑄1 25 − 15 10
𝑐𝑜𝑒𝑓𝑓𝑖𝑐𝑖𝑒𝑛𝑡 𝑜𝑓 𝑄. 𝐷 = = = = 0.25
𝑄3 + 𝑄1 25 + 15 40
Exercise:
Mean deviation is the average difference among the items in a series from the mean
itself or median or mode of that series. It is concerned with the extent to which the
values are dispersed about the mean or the median or the mode. It is found by the
averaging all the deviations from the central tendency. These deviations are taken
into computations with regard to negative sign (i.e., all the deviations assumed as
positive).
Theoretically, the deviations of items are taken preferably from median instead than
from the mean or the mode. Median is supposed to be the suitable central tendency
for calculating deviations because the sum of the deviations from the median is less
than the sum of the deviations from the mean. It is not a common practice to
calculate the deviation from the mode as its value is sometimes not clearly defined.
In aggregating the deviations, the algebraic negative signs are not taken into account.
It means all the deviations are treated as positive ignoring the negative signs. It’s
then denoted by the following formulae:
Individual series:
̅)
∑(𝒙 − 𝒙
𝜹=
𝒏
72 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
𝜹 𝜹 𝜹
𝒄𝒐𝒆𝒇𝒇𝒊𝒄𝒊𝒆𝒏𝒕 𝒐𝒇 𝜹 = 𝒐𝒓 𝒐𝒓
𝑴𝒆 ̅
𝒙 𝒁
Individual series:
Example:
Required: find the mean deviation and coefficient of mean deviation from the mean
& median.
Solution:
S/No. 𝑥 |𝑥 − 𝑥̅ | |𝒙 − 𝑴𝒆|
1 32 8 7
2 36 4 3
3 36 4 3
4 37 3 2
5 39 1 0
6 41 1 2
7 45 5 6
8 46 6 7
9 48 8 9
∑𝒏 = 𝟗 ̅) = 𝟒𝟎
∑(𝒙 − 𝒙 ∑(𝒙 − 𝑴𝒆) = 𝟑𝟗
∑𝒙 𝟑𝟔𝟎
From the mean ̅=
𝒙 = = 𝟒𝟎
𝒏 𝟗
̅) 𝟒𝟎
∑(𝒙 − 𝒙
∴𝜹= = = 𝟒. 𝟒
𝒏 𝟗
𝒏+𝟏 𝟗+𝟏
From the median 𝑴𝒆 = ( 𝟐
) 𝒕𝒉 𝒊𝒕𝒆𝒎 = 𝟐
= 𝟓𝒕𝒉 𝒊𝒕𝒆𝒎 = 𝟑𝟗
∑(𝒙 − 𝑴𝒆) 𝟑𝟗
∴𝜹= = = 𝟒. 𝟑
𝒏 𝟗
𝜹 𝟒. 𝟑 𝜹 𝟒. 𝟒
𝑻𝒉𝒖𝒔, 𝒄𝒐𝒆𝒇𝒇𝒊𝒄𝒊𝒆𝒏𝒕 𝒐𝒇 𝜹 = = = 𝟎. 𝟏𝟏 𝒐𝒓 = = 𝟎. 𝟏𝟏
𝑴𝒆 𝟑𝟗 ̅ 𝟒𝟎
𝒙
73 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Discrete series:
Example:
Determine mean deviation from the median and mean from the following data:
Wages 20 18 16 14 12 10 6 4
Freq. 2 4 9 18 27 25 14 1
Solution:
𝑥
20 2 2 40 8 8 16 16
18 4 6 72 6 6 24 24
16 9 15 144 4 4 36 36
14 18 33 252 2 2 36 36
12 27 60 324 0 0 0 0
10 25 85 250 2 2 50 50
6 14 99 84 6 6 84 84
4 1 100 4 8 8 8 8
∑𝒇 = 𝟏𝟎𝟎 ∑𝒇𝒙 = 𝟏, 𝟏𝟕𝟎 ̅) = 𝟐𝟓𝟎
∑𝒇(𝒙 − 𝒙 ∑𝒇(𝒙 − 𝑴𝒆) = 𝟐𝟓𝟎
∑𝒇𝒙 𝟏𝟏𝟕𝟎
From the mean ̅
𝒙= = = 𝟏𝟐
∑𝒇 𝟏𝟎𝟎
̅) 𝟐𝟓𝟎
∑𝒇(𝒙 − 𝒙
∴𝜹= = = 𝟐. 𝟓
∑𝒇 𝟏𝟎𝟎
𝜹 𝟐. 𝟓 𝜹 𝟐. 𝟓
𝑻𝒉𝒖𝒔, 𝒄𝒐𝒆𝒇𝒇𝒊𝒄𝒊𝒆𝒏𝒕 𝒐𝒇 𝜹 = = = 𝟎. 𝟐𝟏 𝒐𝒓 = = 𝟎. 𝟐𝟏
𝑴𝒆 𝟏𝟐 ̅ 𝟏𝟐
𝒙
Continuous Series:
Example
74 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
From the following data, calculate mean deviation from the median and the mean
Class 1-5 6-10 11-15 16-20 21-25 26-30 31-35 36-40 41-45
Freq. 7 10 16 32 24 18 10 5 1
Solution:
∑𝒇 ∑𝒇𝒙 ̅)
∑𝒇(𝒙 − 𝒙 ∑𝒇(𝒙 − 𝑴𝒆)
= 𝟏𝟐𝟑 = 𝟐, 𝟓𝟎𝟒 = 𝟖𝟕𝟔. 𝟖 = 𝟖𝟔𝟎
∑𝒇𝒙 𝟐𝟓𝟎𝟒
From the mean ̅=
𝒙 = = 𝟐𝟎. 𝟒
∑𝒇 𝟏𝟐𝟑
̅) 𝟖𝟕𝟔. 𝟖
∑𝒇(𝒙 − 𝒙
∴𝜹= = = 𝟕. 𝟏𝟑
∑𝒇 𝟏𝟐𝟑
𝜹 𝟔. 𝟗𝟗 𝜹 𝟕. 𝟏𝟑
𝑻𝒉𝒖𝒔, 𝒄𝒐𝒆𝒇𝒇𝒊𝒄𝒊𝒆𝒏𝒕 𝒐𝒇 𝜹 = = = 𝟎. 𝟒 𝒐𝒓 = = 𝟎. 𝟑𝟓
𝑴𝒆 𝟏𝟖 ̅ 𝟐𝟎. 𝟒
𝒙
STANDARD DEVIATION
'Standard deviation' is the root of the sum of the squares of the deviations divided by
their number. It is also called "Mean Error Deviation". 'Mean Square Error Deviation' or
"Root Mean Square Deviation". It is a second moment of a dispersion. Since the sum of
the squares of the deviations from the mean is minimum, the deviations are taken only
from mean (but not from median or mode).
75 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Standard Deviation is the root-mean-square average of all the deviations from the
mean. It was proposed by Prof. Karl Pearson in 1893, and it is denoted by 𝜎 (sigma)
1. Individual series:
∑(𝑥 − 𝑥̅ )2
𝜎=√ … 𝐴𝑏𝑠𝑜𝑙𝑢𝑡𝑒 𝑚𝑒𝑎𝑠𝑢𝑟𝑒
𝑛
∑𝑓(𝑥 − 𝑥̅ )2
𝜎=√ … 𝐴𝑏𝑠𝑜𝑙𝑢𝑡𝑒 𝑚𝑒𝑎𝑠𝑢𝑟𝑒
∑𝑓
Individual series:
Ten students of B.A. class have obtained the following marks in Economics major
out of 100. Calculate standard deviation of marks obtained.
Solution:
𝑥 (𝑥 − 𝑥̅ ) (𝑥 − 𝑥̅ )2
7 -10 100
76 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
8 -9 81
10 -7 49
13 -4 16
14 -3 9
19 -2 4
20 3 9
25 8 64
26 9 81
28 11 121
∑𝑥 =170 ∑(𝑥 − 𝑥̅ )2 = 534
∑𝑥 170
𝑥̅ = = = 17 𝑚𝑎𝑟𝑘𝑠
𝑛 10
∑(𝑥 − 𝑥̅ )2 534
∴𝜎=√ = = √53.4 = 7.32
𝑛 10
𝜎 7.32
𝐶𝑉 = × 100 = = 0.43 × 100 = 43%
𝑥̅ 17
The following are marks obtained by two students ‘Miracle’ & ‘Fatumah’ in 10 tests
of 100 marks each.
Tests 1 2 3 4 5 6 7 8 9 10
Miracle 44 80 76 48 52 72 72 51 60 54
Fatumah 48 75 54 60 63 69 72 51 57 66
Required:
Find who is better in studies and if consistency is the criterion for awarding a prize,
who should get the prize?
Solution:
Miracle Fatumah
𝑥 (𝑥 − 𝑥̅ ) (𝑥 − 𝑥̅ )2 𝑥 (𝑥 − 𝑥̅ ) (𝑥 − 𝑥̅ )2
44 -16-9 285.61 48 -13.5 182.25
80 -12.9 166.41 75 -10.5 110.25
76 -9.9 98.01 54 -7.5 56.25
77 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
∑(𝑥 − 𝑥̅ )2 1516.90
∴𝜎=√ = = √151.69 = 12.32
𝑛 10
𝜎 12.32
𝐶𝑉 = × 100 = = 0.202 × 100 = 20.2%
𝑥̅ 60.9
Fatumah:
∑𝑥 615
𝑥̅ = = = 61.5 𝑚𝑎𝑟𝑘𝑠
𝑛 10
∑(𝑥 − 𝑥̅ )2 742.50
∴𝜎=√ = = √74.25 = 8.62
𝑛 10
𝜎 8.62
𝐶𝑉 = × 100 = = 0.140 × 100 = 14%
𝑥̅ 61.5
Therefore, Fatumah is better in studies and she should get the prize since her
average is high and the variance is less.
Exercise:
1. Prices of a particular commodity in five years in two cities are given below:
City A 20 22 19 23 16
City B 10 20 18 12 15
78 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
2. The following are the runs scored by two batsmen Miklos & Ojara
Miklos 70 90 80 50 40
Ojara 70 90 60 50 30
Required: find who is a better run getter & more consistent player.
Discrete series:
The following table gives the age distribution of boys & girls in a University.
Age (years) 20 21 22 23 24
Boys 12 15 15 5 3
Girls 13 10 12 2 1
Required: determine which of the two groups is more variable in age.
Solution:
BOYS GIRLS
𝟐 𝟐
𝑨𝒈𝒆 𝒙 𝑓 𝑓𝑥 𝑑 𝒅 𝒇𝒅 𝑨𝒈𝒆 𝒙 𝑓 𝑓𝑥 𝑑 𝒅𝟐 𝒇𝒅𝟐
20 12 240 -1.44 2.0736 24.8832 20 13 260 -1.2 1.44 18.72
21 15 315 -0.44 0.1936 2.904 21 10 210 -0.2 0.04 0.4
22 15 330 0.56 0.3136 4.704 22 12 264 0.8 0.64 9.6
23 5 115 1.56 2.4336 12.168 23 2 46 1.8 3.24 6.48
24 3 72 2.56 6.5536 19.6608 24 1 24 2.8 7.84 7.84
∑𝑓 ∑𝑓𝑥 ∑𝒇𝒅𝟐 ∑𝑓 ∑𝑓𝑥 ∑𝒇𝒅𝟐
= 50 = 1072 = 𝟔𝟒. 𝟑𝟐 = 38 = 804 = 𝟒𝟑. 𝟎𝟒
BOYS:
∑𝑓𝑥 1072
𝑥̅ = = = 21.44
∑𝑓 50
∑𝑓(𝑥 − 𝑥̅ )2 64.32
∴𝜎=√ = = √1.2864 = 1.1342
∑𝑓 50
𝜎 1.13
𝐶𝑉 = × 100 = = 0.053 × 100 = 5.3%
𝑥̅ 21.44
GIRLS:
∑𝑓𝑥 804
𝑥̅ = = = 21.2
∑𝑓 38
79 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
∑𝑓(𝑥 − 𝑥̅ )2 43.04
∴𝜎=√ = = √1.1326 = 1.0642
∑𝑓 38
𝜎 1.06
𝐶𝑉 = × 100 = = 0.050 × 100 = 5%
𝑥̅ 21.2
Thus, the age of Boys is more variable since it has more variation
Exercise:
X 48 49 50 51 52 53 54 55 56 57
Freq. 3 7 11 14 18 17 13 8 5 4
Continuous series:
A factory produces two types of tyres. In an experiment in the working life of these
tyres the following results are obtained.
Solution:
C.I 𝒙 TYPE A TYPE B
(hrs) 𝑓 𝑓𝑥 𝑑 𝟐
𝒅 𝟐
𝒇𝒅 𝑓 𝑓𝑥 𝑑 𝒅𝟐 𝒇𝒅𝟐
15-17 16 50 800 -4.2 17.64 882 40 640 -3.4 11.56 462.4
17-19 18 100 1800 -2.2 4.84 484 300 5400 -1.4 1.96 588
19-21 20 260 5200 -0.2 0.04 10.4 120 2400 0.6 0.36 43.2
21-23 22 100 2200 1.8 3.24 324 80 1760 2.6 6.76 540.8
23-25 24 80 1920 3.8 14.44 1155.2 60 1440 4.6 21.16 1269.6
∑𝑓 ∑𝑓𝑥 ∑𝒇𝒅𝟐 ∑𝑓 ∑𝑓𝑥 ∑𝒇𝒅𝟐
= 590 = 11920 = 𝟐𝟖𝟓𝟓. 𝟔 = 600 = 11640 = 𝟐𝟗𝟎𝟒
TYPE A:
∑𝑓𝑥 11920
𝑥̅ = = = 20.2
∑𝑓 590
80 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
∑𝑓(𝑥 − 𝑥̅ )2 2855.6
∴𝜎=√ = = √4.84 = 2.2
∑𝑓 590
𝜎 2.2
𝐶𝑉 = × 100 = = 0.109 × 100 = 10.9%
𝑥̅ 20.2
TYPE B:
∑𝑓𝑥 11640
𝑥̅ = = = 19.4
∑𝑓 600
∑𝑓(𝑥 − 𝑥̅ )2 2904
∴𝜎=√ = = √4.84 = 2.2
∑𝑓 600
𝜎 2.2
𝐶𝑉 = × 100 = = 0.113 × 100 = 11.3%
𝑥̅ 19.4
Thus, type A tyres are more stable as their variation is less than that of Type B.
Standard Deviation and Coefficient of Variation possess all those properties, which
a good measure of dispersion should possess. The process of squaring the deviations
eliminates negative signs, and thus makes the mathematical manipulation of figures
easy.
81 | P a g e
Intro. To statistics &for Economics -Emma Charles 0753-236-367/0787-080-333
Exercise:
1. An agent obtained samples of bulbs from the two companies. He had them
tested for durability and got the following results:
2. Find which of the two classes is more consistent in scoring marks from the
following table:
3. Two brands of tyres are tested for their life and the following results were
obtained:
INDEX NUMBERS
82 | P a g e