Handnote On B.Stat 2nd Chapter
Handnote On B.Stat 2nd Chapter
Chapter Two
Collection and presentation of data
Structure
2.1 Collection of data
2.1.1 Data
2.1.2 Necessity of data collection
2.1.3 Types of data
2.1.4 Methods of primary data collection:
2.1.5 Sources of secondary data
2.1.6 Distinctions between primary data and secondary data
2.1.7 Questionnaire
2.2 Presentation of data
2.2.1 Variable
2.2.2 Classification of data
2.2.3 Types of classification
2.2.4 Objects or importance of Classification
2.2.5 Table and tabulation
2.2.6 Type of Tables
2.2.7 Parts of a table
2.2.8 Importance of tabulation
2.2.9 Distinctions between classification and tabulation
2.2.10 Frequency
2.2.11 Frequency distribution
2.2.12 Types of frequency distribution
2.2.13 Some important definitions
2.2.14 Preparation of frequency table
2.2.15 Characteristics of frequency distribution
2.2.16 Different graphs for representing a frequency distribution
2.2.17 Diagrams
2.2.18 Types of diagrams
2.2.19 Role and limitations of diagrams and graphs
2.2.20 Distinctions
2.3 Solved Problems
Data is a plural word of datum. In the view of Layman, data means information. In statistics
the data means mass of information collected from different sources.
2.1.3Types of data
In statistics, data are classified into two broad categories: quantitative data and qualitative
data. This classification is based on the kind of characteristics that are measured.
1. Quantitative data: Quantitative data are those that can be quantified in definite
units of measurement. These refer to characteristics whose successive
measurements yield quantifiable observations. Depending on the nature of the
variable observed for measurement, quantitative data can be further categorized
as continuous and discrete data.
(ii) Discrete data: Discrete data are the values assumed by a discrete variable.
A discrete variable is the one whose outcomes are measured in fixed
numbers. Such data are essentially count data. These are derived from a
process of counting, such as the number of items possessing or not
possessing a certain characteristic.
(i) Nominal data: Nominal data are the outcome of classification into two or
more categories of items or units comprising a sample or a population
according to some quality characteristic. Classification of students
according to sex (as males and females), of workers according to skill (as
skilled, semi-skilled, and unskilled), and of employees according to the level
of education (as matriculates, undergraduates, and post-graduates), all
result into nominal data.
(ii) Rank data: Rank data, on the other hand, are the result of assigning ranks
to specify order in terms of the integers 1, 2, 3... n. Ranks may be assigned
according to the level of performance in a test. a contest, a competition, an
interview, or a show. The candidates appearing in an interview, for example,
may be assigned ranks in integers ranging from I to n, depending on their
performance in the interview.
Again, Any statistical data can be classified under two categories depending upon the
sources utilized. These categories are,
1. Primary data
2. Secondary data
1. Primary data: Primary data is the one, which is collected by the investigator
himself for the purpose of a specific inquiry or study. Such data is original in
character and is generated by survey conducted by individuals or research
institution or any organization.
Example: If a researcher is interested to know the impact of noon meal scheme for
the school children, he has to undertake a survey and collect data on the opinion of
parents and children by asking relevant questions. Such a data collected for the
purpose is called primary data.
2. Secondary Data: Secondary data are those data which have been already
collected and analyzed by some earlier agency for its own use; and later the same
data are used by a different agency. According to W.A. Neiswanger, ‘ A primary
source is a publication in which the data are published by the same authority
which gathered and analyzed them. A secondary source is a publication, reporting
the data which have been gathered by other authorities and for which others are
Business Statistics -I
responsible’.
i) Direct personal observations: In this method the person who collects data
visits the field of survey and collects the information through discussions with
the persons who are either directly or indirectly in touch with the facts under
study. Some times through personal observations by investigator can get
information. In this method of collecting data the investigator should be very
much careful for his public behavior and he should mix with the people of that
locality where he is collecting data.
Merits:
1) Original data are collected.
2) True and reliable data are collected.
3) Response will be encouraging because of personal approach.
Demerits:
1) It is not suitable where the area under survey is very large.
2) It is expensive and time consuming.
3) Untrained investigators will not bring good results.
ii) Indirect oral interviews: Under this method of collecting data the investigator
contacts the third person who is capable of supplying the necessary
information. This method is generally adopted in those cases where the
informants are not interested to respond if investigator approaches directly. For
example if we want to interview a drug addict, he may be interested to supply
information about his own habits. In this case we get the necessary information
from those persons who know him well.
Merits:
1) It is simple and convenient.
Business Statistics -I
Demerits:
1) Interview with an improper person will spoil the results.
2) In order to get the real position, a sufficient number of persons are to be
interviewed.
Merits:
1) Of all the methods this method is most economical.
2) This method is suitable when the area of investigation is very large.
3) It saves time, money and labor.
Demerits:
1) In this method there is no direct contact between the investigator and the
informant, therefore we can be sure about the accuracy of the data.
2) This method is only suitable when the informants are literate.
3) There is a chance of delay in receiving of answers from the informants.
iv) Schedules sent through investigators: It is the most widely used method of
collecting primary data. In this method a number of enumerators are selected
and trained. They are provided with a standardized questionnaire and specific
training and instructions are given to them for filling up the schedules. Each
enumerator will be the in charge of a certain area. The investigator goes to the
informants along with the questionnaire and gets replies to the questions in the
schedules and records their answers. They explain the object and purpose of
the enquiry.
Merits:
1) This method is very much useful where the informants are illiterates.
2) In this method because of direct contact between the investigators and the
Informants, the investigator can get accurate information.
Business Statistics -I
Demerits:
1) This method is more expensive and time consuming.
2) The success of this method is depends upon the trained, intelligent and
qualified investigators.
2.1.5 Sources of secondary data
The various sources of secondary data can be divided into two categories
1) Published sources
2) Unpublished sources.
2) Unpublished sources: All statistical material is not always published. There are
various sources of unpublished data such as records maintained by various
Government and private offices, studies made by research institutions, scholars,
etc. Such sources can also be used where necessary.
1. Primary data are those data 1. Secondary data are those data
which are collected from the which are collected from the
primary sources. secondary sources.
2. Primary data are known as basic 2. Secondary data are known as
data. subsidiary data.
3. The collection of primary data is 3. The collection of secondary data is
more expensive. comparatively less expensive.
4. It takes more time to collect the 4. It takes less time to collect the data.
data. 5. Secondary data are less accurate
5. Primary data are more accurate. than the primary data.
6. Primary data are known as first 6. Secondary data are known as
hand data. second hand data.
7. Primary data are not readily 7. Subsidiary data are readily
available. available.
8. It is required to take much care 8. It is not required to take much care
at the time of collecting data. at the time of collecting data.
2.1.7 Questionnaire
Questionnaire refers to a device for securing answers to questions by using a form which
the respondent fills in himself.
2.2.1 Variable
The meaning of the word variable is ‘which varies’. The term, ‘variable’ refers to the
characteristic that varies in amount or magnitude.
Variables are classified into two broad categories: quantitative variables and qualitative
variables.
ii. Discrete variable: A discrete variable is the one whose outcomes are
measured in fixed numbers. Such data are essentially count data. These are
derived from a process of counting, such as the number of items possessing
or not possessing a certain characteristic.
a) Chronological classification
b) Geographical classification
c) Qualitative classification
d) Quantitative classification
Example:
Country America China Denmark France Canada
Yield of wheat 1900 1500 850 1230 1170
in (kg/acre)
For example, if the population to be classified in respect to one attribute, say sex, then
we can classify them into two namely that of males and females. Similarly, they can also
be classified into ‘employed’ or ‘unemployed’ on the basis of another attribute
‘employment’.
If we classify population simultaneously with respect to two attributes, e.g. sex and
employment, then population are first classified with respect to ‘sex’ into ‘males’ and
‘females’. Each of these classes may then be further classified into ‘employment’ and
‘unemployment’ on the basis of attribute ‘employment’ and as such Population are
classified into four classes namely.
Business Statistics -I
1) Simple and complex tables: The distinction between simple and complex table is
based on the number of characteristics studied.
In a simple table only one character is shown. Hence this type of table is also known
as one-way table. On the other hand in a complex table two or more characteristics
Business Statistics -I
are shown. When two characteristics are shown such a table is known as two-way
table or double tabulation.
Example of two-way
table:
2) General and special purpose tables: General purpose tables sometimes termed as
reference tables or information tables. These tables provide information for general
use of reference. They usually contain detailed information and are not constructed
for specific discussion. These tables are also termed as master tables.
Special purpose tables also known as summery tables which provide information for
particular discussion. These tables are constructed or derived from the general
purpose tables. These tables are useful for analytical and comparative studies
involving the study of relationship among variables.
1. Table number
2. Title of the table
Business Statistics -I
Title: A good table should have a clearly worded, brief but unambiguous title explaining
the nature of data contained in the table. It should also state arrangement of data and the
period covered. The title should be placed centrally on the top of a table just below the
table number (or just after table number in the same line).
Captions or column Headings: Captions in a table stands for brief and self explanatory
headings of vertical columns. Captions may involve headings and sub-headings as well.
The unit of data contained should also be given for each column. Usually, a relatively less
important and shorter classification should be tabulated in the columns.
Stubs or Row Designations: Stubs stands for brief and self explanatory headings of
horizontal rows. Normally, a relatively more important classification is given in rows. Also
a variable with a large number of classes is usually represented in rows. For example,
rows may stand for score of classes and columns for data related to sex of students. In
the process, there will be many rows for scores classes but only two columns for male
and female students.
Body: The body of the table contains the numerical information of frequency of
observations in the different cells. This arrangement of data is according to the description
of captions and stubs.
Footnotes: Footnotes are given at the foot of the table for explanation of any fact or
information included in the table which needs some explanation. Thus, they are meant
for explaining or providing further details about the data that have not been covered in
title, captions and stubs.
Sources of data: Lastly one should also mention the source of information from which
data are taken. This may preferably include the name of the author, volume, page and
the year of publication. This should also state whether the data contained in the table is
of ‘primary or secondary’ nature.
2.2.8 Importance of tabulation
Business Statistics -I
1. It simplifies complex data and the data presented are easily understood.
2. It facilitates comparison of related facts.
3. It facilitates computation of various statistical measures like averages, dispersion,
correlation etc.
4. It presents facts in minimum possible space and unnecessary repetitions and
explanations are avoided. Moreover, the needed information can be easily located.
5. Tabulated data are good for references and they make it easier to present the
information in the form of graphs and diagrams.
Classification Tabulation
1. In classification, the data are 1. Tabulation is a process of arranging
divided into various groups and the classified data In row and columns
subgroups on the basis their with suitable heads and sub-heads.
similarities and dissimilarities.
2. The data are classified at first and
2. Classification is the basis for then tabulated.
tabulation.
3. Tabulation is a process of
3. Classification of data is a process presentation.
of statistical analysis.
2.2.10 Frequency
The number of repetition of a particular value of a variable is called frequency of that
value. For example, consider the ages of 7 students of a class are 19, 20, 19, 21, 20, 19,
21 years respectively. Here, the value 19 is repeated thrice, 20 is repeated twice and 21
is repeated twice. Hence, the frequencies of 19, 20 and 21 are 3, 2 and 2 respectively.
There are definite differences between the variables of different groups of items. Each
class is distinct and separate from the other class. Non-continuity from one class to
another class exists. Such as, the number of rooms in a house, the number of companies
registered in a country, the number of children in a family, etc.
The process of preparing this type of distribution is very simple. We have just to count the
number of times a particular value is repeated, which is called the frequency of that class.
In order to facilitate counting prepare a column of tallies.
In another column, place all possible values of variable from the lowest to the highest.
Then put a bar (Vertical line) opposite the particular value to which it relates.
To facilitate counting, blocks of five bars |||| are prepared and some space is left in
between each block. We finally count the number of bars and get frequency.
Example: In a survey of 30 families in a village, the number of children per family was
recorded and the following data obtained.
4 2 0 2 3 2 2 1 0 2
3 5 1 1 4 2 1 3 4 2
5 1 2 2 2 1 3 4 1 0
We can represent the data in the form of a discrete frequency distribution as follows:
Number of children Tally Marks Frequency
0 ||| 3
1 |||| || 7
2 |||| |||| 10
3 |||| 4
4 |||| 4
5 || 2
Business Statistics -I
Total N = 30
Class Limit: The class limits are the lowest and the highest values that can be included
in the class. For example, take the class 30 – 40. The lowest value of the class is 30 and
highest class is 40. The two boundaries of class are known as the lower limits and the
upper limit of the class. The lower limit of a class is the value below which there can be
no item in the class. The upper limit of a class is the value above which there can be no
item to that class. In statistical calculations, lower class limit is denoted by L and upper
class limit by U.
Class interval: The difference between the lower and upper class limits is called class
interval and is denoted by ‘c’. That is, c = U – L. For example, take the class 30 – 40. Then,
class interval c = 10.
Class mid-point: The central point of a class is called the class mid value or class mid-point.
It is found out by adding the upper and lower limits of a class and dividing the sum by 2. That
is,
𝐿+𝑈
Class mid-point = 2
20+30
For example, if the class interval is 20-30 then the mid-value is = 25.
2
In the above example, the class frequencies are 10, 12, 15, 8 and 5. The total
frequency is equal to 50.
The distribution showing the relative frequency and percentage frequency is given below:
Daily wages (in Tk.) No. of workers Relative frequency Percentage frequency
50 – 100 10 10/50 = 0.2 20
100 – 150 12 0.24 24
150 – 200 15 0.3 30
200 – 250 8 0.16 16
250 – 300 5 0.1 10
Total 50 1 100
class frequency
Frequency density = class interval
Example:
Business Statistics -I
Example: Given below are the marks obtained by 30 students in a class test
32 33 55 47 21 50 27 26 24 33
62 42 38 15 45 32 44 48 68 49
40 52 30 17 44 58 37 48 39 42
Construct a frequency distribution table using suitable class interval.
1. The number of classes should preferably be between 5 and 15. However there
is no rigidity about it.
2. As far as possible one should avoid values of class intervals as
3,7,11,26….etc.preferably one should have class-intervals of either five or
multiples of 5 like 10,20,25,100 etc.
3. The starting point i.e the lower limit of the first class should either be zero or 5
or multiple of 5.
4. To ensure continuity and to get correct class interval we should adopt
“exclusive” method.
5. Wherever possible, it is desirable to use class interval of equal sizes.
However here we shall discuss only some important types of graphs which are more
popular and they are
1. Histogram
2. Frequency Polygon
3. Frequency Curve
4. Ogive
axis’.
The height of each rectangle represents the frequency of the class interval. Each rectangle
is formed with the other so as to give a continuous picture. Such a graph is also called
staircase or block diagram.
However, we cannot construct a histogram for distribution with open-end classes. It is also
quite misleading if the distribution has unequal intervals and suitable adjustments in
frequencies are not made.
20
15
10
2. Frequency Polygon: If we mark the midpoints of the top horizontal sides of the
rectangles in a histogram and join them by a straight line, the figure so formed is called a
Business Statistics -I
Frequency Polygon. This is done under the assumption that the frequencies in a class
interval are evenly distributed throughout the class. The area of the polygon is equal to
the area of the histogram, because the area left outside is just equal to the area included
in it.
20
15
10
3. Frequency Curve: If the middle point of the upper boundaries of the rectangles of a
Business Statistics -I
histogram is corrected by a smooth freehand curve, then that diagram is called frequency
curve. The curve should begin and end at the base line.
20
15
10
frequencies up to (or above) the give value. This accumulated frequency is called
cumulative frequency.
These cumulative frequencies are then listed in a table is called cumulative frequency
table. The curve obtained by plotting cumulative frequencies is called a cumulative
frequency curve or an ogive.
50
40
30
20
10
2.2.17 Diagrams
A diagram is a visual form for presentation of statistical data, highlighting their basic facts
and relationship. If we draw diagrams on the basis of the data collected they will easily
be understood and appreciated by all. It is readily intelligible and save a considerable
amount of time and energy.
7. Index must be given for identification so that the reader can easily make out the
meaning of the diagram.
8. Footnote must be given at the bottom of the diagram.
9. Economy in cost and energy should be exercised in drawing diagram.
1. Line Diagram
Line diagram is used in case where there are many items to be shown and there
is not much of difference in their values. Such diagram is prepared by drawing a
vertical line for each item according to the scale. The distance between lines is
kept uniform. Line diagram makes comparison easy, but it is less attractive.
Variable 1 2 3 4 5 6
Frequency 50 80 30 12 3 1
80 °
60
°
40
°
20
°
10 °°
°
0 1 2 3 4 5 6
Variable
Solution:
Business Statistics -I
Profit
30
25
20
15
Profit
10
0
1995 1996 1997 1998 1999 2000
Solution: Since the multiple bar diagram is used for comparing two or more sets
of statistical data, we can represent the given data by the following multiple bar
diagram.
Business Statistics -I
4500
4000
3500
3000
2500
2000
Family A
1500
Family B
1000
500
0
-500 Food Clothings Education Others Savings
-1000
Solution:
Expenditure Monthly expenditure (in Tk.) Total expenditure
items Family A Family B (in Tk.)
Business Statistics -I
Food 75 95 170
Clothing 20 25 45
Education 15 10 25
House Rent 40 65 105
Others 25 35 60
180
160
140
120
100
Family B
80
Family A
60
40
20
0
Food Clothing Education House Rent Others
Two-dimensional Diagrams
1. Rectangles
2. Squares
3. Pie-diagrams
1. Rectangles
Rectangles are used to represent the relative magnitude of two or more values.
The areas of the rectangles are kept in proportion to the values. Rectangles are
placed side by side for comparison. When two sets of figures are to be represented
by rectangles, either of the two methods may be adopted. We may represent the
figures as they are given or may convert them to percentages and then subdivide
the length into various components. Thus the percentage sub-divided rectangular
Business Statistics -I
2. Squares
The rectangular method of diagrammatic presentation is difficult to use where the
values of items vary widely. The method of drawing a square diagram is very
simple. One has to take the square root of the values of various item that are to be
shown in the diagrams and then select a suitable scale to draw the squares.
In constructing a pie diagram the first step is to prepare the data so that various
components values can be transposed into corresponding degrees on the circle.
The second step is to draw a circle of appropriate size with a compass. The size
of the radius depends upon the available space and other factors of presentation.
The third step is to measure points on the circle and representing the size of each
sector with the help of a protractor.
is not the lower limit of the next class, is the lower limit of the next class, it
it is known as the inclusive method is known as the exclusive method of
of classification. classification.
10 19 10 20
20 29 20 30
30 39 30 40
6. It is widely used.
Problem-1: Given below are the marks obtained by 30 students in a class test
32 33 55 47 21 50 27 26 24 33 Construct a
62 42 38 15 45 32 44 48 68 49 frequency
40 52 30 17 44 58 37 48 39 42 distribution
table using
suitable class interval.
𝑅𝑎𝑛𝑔𝑒
Possible class interval, i =
1+3.322 log 𝑁
53
=
1+3.322 log 30
= 8.97
We will take 10 as the class-interval and the first class as 15 – 25.
Problem-2: The obtained marks of 50 students in Business Statistics (out of 100) are as
follows:
39 31 70 25 55 36 59 42 63 57
64 37 45 65 60 45 49 63 54 53
55 39 33 75 65 42 41 82 52 52
26 61 64 30 58 35 15 65 48 42
18 53 50 20 52 40 55 45 45 46
(i) Construct a frequency distribution table taking the class interval 10 – 20, 20 –
30, and so on.
(ii) Calculate the absolute frequency, relative frequency and percentage frequency.
(iii) Calculate cumulative frequency.
(iv) Construct a ‘less than’ cumulative relative frequency distribution.
(v) Construct a ‘more than’ cumulative relative frequency distribution.
Solution:
(i) Table: Construction of a continuous frequency distribution using exclusive method.
Class Tally marks Frequency
10 – 20 || 2
20 – 30 ||| 3
30 – 40 |||| ||| 8
40 – 50 |||| |||| || 12
50 – 60 |||| |||| ||| 13
60 – 70 |||| |||| 9
70 – 80 || 2
80 – 90 | 1
Business Statistics -I
Total N = 50
(ii) Calculation of the absolute frequency, relative frequency and percentage frequency:
Class Absolute Relative Frequency Percentage
Frequency (Absolute/N) Frequency (RF x 100)
10 – 20 2 2/50 = 0.04 0.04 x 100 = 4
20 – 30 3 3/50 = 0.06 0.06 x 100 = 6
30 – 40 8 0.16 16
40 – 50 12 0.24 24
50 – 60 13 0.26 26
60 – 70 9 0.18 18
70 – 80 2 0.04 4
80 – 90 1 0.02 2
Total N = 50
∴ Range, R = H – L
= 210 – 68
= 142
𝑅𝑎𝑛𝑔𝑒
Possible class interval, i =
1+3.322 log 𝑁
142
=
1+3.322 log 40
= 22.46
10
8
Business Statistics -I
10
Class Frequency
60 – 80 1
80 – 100 9
100 – 120 10
120 – 140 6
140 – 160 6
160 – 180 2
180 – 200 4
200 – 220 2
Total N = 40
Business Statistics -I
10
Less than cf
45
40
35
30
25
20 Less than cf
15
10
5
0
80 100 120 140 160 180 200 220
a) Determination of Q1:
Therefore, Q1 = 100
Determination of Q3:
Therefore, Q3 = 153
Thus, the limit of wages of central 50% of the workers is Tk. 100 to Tk. 153.
c) We get from the ogive curve, the number of workers whose wages lie between Tk.
125 and Tk. 160 is equal to 32 – 22 = 10
Business Statistics -I
Weights
36 – 40 41 – 45 46 – 50 51 – 55 56 – 60 61 – 65 66 – 70
(Kg)
Number
of 2 10 13 15 10 7 3
students
i) Represent the data by less than ogive and more than ogive curve.
ii) Find the number of students whose weights lie between 47 kg and 57 kg.
iii) Estimate the limit of weights of middle 20% of the students.
iv) Estimate the range of weights of middle 20% of the students.
Solution: Table for calculation of ‘less than’ and ‘more than’ ogive curve:
60
50
40
30
20
Business Statistics -I
10
Now we draw a ‘less than’ ogive curve for the solution of (ii), (iii) and (iv).
60
50
40
30
20
10
Determination of P60:
Location of P60 = size of 60N/100 th observation
= size of 60 x 60/100 th observation
= size of 36 th observation
Therefore, P60 = 54.5
Thus, the limit of weights of middle 20% of the students is 49.5 – 54.5.
80
60
40
20
10 °
1 2 3 4 5 6
Solution:
Business Statistics -I
Profit
30
25
20
15
Profit
10
0
1995 1996 1997 1998 1999 2000
Solution: Since the multiple bar diagram is used for comparing two or more sets of
statistical data, we can represent the given data by the following multiple bar diagram.
Business Statistics -I
5000
4000
3000
2000 Family A
Family B
1000
0
Food Clothings Education Others Savings
-1000
Solution:
Expenditure Monthly expenditure (in Tk.) Total expenditure
items Family A Family B (in Tk.)
Food 75 95 170
Clothing 20 25 45
Education 15 10 25
House Rent 40 65 105
Others 25 35 60
180
160
140
120
100
Family B
80
Family A
60
40
20
0
Food Clothing Education House Rent Others
Problem-9: Draw a Pie diagram for the following data of production of sugar in quintals of
various countries.
Country Cuba Australia India Japan Egypt
Production of
Sugar (in 62 47 35 16 6
quintals)
Production
Cuba
Australia
India
Japan
Egypt
Exercise-2
Theoretical
Business Statistics -I
Q.1 What do you mean by data or statistical data? Describe the necessity of data
collection.
Q.2 Discuss the different types of data.
Q.3 Discuss the different methods of primary data collection with merits and demerits.
Q.4 Discuss the important sources of secondary data.
Q.5 Distinguish between primary data and secondary data.
Q.6 What is questionnaire? Write down the main characteristics of a questionnaire
preparation.
Q.7 What is variable? Define the various kinds of variables with examples.
Q.8 Define classification. Describe the different types of classification.
Q.9 Write down the importance of classification.
Q.10 What do you mean by table and tabulation? Describe the different types of table.
Q.11 Describe the different parts of a table. Discuss the importance of tabulation.
Q.12 Distinguish between classification and tabulation.
Q.13 What do you mean by frequency and frequency distribution? Discuss the different
types of frequency distribution.
Q.14 How will you prepare a frequency distribution from raw data? Describe the
characteristics of frequency distribution.
Q.15 (i) Describe the different graphs for representing a frequency distribution.
(ii) Define diagram. Write down the general rules for constructing diagrams.
(iii) Describe different types of bar diagrams.
(iv) What do you mean by pie diagram?
Q.16 Explain clearly the role and limitations of diagrams and graphs in presenting
business data.
Q.17 Distinguish between-
(i) Discrete and continuous frequency distribution
(ii) Inclusive and exclusive method
(iii) Graph and diagram
(iv) Histogram and bar diagram
(v) Histogram and frequency polygon
Mathematical
Problem-1: Given below are the marks obtained by 30 students in a class test
70 40 56 86 42 50 60 44 42 63
34 32 33 64 36 46 80 76 31 40
77 56 46 55 81 57 76 66 46 57
Construct a frequency distribution table using suitable class interval.
Business Statistics -I
Problem-2: Given below are the marks obtained by 30 students in a class test
33 32 55 47 21 50 27 12 68 49
40 17 44 48 62 24 33 42 38 45
26 33 44 48 52 30 58 37 38 45
Construct a frequency distribution table taking a class interval of 10 marks.
Problem-3: The following data shows the number of sick employees in 80 days:
7 5 14 11 6 9 11 10 18 25
13 10 10 11 11 12 13 14 23 19
12 16 18 13 13 10 15 7 23 17
10 12 11 16 15 14 18 9 26 18
17 18 18 17 15 17 21 8 23 12
17 16 19 20 22 10 24 11 15 12
24 15 21 22 19 16 19 14 15 14
25 27 26 24 20 16 15 12 13 13
Problem-4: Arrange the following data in a frequency distribution and hence draw
histogram and frequency polygon.
40 38 44 28 60 21 35 42 40 36
50 67 25 58 30 48 65 35 55 39
72 44 70 55 53 21 76 46 57 67
51 34 41 56 62 42 64 73 38 41
Problem-5: Prepare a frequency table for the following data with width 0f each class
interval is 10. Use exclusive method of classification.
57 44 80 75 0 18 45 28 22 53
51 69 34 22 83 70 20 4 34 84
50 47 10 34 61 66 80 14 66 56
50 47 73 42 33 48 65 57 10 72
75 90 58 46 38 69 10 46 96 64
Problem-6: Prepare a frequency table for the following data with width 0f each class
interval is 20 and mid-point of the first class is 70. Use exclusive method of classification.
Problem-7: The following data gives the number of children in 50 families. Construct a
discrete frequency table.
4 2 0 2 3 2 2 1 0 2
3 5 1 1 4 2 1 3 4 2
6 1 2 2 2 1 3 4 1 0
1 3 4 1 0 1 2 2 2 5
2 4 3 0 1 3 6 1 0 1
Problem-8: In a survey, it was found that 64 families bought milk in the following quantities
in a particular month. Quantity of milk (in liters) bought by 64 Families in a month. Construct
a continuous frequency distribution making classes of 5-9, 10-14 and so on.
19 16 22 9 12 39 19 22
14 23 6 24 16 18 7 17
20 25 28 18 10 24 20 21
10 7 18 28 24 20 14 23
25 34 22 5 33 23 26 29
13 36 11 26 11 37 30 13
8 15 22 21 32 21 31 17
16 23 12 9 15 27 17 21
No. of children 0 1 2 3 4 5
Frequency 10 14 9 6 4 2
Problem-13: Draw a histogram and a frequency polygon from the following data.
Problem-14: Draw a frequency curve and ogive curve from the following data.
Age 5 – 10 10 – 15 15 – 20 20 – 25 25 – 30 30 – 35
No. of
2 12 18 30 12 5
persons
Problem-15: You have conducted a market survey with a sample of size 40 regarding
the acceptability of a new product which your company wants to launch. The score of the
respondents on the appropriate scale are as follows:
40 45 41 45 45 30 39 8 48 25
26 9 23 24 26 29 8 40 41 42
39 35 18 25 35 40 42 43 44 36
27 32 28 27 25 26 38 37 36 35
Problem-16: Construct a frequency distribution and then draw Ogive curve and
histogram from the following data
35 65 29 39 57 65 60 69 87 92 40 35 31 26 38 67 77 55
48 50 27 35 49 55 70 42 82 60 58 43 45 29 55 59 30 35
79 86 22 12 30 42 30 54 40 48
Problem-17: The following data refer to the ages of 50 children recorded in months,
obtained in a nutrition survey by a research team:
17 18 21 11 10 12 13 18 15 16
22 14 15 08 09 17 15 14 10 11
10 13 12 17 16 20 19 10 07 12
16 18 21 19 19 15 09 13 19 18
Business Statistics -I
06 11 15 20 13 24 24 17 12 25
Construct a frequency distribution. Draw a histogram. Also find median and mode of ages
of children graphically.
Problem-18: The following is the number of minutes to commute from home to work for
a group of automobile executives.
77 18 63 84 38 54 50 59 54 56
36 26 50 34 44 41 58 58 53 51
62 43 52 53 63 62 62 65 61 52
60 60 45 66 83 71 63 58 61 71
Problem-20: The following data refer to the number of Advertising Spots Purchased by
Members of the Greater Buffalo Automobile Dealers Association:
Problem-21: The Coordinator of BBA wishes to prepare a report showing the number of
hours per week students spend studying. He selects a random sample of 30 students
and determines the number of hours each student studied last week.
15.0 23.7 19.7 15.4 18.3 23.0 14.2 20.8 13.5 20.7
17.4 18.6 12.9 20.3 13.7 21.4 18.3 29.8 17.1 18.9
10.3 26.1 15.7 14.0 17.8 33.8 23.2 12.9 27.1 16.6
Business Statistics -I
Problem-22: The profits (in lakh Taka) of 40 branches of a super shop are given below:-
34 36 48 49 31 60 34 43
45 38 32 27 60 29 47 36
50 46 30 46 32 30 33 45
49 48 41 53 36 37 37 47
30 46 50 28 35 35 38 36
45 49 30 35 79 86 22 12 15 65
29 39 57 65 60 69 87 92 50 35
31 36 38 67 77 55 48 50 27 35
49 55 70 42 82 60 85 75 55 35
Draw the ogive for the distribution and use it to determine the median wage of a worker.
Problem-24: You have conducted a market survey with a sample of size 40 regarding
the acceptability of a new product which your company wants to launch. The score of the
respondents on the appropriate scale are as follows:
37 39 32 45 45 30 39 8 48 25
26 9 23 24 26 29 8 40 41 42
39 35 18 25 35 40 42 43 44 36
27 32 28 27 25 26 38 40 42 46
30 42 30 54 40 48 15 17 51 42
25 41 30 27 42 36 28 26 37 54
44 31 36 22 30 31 19 48 16 36
40 42 32 21 22 40 33 41 21 35
Draw an ogive (less than) and hence find median from the graph.
Business Statistics -I
Problem-27: The following data show how the people visit the health centre in a
locality:
Type of visit frequently Occasionally Rarely Never
Number of
40 26 18 16
people (‘000)