Chapter 23 Data Handling-I (Classification and Tabulation of Data) - Watermark
Chapter 23 Data Handling-I (Classification and Tabulation of Data) - Watermark
DATA HANDLING-I
Classification and Tabulation of
Data)
23.1 RAW DATA
The word data means
information (its
of observations is the
first step in exact dictionary
meaning is: given facts). Collection
collected by an observer cannot be statistical investigations. The numerical
is called a raw data. For put to any use observations
example, look at
immediately
the following list of
and directly. That is why it
30 students of class VIII in a marks (out of 100) scored by
test:
55,65, 15, 40, 35, 70, 90, 92,
78,64, 88, 78, 76, 55,54, 84, 85, 70,75, 65, 72,80,
We find that each entry in the above list
52,72, 70, 90, 85, 75, 65, 80
Such is a numerical fact
a collection of observations which is called an observation.
gathered initially is called raw data.
23.2 PRESENTATION OF DATA
After collection of data, the
investigator has to find
study their salient features. Such an ways condense them in tabular form
in order to to
data. arrangement is called presentation of
The raw data be
can arranged in
of the following ways
any one
i) Serial order or
ii) Descending order
alphabetical order (ii) Ascending order
The data when put in ascending or descending order of
raw
Let the marks obtained
magnitude is called an array.
by 30 students of class VIII in a class test, out of 50 marks,
according to their roll numbers be :
39, 25, 5, 33, 19, 21, 12, 41, 12, 21, 19, 1, 10, 8, 12, 17, 19, 17, 17, 41,
40, 12, 41,
33, 19, 21, 33, 5, 1, 21.
he data in this form are called raw data or ungrouped data. The above raw data can be
arranged in serial order as follows:
Roll No. Marks Roll No. Marks Roll No. Marks
39 11 19 21 40
2 12 1 22 12
25
3 13 10 23 41
5
4 14 8 24
33 33
15 12 25 19
19
17 26
21 16 21
arhates
Roll No. Marks Roll No. Marks Roll No. Marks talled
7 12 17 19 27 33 data, i s
8 41 18 17 28 T h ep r s
9 12 19 17 29 all obs
10 21 20 41 30 21 interva
Now suppose we wish to judge the standard of achievement of the students. The data in W ec a n
this form do not give us a clear picture of the group. If we arrange them in ascending or
descending order, it gives us a slightly better picture. In ascending order, the data look as
follows:
1,1,5,5, 8, 10, 12, 12, 12, 12, 17, 17, 17, 19, 19, 19, 19, 21, 21, 21, 25, 33, 33, 33,
39, 40, 41, 41, 41.
In descending order, the data look as follows
41, 41, 41, 40, 39, 33, 33, 33, 25, 21, 21, 21, 21, 19, 19, 19, 19, 17, 17, 17, 12, 12,
12, 12, 10, 8, 5,5, 1, 1.
The raw data when put in ascending or descending order of magnitude is called an array The cla
or arrayed data. numbe-
If the number of observations is large, then arranging data in ascending or descending or class
serial order is a tedious job and it does not tell us much except perhaps the minimum(s) Thus,
and maximum(s) of data. So, to make it easily understandable and clear, we can tabulate class 1-
data in the from of a table given below. in clas=
data is
Marks Tally marks No. of students
1 2 23.3 F
5 2 Ereque
8 1 from w
10 ireque
12 A ) Di
17 3 3.3.1
19 4 The pr
21 4 iscret
of tally
25 1
33 3
rariabl
We no
39
second
40 1 abar
41 the giv
In the first column of the table, we write all marks from lowest to highest. We now look at and so
the first value in the given raw data and put a bar (vertieal line) in the second column
opposite to it. Now we see the second value in the given raw data and put a bar opposite to
CorTespr
The
it in the second column. This process is repeated till all observations in the given raw data
are exhausted. The bars drawn in the second column are known as tally marks and to
amilie
facilitate we record tally marks in bunches of five, the fifth tally mark is drawn diagonally
across the first four. For example, N I| = 7. We finally count the number of tally marks The da
corresponding to each observation and write in the third column.
particular
number of
marks is
in the given
ed frequency of the variate. The number of times an
occurs
Dbservation
2 6
3
4 I 3
5 2
the upper limit. The span of the class i.e., the difference between the upper limit and the
lower limit, is known as the class interval. For example, in the class 10-20 the class
intervalis 20-10 = 10.
There are two methods of classifying the data according to the class intervals, v1Z.
i) 'exclusive' method, and (ii) inclusive' method.
EXCLUSIVE METHOD When the class intervals are so fixed that the upper limit of one
class is the lower limit of the next class it is known as the exclusive method of
classification. In this method the upper limit of a class is not included in the class.
Thus, in the class 0 10 of marks obtained by students, a student who has obtained
10 marks is not included in this class. He is counted in the next class 10-20.
For more books join in telegram @freebooksyt
Data
a Handling- (Classification and Tabulation of Data) 23.5
1SIVE METHOD In
this method the classes are imit
s is included in that class. Following
class
exampleso formed thethe
ustratesthat upper
method. o
Wages (Rs.) No. of workers
1000-1099 125
1100-1199 150
1200-1299 200
1300-1399 250
1400-1499 175
1500-1599 100
Total 1000
1r
in the classof1000-1099 we include workers and Rs. 1099.
having wages between Rs. 10001100-1199.
the income a worker is exactly Rs. 1100 he is included in the next class
Exclusive Method Inclusive Method
1 3 9 9 . 5 - 1 4 9 9 . 5
100
23.6
interval w a s 99 but after adjustment t
adjustment
the class
before
It should be noted that
the class-mark or mid.
is 100. class mark. For example,
the
class is called
The mid-value of a
class 1000-1100
is 1050. In fact
value of the limit Upper
limit
Lower +
Class m a r k =-
2
between the upper
and lower limits)
limit+(Difference
or, Class mark Lower
=
DISTRIBUTION
FREQUENCY
CONSTRUCTION OF A DISCRETE
23.4
distribution from the given raw data we use the following
To prepare a discrete frequency
algorithm.
ALGORITHM
Step V Count the number of bars (tally marks) in respect of each value of the variable
and place it in the third columnn.
Step VI Give a suitable title to the frequency distribution table so that it conveys
exactly what the table is about.
Following illustrations will elarify the above algorithm.
ILLUSTRATIVE EXAMPLES
Example 1 Given below are the ages of 25 students of class VIII in a school. Prepare a
discrete frequency distribution.
15,16, 16, 14, 17, 17, 16, 15, 15, 16, 16, 17, 15, 16, 16, 14, 16, 15, 14, 15, 16, 16, 15,
14, 15.
Solution Frequency distribution of ages of 25 students
Age Tally marks
Frequency
14
15
8
16
10
17
3
Total
25
For more books join in telegram @freebooksyt
Jandling-1 (Classification and Tabulation of Data) 23.7
15, 18, 16, 20, 25, 24, 25, 20, 16, 15, 18, 10,
18, 16. 24, 15, 20, 28, 30, 27, a*
18, 28, 27, 25, 24, 24, 18, 18, 25, 20, 16,
15, 20, 27, 28, 29, 10.
Frequency distribution of scores
Variate FrequenCy
Tally marks
15
16
18 6
20
24 5
25
27
28 3
29 1
30
40
Total
EXERCISE 23.1
ii) 37 (iv) 63
ANSWERS
(v) 2 (vi) 8 54, 57 (vii)5
3.1.3.0,2.9, 2.9, 2.8, 2.8, 2.7,2.7, 2.6, 2.5, 2.4, (vii) 51,
i5) 3.1 kg Gii) 2.1 kg (iv) 1.0
kg (v) 15
2.4,2.3,2.2,2.1
(vi) 4 (vii) 4 (vii) 2
Marks No. of
Marks No. of Marks No. of
students students
students
33 2 49
14 34 51 3
16 37 4 52
17 38 2 53 3
19 39 54
21 1 41 57 1
22 42 6 59
27 43 1 61
29 44 62
31 1 47 1 67 1
6 Score :1 2 3 4 5 6
Number of times 5 5 43 4 4
1. Number of accidents: 0 1 2 3 4 5 6
Number of days 2 3 6 3 4 6 6
:12 13 14 15 16 17
&Ages in years)
Number of students 4 13 8 2 2 1
ILLUSTRATIVE EXAMPLES
Example 1 The water tax bills (in rupees) of 30 houses in a locality are given below.
Construct a grouped frequency distribution with class size of 10.
30, 32, 45, 54, 74, 78, 108, 112, 66, 76, 88, 40, 14, 20, 15, 35, 44, 66, 75, 84, 95, 96,
102, 110, 88, 74, 112, 14, 34, 44.
Solution Here, the maximum and minimum values of the variate are 112 and 14
respectively.
Range = 112-14 =98
24-34 2
34-44 3
44-54 3
54-64
64-74 2
74-84 5
84-94 3
94-104 3
104-114 4
Total 30
data by inclusive
Example2 Form a grouped frequency distribution from the following
method taking 4 as the magnitude of class intervals.
15
31, 23, 19, 29, 22, 20, 16, 10, 13, 34, 38, 33, 28, 21, 15, 18, 36, 24, 18,
10
12, 30, 27, 23, 20, 17, 14, 32, 26, 25, 18, 29, 24, 19, 16, 11, 22, 15, 17,
minimum
respectively. and 23.11
maximum values of the
.
Range 38-10 =28 variable are 10 and 38
Since magnitude of class intervals =
4
Number of class intervals [Given
We
= =7
take 10 as the
distribution is to be lower limit of first class
10-13, 14-17, formed by inclusive interval. Since the
frequeney
method. So, the class intervais
In view of the18-21,22-25, 26-29, 30-33, at
above, we obtain the 34-37,38-41.
following
Frequency Distribution frequency distribution
Class interval
Tally marks Frequency
10-13
14-17 TH
18-21
22-25
26-29
5
30-33
4
34-37
2
38-41
1
Total 40
Example 3 The maximum temperatures (in degrees celcius) and relative
percent) for Delhi for the month of August 1998, as reported by Meteoro-
humidity (in
logical department, are given below. Construct a frequency table for each.
Maximum temperatures (in degreecelcius) 32.5, 30.5, 33.8, 31.0, 28.6, 33.9,
33.3,32.4, 30.4, 32.6, 34.7,34.9, 31.9, 35.2, 35.3, 35.5, 36.4, 36.9, 37.0, 34.4, 32.5,
31.4, 34.4, 35.6, 37.3, 37.5, 36.9, 37.0, 36.3, 36.9, 36.7.
Relative humidity (in percent) 90, 97, 92, 95, 93, 95, 93, 85, 83, 85, 83, 77,
83, 77,74, 60, 71, 65, 74, 80, 87, 82, 81, 76, 61, 63, 58, 58, 56, 57,54.
Solution We have,
Lowest temperature 28.6° C
Highest temperature =37.5°C,
=
30-32
32-34
34-36
36-38 10
Total 31
We have,
Maximum humidity =97%, Minimum humidity =
54%
Range =97-54 43
Let us choose the class size 10.
as Then,
Number of classes = 5 Range 43 4.3
Class size 10
Let the lower limit for the first class
interval be 50. Then, various class
intervals are: 50-60, 60-70, 70-80,
80-90, and 90-100 and the frequency
distrihution by inclusive method is as
given below.
Frequency Distribution for Relative Humidity
Relative Humidity (in%)
Talymarks Frequency distribution
50-60
5
60-70
70-80
6
80-90
90-100
Total
31
Example4 The marks obtained by 40 students of
class VIII in an examination are given
below:
,8, 12,0, 8, 16, 12, 5, 23, 2, 16, 23, 2,
24, 1,7,21, 16, 13, 18,
10, 20, 12, 9, 7, 6, 5, 3, 5, 13, 21, 13, 15, 20,
23,7,3, 18, 17, 16.
Present the data in the form of a
frequency distribution using the same cla5s
size, one such class being 15-20 (where 20 is not
included).
Solution The minimum and
maximum marks in the given raw data are 0 and 2*
respectively, It is given that 15-20 is one of the class intervals and the class
S1ze is same. So, the
classes of equal size are: 0-5, 5-10, 10-15, 15-20 and 20-25.
Thus, the frequency distribution is as given below:
Example 5 The weights in grams of 50 oranges picked at random from a consignment are
as follows:
131, 113, 82, 75, 204, 81, 84, 118, 104, 110, 80, 107, 111, 141, 136, 123, 90, 78,30
115, 110, 98, 106, 99, 107, 84, 76, 186, 82, 100, 109, 128, 115, 107, 115, 119, 93,
187, 139, 129, 130, 68, 195, 123, 125, 111, 92, 86, 70, 126
Form the grouped frequency table by dividing the variable range into
intervals of equal width, each corresponding to 20 gms in such a way that the
mid-value of the first class corresponds to 70 gms.
Solution We have,
Size of each class = 20, Mid-value of first class = 70
We know that if a is the mid-value of a class and h is the class size, then the
h
lower and upper limits of the class are a-and a+ respectively.
Here, a = 70 and h = 20
interval
and, the upper limit of the first class 70+10 80
=
23.14
EXERCISE 23.2
examination are given below:
in an
40 students
of class VIII
by 23, 5, 12, 18, 8, 12, 6. &,
15, 19, 24, 16, 3,
obtained
1. The marks 21, 13,
13,
13, 10, 21, 7, 1,
16, 17, 18, 3, 7, 23, 18,
10, 2, 23. 15-20 and 20-25 and prepare a
16, 5, 3, 5, 0, 7, 9, 12, 20, 0-5, 5-10, 10-15,
five groups, namely
Divide the data into
grouped frequency table.
below:
a test a r e given
2. The marks scored by 20 students in
44, 58, 47, 64, 41, 57,
89, 53, 84, 57.
42, 68, 56, 62, 71, 78, 51, 72, 53,
54,
table:
Complete the following frequeney
Tally marks Frequency (No. of children)
(Marks in class intervals)
40-50
50-60
60-70
70-80
80-90 occurs?
What is the class interval in which the greatest frequency
3. The following is the distribution of weights (in kg) of 52 persons:
Weight in kg Persons
30-40 10
40-50 15
50-60 17
60-70 6
70-80 4
(i)What is the lower limit of class 50-60?
ii) Find the class marks of the classes 40-50, 50-60.
(ii) What is the class size?
4. Construct a frequency table for the following weights (in gm) of 35 mangoes using the
equal class intervals, one of them is 40-45 (45 not included):
30, 40, 45, 32, 43, 50, 55, 62, 70, 70, 61, 62, 53, 52, 50, 42, 35, 37, 53, 55, 65, 70, 73, 74, 45,
46,58,59, 60, 62, 74, 34, 35, 70, 68.
G) What is the class mark of the class interval 40-45?
i) What is the range of the above
weights?
(ii) How many classes are there?
5. Construct a frequency table with class-intervals
0-5 (5 not included) of the following
marks obtained by a group of 30 students in
an examination:
0,5,7,10, 12, 15,20, 22, 25, 27,8, 11, 17, 3, 6, 9, 17,
4,50, 16. 19, 21, 29, 31, 35,37, 40, 42, 45, 49,
6. The marks scored by 40 students of class VIII in
mathematics are given below:
81,55,68, 79, 85, 43, 29, 68, 54, 73, 47, 35,
83,62,64,72, 92, 84, 76, 63, 43, 54, 38, 73, 72,
68,
64, 95, 44, 50, 77, 64, 35, 79, 52, 45, 54, 70,
Prepare a frequency distribution with class 52,54.
size of 10 marks.
For more books join in telegram @freebooksyt
a Handling-/ Classification and Tabulation of Data) 23.15
monthly wages (in rupees) of 28 labourers working in a factory, taking one of the class
intervals as 210-230 (230 not included):
220, 268, 258, 242, 210, 268, 272, 242, 311, 290, 300, 320, 319, 304, 302, 318, 306, 292, 254,
278,210,240, 280, 316, 306, 215, 256, 236.
10. The daily minimum temperatures in degrees Celsius recorded in a certain Arctic
- 7.8,-2.9
Represent them as frequency distribution table taking - 19.9 to 15 as the first class
interval.
ANSWERS
THINGS TO REMEMBER
1. Numerical observations collected by an observer, is called a raw data.
2. The number of times an observation occurs in the given data, is called the frequency of the
observation.
3. Away ofpresenting data that exhibits the values of the variable and corresponding frequencies,
is called a frequency distribution.