CH 3 Org of Data Together With
CH 3 Org of Data Together With
I
lf
'l
t
0rgan1sat1on o_f- Data •
Learning Outcomes
After studying this chapter, you should be
able to:
classify the data for further statistical analysis.
distinguish between quantitative and qualitativ
e classification.
prepare a frequency distribution table.
know the technique of forming classes.
discuss the method of tally marking.
differentiate between univariate and bivariate
frequency distributions.
...
data
hnique of classifying the low er lim it of a cla ss.
requency array is a tec takes the up pe r lim it an d the
ch a discrete variable no n- me as ur ab le ch ara
cte ris tic or
or a discrete variable. Su ency 22. At tri bu te ref ers to the
ctional values. In .frequ ur ed nurnericaUy.
tegral values and no t fra h of which ca nn ot be me as
that correspond to eac
y, we have frequencies tion ma y be of tw o kin
ds :
23. Frequency distribu
integral values. en cy dis tri bu tio n
data (a) Univariate fre qu
a technique ofclassifying
.frequency distribution is ws (b) Bivariate fre qu en
cy dis tri bu tio n
frequency distribution sho fre qu en cy
a continuous variable. A uted in fre qu en cy dis tri bu tio n is the
of a variable are distrib A un iva ria te ari ate
ow the different values va ria ble wh ere as a biv
th their corresponding
class dis tri bu tio n of a sin gle bu tio n of
·trerent classes alongwi dis tri bu tio n is the fre qu en cy dis tri
fre qu en cy
uencies.
Jike two variables.
cy distribution, concepts lcu lat ion s are
the formation of.frequen and tribution, sta tis tic al ca
erval, class frequency 24. In a frequency dis
s, class limits, class int ma rk or mi d-v alu es, ins
tea d of
s mid-values are used. based on ly on the cla ss-
servations.
values having two ends
calJed actual values of the ob
• A class is a group of co ns tru cti on of
ste ps tha t are inv olv ed in th e
class limits. 25 . M ai~
two class limits is called contmuous series are:
The difference between the the nu mb er of cla sse s
the class interval/class siz
e. (a) Determination of
ran ge
limits is caJied the mid- (b) Determination of
The average of the class erv als
value (or mid-point). (c) Siz e of class int
36 7 ~ u,,U4® Economics-11
[DoEJ
27. Define class-interval.
dual is a var iab le. between
. (A)·• Age of an indivi ss-width is the difference
JS. Assertion Ans. Class interval or cla en
istic which lower class limit a giv
of
. Va riable means the character the upper class limit and
Reason (R) • class.
changes. ) are tru e and (NCT 2017, 131
sse rtio n (A) and Reason (R
(A) 28 . W ha t is exclusive series?
BOth A •
way that
ns. (a) Reason (R) is the correct explanation ofAssertion
•
we form classes in such a
Ans. Under this method, class
classification is based on coincides with the upper
• (A)·• Chronological the lower limit of a class
J9. Assertion limit of the previous class.
period of time.
ssi fic ati on is ba sed up
on de d' distribution?
Reason (R): Sp ati al cla 29. W ha t is an 'op en -en
the first class
loc atio n. An s. A dis trib uti on in whic~ the low er limit of
geographical ssing.
th As ser tio n (A ) and Re ason (R) are t~ e and and up per limit of the last class are mi
s. (b) Bo
rre ct ex pla na tio n of
Reason (R) is no t the co tions [3 /4 M ar ks ]
Sh ort An sw er Type Ques
Assertion (A) fyi ng ra w da ta with the
te var iab le mo ve in a finite jump. 30. Ex pla in the pu rp ose of cla ssi
o. Assertion (A) : Dis cre
ss is an example of discrete help of an exa mp le. s
a cla allah
disorganised as the Kabadiw
Reason (R) : De sk in
Ans. Ra w data is as highly mb ers e to
om
variable.
) are tru e and jup k, wh ich is quite large and als o ve ry cu
ason (R it is. It caUs for a pro pe
r
Both Assertion (A) and Re of dle. It cannot be use d as
co rre ct ex pla na tio n han for
Reason (R ) is no t the ion bef ore it can be use d
organisation an d presentat
Assertion (A) aft er the da ta ha s bee n
fur the r statistical analysis. Thus, ted
21. Give the meaning of
organisation of data. anised an d the reb y pre sen
in wh ich coU ect ed collected it has to be org
a process
s. Organisation of data is n, in a classified form.
ani sed in suc h a way that in investigatio s mp ris es of old
dat a is org
y for com par iso n and Fo r ex am ple Ka ba diw all ah jun k co
e eas etc.
the mass of data may becom pty gla ss bottles, pla sti cs
newspapers, magazines, em s or
further analysis. households, he seg rag ate
So, aft er co1Jecting it from
wa y he ma ke s his tra de
2. What is raw data? cla ssi fies them itemwise. Th is ise d
m ite mw ise an d in an org an
investigator in its original for easy, as jun k is classified
s. The data coUected by the
is called 'raw data'. way.
of
ssificati on . IDo EJ
ssification wi th the he lp
3. State the meaning of cla 31 . Ex plain chronological cla
of
a technique with the help
s. Classification of data is a sche~ule.
ang ed int o different groups ing to tim e, suc h a
which the collected dat a is arr
cs so An s. Wh en the raw data is classified acc ord
e common character isti ssification'.
or classes, according to•som '
cla ssi fic atio n is kn own as 'Chronological Cla
n
, analysis and interpretatio are classified in asc en din
g or
as to facilitate the tabulation In this classification, da ta nth s
of data. of time such as weeks ' mo '
descending order in terms
bute' . years, decades etc.
. Give an example of 'attri
e qua lity suc h as
basis of som Population of lndJa
s. The data classified on the mp les of an
on etc . are exa
marital status, literacy, religi Year Population (fn crores)
"attribute ".
pulation' a time series? 2011 121.0
S. Why Js the variable 'po
on' is a time series, as it dep
icts 2001 102.7
s. The :ariable 'populati
years. 81.8
a senes of values of different 1991
6• Give the meaning of Mid-value. 1981 68.4
mid point which lies
•· Mi d-value of a class refers to the 1971 54 .6
class limit and the upper
half way between the lower 1961 43.8
class limit of a class.
~ ~® E co no m ic s- 11
38 7
.
32. Di st 1n gu 1s between the di sc re te va ri
· h
. us variable ab le an d Th e difference betw
con ti nuo (NCT 20 15 ; D oE . een th
Ans. A "discrete • \ a class mterval is ca eupper
variable" is the one lled th
or discontinuous va which takes only isolated T he numerical fi emagru.tu and loWer
d
lues. There are ju .
a discrete variabl
e, e.g., number of
m ps in ca se of up pe r limits of a gu re s us ed t0 e of the t'C,....III.
go als sc or ed in a , clas . sp
match. Intem1ediat
e values between Fo r example if th s interval' ecify •'-- 'Ill: t...
2 and 3 are not po
ssible as there ca
l an d 2 or be tw ee n ,
lo w er class limit
e
c 1as s .
m te ar
1.e cau-~
""tla""1a.Jti
nnot be 1.4 or 2. is 15 and rv a is
goals in a match. Si 5 case. the 1S - 19
milarly, the numbe
r of UpPer r ..
family, number of ch ild re n in a 36. Exp llllit 11 19 •
students in a class, lain the 'exclusive
factory in a particu fans pr od uc ed in ' an d ,.
lar year etc. are ex a in classification '
amples of discrete of dat inclusive' lllett.....
variables. a. -·'" 911
Whereas a continuo
us variable is th e Or lNcr
any value in a spec on e w hi ch ca n ta ke D is tin gu is h be tw
ified interval. Tem m et ho d. ee n exclusive
of patients in a ho perature re co rd ed methOd Ind
spital, heights of
a University, wag all B B A students Th er e
es of all workers of Ans. . ar e two methods of cl
asS1'fyin
examples of cont in a f:actory are
so m e c l as s m te rv als, namely, g data lNcr
inuous variables.
33 . St at e w hi ch (a ) Exclusive m
of th e fo ll ow in ethod• Under thi
s
variable and w hi g re pr es en t di sc re te of
. th e cl.ass is excluded Th method,,._
ch represent cont mterval IS the low • e upper lim-i'lit "Ior
(a) H ei gh t of th inuous variable. er limit of th
e st ud en ts of cl as Marks enextclaa,
(b) N um be r of s X I in a sc ho ol
ch il dr en in a fa 0 -1 0 10-20 20
(c) N um be r of
(d ) W ei gh t of
go al s sc or ed in
th e st ud en ts of
m ily
a m at ch W
N o. of Students
e in cl ud e th
2
ose students in fir
5
st group
t
A ns . (a ) Contin cl as s X I in a sc ho marks are O or mor
uous ol e but less than 10. If
(b ) D is cr et e of a student are l the
(c) D is cr et e 0, he is not included
(d) Continuous group bu t in the se in 111
34 . W ha t do yo cond group, i:e., 10
u kn ow ab ou t Fr (b ) Inclusive m -20.
eq ue nc y distribu ethod. Under this
Class lim it s an d tion, lower limit and th me tho d,
C la ss m ar k. e upper limit is includ
Ans. A compreh re sp ec tiv ed·
ensive w ay to clas e class, e.g.,
sify ra w data of a qu
va ri ab le is kn ow antitative Marks
n as a frequency 0 -9
distribution. • N o.
C la ss li m it s ar e of St udents
th e tw o en ds of 2
va lu e is ca lle d 'lo a cl as s. T he lo w We include student
w er cl as s lim it' an es t s in th e first group wlMJaD
d th e highest valu are between 0 and
is ca lle d th e 'u pp e 9. If the ID8lks of a
er cl as s lim it' of l 0, he is included
a distribution. in the next group, i.e
C la ss m ar k refers 37. W ha t is loss .. lit
to class 'mid-point of information in
value of a class. It ' w hi ch is th e middle cla ssified ctata!
is calculated by th
e formula given be (Netlt11
.Cl 'd ~ow. Ans. Ther
C lass m arkl ass rm -p . U pp er lim it + Lo e is an inherent shortcomin
oi nt = w er limit g in the c)as.,ifiell
matter classificatio
35 . W ha t is a 2 n of data summarisC
cl as s interval? H
ow do th e magni making it concise an 5 the -
th e cl as s in te rv tu d ea sy to co mp rehell
al an d class lim de of show the details fo d- ~ :. .
its ge t determined und in the raw data-
A ns . W he n th e ? '{b el e•odd
w ho le ra ng e of of information in cl
values is classifie as si ~ ~w.d a ta
gr ou ps in th e fo rm d in some much is gained by -~
of intervals, th en summariS1ll8 it as 1
is kn ow n as cl as ea ch such interval Once the data are gr .
;n,ttllllll!
s -interval. Fo r ex ouped into ctasseS,
am pl e, w e m ay want ha s no sig nificance in further an
to classify a gr ou statistical caliedlMl!!l!I
p of pe op le acco th e us
Here, th e cl as s inte rd in g to age group. e of class mark instead
rvals m ay be 15 - of tbe actual
19 years, 20 - 24 observation in statis
years, 25 - 29 year tics in vo lv es inc,ieed. •
. s etc. loss of informatio
n.
.
Organisation of Data 39
Long An~er Type Questions [6 Marks] Class Tally Bars Frequency (/J
inclusive method.
chairs etc.
Exclusive method is the method when the class intervals (a) Explain how does classification of data
are so fixed that the upper limit of one class is the lower order to raw data.
limit of the next class. Whereas under inclusive method (b) Give an example each of a discrete variabk 1
is used more in practice so when the data is given in the sense that raw data can be classified thlO'f
inclusive series then it is first arranged in exclusive series frequency distribution in a comprehensive_
of c,.....itllPII
for further statistical enquiry. Once one masters the teehn1ques. 1
_t!,
(a) The method in which both lower and upper limit one can easily construct a frequency 1
of du• are included in same class is known as both for the discrete variable and tho
---- ladulve
method. variable.
(b) Discrete variable - Number of chili
·;Bsdllllve Continuous variable - Height of
Or ga nis ati on of Da ta
41
.
the Economics test:
• al Qu es tio ns
k ht ·ne d by 25 stu de nts of a class in
s o a1
Nu me ricare a frequency arr ay of ma r , 40, JO, 50 , 25, 30 , 30 , 15
, 45
20 35 35' 50 ' 15' 50' 25' 40 , 45 50, 50, 50
,
'
• 44. Pre p 40 25 25 30 ,40 , ' 25 25 30 30 30 30, 30, 35, 35, 40, 40, 40, 40
' ,25 ,25 , ' , ,
30, 15',20 ',20 ,20 , ,
20, 15
20, 15,. 15 Fre qu en cy Ar ray
An s. Array. , '
f
TaUy Ba n
Ma rks 3
Ill 3
15
20
Ill 4
25
/Ill 5
30
tH1 2
35
II 4
40
Ill / I
45
I 3
• Ill
50 "1;f 25
En gli sh tes t:
g are the ma rks ob tai ne d by 20 stu de nts in an . .
45. Followin .
25, 25 , 26 , 26, 30 , 31 , 31 , 34 , 35: 42 , 48
5, 16, 17, 17, 20, 21, 22 , 22, 22 , lus1ve me tho d.
ing cla ss int erv al of 10 usi ng ex clu siv e an d 1nc
Pre pa re a firequency dJS • tri bu tio n tak In clu siv e Me tho d
ve e th o d
s. • E xcIus1• M Ta lly Ba rs f
f C. I.
C.I. Tally Ba rs 1
1 0- 9 I
0- 10 I 3
3 10 - 19 Ill
10 - 20 Ill 9
20 - 30 t/11 Ill / 9 20 - 29 tH1 Ill /
5
30 - 40 ' t/11 5 30 - 39 tH1
40 - 49 II 2
40 - 50 II 2
"1;f = 20
I;f = 20
cy dis tri bu tio n wi th cla ss int erv al
m of exc lus ive fre qu en
en. Pre sen t the m in the for
46. Wa40 ges of. 50 workers are giv
as -50 79 79 79 78
72 76 77 77
87 88 70 60 61 62
90 88 82 86 87 59 60 60 60
95 92 91 63 63 63 63 64 58
78 68 67 67 66 65
54 53 52 52 so
55 55 55 56 57 , 62 , 62 ,
62 62 , 57 58 , 59 , 60 , 60 , 60 , 60 , 61
53, 54, 55, 55, 55
, 56 78 , 79 ,
s. Arr ay: 50, 52, 52, 67 , 68 , 70 , 72 , 76 , 77 , 77 , 78 ,
An 66, 67,
62, 63, 63, 63, 63, 64, 65, 88, 90, 91, 92, 95
82, 86, 87, 87, 88,
79, 79,
str ibu tio n
Exclusive Fre qu en cy Di
Class Tally Ba.n
50 - 60 tH1 tH1 II 12
60 - 70 tH1 tH1 tH1 Ill 18
70 - 80 tH1 tH1 10
80 - 90 tH1 I 6
90 - 100 Ill/ 4
l;f = 50
42 7 ~ r«d® Economics- 11
..
20 - 29, 30 - 39, 40 - 49, 50 - 59, 60 - 69
(b) Form the cumulative frequency distribution also.
r-----.------,i-------r--
Ans. (a) (b) I I Exclusive I
I
. C.I. Tally Bars f C.I. Groups
f
20 - 29 II 2 20 - 29 19.5 - 29.5 2
30 - 39 /Ill 4 30 - 39 29.5 - 39.5 4
40 - 49 tlfl Ill 8 40 - 49 39.5 - 49.5 8 14
50 - 59 tlfl I 6 50 - 59 49.5 - 59.5 6 20
60 - 69 tlfl 5 60 - 69 59.5 - 69.5 5 2S
rf = 25 rf = 2S
48. Convert the following inclusive series into exclusive series:
C.I. 1- 5 6 - 10 11 - 15 16 - 20 21 - 25
No. of Workers 10 15 20 25 30
Ans. Exclusive Series
C.I. f.
0.5 - 5.5 10
5.5 - 10.5 15
I
10.5 - 15.5 20
15.5 - 20.5 25
20.5 - 25.5 30
:Ef = 100
49. From the following frequency distribution, prepare 'less than' and 'more than' cumulative frequency distribudoa:1
11 j
No. of Workers 4
I I
l
12 20 7 s j
Ans.
Wages (f) No. of workers (() c.f.
c.f.
-,
! ;
Oess than)
100- 110 4
(more thu) I l
4 48 l
110- 120 12 .lj
16 44 j;.,
120-130
20 ·il;
36
130-140 32
7
43
140-150 12
5
48
If=48
s
so. Convert the following 'less than' cumulative frequency series Into 'mo~ than' cumulative frequency serfell
Marb (lea tban). 5 10 1 15
.Ctmrdatlve !!equency 12
25 I 35
Org ani sat ion of Da ta 43
C.I. f
ns.
0- IO 16
10 -20 24
20 -30 30
30 -40 18
40 -50 8
50 -60 4
PRACTICE QUESTIONS
es of
Type Questions [1 Mark] 3 • - - - - - and _ _ _ _ _ are the two typ
Objedive/Very Short Answer qualitative classification.
l. In the exclusive method: series include the value of bot
h the upp er
excluded but lower class 4. . .
(a) the upper class limit is hm1t and low er lim it itself.
limit is included. __ _ lim it of a class are
S. __ __ _ limit and __
(b) ~e_l~w~r class limit is
excluded but upper class called class Jimit.
hm1t 1s included. gin al for m is kno wn as
6. A ma ss of dat a in its ori
(c) ~oth the upper class lim
it and lower class limit are
tion.
mcluded. 7. __ __ _ cannot be in frac
(d) None of these largest and the sm alle st
8. Range is the ave rag e of the {True/False)
2• W h'h
ic is' an attribute out of the following? observation.
(b) Weight known as one wa y frequency
(a) Height 9. Univariate frequency is also {True/False)
{c) InteJligence (d) Age distribution.
,4 70 9e ~ ~ ® Eco nom ics- 11
expenditure on food:
end the following monthly271 169
•• • 310 129 335 184 234 245
212 223 256
22. 30 fam1hes m an area ;~5 220 238 278 243 220 238 238
172 229 173
115 159 196 135 .
241 265 298 144
of
(a) Prepare a frequency dist
ribution with the class mterval
.
250, 250 - 300, 300 - 350
100 - 150, 150 - 200, 200 - e on foo is
d
(b) F. d the number of fam
ilies whose monthly expenditur
(ii) more than ? 300 (iii) between ? 200 and t 300
(i;nless tha n? 25 0
ribution-
mto ord.mary frequency dist
23. Convert the following 35 40 45 50
n 25 30 105 120
(a) Les s tha 50 85
8 22
f 50
40
I I I 11 4
(b) I Mo r~ than I 5~
I
given data:
er lim it of the first class and upper limit of last class from the
24. Calculate low
Fre que ncy
Classes
12
Less than IO 15
IO- : 20 4
20 -30 16
30 -40 3
Above 40
50
Total
ONS
NSWERS TO PRACTICE QUESTI
16. (a) Height of a student
is a continuous variable. It is
1. (a) 2. (c) al val ues like 80.85
ld classification so because it can take fraction
3. Simple classification, manifo cm, f0I .62 cm etc.
5. Upper, lower continuous variable. It is so
4. IncJusive (b) Distance covered is a
7. Discrete series values like 48. 4 km ,
6. raw data because it can take fractional
8. False, as range is the differe
nce between the largest and 59.2 km etc.
the smaJiest observation. (c) Nu mb er of students in
a class is a discrete variable
ole num ber s only.
9. True, as data is classified on
the basis of single variable. because it can take val ue of wh
, e.g.. num ber of
but Reason (R) is true It can not tak e fractional values
JO. (d) Assertion (A) is false not 40.5 or 40.6.
Reason (R) is true students can be 40 or 41 but
11. (d) Assertion (A) is false but trib uti on is the fre que ncy
based on quality is cal led 17. Un iva ria te fre que ncy dis
12. A classification of dat a le, wh ere as .Bi var iate
"Qualitative Classification". dis trib utio n of a sin gle var iab
frequency distribution of
d to measure an "attribute" frequency distribution is the
13. A variable is a quantity use
which can take different two variables.
• ofsomething or some persons,
values in different situ ations. 18. (a) (iii ) a class
ation occurs in the given
14. Number of times the observ (b) (i) one
series is called frequen cy.
(c) (iv) All of the above
on the basis of time then it
JS. When the data is classified (d) continuous
cation". For e~ample,
is known as "chronological classifi n in cla ssif yin g raw data.
on the basis of number of 19. Th ere is los s of informatio
we may present population classes an individual
years. On ce the dat a are gro upe d into
the basis of place such nce in fur the r sta tist ica l
When the data is classified on obs erv atio n has no sig nif ica
as country, state, are a etc ., it
is kno wn as "sp atia l when the data are grouped
calculations. The reason being
duction of wheat in ation has no significance
classification". For example, pro into classes an individual observ
different States of lndia. because when dat a are
in futher calculation. This is so
7 ~ """-® Economics-11
CERT EXERCISES
of tbe followln& alternadves is true? (b) The product of upper clasS Umlt1
c1UI midpoint II equal to: lower class limit.
3)e averqe of the upper class llmlt and (c) The rado of the 1:1pper dlll
~' ,_ ,. . . . . llmlt. lower class limit.
(d) None of the above
•
Organisatio n of Data 47
5. Use the data in Table 3.2 that relate to monthly household expend
iture (in on food of 50 h
(1) Obtai n the range of monthly household expenditure
on food. ouseholds •nd
(ii) Divide _the range into appro priate numb er of class
interv als and obtain the frequenc d
expen diture .
Y lstrn,"'-
(ii,) Find the numb er of households whose monthly expen
diture on food is
(a) less than f 2000
(b) more than f 3000
(c) between f 1500 and f 2500
Table 3.2
Month ly Household Expen diture (in Rupees) on Food of 50 House
holds
1904 1559 3473 1735 2760 2041 1612 1753 1855
5090 1085 1823 2346 1523 1211 1360 1110 2152 1111
1218 1315 1105 2628 2712 4248 1812 1264 1183 11'11
4248 1812 1264 1183 1171 1007 1180 1953 1137 20I
1007 1180 1953 1137 2048 2025 1583 1324 2621 361
1397 1832 1962 2177 2575 1293 1365 1146 3222 131
Ans. (l) Range = Hv - Ly = 5,090 - 1,007 = f 4,083
(ii) Frequency distribution of expenditure of 50 households
Expenditure c.r. c.f.
No. of Households •
(in ') (less than) (more than
1,000 - 2,000 33 33 50
2,000 - 3,000 11 44 17
3,000 - 4,000 3 47 6
4,000 - 5,000 2. 49 3
5,000 - 6,000 1 50 1
Total Lf = 50
(iii) (a) There are 33 households whose monthly expend
iture on food is less than f 2,000.
(b) There are 6 households whose monthly expenditure on
food is more than f 3,000:
. In a city 45 families were surveyed for the numbe r of celf. phones
6 they used. Prepa re a frequency array b
their replies as record ed below.
1 3 2 2 2 2 1 2 1 2 2 3 3 3 3
3 3 2 3 2 2 6 1 6 2 1 5 1 5 3
z 4 2 7 4 2 4 3 4 2 0 3 1 4 3
Frequency Array
Numb er of cell phones Tally Bars No. of fa
0 I 1
I tH1 II 7
2 tH1 tH1 tH1 15
3 tH1 tH1 II 12
4 tH1 s
5 II 2
6 II 2
7 I 1
If== 45
Organisation of Data 49
Tally Bars f .
C.I. . , J '
8-14 I
tH1 tH1 II 12
29-35 tH1 I 6
36-42 II 2
:Ef= 60
11. "The
tt quick
.
.
b rown fox J·umPs over the lazy dog " Examin th •
e ers ID each word. Treating the number of le~ers in a ve . e given sentence carefully and note the number of
s. 1 . ariable, prepare a frequency array for this data.
Freque.ncy Array
Wo.-ds
The
Quick /Jfl 5
Brown /Jfl 5
Fox Ill 3
Jumps /Jfl
Over Ill/
The Ill
Lazy
Dog