0% found this document useful (0 votes)
7 views

02 Data Collection & Classification

Uploaded by

Porush Jain
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

02 Data Collection & Classification

Uploaded by

Porush Jain
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 22

Data Collection &

Classification
Some Definitions
• What is data?
Data are collections of any number of
related observations.
• What is data set?
A collection of data is called a data set.
• What is singular of Data?
Datum. A single observation is datum.
Sources of Data
• Primary Sources
A primary source is one that itself
collects the data. Such data are
original or first-hand data.
• Secondary Sources
A secondary source is one that makes
available data which were collected
by some other agency.
Primary Sources
• Direct Personal Interviews
• Indirect Oral Interviews
• Information from Correspondents
• Mailed Questionnaire Method
• Schedules sent through Enumerators.
Secondary Sources
• Published Sources
• Reports and publications of
• International Bodies e.g. World Bank, WTO,
ILO, UNO etc.
• Central and State Governments
• Ad-hoc Committees and Commissions etc.
• Semi-official Publications of Various Local
Bodies.
-Contd…
Secondary Sources Contd…
• Publications of autonomous and private
Institutes etc.
• Unpublished Sources
• Records maintained by Government and
Private Bodies.
Classification of Data
• Geographical Classification
• Chronological Classification
• Qualitative Classification
• Quantitative Classification
Geographical Classification
In this type of classification data are
classified on the basis of geographical
location differences between the
various items, e.g. states, cities,
regions etc.

-Contd…
Geographical Classification Contd…
Country Human development index
(Year 2002)
Norway 0.956
Australia 0.946
China 0.745
Sri Lanka 0.740
Indonesia 0.692
India 0.595
Bangladesh 0.509
Pakistan 0.497
Chronological Classification
In this type of classification data are
observed over a period of time.

Year Sales (Million Rs.)


2001 10
2002 15
2003 20
2004 25
2005 15
2006 12
Qualitative Classification
In this type of classification data are
classified on the basis of some
attribute or quality such as colour,
sex, literacy, religion etc.
Qualitative Classification Contd…
Population

Males Females

Literates Illiterates Literates Illiterates


Quantitative Classification
In this type of classification data are
classified on the basis of some
characteristics that can be measured, such
as height, weight, income, sales, profits
production etc.

Such type of data generates Frequency


Distribution.
Contd…
Frequency Distribution
A Frequency Distribution refers to
data classified on the basis of some
variable that can be measured such
as prices, wages, age, number of
units produced or consumed.
Elements of Frequency Distribution
• Variable
The term “variable” refers to the
characteristic that varies in amount or
magnitude in a frequency distribution
• Frequency
The term “Frequency” denotes number of
times an item repeats in a frequency
distribution.
Types of variables
• Discrete Variable
A Discrete Variable is that which can vary
only by finite jumps and cannot manifest
every conceivable fractional value, e.g. No.
of rooms, number of persons etc.
• Continuous Variable
A Continuous Variable is that which is
capable of manifesting every conceivable
fractional value within the range of
possibilities, e.g. Height, weight, length etc.
Types of Frequency Distributions
Discrete Frequency Distribution Continuous Frequency Distribution

No. of Children Weight (lbs) No. of


Frequency Persons
0 10 100-110 10
1 40 110-120 15
2 80 120-130 40
3 100 130-140 45
4 250 140-150 20
5 150 150-160 4
6 70 Total 134
Total 680
Formation of Discrete Frequency Distribution

1 0 2 3 4 5 6
7 2 3 4 0 2 5
8 4 5 12 6 3 2
7 6 5 3 3 7 8
9 7 9 4 5 4 3
Elements of Continuous Frequency
Distribution
• Class Limits
Class limits are the lowest and highest values that
can be included in the class.
• Class Intervals
The difference between the upper and lower limit of
a class is known as class interval of that class.
i = U-L

Where, i = Class interval


U = Upper Limit
L = Lower Limit

Contd…
Elements of Continuous
Frequency Distribution Contd…
• Class Frequency
The number of observations corresponding to a
particular class is known as the frequency of that
class or the class frequency.
• Class Mid-Point
It is the value lying half-way between the lower and
upper class limits of a class-interval.
U L
M
2
Where, M = Mid-Point
Methods of Classification of Data

• Exclusive Method
In this method upper limit of one class is the
lower limit of next class and an observation
equal to upper limit is excluded from that class.
• Inclusive Method
In this method an observation equal to upper
limit is included in that class.
Example:
Exclusive Method Inclusive Method

Marks No. of Marks No. of Students


Students
0-9 5
0-10 5
10-19 10
10-20 10
20-29 15
20-30 15
30-39 15
30-40 15
40 & above 5
40-50 5
Total 50
Total 50

You might also like