Classification
Classification
Unit No: 1
Unit Title: Classification & Tabulation of Data
Subject Code: 02OE0004
Subject Name: Statistics and its Applications
Semester : 3 & 5 & 7
Session: 2024-25
Department of Mathematics
The subject of Statistics, as it seems, is not a new discipline but it is as old as the human society itself.
It has been used right from the existence of life on this earth, although the sphere of its utility was very much
restricted. In the old days, Statistics was regarded as the ‘Science of Statecraft’ and was the by product of the
administrative activity of the State. The word Statistics seems to have been derived from the Latin word ‘status’ or
the Italian word ‘statista’ or the German word ‘statistik’ or the French word ‘statistique’, each of which means a
political state. In the ancient times the scope of Statistics was primarily limited to the collection of the following data
by the governments for framing military and fiscal policies :
(i) Age and sex-wise population of the country ;
(ii) Property and wealth of the country ;
Statistics
DEFINITION OF STATISTICS
Statistics is the science which deals with classification and tabulation of numerical facts as the basis for explanation,
description and comparison of phenomenon.
Statistics is the science which deals with the methods of collecting, classifying, presenting, comparing and
interpreting numerical data collected to throw some light on any sphere of enquiry.
Statistics is a method of decision making in the face of uncertainty on the basis of numerical data and calculated
risks.”
Statistics
Statistics in Planning
Statistics in State
Statistics in Economics.
Statistics in Business and Management.
Statistics in Accountancy and Auditing
Statistics in Industry
Statistics in Physical Sciences
Statistics in Social Sciences
Statistics in Biology and Medical Sciences.
Statistics in Psychology and Education
What is Data?
Definition: Facts or figures, which are numerical or otherwise, collected with a definite
purpose are called data.
• Everyday we come across a lot of information in the form of facts, numerical figures, tables, graphs, etc.
• These are provided by newspapers, televisions, magazines and other means of communication.
• These may relate to cricket batting or bowling averages, profits of a company, temperatures of cities,
expenditures in various sectors of a five year plan, polling results, and so on.
• These facts or figures, which are numerical or otherwise, collected with a definite purpose are called data.
What is Data?
Primary Data Vs Secondary Data
Primary Data
Primary data is the data that is collected for the first time through personal
experiences or evidence, particularly for research.
It is also described as raw data or first-hand information.
The mode of assembling the information is costly.
The data is mostly collected through observations, physical testing,
mailed questionnaires, surveys, personal interviews, telephonic
interviews, case studies, and focus groups, etc.
Primary Data Vs Secondary Data
Secondary Data
Secondary data is a second-hand data that is already collected and recorded by
some researchers for their purpose, and not for the current research problem.
It is accessible in the form of data collected from different sources such as
government publications, censuses, internal records of the organisation, books,
journal articles, websites and reports, etc.
This method of gathering data is affordable, readily available, and saves cost
and time.
However, the one disadvantage is that the information assembled is for some
other purpose and may not meet the present research purpose or may not be
accurate.
Discrete Vs continuous data (Quantitative Data)
Discrete data (countable) is information that can only take certain values. These
values don’t have to be whole numbers but they are fixed values – such as shoe
size, number of teeth, number of kids, etc.
Discrete data includes discrete variables that are finite, numeric, countable, and
non-negative integers (5, 10, 15, and so on).
Continuous data (measurable) is data that can take any value. Height, weight,
temperature and length are all examples of continuous data.
Continuous data changes over time and can have different values at different time
intervals like weight of a person.
Nominal Vs Ordinal data (Qualitative Data)
Nominal Data – Nominal data is a basic data type that categorizes data by labeling or naming
values such as Gender, hair color, or types of animal. It does not have any hierarchy.
Nominal Vs Ordinal data
Ordinal Data – Ordinal data involves classifying data based on rank, such as social status in
categories like ‘wealthy’, ‘middle income’, or ‘poor’. However, there are no set intervals between
these categories.
Choice between Primary and Secondary Data:
The investigator must decide at the outset whether he will use primary data or
secondary data in an investigation. The choice between the two depends mainly on
the following considerations:
Nature and scope of the inquiry.
Availability of financial resources.
Availability of time.
Degree of accuracy desired, and
The collecting agency, i.e., whether an individual, an institution or a Government body.
Methods of Collecting Primary Data:
Merits:
The data is more reliable as they are obtained directly.
There is chance of more response in it.
The questions can be adjusted according to the standard of the sources.
There is uniformity in data.
Methods of Collecting Primary Data: