Module 1
Module 1
Deepu Tiwari
Faculty of Biostatistics
Jammu Institute of Ayurveda and Research,
Jammu
Module 1 : Fundamentals of Statistics
Indirect Methods:
The indirect method is used in cases where it is delicate
or difficult to get the information from the respondents
due to unwillingness or indifference. The information
about the respondent is collected by interviewing the
third party who knows the respondent well.
For E.g. information on addiction, marriage proposal,
economic status etc.
Questionnaire Methods:
A questionnaire contains a sequence of questions
relevant to the study arranged in a logical order.
The general guidelines for a good questionnaire
(1) The wording must be clear and relevant to the study.
(2)Ability of the respondents to answer the
questions to be considered.
(3)Ask only the necessary questions so that the
questionnaire may not be lengthy.
(4) Arrange the questions in a logical order.
(5)Questions which hurt the feelings of the
respondents should be avoided.
(6) Calculations are to be avoided.
(7)It must be accompanied by the covering letter
stating the purpose of the survey and guaranteeing the
confidentiality of the information provided.
Local Correspondents Method:
In this method, the investigator appoints local agents or
correspondents in different places. They collect the
information on behalf of the investigator in their locality
and transmit the data to the investigator. This method
is adopted by newspapers and government agencies.
am Manohar Lohia Institute of Medical Sciences,
Enumeration method:
In this method, the trained enumerators or
interviewers take the schedules themselves, contact
the informants, get replies and fill them in their own
hand writing. The voters’ list preparation, information
on ration card for public distribution in India, etc.,
follow this method of data collection.
Secondary Data:
Secondary data is collected and processed by some other
agency but the investigator uses it for his study. They
can be obtained from published sources such as
government reports, documents, newspapers, books
written by economists or from any other source. Before
using the secondary data scrutiny must be done to
assess the suitability, reliability, adequacy, and accuracy
of the data.
Sources of Secondary Data:
The secondary data comes from two main
The published sources include:
(a) Government Publications
(b) International Publications
(c) Publications of Research
institutes
(d) Journals or Magazines or
Newspapers
Classification of Data
‘Knowledge is power, and data is just data. No matter
how much data you have on hand, if you don’t have a
way to make sense of it, you really have nothing at all’ -
Unknown
Classification of Data:
The data that are unorganized or have not been
arranged in any way are called raw data.
Raw data must be presented in a condensed form and
must be classified according to homogeneity for the
purpose of analysis and interpretation.
An arrangement of raw data in an order of
magnitude or in a sequence is called array.
Specifically, an arrangement of observations in an
ascending or a descending order of magnitude is said to
be an ordered array.
Thus, Classification is the process of arranging the
Types of Classification:
The raw data can be classified in various ways
depending on the nature of data.
The general types of classification are:
(i) Classification by Time or Chronological
Classification
(ii) Classification by Space or Spatial Classification
(iii) Classification by Attribute or Qualitative
Classification and
(iv) Classification by Size or Quantitative
Classification.
Rules for Classification:
There are certain rules to be followed for classifying the
data (i)The classes must be exhaustive, i.e., it should be
possible to include each of the data points in one or the
other group or class.
(ii)The classes must be mutually exclusive, i.e.,
there should not be any overlapping.
(iii)It must be ensured that number of classes
should be neither too large or nor too small. (no. of
Tabulation of Data
A logical step after classifying the statistical data is to
present them in the form of tables. A table is a
systematic organization of statistical data in rows and
columns.
Advantages of Tabulation:
(i) is a logical step of presenting statistical data after
classification.
(ii)enables the reader to understand the required
information with ease as the information is contained in
rows and columns with figures.
(iii)enables the investigator to present the data
in a brief or condensed and compact form.
(iv)Comparison is made simple by displaying data to
be compared in a single table.
(v)easy to remember the data points if they are
properly placed in the form of table.
(vi)facilitates easy computation and helps easy
detection of errors and omissions.
am Manohar Lohia Institute of Medical Sciences,
Types of
Table
Statistical tables can be classified under two general
categories, namely, general tables & summary tables.
General tables contain a collection of detailed
information including all that is relevant to the
subject or theme.
Summary tables are designed to serve some specific
purposes. They are smaller in size than general tables,
emphasize on some aspect of data and are generally
incorporated within the text. The summary tables are
also called derivative tables or interpretative tables
because they are derived from the general tables & aims
at analysis and inference.
The statistical tables may further be classified into
two broad classes namely simple tables & complex
tables.
A simple table summarizes information on a single
characteristic and is also called a univariate table.
E.g. The marks secured by a batch of students in a
class test are displayed as