Introduction and Statistical Technologies (1) - 1
Introduction and Statistical Technologies (1) - 1
SWE- 3107
S. M. Shahriar Zaman
Lecturer
Soil, Water and Environment Discipline
1
Introduction and Statistical Terminologies
Definition of Statistics
Importance of Statistics in Biological/Agricultural Sciences
Population and Sample
Data, Variables (Qualitative, Quantitative: Discrete and Continuous)
Descriptive Statistics
Inferential Statistics
Type of Statistical Data
Collection of Statistical Data
2
Introduction and Statistical Terminologies
Definition of Statistics
Statistics is the study of
theories, methods and
techniques developed
and applied for
collection, classification,
presentation, analysis
and interpretation of
data relating to any
sphere of enquiry.
3
Introduction and Statistical Terminologies
4
Introduction and Statistical Terminologies
A population is a complete
set of individuals, objects or
measurements having some
common characteristics,
while a sample is subset or
part of the population
selected to represent the
population.
5
Introduction and Statistical Terminologies
Data
Data are measurements or observations that are collected as a source of
information.
Variable
A variable is a characteristic, often but not always quantitatively
measured, containing two or more values or categories that can vary
from person to person, object to object or from phenomenon to
phenomenon.
Constant
A constant is particular type of variable which does not vary from one
member of a group to another. 6
Introduction and Statistical Terminologies
Types of Variable
Variable
Qualitative Quantitative
Discrete/
Continuous
Discontinuous
7
Introduction and Statistical Terminologies
Types of Variable
1. Qualitative: Not numerical and the values do not result from counting or
measuring.
Example: Hair color, religion, marital status etc.
2. Quantitative: Provide numerical measures of individuals.
Example: Height, weight, days of a week etc.
(a) Discontinuous/Discrete: Generally cannot accept the fractional value and any
random value within its limit. Example: Days of a week, family size etc.
(b) Continuous: Accept the fractional value and any random value within its limit.
Example: Height, weight etc.
8
Introduction and Statistical Terminologies
Descriptive vs Inferential Statistics
SL No. Descriptive statistics Inferential Statistics
1 Concerned with describing the target Make inferences from the sample and
population generalize them to the population
2 Organize, analyze and present the Compares, tests and predicts the future
data in meaningful manner outcomes
3 Final results are shown I form of Final result is the probability scores
charts, tables and graphs
4 Describe the data which is already Tries to make conclusions about the
known population that is beyond the data
available
5 Tools – measures of central tendency Tools – hypothesis test, analysis of
(mean/median/mode), spread of data variance etc.
(range, standard deviation etc.)
9
Introduction and Statistical Terminologies
10
Introduction and Statistical Terminologies
Types of Statistical Data/ Scale of measurement
Scale Characteristics Examples
Nominal Categories are homogeneous, mutually Eye color (black, brown), Religion,
exclusive, and no assumptions about Political affiliation, Place of residence
ordered relationships between categories (Urban or Rural), etc.
made
Ordinal All of the above plus the categories can be Examination grade (A, B, C), Health
rank-ordered Status (poor, average, good), Level of
education (Illiterate, primary,
secondary) etc.
Interval All of the above plus exact differences Temperature (98°C), IQ test score,
between categories are specified and an Calendar time (3 pm, 6 pm) etc.
arbitrary zero point is assumed
Ratio All of the above with exception that a true Height, Weight, Distance etc.
zero point is assumed
11
Introduction and Statistical Terminologies
Collection of Statistical Data
Planning is an essential first step in the assembling or collection
of statistical data. The steps of good planning involves-
Defining a problem
Ascertaining the purpose and scope of inquiry
Choosing the type of inquiry
Fixation of unit of measurement
Determination of the degree of perfection or accuracy
12
Introduction and Statistical Terminologies
Types of Inquiry
Primary vs Secondary Inquiry
Census vs Sample Inquiry
Direct vs Indirect Inquiry
Open vs Confidential Inquiry
Original vs Repetitive Inquiry
Regular vs Adhoc Inquiry
These types of inquiry are not complete altogether, nor these are only
possible ways of classification. The types of inquiries are not always mutually
exclusive. A study might utilize more than one types of inquiry
simultaneously.
13
Introduction and Statistical Terminologies
Statistical Unit
The definition of statistical unit to be un ambiguous and precise
should have some common characteristics which are-
Must be specific , self explanatory and simple
Must be homogeneous and uniform
Must be stable
Must be suitable and appropriate to the purpose of inquiry
Degree of accuracy
The scope and purpose of the inquiry affect the degree of accuracy
The time and cost factor also affect the degree of accuracy
A prompt and timely repost with a tolerable accuracy level may be more useful than a
delayed but more accurate report 14
Introduction and Statistical Terminologies
Sources of Data: Primary Data vs Secondary Data
Primary Data Secondary data
Primary data are those that are collected for Secondary data refer to those data that have
Definition
the first time already been collected by some other person.
These are original because these are These are not original because someone else has
Originality collected by the investigator for the first collected these for his own purpose.
time.
Nature of
These are in the form of raw materials. These are in the finished form
Data
Reliability These are more reliable and suitable for the These are less reliable and less suitable as
and enquiry because these are collected for a someone else has collected the data which may
Suitability particular purpose. not perfectly match our purpose.
Time and Collecting primary data is quite expensive Secondary data requires less time and money;
Money both in the terms of time and money hence it is economical.
No particular precaution or editing is Both precaution and editing are essential as
Precaution required while using the primary data as secondary data were collected by someone else
and Editing these were collected with a definite for his own purpose.
purpose. 15
Introduction and Statistical Terminologies
Method of Field Investigation for Collecting Primary Data
16
Introduction and Statistical Terminologies
Framing a questionnaire
The questions asked should be as few as possible
Questions should be such that they can be understood and answered by
least educated and intelligent respondents
The questions should be carefully worded, precise and unambiguous
The questions should be capable of being answered readily in a simple
way like yes/no type
Questions should be such that they can be answered without any bias
Due regard should be paid to the religious, communal and political
belief of the respondents
17
Introduction and Statistical Terminologies
Framing a questionnaire
Too much private things should not be asked unless specifically
required
Questions should not be too many but sufficient to cover the
information required
Suspicious questions should be avoided and questions should be such
that they can be answered spontaneously and truthfully
Questions should be arranged in logical sequence
Overall design of the questionnaire should be nice and attractive
Sufficient space is to be provided for answers
Detailed instruction is to be provided
18
Introduction and Statistical Terminologies
Selection of Investigators
The quality of investigator will largely determine the reliability of the
information. Thus an investigator/enumerator-
Training of Investigators
Theoretical
Practical
19
Introduction and Statistical Terminologies
Scrutinizing the primary data
The data collected are subjected a close, continuous and rigorous checking.
This aspect is also regarded as the editing of collected information.
The plan for editing should be uniform and standardized to ensure uniformity
in all schedules.
Some mistakes may be corrected by counter-check.
The investigator may also correct some anomalies if he/she is sure of it.
If required, defective and unsatisfactory data may be sent back to the field for
correction.
Totally incomplete and thoroughly erroneous schedule ma have to be
rejected.
At the time of editing all the schedules should be uniform in terms of their
measurement unit.
20
Introduction and Statistical Terminologies
Collection of Secondary Data
A careful review of the secondary data is needed before they are used. It is
necessary to know the following matters in collecting secondary data-
Methods used in the original collection
The purpose for which the original collection was made
Units used in the collection
Reliability and representativeness of the data
Degree of accuracy followed in the collection of data
References
1. An Introduction to Statistics by Muhammad Ali Mian & M.
Alimullah Miyan
2. An Introduction to Statistics and Probability by M. Nurul
Islam
22