Descriptive 01 PDF
Descriptive 01 PDF
Statistics
By;
Mr. Johny Kutty Joseph
Asstt. Professor
Concepts & Definition
• It is to organize, interpret, and
communicate numeric information.
• Logical thinking is required more than
mathematical ability.
• The word statistics comes from the
Italian words Statista means Statement
and a German word Statistik means
political state..
• It is a science of learning from
numbers/data.
• It is a science of collecting, classifying,
analyzing and interpreting the data.
Concepts & Definition
• A branch of mathematics dealing with
the collection, analysis,
interpretation, and presentation of
masses of numerical data. (Merriam-
Webster)
• Statistics is defined as collection,
Presentation, analysis and
interpretation of numerical data”. (
Croxton & Cowden)
• It is the sciences and art of dealing
with figure and facts.
Uses of Statistics
• To make the raw data meaningful.
• To test null hypothesis.
• To test the statistical significance of
data .
• To draw inferences and make the
generalization.
• To estimate parameters.
• Make decisions based on data, and
make predictions.
• It helps in comparison
Biostatistics
• Biostatistics is the branch of statistics
applied to biological or medical
sciences.
• Biostatistics is the methods used in
dealing with statistics in the field of
health sciences such as biology,
medicine, nursing, public health etc.
• Biostatistics is the branch of statistics
applied to biology or medical sciences.
Biostatistics is also called “Biometry”
Data
• Data is defined as factors known or assumed
as facts, making the basis of reasoning or
calculation.
• Broadly there are quantitative and qualitative
data.
• Quantitative data deals with numbers and
things you can measure objectively: Eg; height,
weight, length, temperature, volume, area etc.
It is number value.
• Qualitative data deals with characteristics
and descriptors that can't be easily measured,
but can be observed subjectively. Eg. smells,
tastes, textures, attractiveness, and color.
Data
• Quantitative data;
continuous and discrete.
• Discrete data is a count that can't be
made more precise. For instance, the
number of children in your family is
discrete data, because you are counting
whole, indivisible entities: you can't have
2.5 kids.
• Continuous data could be divided and
reduced to finer and finer levels. Eg;
Height of children made more precise by
Meters-centimeters-millimeters and
Data
• Qualitative data; It is also referred as
attributable data. Binary, Nominal (unordered)
and Ordinal (ordered) data.
• Binary data place things in one of two
mutually exclusive categories: right/wrong,
true/false, or accept/reject.
• Nominal Data: We assign individual items
number or category that do not have an
implicit or natural value or rank. (Gender:
1=male and 2= female)
• Ordinal Data: The items are assigned to
categories that have some kind of implicit or
natural order. Eg. "Short, Medium, or
Tall." Rating from 1 to 5 on scale where 5 is
Scales of Measurement
• Measurement is the process of assigning
numbers or labels to objects, persons,
states, or events in accordance with
specific rules to represent quantities or
qualities of attributes.
• We do not measure specific objects,
persons, etc., we measure attributes or
features that define them.
• It is a system of classifying
measurements according to the nature of
the measurement and the type of
mathematical operations to which they
Scales of Measurement
Data Classification in Science
Nominal Measurement
• The lowest level of measurement also referred
as categorical data.
• It represents characteristics. Eg. Gender,
Language, locality etc. Numerical values may
be given but do not have any mathematical
meaning.
• It act as labels and hence changing order
doesn’t have any significance.
Ordinal Measurement
• It is the second level, in which the scores are
given in such a manner as the number
increases the status/condition also increases
or upgrades.
How often do you feel
back pain ?
No Pain: 1, Mild Pain:
2
Moderate: 3, Severe :
• The limitation of this type4 of data is that
difference between all the 4 options are not
equally measurable or not known.
• It is mainly used to measure non numerical
features such as patient satisfaction, etc.
Interval Measurement
• An interval scale has the characteristics of an
ordinal scale.
• An interval scale permits use of measurement
that enables data to be placed at equally
spaced intervals in relation to the spread of
the variable.
• This measurement has a starting and a
terminating point that is divided into equal
space intervals.
• The problem with interval values data is that
they
Whatdon’t have
is the a true
room zero.
temperature ?
a) -20 to -10; b) -10 to 0; c) 0 to 10 ; d) 10 to
20
Ratio Measurement
• It is the highest level of data.
• A ratio scale is a scale that measures in
terms of equal intervals and an absolute zero
point of origin. It has all the properties of
nominal, interval and ordinal.
• The bio-physiological characteristics such as
age, weight, height are examples.
• The variables that are measured either on
interval or ratio are considered continuous.
• Eg. It can easily be stated that one who
weighs 80 kg is twice heavy as someone who
weighs 40kg.
Comparison of levels
• The levels of measurement forms a hierarchy,
with ratio at the top and nominal at the base.
• The higher the level of measurement precise is
the data.
• It is possible to convert data to lower level but
not the reverse process.
• A ratio may be converted to ordinal but
ordinal cannot be ratio.
Assess the weight of
people
Some psychological
Ordinal Ratio
scales (Likert’s scale)
a.Below 50 a.
are considered
40 to 50
ordinal as well as
b. 50 to 70 b. 50
interval.
to 60
Classification of Statistics
• Descriptive Statistics: It is the
enumeration, organization and graphical
representation of data. It helps to
summarize the meaning of data. Eg.
Demographic variables.
• Inferential Statistics: It is also called as
sampling statistics. It is the inference of
conditions that exist in large set of
observations. Eg. Test the efficiency of a
new hypertensive drug on a particular
population.
Descriptive Statistics
• It is classified as the following
• Frequency distribution and graphical
presentation(measures of condensation).
• Measures of central tendency. (Mean,
Median, Mode)
• Measures of dispersion. (difference) Eg.
Range, Mean deviation, Standard
deviation, Quartile deviation
• Measures of relationship (correlation
coefficient, regression etc.)
Frequency Distribution
• A set of data can be described in terms of
three characteristics. Distribution of
values, central tendency and variability
(dispersion and relationship).
• Distribution of values or frequency
distribution are used to organize the
numeric data.
• It is a systematic arrangement of values
from lower to higher together with count
of number/frequency with which the
value was obtained.
Frequency Distribution
• Observe the below given table for anxiety
scores of 60 patients.
22 24 25 19 24 25 23 23 24 20
25 16 20 25 17 22 24 18 22 23
15 24 23 22 21 24 20 25 18 25
24 23 16 25 30 20 19 21 23 24
19 18 20 21 17 25 22 24 20 17
20 25 21 24 23 19 21 21 25 21
50
40
40
30 30 Population
30 26 Land
20 14
10
0
Asia Africa Europe
Types of Graphs and Diagram
Proportionate bar graph showing worlds
population and land area
100
90 26 30
80
70 14
Europe
60 30
Africa
50
Asia
40
60
30
40
20
10
0
Population Land
Types of Graphs and Diagram
Number of Males
70
60 60
50 50
40 40
Number of Males
30
20 20
15
10
0 0 0
15 - 20 20 - 25 25 - 30 30 - 35 35 - 40
Types of Graphs and Diagram
500 486
456
400 402 417
387
347 342
337
328
300 298 307 298 In Delhi
In Mumbai
200 203
123
100
0
2001 2002 2003 2004 2005 2006 2007
Types of Graphs and Diagram
Number of Males
200
180 185
160
140 135
120
100
Number of Males
80 75
60
40 35
20 15
0
15 - 20 20 - 25 25 - 30 30 - 35 35 - 40
Types of Graphs and Diagram
10
8
6
No. of students
4
2
0
0 50 100 150
Marks obtained out of 100
Types of Graphs and Diagram