20 - Basic Concepts and Terminology in Biostatistics (SepI2020)
20 - Basic Concepts and Terminology in Biostatistics (SepI2020)
Biostatistics
By
Dr. Shaik Shaffi Ahamed
Professor
Dept. of Family & Community Medicine
KKUH
Objectives of this session
• Definition of statistics and biostatistics
• To understand different Levels of measurements
• To understand different Types of data
• To use these concepts appropriately
Statistics is the science
of conducting studies to
collect, organize,
summarize, analyze,
present, interpret and
draw conclusions from
data.
3
What is Statistics?
1. Collecting Data
e.g., Sample, Survey,
Observe, Data Why?
Simulate Analysis
2. Characterizing Data
e.g., Organize/Classify,
Count, Summarize
3. Presenting Data Decision-
e.g., Tables, Charts, Making
Statements
4. Interpreting Results
e.g. Infer, Conclude,
Specify Confidence
Government
Census, Planning,
National Defense
5
Biostatistics is the science
that helps in managing
medical uncertainties
and variability of data
“Biostatistics”
• Statistics arising out of biological sciences, particularly
from the fields of medicine and public health.
• The methods used in dealing with statistics in the fields of
medicine, biology and public health for planning,
conducting and analyzing data which arise in investigations
of these branches.
Basic Concepts
Data : Set of values of one or more variables recorded on
one or more observational units (singular: Datum)
OBS AGE BMI FFNUM TEMP( 0F) GENDER EXERCISE LEVEL QUESTION
1 26 23.2 0 61.0 0 1 1
2 30 30.2 9 65.5 1 3 2
3 32 28.9 17 59.6 1 3 4
4 37 22.4 1 68.4 1 2 3
5 33 25.5 7 64.5 0 3 5
6 29 22.3 1 70.2 0 2 2
7 32 23.0 0 67.3 0 1 1
8 33 26.3 1 72.8 0 3 1
9 32 22.2 3 71.5 0 1 4
10 33 29.1 5 63.2 1 1 4
11 26 20.8 2 69.1 0 1 3
12 34 20.9 4 73.6 0 2 3
13 31 36.3 1 66.3 0 2 5
14 31 36.4 0 66.9 1 1 5
15 27 28.6 2 70.2 1 2 2
16 36 27.5 2 68.5 1 3 3
18 31 21.2 11 70.7 1 1 2
19 36 22.7 8 69.8 0 2 1
20 33 28.1 3 67.8 0 2 1
Definitions for Variables
1. Nominal 2. Ordinal
3. Interval 4. Ratio
Scales of Measurement
Nominal scale variables
• A type of categorical data in which objects fall into
unordered categories.
• Variables that have only two responses i.e. Yes or no, are
known as dichotomies.
Ordinal Scale variables
• Ordinal data is data that comprises of categories that can be
rank ordered.
• Weight
• Age
Hierarchical data order
Nominal Ordinal
data data
Examples of Nominal Data
• Type of car
BMW, Mercedes, Lexus, Toyota, etc.,
• Ethnicity
White British, afro-caribbean, Asian, Arab, Chinese,
other, etc.
• Smoking status
Smoker, non-smoker
Binary Data
• A type of categorical data in which there are only two
categories.
Examples:
• Smoking status- smoker, non-smoker
• Attendance- present, absent
• Result of a exam- pass, fail
• Status of student- undergraduate, postgraduate
Examples of Ordinal Data
• Grades in exam- A+, A, B+ B, C+, C ,D , D+, and fail.
Very satisfied
Somewhat satisfied
Neutral
Somewhat dissatisfied
Very dissatisfied
Quantitative Data
• The objects being studied are ‘measured’ based on some
quantitative trait.
• The resulting data are set of numbers.
• Interval and Ratio scales will be used to measure
quantitative data.
Examples
• Pulse Rate
• Height
• Age
• Exam marks
• Time to complete a Bio-statistics exam
• Number of cigarettes smoked
Quantitative
data
Discrete Continuous
Discrete Data
Only certain values are possible (there are gaps between
the possible values). Implies counting.
Continuous Data
Theoretically, with a fine enough measuring device. Implies
measuring.
Discrete data -- Gaps between possible values
Number of Children
Hb
Examples of Discrete Data
• Number of children in a family
• Number of students passing a stats exam
• Number of crimes reported to the police
• Number of bicycles sold in a day.
Generally, discrete data are counts.
We would not expect to find 2.2 children in a family or 88.5
students passing an exam or 127.2 crimes being reported to
the police or half a bicycle being sold in one day.
Example of Continuous Data
• Age ( in years)
• Height( in cms.)
• Weight (in Kgs.)
• Sys.BP, Hb., Etc.,
Generally, continuous data come from
measurements.
Relationships between Variables
Variables
Categorical Quantitative
Continuous
Ordinal Discrete
Nominal (counting) (measuring)
CONTINUOUS DATA
QUALITATIVE DATA
Examples:
(1) Apgar score based on appearance, pulse, grimace,
activity and respiration is used for neonatal prognosis.
(2) Smoking index: no. of cigarettes, duration, filter or not,
whether pipe, cigar etc.,
(3) APACHE (Acute Physiology and Chronic Health
Evaluation) score: to quantify the severity of condition of
a patient
Data types –
important?
• Why do we need to know what type of data we
are dealing with?
• The data type or level of measurement
influences the type of statistical analysis
techniques that can be used when analysing data.
To conclude
Type of variables in any data set are:
Categorical(Qualitative)
&
Quantitative
Whereas the scales to measure these
two variables are:
Nominal, Ordinal, Interval and Ratio
scales
38