0% found this document useful (0 votes)

63 views53 pages

Introduction To Bio Statistics

Dr. Joseph Kamalesh introduces biostatistics, which involves the design, collection, analysis, and interpretation of data from biological experiments, especially in medicine and agriculture. There are two main types of study designs: experimental studies where the researcher directly assigns exposures, and observational studies where exposures occur naturally. Key study designs include randomized controlled trials, cohort studies, case-control studies, and cross-sectional studies. Biostatistics also examines sources of bias and confounding that can influence study results. Methods to analyze and summarize data include descriptive statistics like measures of central tendency, dispersion, and frequency distributions, as well as inferential statistics like significance tests and regression.

Uploaded by

Joseph Kamalesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views53 pages

Introduction To Bio Statistics

Uploaded by

Joseph Kamalesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 53

Introduction to biostatistics

Dr. Joseph Kamalesh

defifintion
• Statistics is the science of the collection,
organization, and interpretation of data.
• It deals with all aspects of this, including
the planning of data collection in terms of
the design of surveys and experiments.
• Statistics is closely related to probability
theory, with which it is often grouped
biostatistics
• The science of biostatistics encompasses
the design of biological experiments,
especially in medicine and agriculture; the
collection, summarization, and analysis of
data from those experiments; and the
interpretation of, and inference from,
the results.
Study designs in clinical research

Did researcher
assign exposures?

Experimental Observational
study study
Experimental study

Is allocation
random?

Randomised Non-randomised
control control
Observational study

Is there a
comparison group?

Analytical Descriptive
study study
Analytical study

Direction of the
study

Exposure Outcome Exposure and

↓ ↓ outcome at
Outcome Exposure the same time

CROSS
CASE CONTROL
COHORT STUDY SECTIONAL
STUDY
STUDY
Some definitions….
• Clinical Trial Experimental study in which
the exposure status (e.g. assigned to
active drug versus placebo) is determined
by the investigator.
• Randomized Controlled Trial A special
type of clinical trial in which assignment
to an exposure is determined purely by
chance.
• Cohort Study Observational study in which
subjects with an exposure of interest (e.g.
hypertension) and subjects without the
exposure are identified and then followed
forward in time to determine outcomes
(e.g. stroke).
• Case-Control Study Observational study
that first identifies a group of subjects
with a certain disease and a control group
without the disease, and then looks to back
in time (e.g. chart review) to find exposure
to risk factors for the disease. This type
of study is well suited for rare diseases.
• Cross-Sectional Study Observational study
that is done to examine presence or absence
of a disease or presence or absence of an
exposure at a particular time. Since
exposure and outcome are ascertained at the
same time, it is often unclear if the
exposure preceded the outcome.
• Case Report or Case Series Descriptive
study that reports on a single or a series of
patients with a certain disease. This type of
study usually generates a hypothesis but
cannot test a hypothesis because it does not
include an appropriate comparison group.
bias
• Any systematic error in the design or
conduct of a study that results in a
mistaken estimate of an exposure’s
effect on risk of disease.
Selection Bias
• Bias introduced by the way in which
participants are chosen for a study. For
example, in a case-control study using
different criteria to select cases (e.g.
sick, hospitalized population) versus
controls (young, healthy outpatients)
other than the presence of disease can
lead the investigator to a false conclusion
about an exposure.
Confounding
• This occurs when an investigator falsely
concludes that a particular exposure is
causally related to a disease without
adjusting for other factors that are
known risk factors for the disease and
are associated with the exposure.
Classification of statistics

STATISTICS

DESCRIPTIVE INFERENTIAL
STATISTICS STATISTICS
DESCRIPTIVE STATISTICS

Frequency distribution

Measures of central tendency

Measures of dispersion

Measures of probability
INFERENTIAL STATISTICS

Significance and estimation

Linear regression and correlation

Analysis of variance

Non – parametric tests

FREQUENCY DISTRIBUTION
Frequency distribution
• Representation of data collected can be
done in two ways :
1. Tables
2. Diagrams
tables
• Tabulation is a process by which data of a
long series of observations are
systematically organised and recorded so
as to enable analysis and interpretation
• A set of categories is formed to classify
the data
• The essential requirement of such
classification is that any one of the
observation would definitely fall in one
category and all the observations would
fall in any one category.
Diagrams - histogram
• A histogram is a special type of bar
diagram used to present a frequency
distribution of a characteristic measured
on a continuous scale
• Rectangles are erected over class
intervals to represent the frequencies of
the class intervals.
• In a histogram, the area of each
rectangle represents the frequency of
the corresponding class interval.
Frequency polygon
• A frequency polygon is a variation of a
histogram
• Instead of rectangles erected over class
intervals, points are plotted at the mid-
points of the tops of the corresponding
rectangles in a histogram, and the
successive points joined by straight lines.
Frequency curve
• When the total frequency is large and
when we adopt much narrower class
intervals the frequency polygon will most
often have a much smoother appearance.
• If the total frequency is increased
indefinitely, the frequency polygon will
approach a smooth curve.
• This limiting condition is known as the
frequency curve
Cumulative frequency polygon or ogive
• The number of observations falling below
each specified value is presented
• Similarly the number of observations
falling above each specified value may
also be presented.
• Most often the observations are
presented as a percentage instead of
actual numbers
MEASURES OF CENTRAL
TENDENCY
Measures of central tendency
• In many biological characteristics, the
values of the extent of the observations
are not equal, but it is noticed that a
general tendency of such observations
cluster around a particular level.
• In this situation it may be preferable to
charecterise each group of observations
by such a level which is called the central
tendency of that group
Measures of central tendency

Arithmetic mean

Median

Mode

Position of averages

Geometric mean

Harmonic mean

Percentile
Arithmetic mean
• The arithmetic mean of a group is the
simple arithmetic average of the
observations
• This is calculated by dividing the total of
all observations by the number of
observations.
• In case of grouped data, arithmetic mean
is calculated assuming that each
observation in a class interval is equal to
the midpoint of that class interval
median
• The median is the magnitude of the
observation which occupies the middle
position when all the observations are
arranged in order of their magnitude.
• When there are even number of
observations in the group, the median is
the arithmetic mean of the center two
observations.
mode
• Mode is the most frequently occurring
value.
• The mode of the group is the value
around which all most of the observations
are heavily concentrated
Position of averages
• In a frequency distribution, the measures
of central tendency – mean median and
mode – occupy some definite relative
positions
• This position on a graph is called the
position of averages.
Symmetric distribution
• In a frequency distribution, if the
frequencies are equal on both sides of
the position of averages, then the
distribution is said to be symmetrical.
Skewed distribution
• In a frequency distribution, if the two
sides of the position of averages are
unequal, it is called an asymmetrical or
skewed distribution.
Geometric mean
• The geometric mean is usually more
suitable as a measure of central tendency
if the values change exponentially.
• If there are only 2 observations, then
the GM is the square root of the product
of 2 observations and if there are 3
observations it is the cube root of the
product of the 3 observations…
• Thus if there are n observations, it is the
‘n’th root of the product of the n
observations.
Harmonic mean
• The harmonic mean is used in situations
where the reciprocals of the actual
values seem more useful to determine the
central tendency.
• For example, it has been suggested that
the sensitivity to detect clusters of
observations is increased by measuring
the reciprocal of the distances rather
than the distances directly
percentile
• The value below which a given percentage
of observations occur is called a centile
or percentile.
• The median is called the 50th percentile
or centile.
• The percentiles divide the distribution
into 100ths but sometimes it is more
convenient to divide it into quartiles or
deciles.
MEASURES OF DISPERSION
Measures of dispersion
• The fact that we need an average or a
measure of central tendency shows that
there is variation among the observations
• Variation which is another characteristic
of a group of observations has to be
considered for describing the group more
satisfactorily.
• A single figure for a group relating to its
central tendency does not give any idea
about the variability of the observations.
Measures of dispersion

Range

Interquartile range

Mean deviation

Standard deviation and variance

Coeffecient of variation
range
• The range of a group of observations is
the interval between the smallest and the
biggest observation.
• The value of the range is dependent only
upon the two extreme observations in the
group and does not consider the other
observations.
• The occurrences of rare observations in
the group greatly influences the value of
the range and so it is not considered as
an ideal measure of dispersion.
Interquartile range
• Interquartile range is the interval
between the values of the upper quartile
and the lower quartile
• Upper quartile is the value above which
25% of the observations fall and lower
quartile is the value below which 25% of
the observations fall.
• This measure gives the range which
covers the middle 50% of the
observations in the group
Mean deviation
• The mean deviation is the arithmetic
mean of the deviations of the
observations from the arithmetic mean
ignoring the sign of these deviations.
• The mean deviation is based on all the
observations in the group.
• It is easy to measure this but it is not
widely used as more advantageous
methods are available.
Standard deviation
• The standard deviation is the square root
of the average of the squared deviations
of the observations from the arithmetic
mean.
• The deviation from mean is considered
without its sign in calculating the mean
deviation but in calculating the standard
deviation it is squared.
• The standard deviation is the most
important measure of dispersion.
Standard deviation
• For some frequency distributions there is
a relationship between the range and the
standard deviation.
• Standard deviation together with the
arithmetic mean can describe a frequency
distribution uniquely.
• Standard deviation of a population is
usually denoted by δ and that of a sample
by ‘s’
variance
• The square of the standard deviation is
called variance and it can also be used as
a measure of dispersion.
MEASURES OF PROBABILITY
probability
• One function of statistical methods is to
provide techniques for making inductive
inferences and also for measuring the
degree of uncertainty of such inferences.
• In games of chance for example, the
outcome of a particular trial is uncertain
but the long term outcome is predictable.
• The long term regularity provides us with
a measure of the amount of chance and
that is called probability.
The probability scale
• Chance is measured on a probability scale
having zero at one end and one at the
other end. The top end of the scale
marks absolute certainty and the bottom
end marks absolute impossibility.
• The other points on the probability scale
falling between one and zero indicate the
relative chance of occurrence of the
outcome.
Types of probability
A. a priori or Classical probability : when
the number of outcomes of a certain
trial is limited and known, then the
calculation of probability based on the
limited number of known outcomes is the
classical probability
B. a posteriori or frequency probability:
when the outcome of a trial is always
random and the probability can be only
calculated by the previous observational
or experimental evidence only.
Laws of probability for independent
events
1. Addition law : if an event can occur in
any one of several mutually exclusive
ways, the probability of that event is
the sum of the individual probabilities of
the different ways in which it can occur
2. Multiplication law : the probability of
the simultaneous occurrence of 2 or
more independent events is the product
of the individual probabilities.

16PF® Fifth Edition Sixteen Personality Factor™ Fifth
50% (8)
16PF® Fifth Edition Sixteen Personality Factor™ Fifth
10 pages
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
100% (1)
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
33 pages
Research Methodology 2025
No ratings yet
Research Methodology 2025
91 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
59 pages
Biostat Aguila Mission Solis
No ratings yet
Biostat Aguila Mission Solis
44 pages
Bio Statistics
No ratings yet
Bio Statistics
72 pages
Basic Biostatistics
No ratings yet
Basic Biostatistics
31 pages
Bio Statistics
No ratings yet
Bio Statistics
55 pages
43hyrs Principles of Statistics 3
No ratings yet
43hyrs Principles of Statistics 3
56 pages
Summarizing Data
No ratings yet
Summarizing Data
67 pages
Basic Biostats Part
No ratings yet
Basic Biostats Part
59 pages
Week 3 - Measures of Central Tendency
No ratings yet
Week 3 - Measures of Central Tendency
4 pages
2statsnotes 1
No ratings yet
2statsnotes 1
24 pages
Organization of Data
No ratings yet
Organization of Data
6 pages
Introduction To Statistics and SPSS
100% (1)
Introduction To Statistics and SPSS
110 pages
Techniques in Geog 1 Complete
No ratings yet
Techniques in Geog 1 Complete
153 pages
Class 1
No ratings yet
Class 1
52 pages
Ipsita Panda-Biostats Assignment
No ratings yet
Ipsita Panda-Biostats Assignment
11 pages
Basics of Statistics
No ratings yet
Basics of Statistics
40 pages
Biostatistics Notes-Numbered
No ratings yet
Biostatistics Notes-Numbered
21 pages
Biostatistics in
No ratings yet
Biostatistics in
75 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
SOC 212 - Introduction and Measures of Location
No ratings yet
SOC 212 - Introduction and Measures of Location
43 pages
11.modelling For Simulation
0% (1)
11.modelling For Simulation
120 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
86 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
4 pages
1 Biostatistics LECTURE 1
100% (1)
1 Biostatistics LECTURE 1
64 pages
MMW Data Management
No ratings yet
MMW Data Management
35 pages
Basic Statistics
No ratings yet
Basic Statistics
52 pages
Statistics Theory
No ratings yet
Statistics Theory
3 pages
Statisti CS: Stati Stics
No ratings yet
Statisti CS: Stati Stics
41 pages
BIOSTAT LESSON 2 - Descriptive Statistics
No ratings yet
BIOSTAT LESSON 2 - Descriptive Statistics
3 pages
Central Tendency Variability and Sampling Distribution PNCH
No ratings yet
Central Tendency Variability and Sampling Distribution PNCH
6 pages
Bio-Statistics: School of Bio-Science and Engineering, 2016
No ratings yet
Bio-Statistics: School of Bio-Science and Engineering, 2016
134 pages
Data Management
No ratings yet
Data Management
81 pages
Basic Concepts of Statistics
No ratings yet
Basic Concepts of Statistics
41 pages
Introduction Into Statistics: Vladimir Kozlov
No ratings yet
Introduction Into Statistics: Vladimir Kozlov
20 pages
Basic Concepts in Biostatistics-1
No ratings yet
Basic Concepts in Biostatistics-1
40 pages
Module 2 - Statistical Foundations
No ratings yet
Module 2 - Statistical Foundations
108 pages
Or Lecture 202209
No ratings yet
Or Lecture 202209
21 pages
1 - 3 - 4 - Class1 - Descriptive Statistics - 4slines - 1trang
No ratings yet
1 - 3 - 4 - Class1 - Descriptive Statistics - 4slines - 1trang
99 pages
Lesson2 - Measures of Tendency
No ratings yet
Lesson2 - Measures of Tendency
65 pages
Statistical Techniques Notes (Monitoring & Evalution - BMEC - Level 4)
No ratings yet
Statistical Techniques Notes (Monitoring & Evalution - BMEC - Level 4)
118 pages
WK 1b Biostat
No ratings yet
WK 1b Biostat
38 pages
Stats For PGDM
No ratings yet
Stats For PGDM
52 pages
Basic Stat
No ratings yet
Basic Stat
46 pages
Basic Statistics Notes
No ratings yet
Basic Statistics Notes
10 pages
Session 3 Week 2
No ratings yet
Session 3 Week 2
31 pages
And Dividing It by Total Number of Values
No ratings yet
And Dividing It by Total Number of Values
3 pages
3rd QTR Stats Reviewer
No ratings yet
3rd QTR Stats Reviewer
24 pages
Basic Concepts in Statistics
No ratings yet
Basic Concepts in Statistics
42 pages
Statistical Foundations - Intro 64zlf
100% (2)
Statistical Foundations - Intro 64zlf
86 pages
Math
No ratings yet
Math
13 pages
Engineering Probability and Statistics
No ratings yet
Engineering Probability and Statistics
42 pages
Biostatistics: DR Priyanka N Maiya
No ratings yet
Biostatistics: DR Priyanka N Maiya
85 pages
Quantitative Methods
No ratings yet
Quantitative Methods
4 pages
Sampling Design and Analysis MTH 494: Ossam Chohan Assistant Professor CIIT Abbottabad
No ratings yet
Sampling Design and Analysis MTH 494: Ossam Chohan Assistant Professor CIIT Abbottabad
34 pages
CH 29 & 30 Statistics & Probability
No ratings yet
CH 29 & 30 Statistics & Probability
65 pages
Drug Development and BE
No ratings yet
Drug Development and BE
19 pages
Anda
No ratings yet
Anda
26 pages
Global Cro Report
No ratings yet
Global Cro Report
9 pages
Assignment 1. Stat
No ratings yet
Assignment 1. Stat
4 pages
The Crucible - Writing Project
No ratings yet
The Crucible - Writing Project
2 pages
RT Vol. 6, No. 3 People
No ratings yet
RT Vol. 6, No. 3 People
1 page
Sodapdf
No ratings yet
Sodapdf
10 pages
Rosen Blatt's Perceptron Model
No ratings yet
Rosen Blatt's Perceptron Model
11 pages
Nacest BLD Hnd22 529
No ratings yet
Nacest BLD Hnd22 529
78 pages
Compilation of Module Answers
No ratings yet
Compilation of Module Answers
46 pages
Group Decision and Negotiation A Process Oriented View Joint INFORMS GDN and EWG DSS International Conference GDN 2014 Toulouse France June 10 13 2014 Proceedings 1st Edition Pascale Zaraté All Chapters Instant Download
100% (1)
Group Decision and Negotiation A Process Oriented View Joint INFORMS GDN and EWG DSS International Conference GDN 2014 Toulouse France June 10 13 2014 Proceedings 1st Edition Pascale Zaraté All Chapters Instant Download
55 pages
3aa5fc367415d40d722c37e70fff9bf0
No ratings yet
3aa5fc367415d40d722c37e70fff9bf0
77 pages
Nutritional Surveillance
100% (1)
Nutritional Surveillance
20 pages
1 s2.0 S0928765517304086 Main
No ratings yet
1 s2.0 S0928765517304086 Main
16 pages
6901-Article Text-26436-1-10-20231230
No ratings yet
6901-Article Text-26436-1-10-20231230
11 pages
Hulp Bij Thesis Spss
100% (3)
Hulp Bij Thesis Spss
7 pages
Axis Bank
No ratings yet
Axis Bank
28 pages
A Continuum of Play Based Learning The Role of The Teacher in Play Based Pedagogy and The Fear of Hijacking Play
No ratings yet
A Continuum of Play Based Learning The Role of The Teacher in Play Based Pedagogy and The Fear of Hijacking Play
17 pages
CBR Statdas
No ratings yet
CBR Statdas
20 pages
Ebook Mantra-445-450
No ratings yet
Ebook Mantra-445-450
6 pages
FNCE 625-12ChairApproved
No ratings yet
FNCE 625-12ChairApproved
8 pages
Patient Safety Culture
100% (3)
Patient Safety Culture
444 pages
Asq Control Chart
No ratings yet
Asq Control Chart
5 pages
Declaration: (External Guide Name
No ratings yet
Declaration: (External Guide Name
49 pages
Research Homework Ks2
100% (1)
Research Homework Ks2
6 pages
Lin, J.-W. (2018) - Effects of An Online Team Project-Based Learning Environment With Group Awareness and Peer Evaluation On Socially Shared Regulation of Learning and Self-Regulated Learning.
No ratings yet
Lin, J.-W. (2018) - Effects of An Online Team Project-Based Learning Environment With Group Awareness and Peer Evaluation On Socially Shared Regulation of Learning and Self-Regulated Learning.
18 pages
Gage R&R
No ratings yet
Gage R&R
24 pages
Personal DevelopmentQ3W5.significant People
No ratings yet
Personal DevelopmentQ3W5.significant People
86 pages
Untitled
No ratings yet
Untitled
13 pages
Transnational Capital in Somalia
No ratings yet
Transnational Capital in Somalia
91 pages
Bars CTR
No ratings yet
Bars CTR
11 pages
Evaluation of Alcoholic and Aqueous Extracts of Nicandra Physalodes Leaves For Diuretic Activity
No ratings yet
Evaluation of Alcoholic and Aqueous Extracts of Nicandra Physalodes Leaves For Diuretic Activity
4 pages
Testbank & Ebook Basic Technical Mathematics With Calculus 12th Edition Washington Instant
No ratings yet
Testbank & Ebook Basic Technical Mathematics With Calculus 12th Edition Washington Instant
18 pages

Introduction To Bio Statistics

Uploaded by

Introduction To Bio Statistics

Uploaded by

Introduction to biostatistics

Dr. Joseph Kamalesh

Exposure Outcome Exposure and

Measures of central tendency

Significance and estimation

Linear regression and correlation

Non – parametric tests

Standard deviation and variance

You might also like