Applied Biostatistics
Applied Biostatistics
Rinki Chaudhary
Assistant Professor
School of Physics, Humanities and Applied Sciences
Shobhit Deemed University, MEERUT - 250110
Content
Introduction
Type of Statical Method
Classification of Biostatistics
Applications of Statistics
Limitations of Statistics
Types of Data
Methods of presentation of data
Central Tendency
What Is Statistics?
1. Collecting Data
Data
e.g., Survey Why?
Analysis
2. Presenting Data
e.g., Charts & Tables © 1984-1994 T/Maker Co.
Statistical
Methods
Descriptive Inferential
Statistics Statistics
Descriptive statistics:
A statistical method that is concerned with the collection, organization,
summarization, and analysis of data from a sample of population.
Inferential statistics:
A statistical method that is concerned with the drawing
conclusions/inferring about a particular population by selecting and
measuring a random sample from the population.
Descriptive Statistics:
Statistical procedures used to summarize, organize, and simplify data.
This process should be carried out in such a way that reflects overall
findings.
Raw data is made more manageable
Raw data is presented in a logical form
Patterns can be seen from organized data
Some statistical summaries which are especially common in descriptive
analyses are:
Measures of central tendency
Measures of dispersion
Measures of association
Cross-tabulation, contingency table
Histogram
Quantile
Inferential Statistics
This branch of statistics deals with techniques of making conclusions about the
population
They encompasses a variety of procedures to ensure that the inferences are sound
and rational, even though they may not always be correct
Stages in statistical investigation
1. Collection of data
• The process of obtaining measurements or counts.
2. Organization of data
• Includes editing, classifying, and tabulating the data collected
3. Presentation of data
• overall view of what the data actually looks like
• facilitate further statistical analysis
• Can be done in the form of tables and graphs or diagrams
4. Analysis of data
• To dig out useful information for decision making
• It involves extracting relevant information from the data (like mean, median,
mode, range, variance. . . )
5. Interpretation of data
• Concerned with drawing conclusions from the data collected and analyzed; and
giving meaning to analysis results
• A difficult task and requires a high degree of skill and experience
Definition of Some basic terms
Population:
• Population is the complete set of possible measurements for which inferences are
to be made.
Census:
• a complete enumeration of the population. But in most real problems it cannot be
realized, hence we take sample.
Sample:
• A sample from a population is the set of measurements that are actually collected
in the course of an investigation.
Parameter:
• Characteristic or measure obtained from a population.
Data:
• Refers to a collection of facts, values, observations, or measurements that the
variables can assume.
Statistics:
• Statistics is a branch of mathematics dealing with data collection, organization,
analysis, interpretation and presentation.
Sampling:
• The process or method of sample selection from the population.
Sample Size:
• The number of elements or observation to be included in the sample.
Applications of Statistics
Types of
Data
Quantitative Qualitative
Data Data
Measured on a numeric 4
scale.
94
Number of defective 5
items in a lot. 3 2
1 2
Salaries of CEOs of
12 1
oil companies.
0 28
Ages of employees at
a company. 7 3
1
© 2011 Pearson Education, Inc
Qualitative Data
Classified into categories.
College major of each
student in a class.
Gender of each employee
at a company.
Method of payment
(cash, check, credit card).
$ Credit
Primary data:
collected from the items or individual respondents
directly for the purpose of certain study.
Secondary data:
which had been collected by certain people or agency,
and statistically treated and the information contained in
it is used for other purpose.
Frequency: number of times that something occurs.
The notation fx is used to denote the frequency or number of times the value x
occurs.
Cumulative relative frequency: sums of all relative frequencies below and including
each category
Methods of presentation of data
1. Numerical presentation
2. Graphical presentation
3. Mathematical presentation
1- Numerical presentation
Tabular presentation (simple – complex)
Total
Table (I): Distribution of 50 patients at the surgical department of
Alexandria hospital in May 2008 according to their blood groups
Blood Frequency %
group
A 12 24
B 18 36
AB 5 10
O 15 30
Total 50 100
Table (II): Distribution of 50 patients at the surgical department
of Alexandria hospital in May 2008 according to their age
Age Frequency %
(years)
20-<30 12 24
30- 18 36
40- 5 10
50+ 15 30
Total 50 100
Complex frequency distribution Table
Lung cancer
Total
Smoking Cases Control
No. % No. % No. %
Smoker 15 75% 8 20% 23 38.33
Non
smoker 5 25% 32 80% 37 61.67
0
Age 60-70 (8%) (20%) 65
25 35 45 55 65
9
8 Female
7 Male
6
Frequency
5
4
3
2
1
0
20- 30- 40- 50- 60-69
Age in years
Histogram Distribution of a group of cholera patients by age
% 35
30
Age (years) Frequency %
25
20 25-30 3 14.3
15
30-40 5 23.8
10
40-45 7 33.3
5
45-60 4 19.0
0
60-65 2 9.5
0 25 30 40 45 60 65
Age (years)
Total 21 100
45
40
35
30
25
20
15
10
5
0
Single Married Divorced Widowed
Marital status
Bar chart
%
45
Male Female
40
35
30
25
20
15
10
5
0
Single Married Divorced Widowed
Marital status
Steps to create a pie-chart
• Construct a frequency table
In continuous distribution, there are given class interval and their corresponding
frequencies. First of all, we find mid values of these classes and treat them as the
variable values. Now we apply the formula (2) for the calculation of arithmetic mean.
Example 3. For the data given in the below Table 2 on systolic BP of 68
patients, patients, calculate the arithmetic mean.
∑ 𝑓 𝑖 𝑥𝑖
9220
𝑖= 1
𝑋= = =135.6 𝑚𝑚h𝑔
𝑁 68
WEIGHTED MEAN
Example: The following table gives the platelets count (in lakh/cmm) from
the analysis of the blood samples on five different in a pathology
laboratory. Find the average platelets count per patient.
Day Platelets count (in lakh/cmm) No. of patient. (w)
(x)
1 0.5 65
2 0.75 80
3 1.00 95
4 1.5 90
5 2.00 70
Solution: The table for the calculation of weighted mean is given by:
1 0.5 65 32.5
2 0.75 80 60
3 1.00 95 95
4 1.5 90 135
5 2.00 70 140
lakh/cmm
Combine Mean
CORRECTED MEAN
Some time there are problems of such type that we used wrong digits
while the actual digits were different, then we replace the wrong digits
with the correct digits and now we can get the correct mean. The
procedure will be clear from the example given below: