Biostatistics Introduction
Biostatistics Introduction
Presentation of facts
Comparison of facts
Establishment of relationship
Measure the effects
Formulate policies in different fields
Test a hypothesis
Forecasting
Functions of Statisticians
Collection of data
Presentation of the collected data
Analysis and interpretation of the results
Making decisions on the basis of such analysis
DATA VERSUS INFORMATION
Data is raw, unorganized facts that need to be processed
Comprehensive Sample
Type of Statistics
Statistics
Sample
Population refers to any well
Sample
Sample is a small group of a
population selected using a
suitable method of sampling
It can be regarded as
representative of the entire Population
population
Descriptive statistics
Descriptive Sample
statistics
describing what is -
in
the sample data
Population
Descriptive Statistics
Frequency
• Is the study of the
procedures which deal with 50
the collection, processing
and summarization of data 25
to make it more informative
and comprehensive. 0
20-29 30-39 40-49 50-59
Population
Inferential Statistics (1)
the
Population based
on the data
collected from
the sample
drawn from the
population
Inferential Statistics (1)
-
Collecting- Making Inférence-
,Hypothèses testing-
,Organizing-
Determining relationship-
Summeraizing- Making prédictions-
Presenting Data-
Definitions
Data
• Are observations made on individuals.
Population:
• Aggregate of individuals of people, things, events with
common characteristic(s), e.g. university students,
hospitals, attacks of diarrhea ),
Variable:
• Is a quality, characteristic or a constituent of an
individual, that can be measured and is subject to
change.
Types of data
Constant
Variables
Variables & Constants
• Constants
Characteristics which does not vary or is fixed
• Variables
A characteristic of a person, object or
phenomenon that can have different values
Data Vs variable
Variable
: Gender
Data
Variable - Examples
Variables Description
In yearsAge
1 = Male 2; = FemaleGender
1 = Present; 2 = AbsentMalaria
Weight in kg
Height in cm
Variable - Examples
Height 167cm
Number of children 3
Sex Male
Ethnic group Chinese
Severity of illness Moderate
Status Alive
Sources of Data
Secondary Data:
• Collected for purposes other than the specific study ( from records
and reports)
Classification of variables
Qualitative variables (categorical ):
Are those for which the value indicates
different groupings
1. Nominal:
• have distinct levels that have no inherent ordering
• Blood group (A, B, AB, O) – occupation (employee
worker self-employed – retired)
• When only two options (gender : male female ) it is
called binary or dichotomous
• Colour (red, blue, green)
Classification of variables
2. Ordinal :
• Have levels that do follow a distinct ordering
• Stage of labour – socioeconomic status.
Categorical variables, ordinal
Ordinal variables:
• These are grouped variables that are ordered or ranked in
increasing or decreasing order:
For example:
• High income (above 500 ,000 SD per month);
• Middle income (200,000-500,000 SD per month); and
• Low income (less than 200,000 SD per month).
• Severity of pain
Quantitative Qualitative
)Numerical data( (Categorical data)
-is un-measurable
Data represented-
- un-expressible in
numerically
magnitude or
e.g. weight and numerically
- it can be
height of study
subjects expressed in
qualities.
Severity of pain
Type of variables
variables
Quantitative
Qualitative
Discrete
The data can Continuous
only have The data can have
known fixed -any value within a
values and can defined range
e.g. Numbers of
be counted -any value in a
boys, girls,
continuous interval of
number of
measurement.
episodes of
e.g. weight, height age,
diarrheal diseases
B.P and blood glucose
per years ,number
level.
of babies
Type of variables
)variables
Quantitative Qualitative
2. Independent
3. Confounding
4. Universal (Background)
Study
• In one study which was designed to determine the
maternal mortality. The aim of the study was to
identify the effect of maternal risk factors such as age,
parity, nutritional status, smoking, weight, occupation
on the prevalence of maternal mortality.
• Quantitative variables
• Frequency distribution tables
• Relative frequency tables
• Histogram chart
Data Presentation
Qualitative variables
• Tables:
• Frequency distribution table
• Cross tabulation
• Graphs:
• Bar chart
• Pie chart
• Pictogram
:Presentation of qualitative data
• 1.Tabulation form :
The basic rule for displaying qualitative data is to:
1.Classify them to categories
2. Then count the number of observations in
each category of the variable and
3. Present the number and percentage in a table.
Data Presentation
Qualitative variables
• Frequency distribution
Forceps 65 10.8%
Caesarian 57 9.5%
Lung cancer
Total
Smokin Cases Control
g No. % No. % No. %
Smoker 38.3
15 75% 8 20% 23
3
Non 61.6
smoker 5 25% 32 80% 37
7
Total 20 100 40 100 60 100
:Presentation of qualitative data
• 2.Graphical form :
1. Bar chart
2. Pie chart
• Bar chart: is used with categories data ;
with nominal data, the bars may be in any order
that seen sensible to research, but with ordinal
data they should be arranged from lowest to highest.
The bars may be horizontal or vertical.
Bar Chart
• Display frequency distribution
• Has columns with same width
• There are spaces between columns
• Bars may be horizontal or vertical
60
50
40
Number of
respondents 30
(% )
20
10
0
Malay Chinese Indian Others
Bar chart
Pie Chart
• Shows frequency distribution
• Represented by the angles in different sectors of a circle
• The total angle 360o represents 100%
7%
Chinese
Malay
31% I ndian
22%
Others
40%
Proportion of ethnic groups in a
Pictogram