Intro To Statistics
Intro To Statistics
magnitude (level) of
associations and test
hypothesis.
Statistics
Frequency Tables
Diagrams (Graphs/charts)
Summary Indices
INFERENTIAL STATISTICS
Statistical inference is the act of
generalizing from a sample to a
population with calculated degree of
certainty. The importance of inference
during data analysis is important
7
MEDICAL STATISTICS/
BIOSTATISTICS
Provides appropriate
methods for:
Collecting,
Organizing,
Analyzing,
Interpreting and
Presenting Medical and
Health data.
BENEFIT OF STATISTICS
Has a central role in biomedical
investigations
Better way of organizing information on a
wider and more formal basis (empirical
evidence) than exchange of anecdotes
and personal experience
Takes into account the intrinsic variation
inherent in most biological processes (e.g.
blood pressure)
BENEFIT OF STATISTICS IN
HEALTH
Measurement of population health
Allow for comparisons of the state of
population.
VARIABLE: is a characteristic or property of
representative manner
Types of variables:
Quantitative & Qualitative
Numerical (Quantitative) variables-
measured on a naturally occurring
numerical scale. Two types-
– continuous i.e. measured on a
continuous scale including fractions
and decimals e.g. age, weight, height
etc
- discrete i.e. can only take limited
numbers of values, usually whole
numbers e.g. episodes of diarrhoea in
a child, number of men in a village.
Types of variables
Categorical (Qualitative) variables
– non numerical e.g. place of birth,
ethnic group, social class, gender
- dichotomous (binary) – has
only 2 possible outcomes e.g. sex
(M/F), survival status (A/D)
- ordered categorical – has a
natural ordering, but not in a
numerical sense e.g. social class (I,
II, III, IV, V)
ELEMENTS OF STATISTICAL
INFERENCE
Population
Sample
One or more variables of
interest
Reliability
RELIABILITY
Measures how good a
inference is.
In using a sample, we
introduce an element of
uncertainty into our
inferences.
As much as possible, it is
important to determine&
report the reliability of each
RELIABILITY
Measure of reliability that
accompanies an inference
separates the science of
statistics from the art of
fortune telling.
Measure of reliability- is a
statement (usually
quantified) about the degree
of uncertainty associated
NATURE OF STATISTICAL
DATA
Primary and Secondary data
- Primary data: Data originally
collected in the process of any
statistical inquiry
- Secondary data: Data collected by
other individual/people/organization
Primary source is preferred to
secondary source.
23
SOURCES OF HEALTH DATA
Census
Vital Registration systems
Institutions (school health,
Epidemiological surveillance -
(infectious diseases, cancer
registries etc).
CENSUS
National census- enumeration of the whole
populace in a country, usually done every ten
years.
The last census in Nigeria was in 206& the popn.
Was 140,003,542
North West zone- 35,786,944 most populous
population rates.
Figure 1: Population pyramid, Oriade
HDSS.
>=105 0.13 0.1
100-104 0.11 0.24
95-99 0.12 0.18
90-94 0.4 0.600000000000001
85-89 0.6 0.730000000000001
80-84 1.29 1.88
A 75-79 1.25 1.29
70-74 2.22 2.63
G 65-69 1.71 2.23
E 60-64 2.47 3.41 FEMALE
55-59 2.05 2.4 MALE
G 50-54 3.82 4.09
45-49 3.62 3.31
R 40-44 5.26 5.24
O 35-39 5.33 5.68
U 30-34 6.15 6.74
P 25-29 6.66 7.81
20-24 7.31 7.98
15-19 10.82 10.12
10_14 12.85 10.95
05_09 14.97 12.81
0-4 10.81 9.59
20 15 10 5 0 5 10 15
Proportionate Percent population by sex
VITAL STATISTICS
Records of vital events- births, deaths,
marriages& divorces- obtained by
registration.
Used for generating birth and mortality
demographic information.
Data on important indicators of
(occupational setting)
Hospital based records
SURVEYS.
Usually ad-hoc, but may be routine.
Popular National surveys-
32
Common quantitative
techniques
Structured Interview using questionnaire
Service Statistics – Information routinely
33
Common Qualitative
Techniques
Focus Group Discussion
In-depth Interview
Observation
- Direct
- mystery client
- ethnological technique
34
Preparing for Data
Collection
Address any ethical concern
Prepare written guidelines for how data will
be collected
Pretest instruments
Modify Questionnaires
Train all staff involved
35
Ethical Concern
Parental Permission
Informed Consent
Voluntary Participation
Confidentiality and Privacy
36
Pilot-Testing Survey Questions
Detects difficult questions
Verifies duration to complete questionnaire
Builds competence in data collector
Uncover problems in field procedures
37
Processing Quantitative
Data
Steps to organize your data for analysis.
* Field editing
* Coding
* Data Entry and Tabulation
* Data Cleaning
38
Field Editing
Involves systematically reviewing
questionnaires for consistencies and
completeness
Systematic organization of data, recording
39
Coding
Process of organizing and assigning
meaning to data eg. CGPA
First Class 1
Second Class Upper 2
Second Class Lower 3
Third Class 4
Pass 5
40
Data Entry
Data will usually be entered into a computer
program prior to analysis.
Statistical Packages with data entry
modules are:
Epi-info (both DOS and Windows Versions)
SPSSPC (DOS)
SPSS Data Entry Builder (Windows)
ISSA etc
41
Data Cleaning
Checking for and correcting errors in data
entry.
Some software packages have built-in-
42
Types of Data Entry Errors
Missing data
Inconsistent data
Out of range values
43
Missing Data
Respondent declines to answer a question
A data collector failed to ask or record a
response
A data entry clerk skips a question
44
Inconsistent Data
A respondent may contradict himself
thereby creating inconsistency in reporting
45
Out-of-Range Values
Impossible or implausible data items eg
30 recorded as number of years of
experience for a respondent who is 25 years
old
46
ANALYSING QUANTITATIVE
DATA
Two Types of Analysis:
* Descriptive
* Inferential
Three levels of Analysis:
* Univariate Level uni = single
variable
* Bivariate Level bi = two variables
* Multivariate Level multi = 3 or more
47
SCALES OF MEASUREMENT
Measurement
The way variables are
measured is very important.
Measurement is the
assignment of numbers to a
variable
Measurement determines the
(non-numerical/qualitative)
Ordinal
(non-numerical/qualitative)
Interval
(numerical/quantitative)
Ratio
(numerical/quantitative)
Scales of Measurement
Nominal scale – lowest level
of measurement. Merely
classifies the measure into
mutually unordered
categories; has no notion of
numerical magnitude e.g.
gender (male, female), blood
group (A, B, AB, O)
Nominal Variable
Classifies persons or objects into two or
more categories
Members of a category have at least one
common characteristic.
We cannot quantify or even rank-order
those category.
For identification purpose, nominal
variables are often represented by
numbers.
The values of the scale have no 'numeric'
meaning in the way that you usually think
about numbers.
52
Examples of Nominal
Variables
53
Scales of Measurement
Ordinal scale – in addition
to its nominal property,
has ability to rank or order
phenomenon. It is defined
by related categories e.g.
grades of pain (mild,
moderate, severe), social
class (I, II, III, IV ,V).
Scales of Measurements
Intervalscale – measurements
are expressed in numbers
except that the starting point is
arbitrary, depending largely on
the units of measurement.
Meanings can be physically
attached to the difference
between 2 measurements on
this scale, but not to their ratios,
as the ratio of any 2 intervals is
Scales of Measurements
Ratio scale – has all the 3
Data
Summary indices (next lecture)
TABULAR PRESENTATION.
Done in form of frequency tables.
Can be for both quantitative and qualitative
data.
Definitions for Frequency Table
Others 75 0.10
Total 746 1.00 61
CLASS BOUNDARIES& INTERVALS
A class or group boundary lies midway between the data values.
For example,
For data in the class or group labelled:
7.1 – 7.3
(a) The class values 7. 1 and 7.3 are the lower and upper limits of the
class and their difference gives the class width.
(b)The class boundaries are 0.05 below the lower class limit and 0.05
above the upper class limit (because the figures are in 1Decimal
place)
(c) The class interval/ width is the difference between the upper and
lower class boundaries.
(d)Question- What are the class boundaries if the figures are between 7
and 8?
Graphical or diagrammatic
presentation of Data
Diagrams give a very clear & understandable
picture of data
Comparison can be made between different
clean.
Simplicity- diagram should convey meaning
clearly& easily.
Scale must be presented along with the
diagram.
Vertical diagram should be preferred to
Horizontal diagrams.
TYPES OF GRAPHICAL
PRESENTATION
Dot Plot
Stem and Leaf Display
Line graphs
Bar chart – simple, multiple, component.
Pie chart.
Histogram
Frequency Polygon (Ogives).
Scatter diagrams
Graphical Presentation cont ....
Dot Plot and Stem and leaf display to
be demonstrated in class on a white
board.
Bar Charts
A bar chart is made up of
columns plotted on a graph.
The columns are positioned
90
89.4%
80
70
60
Weight Category
50
40
30
20
10 7.7%
2.9%
0
Normal Weight Overweight Obese
Figure 3: Weight category of Osun State in-school
adolescents based on location (Multiple Bar Chart).
100
93.1
90
80
74.0
70
60
53.5
% 50 46.5 Urban
Rural
40
30 26.0
20
10 6.9
0
Normal weight Overweight Obese
Weight Category
Figure 1: Proportion of urban & rural school-
children who experienced bullying in a 1-year
period, Osun State, Nigeria, 2009 (Component
Bar chart).
100
10.8 16.2
80
% of students
60
89.2 83.8
40
20
0
Urban Rural
Bullying No bullying
Source: Omisore et al., 2010
PIE CHART
Similar to Bar chart but is in
a circle.
Each category is given a
7.7%
Normal Weight
Overweight
Obese
89.4%
Histograms
Like a bar chart, a histogram is made up of
columns plotted on a graph. There is no
space between adjacent columns.
The columns are positioned over a label
range of values.
The height of the column indicates the size
distributions
Scatter Plot- Positive perfect
correlation.
r = +1.00
Scatter Plot- negative
correlation
r = -0.54
Scatter Plot- no correlation
PRACTICAL SESSIONS
CONCLUSION
We live in a data world- everything we hear,
see or do is often based on data (collected
information).
It’s vital that we learn the rudiments of
into practice.
THANK YOU