Week 1: Nature of Biostatistics
Week 1: Nature of Biostatistics
Inferential statistics can now be used to verify whether Example: In a research problem entitled,
the dice is a fair or not. "The Effect of Technology-based Instruction on the
DATA PRESENTATION AND ANALYSIS (Every Students' Mathematics Achievement".
study or experiment yields a set of data)
The independent variable here is the technology-based
Steps in Statistical Investigations instruction, while the dependent variable is the
1. Identification of the problem academic achievement of students.
- We generate the hypothesis Common statistical terms
2. Collection of data
Constant
3. Presentation of data
4. Analysis of data - Quantities that do not vary e.g. in biostatistics,
5. Interpretation of data mean, standard deviation are considered
constant for a population
Collection of data
Variable
Refers to the different methods and techniques of
gathering data - Characteristics which takes different values for
different person, place or thing such as height,
What is DATA?
weight, blood pressure.
It refers to any information concerning to a population
Categories of Variable
or sample
Qualitative Variable Uses categories or
Types of Data (According to Source)
attributes that are distinguished by some
Primary Data nonnumeric characteristics. ( color of the eyes)
Examples: Interview, registration, experiment, Quantitative Variable It is consist of numbers
questionnaire, etc. representing counts or measurements ( height,
weight)
Secondary Data
Types of Quantitative Variable
Examples: book, journal, newspaper, thesis,
Discrete Data
dissertation, etc.
Types of Data (According to Functional relationship) Quantitative data which can assume a finite or
countable number of values. (Whole numbers: no. of
Independent Data siblings)
It refers to any controlling data Cannot be represented by fractions or decimals
Dependent Data Continuous Data
It refers to any data that is affected by the controlling Quantitative data which can assume an infinity or
data many possible values corresponding to a point on a line
EXAMPLE 1: interval
BIOSTATISTICS AND EPIDEMIOLOGY
Examples of conversion of discrete to continuous data
MEASURING DISCRETE CONTINUOUS 3. Interval
Gas Tank Empty/ full Gas volume in It does not have a true-zero value starting point.
tank
There is no absolute zero, unique origin does not exists
Tree Heights Tall/Medium/Short Meters
Interval data are more powerful than ordinal scale due
Performance Poor/Average/Good Pieces/Hour to equality of intervals
Temperature No. of Days > Average Example:
Freezing Temperatures
Temperature in Fahrenheit
Delivery No of Late Time per
deliveries Delivery Standardized scores
Scrap No. of pieces > Average part 4. Ratio
Max. Length length
A modified interval level which includes zero as a
starting point.
Measurement of Data Highest level
1. Nominal Relation among Data Types
Data that consist of names, label or categories - Nominal and ordinal -> Nonparametric
commonly used number to categorize data. (qualitative data)
Lowest levels of data - Interval and Ration -> Parametric (quantitative
data)
Examples:
Nonparametric statistics may be used to analyze
Gender, marital status, or any alphabetic/numeric code interval and ratio data measurement.
without intrinsic order or ranking
PRINCIPLES OF SAMPLING
SUBJECT CODE
• Population (N)
Physics P
• Is the set of complete collection or totality of all
Chemistry C possible values of the variable.
Mathematics M • Sample (n)
Biology B • A subset or sub-collection of elements drawn from a
population.
2. Ordinal
Measurement which deal with order and rank.
Second level
Examples
Ranks or grades of students; Quality rating of service or
product
EDUCATION CODE Stages in the selection of a sample
• However, the higher the percentage of the sample, Determine the number of samples needed in each:
the higher the validity of the study.
• Sampling techniques
Probability Sampling
Simple random sampling
• In this technique elements of the sample are selected
through lottery.
Systematic sampling
• This technique of sampling is done by taking every
element in the population assignment of number as a
part of the sample.
• To select the systematic sample of n elements from a
The bigger population of N element, we divide the N element in
the population, the lesser percentage of the sample is the population in the n groups of kth element.
taken.
• For a specific calculation of the sample for the
purpose of adequate sampling, the use of the Sloven’s
Formula presented below is advised as given by Pagoso Cluster sampling
Simplified Formula for Proportion • Population under this technique is being divided into
sections (or cluster), randomly select some of these
cluster as the member of the sample size.
Stratified sampling
• In this technique, the population is subdivided into at
least two different subpopulation (or strata) that share
the same characteristics and then the elements of the
sample are drawn from its stratum proportionately
Example
N = Population;
n = sample
e = margin of error or percentage of error
BIOSTATISTICS AND EPIDEMIOLOGY
Non-Probability Sampling
Purposive sampling (Judgmental /
Authoritative)
• In this technique, the elements of the sample are
being selected according to the criteria or rules set.
Quota sampling
• In this technique, the sample size is limited on the
required number or subject in the study
Convenience sampling (Haphazard /
Opportunity)
• In this technique, the sample are being selected from
a particular place at specified time preferred.
Snowballing sampling
• In this technique, the researcher asks respondents to
give referrals to other possible respondents.