Chapter 1 Introduction To Biostat
Chapter 1 Introduction To Biostat
1
REFERENCES
Aryal, U. R. & Shrestha, Y. M. (2015). Biostatistics for
medical sciences. Kathmandu: Makalu Publication
house.
Gupta, S. C. (2012). Fundamental of statistics. Mumbai:
Himalaya Publication House.
Mahajan, B. K. (2016). Biostatistics. New Delhi: Jaypee
Brothers.
Panta, P. R. (2015). A text book of bio-statistics.
Kathmandu: Vidyarthi Pustak Bhandar.
Patel S. P. (2015). Biostatistics. Kathmandu: Samiksha
Publication
Rastogi, V. B. (2015). Biostatistics. New Dehli: MTDCEC
Siegel, S. (1965). Non-Parametric statistics for the
behavioral sciences. New York: McGraw-Hill Publishing
Com. Inc.
Wayne, W. D. (2010). Biostatistics, basic concepts and
methodology for the health sciences: London: Wiley 2
student edition.
REFERENCES
3
REFERENCES
4
REFERENCES
5
REFERENCES
6
REFERENCES
7
REFERENCES
8
REFERENCES
9
REMEMBER
10
UNIT ONE INTRODUCTION
What is the difference between these two
phenomenon?
Days Sun Mon Tue Wed Thu Fri Sat
Temperatu 24° 26° 27° 30° 29° 28° 25°
re
Distribution of Hemoglobin count in 100 ml Blood of
175 children
9.0 – 9.4 2
9.5 – 9.9 5
10.0 – 10.4 7
10.5 – 10.9 20
11.0 – 11.4 32
11.5 – 11.9 40
12.0 – 12.4 25
12.5 – 12.9 16
13.0 – 13.4 10
13.4 – 13.9 8
14.0 -14.4 5
11
14.5 – 14.9 5
CONTD…
12
ORIGIN AND DEVELOPMENT OF STATISTICS
13
CONTD…
1. Descriptive Statistics:
involves methods of collecting,
organizing, picturing and
summarizing information from data.
2. Inferential Statistics:
involves methods of using
information from a sample to draw
conclusions about the population.
16
CONTD…
Includes
Descriptiv -collecting
-organizing
e -summarizing
Statistics -analyzing
-presenting
Statistic
s Includes
-making
inferences
-hypothesis
Inferential testing
Statistics -Determining
relationship
-Making
predictions
17
DEFINITION OF STATISTICS (SINGULAR SENSE)
Croxton & Cowden defined,“ The
collection, classification, presentation,
analysis and interpretation of numerical
data”.
1. Collection of Data
3. Presentation of Data
4. Analysis of Data
5. Interpretation of data
18
Different Steps of Statistical Methods in Singular Sense
Editing Classification
Organisation
Dispersion Correlation
Comprehensive Sample
22
APPLICATION OF BIO - STATISTICS
1. Biostatistics is the application of statistical methods to a
wide variety of fields of biology like:
2. human biology
3. Molecular biology
4. Biochemistry
8. Biomass estimation
9. Statistical ecology
10. Veterinary
11. Microbiology
2. Francis Galton (1822 – 1911) cousin of Charles Darwin called the father of
Bio-statistics
Notable contributions in the field of heredity, eugenics, anthropometry and
psychology
Also contributed on correlation between two variables
KARL PEARSON(1857 – 1936) SIR RONALD AYLMER FISHER FRANCIS GALTON (1822 –
(1890 – 1962) 1911)
26
ROLE OF BIOSTATISTICS IN RESEARCH
1. Calculating averages
2. Variability of measurement
6. Combination of variables
7. Predict variables
27
Difference between Statistics and Bio-statistics
Statistics Bio-Statistics
1. It is the branch of science 1. It is the application of
where we plan, gather, statistical science to analyze
arrange and analyze public health related
information about a particular problems to further
collection of individuals or biomedical research.
objects under investigation.
2. It is the mathematical body 2. It is the biomedical body of
of science. science.
3. It is concerned with general 3. It is concerned with the
data on any discipline: study and analysis of
collection, classification, biological and medical data.
presentation, analysis and
interpretation.
4. It is utilized to reach a 4. It is applied mostly in
conclusion in each and every biological evaluations.
field.
28
CONTD..
5. Statistical activity spans a 5. Bio-statistical activity
broad range of individual spans a broad range of
problem, social problem, scientific inquiry from the
science and technology. broad structure and functions
of human beings. Though the
interactions of human beings
with environment, including
problems of environmental
sanitations, diseases,
organizations of health care
system and financing.
6. After studying statistics, 6. After studying biostatistics,
people get opportunities people get opportunities
every aspects. related to biomedical sector.
7. It is directed to questions 7. It is specially directed to
that arise in almost all field. questions that arise in
biomedical science.
8. It is the vast field of study 8. It has no any sub-divisions.
which has so many sub- 29
Statistical Terms
1: Data
It is a set of facts (entries or observations)
Facts or pieces of information.
General concepts refers to the fact of some existing
information.
Data can be divided into two types
i. Quantitative data & ii. Qualitative data
i: Quantitative data
Those information expressed in quantitative form.
Numerical facts and figures.
32
Difference between Primary and Secondary data
SN Primary Data SN Secondary Data
1 Primary data are original in 1 Secondary data are not original in the
sense that they are sense that, they are collected by
personally collected by the someone other, than the concern
researcher involving researcher so it may or may not fit for
directly/ indirectly by the same research purpose.
himself/herself in the
research concern field so it
is more reliable.
2 Primary data collection is 2 Secondary data are readily available
more expensive and time at less expenses.
consuming.
3 Primary data are collected 3 Secondary data might have been
as per requirement of the collected with different objective.
concern investigators.
33
Methods of Collecting Primary Data
34
STEPS FOLLOWED IN COLLECTING PRIMARY DATA
• Planning the study:
• Reviewing the earlier work
Step • Scope of study
• Determination of statistical units
1 • Method of data collection
• Editing:
Step • Field edition
4 • Office edition
2. Validity:
It is the use of appropriate procedure for data collection.
If a biased scale is used, the repeated measurements will give the same results
but they are not valid because the same mistakes or same performance is used
during measurement.
3. Suitability of data:
It is essential to see whether the collected data are suitable for the purpose of
inquiry or not.
Investigators should conform various terms and units which are clearly defined
and uniform or not.
4. Adequacy of data:
It is necessary to see whether it is adequate or not for the inquiry.
36
Classification
According to Connor, “Classification is
the process of arranging collected
information(either actually or notionally)
in groups and classes according to their
resemblances and affinities and gives
expression to the unity of attributes that
may subsist among a diversity of
individual”.
In other words, classification is grouping
of data on the basis of any common
characteristics they may have.
37
TYPES
There are four types of classification of data:
1. On qualitative basis: Classification of the total population of a
country on the basis of gender, religion, occupation etc.
belongs to this type. This is also known as classification by
attributes.
2. On quantitative basis: Classification of the total population in a
natural countable way. Here, the basis of classification is some
variables, and hence this is also known as classification by
variables.
3. On time basis: Some statistical data are arranged in order of
their time of occurrence. Production of a factory may be shown
by weeks, months, quarters or years. Statistical data classified
according to time as known as time series.
4. On geographical basis: The total population of a country may
be classified by states or districts, exports of a particular
commodity from India may be classified by the country to
which exported. The basis of classification in such cases is by
geographical regions.
38
CODING
Coding means assigning numerals or other
symbols to the categories or responses.
For each question a coding scheme is
designed in such a way that the bulkiness
name of the variable will be represented by
the shortly written code.
Suppose a variable ‘educational status’ can
be coded as:
0 = undergraduate
1 = graduate
2= post graduate
39
DATA PRESENTATION
40
DATA RECORDING
Tabulation of data:
Tabulation is the process of arranging
the data in an orderly manner into rows
and columns.
The purpose of the tabulation is to
simplify the presentation of data and to
facilitate comparison between related
information so collected.
It is the final steps of collection of data
and is a stepping stone to the analysis
and interpretation of facts and figures.
41
DIFFERENT PARTS OF TABLE
1. Table number: In order to identify tables, every
table should be numbered. The number of table is
normally kept at the top of the table either at the
centre above the title or at the side of the table.
2. Title of the table: every table should have suitable
title that indicating the type of data contained.
Title should be short, clear, simple and non-
ambiguous.
3. Caption(column heading): The title of the column
is known as caption. The word caption should also
be simple and brief. Caption are usually written at
the middle of the column in small letters to save
the space. If the different columns are measured
in different units, the units must clearly be
mentioned in the columns.
42
CONTD…
4. Stubs (Row heading): Stubs means row heading. They are at
the extreme left. The function of stubs in the horizontal rows
is same as the function of captions in the vertical columns.
5. Body of the table: This is the most important part of table. It
contains data. The data are entered from the top to bottom
in columns and from left to right in rows.
6. Head note: generally, a head note is kept just below the title
of the table indicating the unit of measurement applicable to
the data displayed. It is normally kept within brackets. It
helps to clearify the points relating the content of the table
that have not been included in the title nor in the caption
and stubs. Example: The unit of measurement such as
thousands of rupees.
7. Foot note: Foot notes are written directly below the body of
the table whenever necessary. Anything in a table which can
not be understood by readers from the table, captions and
stubs should be explained in foot note.
43
CONTD…
Foot note:
Sources:
44
TYPES OF TABLE
1. Simple table: A simple table is one in which the data are
presented according to only one character.
Table No.
Title AREAS OF SAARC COUNTRIES
Head note:(in square kilometers)
Countries Area
Bangladesh 1,43,998
Bhutan 46,500
India 32,87,590
Maldives 9,000
Nepal 1,47,181
Pakistan 7,69,099
Srilanka 65,610
45
CONTD…
2. Complex table:
Data are presented according to two or
more inter-related characteristics
simultaneously.
If two characteristics are shown in a table,
it is said to be two-way table.
If three characteristics are shown in a
table, it is said to three way.
When four or more characteristics are
shown simultaneously it is case of
manifold tabulation.
46
CONTD…
Table No.
Title: STUDENTS OF SECONDARY SCHOOL,
ACCORDING TO GENDER FOR 2011 – 2012
47
CONTD…
Three ways table:
Table No.
Title: DISTRIBUTION OF STUDENTS OF
T. U. ACCORDING TO GENDER, AGE & FACULTY
Age Faculty
Humani Manageme Science Education Others Tota
ty nt l
B G T To B G T To B G T To B G T To B G T To
19-
22
22-
25
27-
abov
e note: B = Boys, G = girls, T = (transgender) , To = Total
Foot
Total Source:
48
Sample and Sampling
Terminologies Definition
1. Population: A group of whole individuals under study.
i. Finite: A population containing finite number of objects
or items. Eg. The students in a certain college.
ii. Infinite A population containing infinite number of
objects or items .
Eg. All married women of 20 to 35 years of age.
2. Sample Number of units drawn from the population for
the study.
3. Census If every items of the population is examined
under study then it is census survey.
4. Sampling Process by which appropriate samples are drawn
from the population.
5. Sampling List of each and every units in the population.
frame
6. Sampling unit Individual unit from the sampling frame.
7. Sample size Number of samples taken for the study.
49
Sample and Sampling
Terminologies Definition
8. Sample statistic Information provided by the sample or A summary
value of the variable for a sample.
9. Parameter A summary value of the variable for a population.
10. Sampling Process by which samples are drawn from the
process population
11. Unit is the object or individual or group that can be
investigated as the source of basic information. For
example: individual patient, small sub-areas of land
etc.
12. Population Number of items in population having same
proportion characteristics. Population proportion size (P) = .
13. sample Number of items in sample having same
proportion characteristics.
14. Sampling ratio Sample proportion size (p) = .
14. Sampling ratio The ratio of sample size of the target population.
For example: Population = 20,000 people, sample
= 200 people. Then, .
50
Notation
Population Sample
Size ‘N’ ‘n’
Mean ‘μ’ ‘X̅’
Standard Deviation ‘σ’ ‘S’ or ‘s’
Proportion ‘P’ ‘p’
Regression ‘’ ‘b’
coefficient
Correlation ‘’ ‘r’
coefficient
51
CENSUS VS SAMPLE
Census Sample
Refers to complete inclusion of Sample is a part of the
elements in the population. population.
52
Variables
Food
Habit
Regular
Weight
Exercise
Blood
Heridity Age
Pressure
53
Variables
Variables are the conditions or characteristics that the
experimenter (researcher) manipulates, controls or observes.
It can be a persons, things, events groups, objects, ideas, feelings
etc.
The variables are the symbol to which numerals or values are
assigned.
For example:
Gender is a variable. The two/three possible values of Gender are
(male/female/transgender)
Education is a variable. The three possible values of education are
(undergraduate, graduate, post graduate)
Other example of the variable are: length of service, employ
attitude, job satisfaction, height, length, weight, income,
expenditure, no. of patient, no. of medicine etc.
Variables are often symbolized by a letters of the English
alphabets such as: x, y, z etc. these letters of alphabet symbolizes
a particular value for a particular variable.
Variables are of two types and they are:
54
CONTD…
i. Numerical Variable (Quantitative): When all the values of the
variable are expressed in numbers then it is called numerical
variable. For example: income, weight, height, length.
Numerical variable also can be further sub divided into:
55
Attributes Variables
Those characteristics that cannot be altered by the experimenter.
An attribute is a specific element or value in a variable.
For example: The variable gender has two attributes: (male and
female).
Satisfaction might be defined as five attributes as:
Nominal
Qualitative
Ordinal
Scales of
measureme
nt
Interval
Quantitativ
e
Ratio
57
a. Nominal Scale
Classify a response from a variable into mutually exclusive and
exhaustive categories i. e. possible values are categories.
This follows for only qualitative classification.
Example:
62