BioStatistics Ganbawale
BioStatistics Ganbawale
Research Methodology
By,
Dr. Rahul Manvendra Ganbawale
B.H.M.S, M.D. (Hom.)
- Hon. Visiting Lecturer Dept. of Biostatistics and
Research Methodology, Dr. J. J. Magdum Homeopathic
Medical College, Post-Graduate Research Institute,
Jaysingpur.
- Member of International Homoeopathic Medical
Society, U.S.A.
BIO – STATISTICS AND RESEARCH METHODOLOGY 2
* Published by,
* Price. Rs.
* Printed at:
BIO – STATISTICS AND RESEARCH METHODOLOGY 3
Dedicated to,
My parents and teachers.
Their words always inspired me.
BIO – STATISTICS AND RESEARCH METHODOLOGY 4
FORWORD
I am happy to write forword to book – Biostatistics and
Research Methodology written by Dr. Rahul Ganbawale. I have gone
through this book. This is a unique book written by Homoeopath on
Biostatistics and Research Methodology, up till now allopath write
these types of books. But as this book is written with Homoeopathic
background, is useful to students who are doing Post Graduation in
Homoeopathy. This will serve purpose for doing thesis and for
appearing examination.
In Homoeopathy many practioners are doing well, but their
results are not published. To flourish Homoeopathy we should
produce our result in Statistics form and also we should understand
Research Methodology. This book will serve this purpose.
This young Homoeopath is sincere; research oriented and will
contribute for Homoeopathy.
I wish best luck for his bright future and congratulate for
writing this book.
FORWORD
ACKNOWLEDGEMENT
Contents
Common Terms used in Biostatistics and Research
Methodology.
List of Symbols and Greek Alphabet.
Part I - Biostatistics
1. Statistics
2. Statistical Data
3. Sampling
4. Central Tendency
5. Dispersion
6. Normal Distribution
7. Measures of Location
8. Probability
9 Statistics and Epidemiology
10. Life Table
11. Errors in Statistics and Research
12. Correlations and Regression
Mathematical Tables
1. Logarithm Table
2. Anti Logarithm Table
3. Areas of Standard Normal Distribution
4. Percentile Value for Students „t‟ Distribution
5. Percentile Value for the Chi-Square Distribution
6. Conversion of Persons „r‟ into corresponding Fishers „z‟
Coefficient.
7. Spearmans Rank difference Correlations
Exercise
References
BIO – STATISTICS AND RESEARCH METHODOLOGY 11
Concept:
It is an overall idea about particular things or phenomena.
Correlation:
It is the relationship or association between two continuous
variables.
Data:
It is collected information on a specific subject, or a set of
values obtained by measurement or counting.
Editing:
Its is a process of modifying by examining and correcting any
errors.
Hypothesis:
A tentative prediction or explanation of the relationship
between two or more variables on the basis of limited
evidence as a starting point for further investigation.
Incidence:
It is the number of new cases arising in a given period in a
specified population.
Interpretation:
It is an art of drawing conclusions and explaining their
significance after a careful analysis.
Intuition:
It is the method, which relies upon its appeal to reason. The
investigator who supports the proposition should show that the
reasons are considered to be correct so that it becomes an
important step in research.
BIO – STATISTICS AND RESEARCH METHODOLOGY 13
Mean:
It is the average of a set of quantities and is calculated by
dividing the sum of quantities by the number of cases.
Measurement:
It is the application of number to the characteristics of objects,
persons, states or events.
Median:
It is the point in a frequency distribution of observed values or
quantities with an equal number of cases on either side of it.
Meta Analysis:
It is a process of combining the result of several clinical
studies on the same subject to derive a definitive conclusion.
Mode:
It is the most frequent observations seen in a series.
Morbidity:
It is defined as any departure, subjective or objective, from a
state of physiological well being.
Mortality:
It means death from any cause. Mortality rate is the number
of deaths in a given area or period, or from a particular cause.
Natality:
It is defined as the rate of birth per unit time per area.
Population:
Population is defined as the total number of individuals of a
given species at a specific area at a particular time, or it is an
entire mass or a group of people or group of quantitative data.
BIO – STATISTICS AND RESEARCH METHODOLOGY 14
Prevalence:
The prevalence of a disease is the number of cases in a given
population at a specific point of time.
Randomization:
It is a procedure by which participants are allocated into study
group and control group.
Rate:
It is a measure of the occurrence of some particular event (e.g.
development of disease or the occurrence of death) in a
population during a given time period.
Ratio:
Ratio is the quantitative relation between two amounts
showing the number of time one value contains or is contained
within the other.
Research:
A scholarly pursuit directed towards systematic search, or
investigation in to and study of materials and sources in order
to establish facts and reach new conclusion.
Research Methods:
The techniques that are used for conduction of a research.
Research Methodology:
It is the way to solve problems systematically; or it is a
planned approach towards observing the reality, defining the
problems, examining its various dimensions, analyzing and
evaluating the information and drawing conclusions there
from.
BIO – STATISTICS AND RESEARCH METHODOLOGY 15
Sample:
It is a selected number of individuals each of which is a part
of whole population.
Sampling:
It is a technique of selecting a sample.
Sampling Frame:
It is a cluster of units from which sample is to be selected.
Scale:
The instrument, which is used, is called scale.
Scientific Research:
It is a systematically controlled, empirical and critical
investigation of hypothetical proposition about the presumed
relations among the natural phenomena; or it is the process of
knowing new facts and verifying old ones by the application
of systematic and scientific methods to the natural
phenomenon.
Sensitivity:
It is the ability of a test to identify correctly those who have
the disease that is true positive.
Specificity:
It is the ability of a test to identify correctly those who do not
have the disease that is true negative.
Screening:
It is testing for infection or disease in population or on
individuals who are not seeking health care.
BIO – STATISTICS AND RESEARCH METHODOLOGY 16
Tabulation:
It is the summarization of results in the form of statistical
tables.
Tenacity:
It is the tendency to continue to believe a proposition through
habit keeping a firm hold on it. Here, we accept a preposition
as true simply because we have always believed it to be true.
Transcription:
It is an act of transcribing or making copy of any kind from
the original.
Validity:
It means acceptability depending on accuracy.
Variable:
It is a quantity which is able to assume different numerical
values as per need. In statistics variable are the characteristics
of an individual which can be measured numerically.
BIO – STATISTICS AND RESEARCH METHODOLOGY 17
LIST OF SYMBOLS
Symbol Meaning
W Set of whole numbers
N Set of natural numbers or Number of observations
Q Set of rational numbers or Quartile
< Is less than
> Is greater than
~ is similar to
Since
Therefore
The absolute value of x
f(x) A Function of x
Varies as
Σ Summation
X Chi
μ Mu
ƒ Frequency
√ Root
A Assumed mean
d Deviation of variables
V Variance
X Mean
P Probability
Ho Null Hypothesis
O Observed value
E Expected value
BIO – STATISTICS AND RESEARCH METHODOLOGY 18
r Coefficient of correlation
D Difference
G Geometric mean
H Harmonic mean
M Median
L Lower limit
C Cumulative frequency
i Width of class interval
Mo Mode
m Mid - point
SE Standard error
df Degree of freedom
fo Observed frequency
fe Expected frequency
pdf Probability density function
Ø Mother Tincture
BIO – STATISTICS AND RESEARCH METHODOLOGY 19
GREEK ALPHABET
Symbol Meaning
λ Lambda
Δ Delta
б Sigma or Standard deviation
p Rho Spearmen‟s rank correlation coefficient
α Alpha
β Beta
γ Gamma
δ Delta
Ψ Psi
BIO – STATISTICS AND RESEARCH METHODOLOGY 20
PART I - BIOSTATISTICS
BIO – STATISTICS AND RESEARCH METHODOLOGY 21
BIO – STATISTICS AND RESEARCH METHODOLOGY 22
Chapter No. 1
STATISTICS
History:
tatistics the word is derived from Latin word „Status‟. In
S German mythology a word „Statistik‟ was used for the
first time political arrangement. It was Gottfried Achenwall who used
his word in the history of medicine.
Pierre C.A. Louis (1787-1872), a pioneer of clinical Statistics,
studied 77 patients of Pneumonia. He counted and compared the
results of patients treated by „Blood Letting‟ and concluded that early
bleeding was associated with reduced survival.
William Bateson and Galton applied Mathematics to
Evolution.
In 1959, Robert Ledley and Lee Lusted published –Reasoning
Foundations of Medical Diagnosis. They were the first to provide a
mathematical analysis of the reasoning process inherent in medical
diagnosis
Wilfrid Card and Jack Good (1973) wrote on Mathematical
Structure of Clinical Medicine and Logical Analysis of Medicine.
Dr. Hahnemann also mention under the foot note of §106 in
his 6th edition of Organon of Medicine - The healing art will then
come near the mathematical sciences in certainty.
Statistics:
In simple words, Statistics is a science of figures which deals
with data in an experimental study.
BIO – STATISTICS AND RESEARCH METHODOLOGY 23
According to Webster:
Statistics are the classified facts representing the conditions of
the people in a state especially those facts which can be stated in
number or in a table of numbers or in any tabular or classified
arrangement.
According to Bowley:
Statistics is a numerical statement of facts in any department
of enquiry placed in relation to each other.
According to Horace Secrist:
By statistics we mean the aggregate of facts affected to a
marked extent by multiplicity of causes, numerically expressed,
enumerated or estimated according to reasonable standards of
accuracy, collected in a systematic manner for a predetermined
purpose and placed in relation to each other.
According to Croxton and Cowden:
Statistics may be defined as a science of collection,
presentation, analysis and interpretation of numerical data.
In brief, Statistics is a method of taking decision on the basis
of numerical data properly collected organized, presented, analyzed
and interpreted.
Biostatistics:
The branch of statistics concerned with data related to living
organisms is known as Biostatistics or Biometry.
Biometry:
In Greek, bios = life and
Matron = measure
BIO – STATISTICS AND RESEARCH METHODOLOGY 24
Characteristics of Statistics:
Statistics is a quantitative expression and not a qualitative
expression.
Statistics is the aggregate of facts.
Statistics is affected by multiplicity of causes.
Statistics is estimated based on reasonable standard of
accuracy.
Statistical data is collected for pre-determined purpose.
Statistical data should be related to each other.
Limitations of Statistics:
It does not study the qualitative aspects.
Individualistic study is very difficult in this science.
Statistical studies are based on average values or results.
It lacks mathematical accuracy.
Statistics is only a means and not an end of study.
Aims of Statistics:
The aim of Statistics is not only to collect numerical
data but also to provide a methodology from which certain
conclusion can be arrived it.
BIO – STATISTICS AND RESEARCH METHODOLOGY 29
Chapter No. 2
STATISTICAL DATA
i) Nominal Data:
The following data indicating the sex, male or female, of the
patients undergoing appendectomy are as follows:
Patients: F,M,F,M,M,F,F,M,F,M,F
M=5
F=6
BIO – STATISTICS AND RESEARCH METHODOLOGY 30
ii) Ordinal:
Example: The researchers have to evaluate the effectiveness
of a newly proved homoeopathic drug for headache versus placebo
treatment.
Pain intensity is as follows:
Experimental Group : 3,4,4,3,3,3,4,2,1,3,2,1,3,4,4,2,3,3,3,3.
Placebo Group : 4,4,4,4,4,3,4,3,2,4,4,2,4,5,3,4,4,4,4,4,4
1 = no pain
2 = mild pain
3 = moderate pain
4 = severe pain
Table No. 2 Pain intensity of patients following placebo and drug
treatment:
Pain Experimental
Placebo Group
Intensity Group
1 2 --
2 3 2
3 10 3
4 5 15
n = 20 N = 20
BIO – STATISTICS AND RESEARCH METHODOLOGY 31
Collection of Data:
There are many sources of collection. Collection of data is the
first stage in Statistics. We can collect materials or data from the
following sources:
Sources of Data
External Data
1) Primary Data:
Primary Data
Primary Data:
The data which is obtained by the investigator either by his
own or through some agency set up for a specific purpose, directly
from the field of enquiry for the first time is called Primary Data. This
is original one. Ideally it should be unprejudiced. Many scientific
researches are based on primary data.
Primary data can be collected by the following methods:
I] Direct Personal Interviews:
Here, the researchers or his agency collect data personally
from persons who are the subject of enquiry.
If the area of an enquiry is limited, we can directly reach to
each person, provided researcher should be unprejudiced, skilled and
pleased. Example: If a teacher wants to know the personality of
students in a class he may interview his students personally one by
one.
ii] Indirect Personal Interview:
If a person refuses to provide information to an investigator,
then we can adopt indirect personal interview method. Here the
information is collected from the persons who are nearest to him.
( His relatives, family members, friends, associates etc.)
BIO – STATISTICS AND RESEARCH METHODOLOGY 33
Secondary Data:
Secondary Data
Published Unpublished
Secondary Data:
The data which is collected by some agency when used by
another, or collected for one purpose when used for another may be
termed as Secondary Data. The main advantage of this system is that
it saves time and money.
Secondary data is either published or unpublished information
Published Data:
1. Govt. Publications:
Many government departments such as Health and Family
Welfare Dept. of central and state government regularly publish
current information with statistics. E.g. Health Statistics, Agriculture
Statistics of India, Indian Trade Journal, Gazette, etc.
2. International Organization:
Many international organizations like WHO (World Health
Organization) UNO (United Nations Organization), UNICEF publish
valuable data annually regarding people‟s health, Trade and
Companies atomic research. Etc. which provide valuable statistical
information about each country and its place in the world standard.
3. Semi-official Publications:
Local bodies such as Municipal Corporation, Gram Panchayat,
etc. publish reports periodically, which provide information about
people‟s health, birth and death rate, sanitation, literacy, etc.
4. Reports of Committees or Commissions:
There are committees and, or commissions of enquiry
appointed by the central and the state government for some special
purpose and study. Their reports have very high values.
BIO – STATISTICS AND RESEARCH METHODOLOGY 36
5. Private Publication:
i) Journal and Newspapers:
Journals like Links, Asian Journal, Journal of Industries etc,
provide valuable statistical information on health, medical status, and
industries etc. commerce. Many newspapers like The Times of India,
The Economic Times on Financial Express regularly publish data on
different fields.
ii) Research Institutes:
These are the most important sources of secondary data. Many
research institutes at national level like ICR, CCRH and various
departments at university level provide data on different subjects
from their research activities.
iii) Private Companies:
Companies in the private sector publish annual reports along
with the balance sheets and profits and loss accounts. These reports
help the government to assess the economic status of the state. The
data related to financial positions of the companies, their annual
productions and sales etc. are considered as secondary data and
become useful to the economists and researchers.
iv) Articles and Reports:
Articles and reports published by various field workers and
social activities provide valuable information on different subjects.
Unpublished Data:
In many departments like, Atomic research departments,
information and technology dept. medical research institutes,
Research workers or scientists have great information regarding their
BIO – STATISTICS AND RESEARCH METHODOLOGY 37
Organization of Data:
Organization of data is nothing but simply classification of
data. Classification is the process of dividing different things into
definite classes according to their similarity and dissimilarity.
Soon after collection of data starts, arrangement should be
made to scrutinize them. After proper scrutiny, the information is
classified and tabulated. The main object of statistical analysis is to
abstract significant facts from a large collection of data. The
statisticians arrange these data according to their characteristics and
affinities. Classification or systematic arrangement of data is called
BIO – STATISTICS AND RESEARCH METHODOLOGY 38
Simple Compound
Geographical Classification :
Collection of data according to area or region of some parts of
a country is called Geographical Classification.
It is further classified as follows:
BIO – STATISTICS AND RESEARCH METHODOLOGY 39
i) Alphabetical Order:
Here, names of the villages, districts, states or countries given
alphabetically.
Example: Countries – America, Burma, China, Denmark, and their
Literacy Rate – 95%, 65%, 60%, 80% respectively.
ii) Descending Order:
Here, names of the countries do not appear in an alphabetical
order but the percentage distribution is according to descending order
(consider above example).
Countries : America, Denmark, Burma, China.
Literacy rate : 95% 80% 65% 60%
Chronological or Temporal Classification :
Classification of data according to time is called chronological
or temporal classification. Time may be in the form of year, month,
weeks, etc. It may be classified either in ascending order or in
descending order as per requirement.
i) Ascending Type:
Example: Population of Kolhapur city for the last 5 years is as
follows:
Year Population
2000 – 2001 60500
2001 – 2002 74200
2002 – 2003 80350
2003 – 2004 90411
2004 – 2005 112350
BIO – STATISTICS AND RESEARCH METHODOLOGY 40
Male Female
This type of classification is also called as ‘Dichotomous’
classification.
Definition: When the classification is done with respect to one
quality or attribute of a group into two classes is called Simple
Classification.
ii) Compound Classification:
When the classification is done with respect to two attributes/
qualities and where several classes are formed then that classification
is called Compound Classification.
Example: When a number of students are classified with respect to
two qualities simultaneously, that is sex and intelligence, and then
they are first classified on the basis of sex- male and female.
BIO – STATISTICS AND RESEARCH METHODOLOGY 41
Male Female
No. of students 10 12 18 20
Here, there are 10 students who have expenditure group of Rs. 50 to
99 and so on.
i) Continuous Data:
A continues data is one for which there is a possible value
between any other two possible values. For e.g. when we consider the
height of students, then the value may be any fraction of a number
and can be measured theoretically to any degree of accuracy. That
means if we measure a height of 70 cm and a height of 71 cm then
there is every possibility that the height of any student may be within
the range of 70cm to 71cm that is 70.2cm, 70.3cm, 70.4cm, 70.6cm.
etc. Thus here we get a continuous variable.
BIO – STATISTICS AND RESEARCH METHODOLOGY 42
Presentation of Data:
An investigator has to collect data for his predetermined
purpose. Soon after collection of data, the arrangements should be
made to edit the information for removing any inconsistency or in
accuracy present. Then he classifies all data in to different groups,
classes or intervals. After classification and tabulation report is
prepared by the investigator describing the purpose of the enquiry,
method of collection of information, definition of terms used, degree
of accuracy, places and time of collection of data, final results and
conclusion, and suggestion, if any for any interpretation of the data.
Diagrammatic presentation
Graphical presentation
Tabular Presentation:
Tabulation
Simple Complex
Here the classified data are put in a table having rows and
columns.
The process by which the classified data are presented in an
orderly manner in rows and columns of a table with their
characteristics are known as „Tabulation‟.
Ideal Table:
The following are the essential features of a table:
A table should be simple, easy to understand.
It should not be overloaded with details of each criterion.
It should be attractive and comprehensive. (That is the
proportion of columns, rows and size should be maintained).
Units of measurements must be mentioned.
E.g. weight in Kg, Hb in percentage, etc.
It should have a suitable title, table number and if required,
footnotes and sources of data should be mentioned.
Columns and rows should be numbered for the convenience of
future reference.
Row Heading
Body
Types of Tabulation:
Simple Tabulation:
In a simple table only one set of data gives information. Here
we find only two columns. In this type of tabulation we study only
one characteristic of a data. Therefore, it is also called as One-way
table.
Example: Different districts of Maharashtra and its population:
Table No.4
Sr. No. Districts Population
1 Kolhapur 115302
2 Nagpur 88900
3 Solapur 76530
4 Pandharpur 70312
BIO – STATISTICS AND RESEARCH METHODOLOGY 46
Complex Tabulation:
Here, more than one set of data are presented in a table. In this
table each numerical figure is the value of the measurement having
the characteristics shown both by column and row headings.
i) Two-way Table:
The data are presented with two characteristics in this complex
table.
Example: The students of a high school can be divided according to
their class and they can be subdivided according to their sex as
follows:
Table No.5
Sr. No. Class Students Total
Males Females
1 6th class
2 7th class
3 8th class
4 9th class
5 10th class
Total
Table No.6
Sr. Class Students Totals
No. Boys Girls
Non - Intelligent
Non - Intelligent
Non - Intelligent
Intelligent
Intelligent
Subtotals
Subtotals
Total
Intelligent
1 6th class
2 7th class
3 8th class
4 9th class
5 10th
class
Total
Table No.7
Students
Boys Girls Total
Sr. Class Religion
Non-Intelligent
Intelligent
Intelligent
Intelligent
Intelligent
Intelligent
Subtotals
Subtotals
No
Total
Non-
Non-
Hindu
th
1 6 Muslim
class Others
Sub
total‟s
Hindu
th
2 7 Muslim
class Others
Sub
total‟s
Hindu
th
3 8 Muslim
class Others
Subtotal
Diagrammatic Presentation:
In occasions tabular representation of statistical data becomes
difficult to understand. In that case, diagrammatic presentation is
appealing to eyes and creates a lasting impression in mind. It revels
the hidden facts of the data. Usually picture presentation of data
makes a clear impression than any other diagrams.
BIO – STATISTICS AND RESEARCH METHODOLOGY 49
Diagrams
An Ideal Diagram:
The diagram should posses the following qualifications
The diagram should be observable by naked eye.
It should be orderly divided considering its accuracy.
It should have suitable heading.
The scale should be mentioned and selected according to the
size of paper and graph.
Geometric instruments must be used.
If necessary, footnotes and sources should be mentioned.
Shades and colours make diagrams more attractive.
It should be easy to comprehend.
Types of Diagrams:
The data can be represented in the form of different
dimensional diagrams as follows:
BIO – STATISTICS AND RESEARCH METHODOLOGY 50
70
60
50
Wt. in kg
40
30
20
10
0
Jan Feb Mar April May June
Months
Fig. 2.1
ii) Subdivided bar Diagram or Component Bar Chart:
In this type, the bar is subdivided into certain parts where the
total height is proportional to different sub-divisions representing the
component parts.
Fig. 2.2 represents such a subdivided bar diagram or a component bar
chart. There are four bars- the first one from the left represents the
total expenditure and its component parts (viz., expenditure modern
medicine, Ayurveda medicine, Homoeopathic medicine and other
system) for the year 2000-2001 and the other three depicting the same
items for the years 2001-2002, 2002-2003 and 2003-2004
respectively. The component parts are indicated by different hatchings
and the total expenditure by the complete bar. Component bar chart is
useful in comparing the part of the item to the whole, or among
different parts.
BIO – STATISTICS AND RESEARCH METHODOLOGY 52
Table No.9
Particulars Expenditure (Rupees in lac)
2000-2001 2001-2002 2002-2003 2003-2004
Modern Medicine 80.4 84.3 86.4 90.2
Ayurveda medicine 67.7 72.2 78.2 80.0
Homoeopathic medicine 43.2 45.4 47.3 50.2
Other System 60.4 62.8 66.4 70.4
Total 251.7 264.7 278.3 29
Fig. 2.2
Fig. 2.3
Literacy rate among people of 3 states Kerala, Maharashtra and
Karnataka in the years 2001 and 2005 is shown by multiple bar
diagram.
Fig.2.4
Here, each division of the bar indicates the average results of
the students of different medical colleges.
Table No.10
Area (acre)
Years Average Total Irrigated Un Total
farm farm irrigated
2003 – 2004 9.6 403.2 12.4 15.4 27.8
2004 - 2005 8.7 311.4 13.6 17.2 30.8
35
30
Area in acres
25
20 Un Irregated
15 Irrigated
10
5
0
2003 - 04 2004 - 05
Years
Fig. 2.5
ii) Squares:
It is used when the ratio between the quantities is very high.
For example, if the ratio between the two quantities is 10:1 the height
of one bar will be 10 times the height of the other bar, which is very
difficult to draw on the graph paper. In that case, we can present the
data in by squares as shown below:
BIO – STATISTICS AND RESEARCH METHODOLOGY 56
A B
Fig. 2.6
A B
Fig. 2.7
Fig. 2.8
Three-Dimensional Diagram:
i) Cubes:
When the ratio of two quantities is very high (e.g., 125: 1)
they are represented by cubes.
Their sides are proportional to cube roots of given quantities.
Example: If the quantities are in the ratio 125:1, sides of cubes will
be in the ratio 3 125 :1, that is 5:1, which is shown in figure as
under :
(A) (B)
Fig. 2.9
BIO – STATISTICS AND RESEARCH METHODOLOGY 59
Pictograms:
When the presentation is in the from of a picture is known as
pictogram. It is also known as Vienna method or ISO type method
because this type of technique was first represented by Dr. Otto
Neurath, who was the resident of Vienna. Nowadays this type of
diagram is widely used because of its quantity of attractiveness.
In a pictogram a symbol represents a certain value
Example: Find bellow table no. 12 and its corresponding fig. 2.10
Table No. 12
Year Consumption of Hom. Medicine (bottles)
2001 7000
2002 6000
2003 9000
2004 8000
2005 9500
BIO – STATISTICS AND RESEARCH METHODOLOGY 60
Fig. 2.10
Cartograms or Maps:
Cartograms are nothing but a geographical representation of
data or information. It is mainly used for comparison of two different
areas under some characteristics.
Example: In India states like Kashmir, Punjab, Utter Pradesh,
Madhya Pradesh, Bihar, West Bengal, Assam is called goiter endemic
areas, which are represented by Cartogram.
Fig. 2.11
BIO – STATISTICS AND RESEARCH METHODOLOGY 61
Graphical Representation:
Graphical representations are made when we have to represent
the data of a frequency distribution over a period of time.
Graphs
Graphs:
1. Line frequency graph.
2. Histogram.
3. Frequency polygon.
4. Frequency curve or smoothed frequency curve.
5. Cumulative frequency curve – Ogive.
1) Line Frequency Graph:
Here two variables are used, one on X – axis and other
9
8
7
6
Cases (Millions)
5
World
4
East Asia
3
2
1
0
-12000 2001 2002 2003 2004 2005
BIO – STATISTICS AND RESEARCH METHODOLOGY 63
2) Histogram:
It is a pictorial of frequency diagram distribution. It consists of
a series of blocks drawn adjacently on the same horizontal baseline.
Here class intervals are mentioned on x-axis and the frequencies on
Y-axis. Therefore the area of each block becomes proportional to the
frequency. The rectangles are drawn on each class interval with height
in proportion to its frequency. The number of such rectangles will be
equal to the number of classes. It is useful in presentation of quantity
to data.
Example: Here we take weights of 127 patients from 30-40 kg. With
class interval of 2. Now we draw histogram using following table
Table No.13
Weight in k.g. Frequency
30-32 25
32-34 10
34-36 30
36-38 46
38-40 16
Total 127
50
40
30
20
10
0
30 32 34 36 38 40
Fig.2.13
BIO – STATISTICS AND RESEARCH METHODOLOGY 64
3) Frequency Polygon:
In this type of frequency distribution the variant values are
plotted on X -axis and corresponding frequencies on Y- axis. Then
the points plotted on the graph paper are joined successfully by
straight lines and the polygon is completed by joining the two
extremities of the rectilinear figure. This is called a Frequency
Polygon. Here we get graph in to two forms.
First we take the first group having 30-32. This group is
having 25 frequencies. Now take mid point of this group (that is 31)
then plot a point corresponding to 31 on X-axis and 25 on y-axis. In
this way all frequencies are marked on the corresponding mid points
of the groups. Then connect all these points with straight line.
It is useful to compare two or more than two distributions on
the same graph paper.
Example: We take weight of 127 patents from 30-40 kg. with class
interval of 2. Now we draw frequency polygon using following table.
Table No.14
Weight in kg frequency
30-32 25
32-34 10
34-36 30
36-38 46
38-40 16
Total 127
BIO – STATISTICS AND RESEARCH METHODOLOGY 65
Fig.2.14
4) Frequency Curve:
If the points plotted for a frequency polygon are joined by a
free hand then we get a curve called „frequency curve‟.
When the class intervals are small and number of observe are large
the frequency polygon lose its angulations and forms a frequency or
normal curve. It provides continuo graph giving the relative frequency
for each value.
If the points plotted for a frequency polygon are joined by a
free hand smooth curve we get a „Frequency curve (The curves drawn
by different individuals will be different.
Example: We draw frequency curve using following table:
Table No.15
Marks No. of Students
10-20 20
20-30 40
30-40 60
40-50 30
50-60 20
BIO – STATISTICS AND RESEARCH METHODOLOGY 66
Fig.2.15
5) Cumulative Frequency Curve or Ogive:
Cumulative frequency is the total number of individuals in
each particular range from lowest value of the characteristic up to and
including any higher group value
Ogive is a graph of the cumulative relative frequency
distribution. Therefore if we want to draw Ogive we should convert
ordinary frequency distribution into relative cumulative frequency.
In drawing Ogive variate values are represented on the X- axis
and the cumulative frequencies are on the Y- axis.
Fig.2.16
BIO – STATISTICS AND RESEARCH METHODOLOGY 68
6. Lorenz Curve:
It is a cumulative percentage curve where percentage of items
is combined with the percentage of other things.
It was Dr. Mare Lorenz who investigated this curve and applied it into
statistics.
It is useful in the study of the degree of inequality in the
distribution. E.g. Growth of babies at different periods of time
40
35
30
25
20
15
10
5
0
2 4 6 8 10
Fig.2.17
BIO – STATISTICS AND RESEARCH METHODOLOGY 70
Chapter No. 3
SAMPLING
Laws of Sampling:
In statistics there are two laws of sampling as under:
i) Law of Statistical Regularity
ii) Law of Inertia
Law of Statistical Regularity:
It states that any group of objects taken from a large
group will tend to possess the same.
Law of Inertia:
It states that when the size of the sample increases, it
gives more representative information.
BIO – STATISTICS AND RESEARCH METHODOLOGY 71
Sampling of Variables :
When we study quantitative characteristic of the items in a
population we get statistics of variables.
In research we can use sampling of variables in 3 ways.
a) To estimate population parameters using a sample.
b) To find out degree of reliability of estimate.
c) To compare the observed of expected values and to find whether
this difference is due to the fluctuations of sampling or real
difference.
Samples are divided into 2 types:
Large Sample:
When the size of population is more than 30 items.
Small Sample:
When the size of sample is less than 30 items.
BIO – STATISTICS AND RESEARCH METHODOLOGY 72
Sampling Design:
While collecting primary data the information may be obtained either
by census method or sample method.
1. Census Method:
In this type each and every functional unit of the population or
group is studied. There fore, this method will provide more reliable
and accurate result. But it requires much time and money compared to
other method. For example, if the average Hb percentage of patients is
to be calculated then the Hb percentage would be obtained from each
and every patient in the Hospital and some total of that percentage of
Hb is to be divided. By total number of patients would get the average
Hb% of patient.
2. Sample Method:
In this type only a small part of the whole population is to be
studied and the conclusions are applicable to the whole population.
For example, a pathologist examines a few drops of blood and draws
conclusion on the blood composition of the whole body.
Sampling Techniques:
There are the techniques of taking a sample from a population.
Sampling Techniques
Probability Sampling:
1. Simple Random Sampling:
Here, a sample is selected randomly in which every item has
an equal chance of being included.
This method is used when population is small and equally
characteristically distributed. Here samples are selected freely.
Therefore, it is also called, Unrestricted Random Sampling.
Practically it is difficulty to select sample from a group. Therefore to
make it simple one can use either lottery method or table of random
number method.
i) Lottery Method:
This is a widely accepted method of selecting a sample from
population.
Procedure: Give each and every item a specific number, say, A, B,
C... or 1, 2, 3... Etc. Then mix all the papers into a box. Shake well
and then draw any paper (folded) from the box randomly one by one;
thus you get samples.
Disadvantages:
1) On the basis of similarities in characteristics division of the whole
population into groups is a very difficult task.
2) Overlapping of groups may be possible sometimes.
3. Systematic Sampling:
When we have a complete study of population if each and
every item stet then we can use this method. Here, we will take every
sampling interval from the sampling frame, which is obtained by
dividing total population by desired a sample size.
Example: Select 50 items from a population of 500 and calculate
sampling interval.
Solution: We know,
= 500
50
= 10.
If, we select a starting number, say, 20, by lottery method,
then every 10th item from onward has to be taken until we get 100
items. (30, 40, 50, 60, 70, 80, 90, 100, etc.)
BIO – STATISTICS AND RESEARCH METHODOLOGY 76
District 1
Maharashtra State District 2
District 3
District 4
District 5
District
5. Multiphase Sampling:
Here, one part of information is collected from the whole
sample and part of information is collected from sub sample.
Example: Leprosy survey. In the first phase, tests like Lepromin test
is done in all cases of sample. Those who are tested + ve in lepromin
test are screened in he second phase by culture of organism which is
more expensive and time consuming than the tests performed in the
first phase.
BIO – STATISTICS AND RESEARCH METHODOLOGY 77
6. Cluster Sampling:
Cluster is a small group of a population, which is randomly
selected. Many national surveys are based on cluster sampling where
villages, schools, colonies, corporation areas are considered as a
cluster.
Here, usually 30 clusters are selected, by random sampling
method and then whole population in the cluster is studied.
Example: If, we want to study the national state of one village we
have to select 30 clusters, which are the representatives of the whole
village.
For this we have to prepare cumulative population list of the
village.
- Here 30 clusters are the sampling intervals.
- Select the random numbers from sampling intervals.
- Total population of village = 2700.
- We have 30 clusters.
2) Judgment Sampling
In this type of method the selection of sample is based on the
judgment of the person concerned. This method depending on the
sampling design and purpose of representative ness. Therefore, it is
also called Purposive Sampling. Here an investigator makes a proper
judgment, which decides the effectiveness of the sampling.
3) Quota Sampling
In this type of method a group is selected by subtracting
randomization from stratified random sampling.
Example: If we want to make sampling from a village where
there are 25% farmers. 50% merchants and 20% workers. Then for
sampling the population of that village, same percentage of these
groups should be represented from the villagers.
BIO – STATISTICS AND RESEARCH METHODOLOGY 79
4) Convenience Sampling
When conventional source is used for sampling, it is called
convenience sampling method. Example: Telephone directory, etc.
This is not a scientific method, because many times some
peoples are not included here. (E.g. poor people are not considered in
such samples.)
5) Sequential Sampling
This sampling method is applied as a process for quality
control, because here samples are drawn one after another from a
population depending on the sample drawn earlier.
If the first sample is acceptable, no new sample is needed and
if the first sample is rejected, 2nd sample should be selected. And if
the 2nd one is doubtful, then the 3rd one will be selected for desired
results.
BIO – STATISTICS AND RESEARCH METHODOLOGY 80
Chapter No. 4
CENTRAL TENDENCY
Characteristics of an Average:
A good average must have following characteristics:
It should be simple in presentations.
It should be easy to comprehend.
It should have a fixed value.
It should not be affected by fluctuations of sample.
BIO – STATISTICS AND RESEARCH METHODOLOGY 81
Arithmetic Mean:
It is the number obtained by dividing the total values of
different items by their number.
Calculation for Arithmetic Mean:
i) Individual Series
ii) Discrete Series
iii) Continuous Series
BIO – STATISTICS AND RESEARCH METHODOLOGY 82
i) Individual Series:
In individual series the arithmetic mean is easy to
calculate. Here, we get the total of values and divide this total by
number of observations.
X = ∑X
N
Where,
X = Arithmetic Mean.
∑X=Sum of the values of observations.
N = No. of observations.
These, calculations are done by 2 methods:
A] Direct Method
b] Shortcut (Assumed Mean) Method
A] Direct Method:
The direct method of arithmetic mean can be used when the
items in a series are less.
Example: Calculate arithmetic mean of following marks in Anatomy
obtained by 10 students from first B.H.M.S class.
Students Marks (X)
A 05
B 10
C 05
D 15
E 20
F 05
G 10
H 10
I 05
J 15
N = 10 ∑ X = 100
BIO – STATISTICS AND RESEARCH METHODOLOGY 83
b] Shortcut Method:
When the items in a series are big or more we must use short cut
or assumed mean method. Here, we can assume any figure as the
mean and deviations from this mean is calculated. Then to get
arithmetic mean, the total of deviations from assumed mean is
calculated from assumed mean and is divided by the number of
observations.
By applying this method we can save much time and energy
without changing answer.
Example: Consider the same data given in previous example.
Here, consider assumed mean „A‟ = 15. Then calculate the
deviations from assumed mean that is X – A = d
I 05 - 10
J 15 00
N = 10 ∑ d = - 50
X = A+ ∑d
N
= 15 + (-50)
10
= 15 + (-5)
= 10
X = 10
Therefore, the average marks in Anatomy are 10.
ii) Discrete Series:
In discrete series we multiply the variable by their respective
frequencies and get the sum of the products and thus total obtained is
divided by the number of observations. That is the total of
frequencies.
X = ∑ ƒx or ∑ƒx
∑ƒ N
Here, we multiply the frequency with the variable X then we get the
sum of product (∑ƒx).
Then divide ∑ƒx by total number of observation
That is ∑ ƒ or N.
BIO – STATISTICS AND RESEARCH METHODOLOGY 86
By Shortcut Method:
Formula:
X = A + ∑ƒd
N
130 32 + 30 + 960
140 38 + 40 + 1520
150 40 + 50 + 2000
N = 179 ∑ ƒ d = 5060
X = A + ∑ƒd
N
= 100 + 5060
7
= 128.26
Therefore the average number of Medical colleges is 128.26
iii) Continuous Series:
In this type, the mid- points of various classes intervals are to
be obtained by following equation.
Mid points = l1 + l 2
2
Solution:
Weights No. of Mid points ƒm
(X) Patients (f) (m)
40 – 50 12 45 540
50 – 60 18 55 990
60 – 70 20 65 1300
70 – 80 24 75 1800
N = 74 ∑ ƒm = 4630
We know formula,
X = ∑ ƒm
N
= 4630
74
= 62.56
Therefore, mean weight = 62.56
ii] Shortcut Method:
Example: Calculate arithmetic mean of previous example by short
cut method.
BIO – STATISTICS AND RESEARCH METHODOLOGY 89
Solution:
Weights No. of Mid point s m – 20 ƒd
(X) Patients (m) (d)
(ƒ)
40 – 50 12 45 25 300
50 – 60 18 55 + 35 630
60 – 70 20 65 + 45 900
70 – 80 24 75 + 55 1320
N = 74 ∑ ƒ d = 3150
Median (Me)
When the set of observations are arranged either in ascending
order or in descending order, the middle most or the central value is
called Median which divides the observation series into two equal
parts. It is denoted by M.
Calculation of Median:
A] When data is ungrouped:
1) When ‘n’ is odd: In this case n + 1 th value is the Median
2
M = n + 1 th term
2
= 05 + 1 th value
2
= 3rd value
Thus 3rd value in data is 11.
Median = 11 Hb%.
Example: When ‘n’ is even:
Find out the Median of following terms:
12, 10, 7, 6, 4, 8
Solution: Let us arrange values in ascending order we get,
4, 6, 7, 8, 10, 12
=7+8
2
= 15
2
Median = 7 .5
B] When the data is grouped:
1] When the series is discrete:
Here, the values of variables are arranged in ascending order
or descending order of magnitudes. Then the median is calculated by
the following Formula:
the
M= n+1
2
BIO – STATISTICS AND RESEARCH METHODOLOGY 92
No. of patients 60 80 40 50 20 30
Hb% 11 12 14 10 09 15
Here, n = 280
th
Median M = n + 1 value
2
th
= 280 + 1 value
2
= 140.5th value.
From above table all items from 130 to 210 have their values 12.
Since 140.5th item lies in this interval.
BIO – STATISTICS AND RESEARCH METHODOLOGY 93
Where,
L = Lower limit of class in which Median lies.
n = Total number of frequencies that is n = ∑ƒ.
ƒ = Frequency of the class in which median lies.
C = Cumulative frequency preceding the Median class.
i = Width of class interval of class in which Median lies.
70 – 80 08
80 – 90 06
Therefore, Median =
Here, n = 20
2
L = 60
f = 10
C = 16
i = 10
BIO – STATISTICS AND RESEARCH METHODOLOGY 95
Merits of Median:
1] It is useful when the extreme values of the series are either not
available or abnormal
2] It is useful in case when the items are not susceptible to
measurement in units. Ex. In cases of intelligence, honesty etc.
3] Median is useful in distributions where, extreme classes are ill-
defined like, less than 10 or more than 80 or 100.
4] It is not affected by (abnormally) larger or lower values.
Demerits of Median:
1] When there is great variation among the items of population it fails
to give satisfactory answer. For example, if the marks obtained by
students are 0, 2, 2, 8, 12, 13, 35, 42, 44, 45, 45 then the median will
be 13. This can never be the representative of all, because there are
many who have obtained marks more than 40.
2] Arranging in ascending order will require much time.
3] It is affected by fluctuations of sampling.
4) Further algebraic calculations are difficult from median.
5] many times it falls between two values therefore can not be
expressed.
BIO – STATISTICS AND RESEARCH METHODOLOGY 96
Mode
When a value in a series occurs most frequently is called
Mode. It has maximum frequency. Mode is also known as „Norm‟.
According to Kenny and Reepura, „the value of the variable which
occurs most frequently in a distribution is called Mode.
Example: In a series – 6, 3, 4, 2, 4, 3, 4, 6, 4 we
Find that 4 occur 4 times. Therefore Mode is 4.
Calculation of Mode:
a) In Simple Series
b) In Discrete Series
c) In Continuous Series
a) Simple Series:
Example: In the Series 5, 2, 3, 5, 2, 5 find out Mode.
Here, 5 occur 3 times. Therefore Mode is 5.
b) Discrete Series:
If the distribution in Series is regular and any one maximum
frequency is known then mode can be calculated by Inspection
method as follows.
Inspection Method:
Example: The percentages of students in B.H.M.S are 55, 60,
50, 62, 64, 60, 61, 60, 60 find the mode.
Solution: All the data is arranging in ascending order.
We get 50, 55, 60, 60, 60, 61, 62, and 64
Then, make a grouped frequency table.
BIO – STATISTICS AND RESEARCH METHODOLOGY 97
Students in % Frequency
50 1
55 1
60 3
61 1
62 1
64 1
Merits of Mode:
It plays important role in business for forecasting process.
Meteorological forecasting is based on modal value.
It can be obtained simply by inspection method.
As it is the item of maximum frequency the same item is used
in every sample of the population.
Demerits of Mode:
In many cases we get Bimodal or Multimodal
values. Therefore single value cannot be
obtained.
Its calculations are not based on all values.
Further algebraic process is difficult to
calculate from mode only.
When there are small differences between the
observations mode should not be used.
BIO – STATISTICS AND RESEARCH METHODOLOGY 100
Chapter No. 5
DISPERSION
The value, which is away from the central value or average, is called
Dispersion. The measures devised for dispersion is known as
Measures of dispersion.
Measures of Dispersion:
Measures of Dispersion are of following types:
Range.
Semi – Inter Quartile Range or Quartile Deviation.
Mean Deviation.
Standard Deviation.
Variance.
Range:
It is the difference between the highest and the lowest terms of a
series of observations.
Range = XH – X L
Features of Range:
It is affected by fluctuations of sampling.
Its value usually increases with the increase in size of the
sample.
It is not useful in accurate studies because it gives a rough
answer.
It is not based on all the observations.
It is changes from one sample to other population.
Quartile Deviation:
The half distance between 75th percentile that is 3rd quartile
(Q3) and 25th percentile that is. First quartile (Q1) is called as Quartile
Deviation.
In normal distribution quartile deviation is called as Probable
Error (PE).
Q = (Q3 - Q1)
2
= 22 + 24 = 23
2
10, 12, 16, 18, 22, 24, 26, 34, 38, 40.
BIO – STATISTICS AND RESEARCH METHODOLOGY 104
= 84
10
= 8.4
= 8.4
23
= 0.3652
M.D. = ∑ IDI
N
BIO – STATISTICS AND RESEARCH METHODOLOGY 105
= 82
10
= 8.2
= 8.2
24
= 0.3416
1] Individual Series:
Example: The 5 patients show weights (in K.G.) as follows:
40, 42, 60, 58, 52 respectively.
Solution :
We have formula,
M.D. = ∑ IDI
N
BIO – STATISTICS AND RESEARCH METHODOLOGY 106
= 36
5
= 7.2
Mean Deviation = 7.2
2] Discrete Series:
Example: Find out Mean Deviation of the following distribution and
its coefficient.
Solution:
M.D = ∑ ƒ IDI
N
Calculation of Median:
Me = size of N + 1 th Item
2
= size of 31 + 1 th Item
2
= 16th item
Median = 3
= 1.74
3
= 0.58
BIO – STATISTICS AND RESEARCH METHODOLOGY 108
3] Continuous Series:
Example: Calculate Mean Deviation from Mean and its
coefficient of the following data:
N = 20
C = 40
X = 4 + 26 X 40
20
X = 56
= 9.2 = 0.1642
56
Weights 50 – 60 60 – 70 70 – 80 80 – 90 90 – 100
No. of patients 45 42 38 22 23
Solution:
Weights No. of C. ƒ. Mid points m -85 ƒ IDI
patients (m) IDI
50 – 60 45 45 55 30 1350
60 – 70 42 87 65 20 840
70 – 80 38 125 75 10 380
BIO – STATISTICS AND RESEARCH METHODOLOGY 110
80 – 90 22 147 85 0 0
90 – 100 23 170 95 10 2300
N = 170 ∑ƒIDI =
4870
Median = me = l1 + N - c. f
2 X i
ƒ
Where, l1 = 80
N/2 = 85
c.f. = 125
ƒ = 38
i = 10
We know me = 80 + 85 – 125 x 10
Formula, 38
= 69.47 weight.
BIO – STATISTICS AND RESEARCH METHODOLOGY 111
We know,
Mean deviation M.D. = ∑ƒIDI
N
Here, ∑ƒIDI = 4870 and N = 170
M.D. = 4870
170
= 28.64
Coefficient of M.D. = M.D.
Median
= 28.64
69.47
= 0.4122
4] Standard Deviation:
It is the square root of the arithmetic average of the squares of
the deviations measured from the mean.
It was first time introduced by Karl Pearson. It is widely used
in Statistics. It is also known as Root Mean Square Deviation. It is
denoted by Greek letter 6 (sigma).
Calculation of Standard Deviation:
A. Individual Series.
B. Discrete Series.
C. Continuous Series.
A] Individual Series:
Standard deviation may be calculated by 2 methods,
i) Actual Mean Method.
ii) Assumed Mean Method.
BIO – STATISTICS AND RESEARCH METHODOLOGY 112
Solution:
Value (X) X –X (x) x2
10 - 19 361
12 - 17 289
14 - 15 225
30 -1 1
26 -3 9
38 9 81
48 19 361
54 25 625
∑X = 232 ∑X 2 = 1952
Steps:
1] Calculate actual Mean of the observations.
2] Obtain deviations of values from the Mean that is calculate (x - x)
to get x.
3] Square the deviations to get ∑X 2
4] Divide ∑X 2 by number of observations.
We, know,
X = ∑X = 232
N 8
= 29.
Here, ∑x 2 = 1952 and N = 8.
6= ∑x 2 = 1952 = 244
N 8
BIO – STATISTICS AND RESEARCH METHODOLOGY 113
6 = 15.62
Calculation:
Stapes:
1] Calculate the deviation of the observations from an assumed mean
(X - A) to get d.
2] Obtain the total of d that is ∑d
3] Square the deviations and donate the total ∑d2
4] Apply the following formula:
6= ∑d2 _ ∑d 2
N N
Where, d = X –A
We know formula,
BIO – STATISTICS AND RESEARCH METHODOLOGY 114
6= ∑ d 2 - ∑d 2
N N
= 533 - 289
= 244
= 15.62
B] Discrete series:
Standard deviation can be calculated by the following methods:
i) Actual Mean Method
ii) Assumed Mean Method
Example: Calculate standard deviation of following data:
Family 1 2 3 4 5
Patients 4 6 5 2 8
Calculation:
Steps:
1] Calculate Mean and take the deviations of the items from actual
mean that is (X- X) to get x.
2] Square the deviations.
3] Multiply deviations with respective frequencies.
4] Make the total of deviations and frequencies that is ∑ƒx2
BIO – STATISTICS AND RESEARCH METHODOLOGY 115
5] Apply formula:
6 = ∑ƒx2
N
Solution:
Family No. of
(X) Patients ƒx X–X x2 ƒx2
(ƒ) (x)
Calculation of Mean:
X = ∑ƒx
N
Here, ∑ƒx = 79 and N = 25
X = 79 = 3.16
25
By Appling formula,
6 = ∑ƒx2
N
Here, ∑ƒx2 = 55.36 and N = 25
6= 55.36
25
= 2.2144
BIO – STATISTICS AND RESEARCH METHODOLOGY 116
= 1.48
Calculation:
Steps:
1] Take deviations of size from an assumed mean to get d.
2] Multiply these deviations by their frequencies
3] Multiply frequencies with squares of deviation
4] Obtain total that is ∑ƒd2
Apply formula,
6= ∑ƒd2 – ∑ƒd 2
N N
Where,
d = (X – A)
Here,
∑ƒd2 = 56 and ∑ƒd = 4 and N = 25
2
6= 56 – 4
25 25
= 2.24 – 0.0256
= 2.2144
= 1.48
C] Continuous series:
Here, also we can calculate standard deviations by two methods.
i) Actual mean method
ii) Assumed mean method
1] Actual mean method:
Steps:
1] Calculate actual mean of series that is X.
2] Take the deviations of mid- points from mean that is find (m – X)
to get x.
3] Square these deviations and multiply them by their frequencies.
4] Obtain ∑ƒx2.
5] Divide ∑ ƒx2 by total number of items.
Apply formula,
6 = ∑ƒx2
N
Where, x = (X – X)
Calculation:
Hb % No. of Mid
(X) patients points ƒm (m–X) ƒx ƒ x2
(ƒ) (m) x
6 – 10 4 8 32 -4 - 16 64
10 – 14 2 12 24 0 0 0
14 – 18 4 16 64 4 16 64
N = 10 ∑ƒm ∑ƒx= ∑ ƒ x2 =
= 120 0 128
Calculation of Mean:
Mean X = ∑ƒm = 120 = 12
N 10
Where, ∑ƒm = 120 and N = 10.
6 = 128
10
= 12.8
= 3.57
Steps:
1] Take the deviations of mid points from an assumed mean to get d.
2] Multiply these deviations by respective frequencies.
3] Calculate the total that is ∑ƒd.
4] Calculate the squares of the deviations that are d2.
5] Multiply these squared deviations by their frequencies that are ƒd2.
6] Obtain the total that is ∑ƒd2
Apply formula:
6= ∑ƒd2 - ∑ƒd 2
N N
We know formula,
6= ∑ƒd2 - ∑ƒd 2
N N
Where, d = (X –A)
∑ƒd2 = 128
∑ƒd = 0
N = 10
BIO – STATISTICS AND RESEARCH METHODOLOGY 120
2
6= 128 - 0
10 10
= 12.8 - 0
= 12.8 = 3.57
5. Variance:
Fisher first used the name variance.
The analysis of variance is the method by which one can
measure the significance of difference between several mean at one
time.
BIO – STATISTICS AND RESEARCH METHODOLOGY 121
Variance = (S.D.) 2
Or
S.D. = Variance
Or
Variance = ∑ (X – X) 2
N
Or
Variance = ∑X2
N
We know formula,
Variance = ∑ x2 = 55
N 5
= 11
Variance = 11
Coefficient of Dispersion:
Coefficient of dispersion is the ratio of a measure of
dispersion to the related measure of central tendency [Mean, Median
or Mode] Therefore we get following expressions:
Importance in Biostatistics:
Measure of dispersion is useful mainly in biological processes
of living organism than the nonliving, physical or chemical sciences,
HB%, RBC number, O2 or CO2 capacity are some of the examples
Where dispersion are widely used.
It is also used when cure or non-curable rate with
same drug varies in different patients even we consider same age and
sex. Example: Same individual shows different pulse rate in different
physiological conditions.
Chapter No. 6
BIO – STATISTICS AND RESEARCH METHODOLOGY 124
NORMAL DISTRIBUTION
Fig. 6.1
The normal curve or normal distribution is an important
concept in statistics For example we collect Hb values of many
BIO – STATISTICS AND RESEARCH METHODOLOGY 125
Y= N e - X 2/2 б 2
б2π
π = 3.14
б = Standard deviation of distribution
X = The deviation of any unit of measurement from the mean.
e = 2.7193, the base of the system of natural logarithms.
Z = ( x – x)
б
BIO – STATISTICS AND RESEARCH METHODOLOGY 127
Skewed Curve:
Any deviation from the mean value to either below or above
the normal distribution value results skewness of the frequency
distribution. We find such two curves as under:
Positively Skewed Curve.
Negatively Skewed Curve.
M= Me=Mo
Fig. 6.2
Positively Skewed Curve
Fig. 6.3
BIO – STATISTICS AND RESEARCH METHODOLOGY 128
Fig. 6.4
A distribution is said to be negatively skewed if the frequency
curve has a longer tail towards the lower values of x, that is, on the
left. In this case, Mean (M) < Median (Me) < Mode (Mo)
Example: When outcomes of an examination show exceedingly
good result the scores are found towards the higher end. Such skewed
is called negatively skewed curve. Here, mean falls to the left of
median.
BIO – STATISTICS AND RESEARCH METHODOLOGY 129
i) J-Shaped Curve:
Fig. 6.5
This typically J-shaped curve is seen during the drug proving
process, where the effects of an acute drug wear off after the initial
phase of pathogenesis. There is an increased frequency of events
happening in the initial phase and they disappear later.
Fig. 6.6
BIO – STATISTICS AND RESEARCH METHODOLOGY 130
Fig. 6.6
ii) Platy Kurtic Distribution
Here, the distribution is spread extensively so that the curve of
distribution looks like a Plateau.
BIO – STATISTICS AND RESEARCH METHODOLOGY 131
Fig. 6.7
Iii) Lepto Kurtic Distribution
Here, the range is very small, so that data is spread within a
close extent. The curve of this distribution looks tall.
Fig. 6.8
Skewness
Skewness means the lack of symmetry or any deviation from
symmetry. Measure of Skewness tells us whether the dispersion of
items from an average is symmetrical or asymmetrical. In skewed
distribution values of mean, median and mode do not coincide.
Skewness or Kurtosis may happen if
1) Selection procedure is prejudiced. For example, inadequate
sample size, homogenous group. etc.
2) Unsuitable statistical tests give wrong interpretation.
BIO – STATISTICS AND RESEARCH METHODOLOGY 132
Chapter No. 7
MEASURES OF LOCATION
2. Quintiles:
They are four numbers, which divide the whole distribution in
to 5 equal parts. So 20% percentiles falling or first quintile will have
20% observations falling to its left and 80% on its right side.
BIO – STATISTICS AND RESEARCH METHODOLOGY 133
3) Deciles:
They are nine in number, which divide the whole distribution
into ten equal parts. That means first decile or 10th percentile will
divide the distribution into 10% and 90%. While 9th decile will
divides into 90% and 10% and first decile will be as median which is
also a second quartile Q2.
Calculation of Percentiles:
A) Graphic Method:
Here, the values are to be calculated from cumulative
frequency group.
Example: Find out the location of a percentile in the range of a
variable from following table.
Weight of groups Frequency of each Cumulative
in kg. group class frequency
20 – 22 4 4
22 – 24 3 7
24 – 26 2 9
26 – 28 4 13
28 – 30 5 18
30 – 32 6 24
32 – 34 8 32
34 – 36 10 42
36 – 38 12 54
38 – 40 8 62
Total = 62
BIO – STATISTICS AND RESEARCH METHODOLOGY 134
B] Arithmetical Method:
Here the values are to be calculated from the cumulative
frequency where we have to find variable group in which the
particular observation lies and then raise the lower value of the
variable of that group proportionately to the value of that particular
observation.
= 34 + 29
10
= 36.9 kg.
= 23.46 kg.
Utility:
1) Location of percentile divides the frequency distribution into
two parts, which is sophisticated for further study.
2) Preparation of a standard percentile like Quartile (Q1) or
median (Q2) etc. Example: For age, sex reasons.
3) It is useful for comparison of one percentile value of a variable
of one sample with another sample drawn from same
population.
4) It is used for study in growths in children.
5) It can be used as a best measure of dispersion.
BIO – STATISTICS AND RESEARCH METHODOLOGY 138
Chapter No. 8
PROBABILITY
OR
BIO – STATISTICS AND RESEARCH METHODOLOGY 139
Where,
n= Experiment (exhaustive) that is total number of cases.
M= Favorable (to A) cases.
A= Event.
Laws of Probability:
1] Addition Law:
If the events are mutually exclusive, then the probability of
happening of any one of them is equal to the sum of the probabilities
of the happening of the separate events that is in other words if, E1,
E2, F3, En be „n‟ events and P (E1), P (E2),……P(En) of their
respective probabilities. Then,
P (E1 + E2 + E3 ……….. En) = P (E1) + P (E2) + P (E3)
Example: If the probability of student a passing the examination is
1/3 and the probability of the student „B‟ passing the same
examination is 1/5 then what is the probability that one of the student
will pass that examination?
P (A + B) = P (A) + P (B)
= 1 + 1 = 8 = 0. 53
3 5 15
= 1 x 1 = 1 = 0. 008264
13 17 121
1] Classical Probability:
It is the calculation of probability in situation where there is
easy to predict.
Example: What is a probability of finding a black pen from a group
of 7 colored pens?
2] Frequency Probability:
It is the calculation of chance of happing in more complex
situations. Therefore these calculations are based on previous
observations and experience. Therefore it is also called as – Empirical
Probability.
It is based on two laws -Addition law and Multiplication law.
3] Conditional Probability:
It is the type of happening of an event, which is based on the
presence or absence of another event as a condition.
Probability Distribution:
There are 4 common probability distributions, which is a
very useful in operation research study.
i) Binominal distribution:
Consider, that a physician examines a certain group of patients
„n‟ items each. The fraction of defective items in each group is
estimated from previous data to equal „p‟. Thus we have to determine
the probability density function (pdf) of the number of defectives in a
group.
BIO – STATISTICS AND RESEARCH METHODOLOGY 142
k
P x=k =λ e-λ
K!
Where k = 1, 2, 3….n.
And λ = the rate (number per unit time) at which the events occur.
iii) Negative exponential distribution:
It is the number of arrivals during a specified period, which
occurs according to a Poisson distribution then the distribution of the
BIO – STATISTICS AND RESEARCH METHODOLOGY 143
Where, E = x = μ (Mean)
Variance x = б2
It is useful in many random phenomena that occur in every day
practice like weights, where N (μ6) represents the normal distribute
on. The average of a sample taken from any distribution can always
be approximated by the normal distribution (as per central Limit
theorem).
BIO – STATISTICS AND RESEARCH METHODOLOGY 145
Chapter No. 9
Measurement in Epidemiology:
The first requirement for any measurement in Epidemiology is
what is to be measure that is collection of data. To minimize errors in
classification of data Epidemiology need clear definition of a disease
it includes:-
a) Measurement of Mortality.
b) Measurement of Morbidity.
c) Measurement of Disability
d) Measurement of Natality.
e) Measurement of Demographic Variables
BIO – STATISTICS AND RESEARCH METHODOLOGY 146
1) Rate:
It is measure of the occurrence of some particular events in a
population during a given time period. It indicates the change in some
events that takes place in a population over a period of time.
Example: Death rate = Number of deaths in one year X 100
Mid- year population
The categories of rates are as follows:
i) Crude Rates:
These are the actual observed rates such as the birth and death
rates.
ii) Specific Rates:
These are the actual observed rates due to specific causes. E.g.
Leprosy occurring in specific groups. (Age, sex group) or during
specific time. (Weekly, monthly, annually) etc.
iii) Standardized Rates:
It is mainly used to compare the death rates of two populations
with different age composition. Standardization is carried out by one
BIO – STATISTICS AND RESEARCH METHODOLOGY 147
2) Ratio:
A ratio is a measure of disease frequency. It denotes a relation
in size between two random quantities. In other words ratio is a result
of dividing one quantity by another.
3) Proportion:
A proportion is a ratio, which indicates the relation in
magnitude of a part of the whole. It is expressed as a percentage.
Ex. The number of males with hypertension at s certain time X 100
The total number of males in a city at the same time
v) Survival rate:
It is the proportion of survivors in a group of patient‟s studied
and followed over a period (say 5 yrs).
It is mainly used to know prognosis of that disease For
example: In Cancer patients who is useful for further medication.
Survival rate = Total number of patients alive after 5 yrs x 100
Total number of patients diagnosed or treated
Measurement of Morbidity:
The WHO expert committee on health statistics noted in its 6th
report that morbidity should be measured in terms of:
a) Person who were ill.
b) The illnesses that these persons experienced.
c) The duration of illness.
Morbidity is commonly measured by morbidity rates / ratios,
frequency, duration and severity. Disease frequency is measured by
Incidence and Prevalence rates.
BIO – STATISTICS AND RESEARCH METHODOLOGY 150
1) Incidence Rate:
It is the number of new cases occurring in a population during
a specified period of time.
Incidence rate
= Number of cases of specific disease during a given time period x 1000
Population at risk during that period
Incidence rate is important for:
i) To control disease.
ii) For research into etiology, pathogenesis and distribution of
diseases.
iii) For preventive and therapeutic measures.
iv) Incidence rate is mostly considered in acute diseases.
Utility of Incidence:
1) It is useful to determine whether a disease of unknown
etiology is communicable or not and in evaluating the
effectiveness of control measures like immunization and
isolation.
2) It is useful in infectious diseases in which the primary care is
effective for only a short period of time.
3) It measures the spread of an infection in a family or
community.
Prevalence:
It is the total number of all individuals who have a disease at a
particular time divided by the population at risk of having the disease
at this point in time or mid-way through the period. Prevalence is of
two types:-
a) Point Prevalence
b) Period Prevalence
a) Point Prevalence:
It is the number of all current cases (old and new) of a disease
at one point in time in relation to a defined population.
Point prevalence =
Number of all current cases (old and new) of a specified
Disease exiting at a given point in time x 100
Estimated population at the same point in time
BIO – STATISTICS AND RESEARCH METHODOLOGY 152
b) Period Prevalence:
It is the frequency of all current cases (old and new) exciting
during a defined period of time (e.g. Monthly prevalence) in relation
to a defined population.
Period prevalence
= Number of exciting cases (old and new) of a specified disease
During a given period of time interval x 100
Estimated mid – interval population at risk
Uses of Prevalence:
1) Prevalence is used to estimate health or disease problems in
community and identify high-risk population.
2) Prevalence rates are useful for administrative and planning
purposes.
3) It is used for rehabilitation purpose.
Risk Factors:
Where the disease agent is not known the etiology is generally
discussed in terms of „Risk factors‟.
BIO – STATISTICS AND RESEARCH METHODOLOGY 153
Screening Tests:
It is an initial examination and not a diagnostic test except
anemia and glucose tolerance test, which are used both for screening
and diagnosis.
1) Mass Screening:
Mass screening is nothing but the screening of a whole
population. E.g. screening for tuberculosis.
2) Selective (High-risk) Screening:
Here, the screening tests are selectively applied to individuals
in high-risk group.
Example: Screening for Cancer of cervix in the lower social groups.
Risk factors especially those of a pathophysiological in nature are
identified and then preventive measures can be applied before actual
disease occurs.
3) Multiphasic Screening:
It is the application of many screening tests to a large number
of peoples at a same time for many diseases.
i) Sensitivity
It is the ability of a test to identify correctly all those who have
the disease. Yerushalny introduces it in 1940.It is a statistical index of
diagnosis. It is expressed in percentages. (E.g. 80%, 90% etc.)
ii) Specificity:
It is the ability of a test to identify correctly those who do not
have the disease.
It is expressed in percentage.
iii) Predictive Value:
It reflects the diagnostic power of the test. It depends upon
sensitivity, specificity and prevalence of a disease.
It is either positive or negative.
The positive test indicates the probability that a patient with a
positive test resulting in disease.
iv) False Percentage:
It is either negative or positive as follows:
a) False Negative:
It means that patients who actually have the disease are told
that they do not have the disease. Therefore here many patients do not
receive proper treatment at right time. The lower the sensitivity, the
larger will be the number of false negative patients.
b) False Positive:
It means that patients who do not have the disease are told that
they have. Here normal healthy peoples may be screened. A screening
test with a high specificity will have few false positives.
BIO – STATISTICS AND RESEARCH METHODOLOGY 158
Calculations:
1) Sensitivity
= diseased peoples whose screening test is positive x 100
(Disease peoples whose + (Diseased peoples
Screening test is positive) whose screening test is negative)
2) Specificity
= Non diseased peoples whose screening test is negative x 100
(Non-diseased peoples whose + (Non diseased peoples
Screening test is positive) whose screening test is negative)
BIO – STATISTICS AND RESEARCH METHODOLOGY 159
Chapter No. 10
LIFE TABLE
Utility:
1] To compare health status of different countries.
2] To identify the handicapped groups.
3] It plays important role in health interventions or levels of
preventions.
4] It provides data for health planning and programs.
Heath Expectancy :
There are many factors like social, economical, and cultural
that affects indirectly on health, which was not included in disability
adjusted life years approach. Therefore WHO Advisory Committee on
Health Research creates a new framework that is International
classification of impairments, disabilities and handicaps to ass‟s
health expectancy considering above-mentioned factors.
BIO – STATISTICS AND RESEARCH METHODOLOGY 162
Chapter No. 11
Theoretical Error:
As the researcher do not have enough theoretical knowledge
and information regarding the area of studies. He may choose
BIO – STATISTICS AND RESEARCH METHODOLOGY 163
Methodological Errors:
During conducting any research by using only one type of
methodology is not sufficient for its internal and external validity.
Sampling Error:
These types of errors can be reduced if appropriate statistical
test is used and samples-representatives are increased in numbers. The
technique which is adopted for sampling plays important role in
experimental studies. (Provided equivalent groups considered)
While the size of sample plays important role in survey type of
studies-Epidemiological Studies.
During a survey we take only a small portion of a whole
population that is a sample where naturally a certain amount of error
will occur this error is called as „Sampling error‟. When sampling
error decreases sample size increases and vice versa.
Sampling errors are as follows:
i) Chance Error:
It occurs due to random sampling. In statistics we can accept it
up to 5% (that is if p < 0.05 it is significant). If it is more than 5%
(that is if p> 0.05, it is not significant) we reject it.
BIO – STATISTICS AND RESEARCH METHODOLOGY 164
Utility:
1. It is used to determine whether the sample is drawn
from known population or not.
2. It is used to find the standard error of difference
between two proportions to know if the observed
difference between the proportions of two samples is
statistically significant or not.
3. During survey it helps to find sample size.
SE (d) = 1 1
X12 + X2 2 N1 + N2
N1 + N2 -2
S.E of difference = P1 q1 + P2 q2 OR
Between two proportions n1 n2
1 + 1
PQ n1 n2
Utility:
It is used to test the significance of difference between two
proportions or ratios to find out if the difference between the two
proportions or ratios has occurred by chance.
Measurement Error:
Human variables are measured with the manifestation of their
behaviors (symptomatology) of an individual, different groups
expresses different traits and behaviors. For example: Measurement
of intelligence, success or failure rate etc. Therefore to avoid these
types of errors
- Large samples should be considered which also automatically
eliminates sample errors.
- Collection of data is done by employing appropriate tests.
- Nature of variables should be defined.
- All instruments which are used should be examined by
researcher carefully in the terms of its reliability, validity
and appropriateness for the study.
Statistical Error:
Many behavioral sciences required statistical
techniques. If an appropriate test is not used for analysis of data and
interpreting conclusion it results into statistical errors. Therefore care
should be taken to avoid these errors. For example:
1) Parametric statistics should be used in samplings procedure,
collection of data.
BIO – STATISTICS AND RESEARCH METHODOLOGY 168
Interpretation Error:
Interpretation is an individualistic entity. Though
Homoeopathic science is based on individualization concept, the
researcher should not state something beyond the statistical data.
His interpretative data must be relevant to concerned variable. For
example: When we the use non-parametric statistics, our results
should be interpreted as a description of a sample only and should
not be interpreted in terms of grand generalization. Scientific
words or terminologies used whenever necessary. Certain
evidence should be used for final interpretation.
Inferential Error:
While applying grand generalization one can use many
individuals as samples only. Few peoples do not always become ideal
representative of whole population. Samples should be taken
considering racial geographical and biological variation. We can use
small samples in experimental studies to show cause and effect
relationship of any disease We can not generalized it because may
diseases was multifactorial in origin, many environmental and the
hosts defense mechanisms and susceptibility plays important role in
development of disease mechanism. Therefore the inferences drawn
should be based on clinical findings and should not come in conflicts.
BIO – STATISTICS AND RESEARCH METHODOLOGY 169
Reporting Error:
Many times most of the errors are seen during reporting
research projects. These are either Bias or Prejudices, false analogy in
selecting a problem and considering methodology or the statistical
techniques are used without considering their assumption. We can
avoid these types of errors with the help statisticians and experts,
who easily identify these types of errors.
BIO – STATISTICS AND RESEARCH METHODOLOGY 171
Chapter No. 12
Types of Correlation:
There are 4 types of correlation. They are as follows:
A. Positive and Negative Correlation.
B. Simple and Multiple Correlations.
C. Partial and Total Correlation.
D. Linear and Non-linear Correlation.
BIO – STATISTICS AND RESEARCH METHODOLOGY 172
Degree of Correlation:
It is the intensity of relationship between two variables.
Classification of Degree of Correlation:
Degree of Correlation
Fig. No.12. 1
Fig. No.12.2
Absence of Correlation:
Here, there is no any relationship exists between variables.
Therefore these variables are not depending on each other.
Coefficient of Correlation:
It is a measure of tendency that is the degree to which the two
variables are interrelated which is measured by a coefficient, is called
Coefficient of correlation.
A] Scatter Diagram:
Here, by simply observation we get some idea about the
presence of correlation. Scatter diagram is in the form of plotted
points on a graph paper. The plotted points may be upward or
downward in direction is called positive or negative scattered
diagrams respectively.
r = + 1.
Here, we observe an upward trend indicate positive
relationship or correlation.
r=-1
Here, we find plotting on graph paper have downward trend indicate
negative correlation.
Degree of Correlation:
If the plotted points are not in a straight line but if we draw a
straight line in a middle of their points – regression line we will find
the points may be nearest to line or not- this kind of scatter diagram is
called degree of correlation. It may be of following types.
Degree of Correlation
Positive Negative
Solution:
Fig. No. 12.12
BIO – STATISTICS AND RESEARCH METHODOLOGY 181
From the above scatter diagram we find that the variables height and
weight are of high degree positive in nature.
Where, x = (X - X)
y = (Y - Y)
б x = Standard deviation of X - axis
б y = Standard deviation of Y - axis
BIO – STATISTICS AND RESEARCH METHODOLOGY 182
r = ∑xy
Nx ∑x 2 x ∑y 2
N N
OR
Students 1 2 3 4 5 6 7 8 9 10
Marks in 15 18 21 24 27 30 36 39 42 48
Anatomy
Marks in 25 25 27 27 31 33 35 41 41 45
Physiology
BIO – STATISTICS AND RESEARCH METHODOLOGY 183
Solution:
Calculation of coefficient of correlation:
Marks in Marks in
2
Anatomy X-X x Physiology Y–Y y2 xy
X x Y y
( X – 30) ( y – 33)
15 - 15 225 25 -8 64 120
18 - 12 144 25 -8 64 96
21 -9 81 27 -6 36 54
24 -6 36 27 -6 36 36
27 -3 9 31 -2 4 6
30 0 0 33 0 0 0
36 +6 36 35 +2 4 12
39 +9 81 41 +8 64 72
42 + 12 144 41 +8 64 96
48 + 18 324 45 + 12 144 216
2 2
∑x = 300 ∑x ∑y = 330 ∑y = ∑xy =
= 480 708
1080
Calculation:
Steps: 1) Calculate arithmetic means of X and Y series.
2) Find out deviation of X series to get x.
3) Square their deviation to get ∑x 2.
4) Find out deviation of Y series to get y.
5) Square their deviation to get ∑y2.
6) Multiply x and y.
7) Find out total that is x and y.
Apply formula, r = ∑xy
∑ x 2 x ∑y 2
BIO – STATISTICS AND RESEARCH METHODOLOGY 184
Calculation of Mean:
X= ∑x = 300 = 30
N 10
Y= ∑x = 330 = 33
N 10
C] Rank Correlation:
It was Charles Edward Spearman (1904) who developed a
formula for obtaining rank correlation. According to him if we want
to find if two characteristics A say smell of a sent and B say its color
are related or not.
Spearman‟s rank correlation coefficient is denoted by p (Rho) and
calculated by following formula.
r or p = 1 - б∑ D2
n (n2 – 1)
Solution:
Anatomy Physiology Rank Difference
(d)
P or R =1– 6 ∑ d2
n (n2 – 1)
=1– 6 x 40
5 (52 – 1)
Marks in 29 24 25 27 30 31
Materia Medica
Marks in 29 19 30 33 37 36
Repertory
Solution:
Sr. Marks in Rank Marks Rank d
No. Materia. R1 Repertory R2 R1 – R2 d2
Medica.
1 29 3 29 5 -2 4
2 24 6 19 6 0 0
3 25 5 30 4 1 1
4 27 4 33 3 1 1
5 30 2 37 1 1 1
BIO – STATISTICS AND RESEARCH METHODOLOGY 187
6 31 1 36 2 -1 1
∑ d2
=8
P = 1 - 6∑d2
n (n2 – 1)
=1- 6x8
6 (62 – 1)
=1- 48
6 (36 – 1)
= 1 - 48
210
= 1 – 0.228
= 0.78
2. Regression:
Regression means change in the measurements of variable character
either positive or negative side, beyond the mean.
Francis Galton introduced this concept. It is used
for estimating the unknown values of one variable from known values
of another.
Regression analysis is a method in which variables can be
determined by regression lines and be indicated by regression
analysis. Regression describes the functional relationship between
dependant and independent variables, which helps us to make
estimates of one variable from another.
Regression Line:
It is a line of points drawn in such a manner, which represent
the average relationship between the two variables such a line, is
called „Regression line‟. Here, it estimates the value of one variable
from the value of the.
When the regression lines show some similarity upward or
downward we can assess same trend or correlation between these two
variables. When the trend is upward the correlation is positive and
when it is downward the correlation is negative. In positive
correlation there may be high or low degree and in case of negative
correlation there may be high or low degree of correlation. If both the
BIO – STATISTICS AND RESEARCH METHODOLOGY 189
Regression Coefficient:
It is a measure of the change in one dependant character
with one unit change in the independent character. It is denoted by
letter „b‟.
As there are two regression lines so there are two regression
equations and therefore two regression coefficients are as follows:
X on Y regression equation = Xc = a + b y.
Y on X regression equation = Yc = a + b x.
Calculation of ‘a’ Value:-
The value of „a‟ can be calculated with the help of following
equation.
∑X = N a + b ∑ Y and ∑ XY = a ∑ Y + b ∑ Y 2
Calculation of ‘b’ Value:-
The value of „b‟ may be calculated with the help of following
equations.
∑Y = N a + b ∑ X and ∑ X Y = ∑ X + b ∑ X 2
BIO – STATISTICS AND RESEARCH METHODOLOGY 192
Coefficient of Correlation:
It is the square root of the product of the two regression
coefficients.
We can calculate coefficient of correlation from the equation
of regression coefficient.
Here, we have to understand two laws:
i) If regression coefficient has negative sign coefficient of correlation
will also be negative.
ii) If regression coefficients have positive sign coefficient of
correlation will also be positive.
It means that, both the regression coefficient have same sign
either + ve or – ve. Never both signs.
A) Normal Equation:
Example: Calculate regression equation of the following data:
X 1 2 3 4 5 6 7 8 9
Y 1 2 3 4 10 13 15 16 17
Solution:
X Y x2 y2 xy
1 1 1 1 1
2 2 4 4 4
3 3 9 9 9
4 4 16 16 16
5 10 25 100 50
6 13 36 169 78
7 15 49 225 105
8 16 64 256 128
9 17 81 286 153
∑X= ∑ Y = 81 ∑ x2 = ∑y2= ∑ xy =
45 285 1069 544
---(2)
Substituting the values Substituting the values
45 = 9 a + 81 b ------ (1) 81 = 9 a + 45 b ------ (1)
544 = 81 a + 1069 b --- (2) 544 = 45 a + 285 b ------ (2)
Multiplying the equation (1) Multiplying the equation (1)
by 9 by 5
405 = 81 a + 729 b --- (3) 405 = 45 a + 225 b ----- (3)
544 = 81 a + 1069 b --- (4) 544 = 45 a + 285 b ---- (4)
b = 0.40 b = 2.31
Substituting value of „b‟ in Substituting value of „b‟ in
st
equation (1) equation (1)st
45 = 9 a + 81 (0.40) 81 = 9 a + 45 (2.31)
45 = 9 a + 32.4 81 = 9 a + 103.95
9 a = 45 – 32.4 9 a = 81 – 103.95
9 a = 12.6 9 a = - 22.95
a = 1.4 a = - 2.55
Now the regression equation Now the regression equation
X on Y is Y on X is
X c = 1.4 + 0.95 Y. Y c = - 2 .55 + 2.31 X.
BIO – STATISTICS AND RESEARCH METHODOLOGY 195
X 1 2 3 4 5 6 7 8 9
Y 1 2 3 4 10 13 15 16 17
Solution:
X Y X–X x2 Y–Y y2 xy
(x) (y)
1 1 -4 16 -8 64 32
2 2 -3 9 -7 49 21
3 3 -2 4 -6 36 12
4 4 -1 1 -5 25 15
5 10 0 0 1 1 0
6 13 1 1 4 16 4
7 15 2 4 6 36 12
8 16 3 9 7 49 21
9 17 4 16 8 64 32
∑X= ∑Y= ∑x= ∑ x2 = ∑y= ∑ y2 = ∑xy
45 81 0 60 0 340 = 139
X - X = r 6 x (y – y ) Y- Y = r 6 y (x – x )
6y 6x
x = 45 = 5 and y = 81 = 9 and
9 9
Hence, Hence,
X – 5 = 0.40 (y – 9) Y – 9 = 2.31 (x – 5)
X – 5 = 0.40 y – 1.44 Y – 9 = 2.31 x – 11.55
X = 0.40 y – 1.44 + 5 Y = 2.31 x – 11.55 + 9
X = 3.56 + 0.40 y Y = -2.55 + 2.31x
X–X бx
= r ( Y– Y ) Y- Y бy
= r ( X– X )
бx бx
r бx =bxy r бy =byx
бy ∑dxdy бx ∑d x x ∑dy
bxy= ∑dxdy- N byx= ∑dxdy- N
∑ d y2 - (∑d y)2 ∑ d x2 - (∑d x)2
N N
= 130– ( 9) ( - 9) = 130 – ( - 9) ( - 9 )
9 9
349 – (- 9)2 69 – (- 9 )2
9 9
= 130 –(-9) = 0.40 = 130 –(-9) = 2.31
349 - 9 69 – 9
X= 45 = 5 Y = 81 = 9.
9 9
Hence, X – 5 = 0.40 (Y – 9) Hence, Y – 9 = 2.31 (X – 5)
BIO – STATISTICS AND RESEARCH METHODOLOGY 198
Chapter No. 1
RESEARCH
According to WHO:
Research is a quest for knowledge through diligent search or
investigation or experimentation aimed at the discovery and
interrelation of new knowledge.
According to C.A Moser:
Systematically investigation to gain knowledge about
phenomena and problems.
According to E. Bogardus:
Research is the investigation of the underlying process
operative in the lives of persons who are in association.
According to P.V. Young:
We may define research as scientific undertaking which, by
means of logical and systematized methods, aims to discover new
facts or verify old facts and to analyze their sequences,
interrelationship, causal explanations and the natural terms which
govern them.
According to F. L. Whitney:
Research includes a study of human group relationship.
BIO – STATISTICS AND RESEARCH METHODOLOGY 203
d. Action Research:
According to Jane Franseth, Action Research is a systematic
examination conducted by individuals or groups studying their own
practices in search of sound answer to unsolved problem in their work
and aimed at improving their own performance on their own jobs.
The concept of action Research is based on the modern
human organization theory. The research has the capacity to solve the
problem and take the decision. Thus they improve and modify their
practices.
Action research is derived from the Social Psychology, where
Kurl Lenin explains the space in terms of person and goal.
The action Hypothesis is formulated on the basis of the cause
of the problem. Action Hypothesis requires one design of research
where one Hypothesis is tested at one time only. The action research
project is evaluated by the researcher himself and no external
evaluation is required.
Methods in Research:
Researchers applied either inductive or deductive logic while
conducting any type of research.
i) Inductive Method: where the conclusions are drawn from
particular to general premises.
ii) Deductive Method: where the conclusions are drawn from
general to particular premises.
Characteristics of Research:
Research aims at finding out the new facts.
BIO – STATISTICS AND RESEARCH METHODOLOGY 205
Research Methods :
These are techniques, which are used for conduction of
research by researcher.
Example: Collection of data, Analysis of data and evaluation of data.
Research Methodology:
It is the way to solve research problem systematically and
logically. Research method is one part of research methodology.
The following are some important methods followed in
research:
i] Historical Methods:
It is concerned with past events. Here the information or data
is systematically collected, analyzed, verified and reconstructed about
past events.
The aim of historical research is to show the importance of
past events in present situation. Here, the research solely depends
upon secondary data such as, library materials like, books, magazines,
Journals, publications, Historical records etc.
Therefore the result depends upon each individualistic
interpretations, reasoning and art. There are different conclusions
from same events. Therefore precautions should to be taken for
accuracy.
Knowledge of past is always beneficial as follows:
1) To be aware of previous mistakes and fallacies.
2) Study of evolution of civilization.
3) Discarded concepts in past became useful today.
4) Arrangement and management of future investigation.
BIO – STATISTICS AND RESEARCH METHODOLOGY 207
Social Research:
It is the systematic method of discovering new facts or
verifying old facts, their sequences, interrelationships, causal
explanations and the natural laws, which govern them. It is based on
the scientific investigation and behavioral science.
Social research therefore attempts to reveal the cause and
effect relationship exiting in various social phenomena.
b) Supportive Environments:
A researcher should provide following things for research:
i) Library and documentation facilities.
ii) Computers and Internet facilities.
iii) Recognition and appreciation.
iv) Financial support.
v) Team support.
vi) Application of research and feedback.
Research Approaches:
Two basic approaches to researches are as follows:
I. Quantitative Approach:
Here, formulation of data is in quantitative form. This
approach can be subdivided into,
i) Inferential Approach.
BIO – STATISTICS AND RESEARCH METHODOLOGY 212
1. Research Problems:
This is the first step in any type of research. The researcher
should start the research problem which he selected for the study.
Here the researcher should also explain the importance of his research
to community or health activities or how his study will be useful for
Research workers, Medical students, Government and other
professionals.
2. Review of Literature:
Dr. Park says, „Those who fail to read history are destined to
suffer the repetition of its mistakes‟. Therefore study of old literatures
is necessary for research. Literature provides us very useful data for
research. Example: We can use source books for descriptive type of
study. Here we can also find out whether or not others have
investigated the same research previously.
3. Research Hypothesis:
Hypothesis is an important step for researcher because here
the researcher predicts the expected results. Therefore it should be
systematically and logically formulated. It guides the researcher
during whole research process by focusing his attention to more
important events. Hypothesis should be formulated only after
BIO – STATISTICS AND RESEARCH METHODOLOGY 215
6) Testing of Hypothesis:
After analysis, Coding and Editing the Hypothesis is to be
tested by tests of significance. The selection of statistical test depends
upon type of research. These tests will prove or disprove our
hypothesis. That is, these tests will decide whether research should be
accepted or rejected. The tests used for this purpose are Chi-square
test (x2), Z – Test, Student (t) test, F-test, ANOVA tests etc.
7) Conclusion:
After testing the hypothesis, the researcher should draw some
conclusion and prepared the project for presentation. Grand
generalization should be done if necessary.
8) Presentation:
This is the last step in research process. In the final report,
Title of our research, Forward, Acknowledgement and Contents
should be presented in first part. The second part should contain
Introduction to research, Actual research process, Research methods
and Methodology, Utility of research and Conclusion.
At the end it should contain Bibliography, Index and
Appendix (if required).
Research in Homoeopathy:
1. Basic or Fundamental Research:
i. Unexplained concepts like Miasms, Potentisation, and Vital
Force. Etc. is to be explained on the basis of Modern Science.
ii. Confirmation of cardinal principles of Homoeopathy
Example: Law of Similia.
2. Pharmacological Research:
Experimental type of research was also done by Dr.
Hahnemann in 1790. It includes:
i. Drug Proving.
ii. Clinical trial of accidental observations of certain drugs which
produces unwanted effects when administered to patients.
iii. Clinical verification and confirmation of symptoms of proved
drugs.
3. Clinical Research:
(i) Animal experimentation to asses Structural or Lethal –
Histopathological changes at Constitutional, Organ, and
Tissue and at Molecular level.
BIO – STATISTICS AND RESEARCH METHODOLOGY 218
Chapter No.2
RESEARCH DESIGN
Clinical Field Community Before and after comparison before and after
Studies without control compassion
Studies with
Control
Experimental Designs
Experimental studies are carried out under the direct control of
the investigator. Experimental studies are of two types:
A) Randomized Controlled Trials.
B) Non – Randomized (non-experimental) Trials.
Select sample
Make exclusions
Randomize
Interpretation of outcome
BIO – STATISTICS AND RESEARCH METHODOLOGY 222
1. Preparation of a Protocol:
The protocol consists of aims and objectives of the study,
criteria for selection of study, size of the sample, control groups, and
the procedure to be applied.
Sometimes, Pilot studies (Preliminary tests) have to be made
before preparation of a protocol to ass‟s efficiency of certain
procedures.
The protocol prevents bias and reduces the sources of errors.
2. Selection of Population:
i) Selection of Target Population:
It is the population to which findings of a trial are applicable.
It is also called as Reference population. It may be whole community,
village, city or a group of workers or students.
ii) Selection of Study Population:
It is actual population, which participates in experiments
derived from target population. It has some characteristics as that of
target population as it is selected randomly from them. During
selection the researcher should fulfill certain ethical criteria‟s.
3. Randomization:
It is a statistical process by which the population is divided
into two groups a study group and a control group. It is the chief
procedure of a control trial. It is necessary that certain variables e.g.
age, sex are to be classified within each group and then randomly
selected for study. Using a table of random numbers usually
BIO – STATISTICS AND RESEARCH METHODOLOGY 223
4. Manipulation:
Here the researcher manipulates the study group by
application or reduction of factors e.g. any medicine, vaccine, dietary
factors etc. as per study. Then final outcomes are measured.
5. Follow Up:
Evaluation of symptoms during follow up is very important
step during study. It may be short or may require many days
depending upon the study. Here one must consider the factors, which
are responsible for attrition like any ones death, migration,
psychological disturbance etc.
Here, examination of both the groups is done under same
circumstances.
6. Interpretation of Outcome:
The last step is the assessment of the outcome of the trial in
the terms of positive or negative conclusions. The result is compared
in both the groups and if differences found it should be tested by tests
of significance.
Many times bias may arise during entire study process.
Therefore to reduce these biases a technique is used called as
„Blinding‟.
BIO – STATISTICS AND RESEARCH METHODOLOGY 224
Blinding:
It can be done in 3 ways as follows:
i) Single Blind Trial:
Here, the participant is not aware whether he belongs to study
or control group.
ii) Double Blind Trial:
Here, both the researcher and participant are not aware of
group and treatment received.
iii) Triple Blind Trial:
Here, the participant, researcher and the person analyzing or
processing the data are not aware of any group or treatment received.
It is an ideal method of blinding.
a) Therapeutic Trial:
According to Austin Hill, Therapeutic trial is a carefully and
ethically designed experiment with the aim of answering some
precisely framed question. It is most rigorous form. It demands
BIO – STATISTICS AND RESEARCH METHODOLOGY 226
Field Trials:
These trials are carried out in a field or in hospitals. Example:
Vaccination or contraceptive trials as like clinical trials.
It requires large number of individuals and more time.
BIO – STATISTICS AND RESEARCH METHODOLOGY 227
Samples
i) Preventive Trials:
Here, primary preventive measures are mostly considered e.g.
trials of different vaccines and chemo prophylactic drugs. These trials
should be applied to groups than to individuals for accurate results.
ii) Risk Factors Trials:
Here the researcher disturbs the series in the development of
disease of those individuals who have risk factors for developing the
disease. For e.g. the risk factor for oral cancer is tobacco chewing.
BIO – STATISTICS AND RESEARCH METHODOLOGY 228
3. Community Trials:
In a community trial the unit of randomization is a group of
peoples or a community. Here, one community receives intervention
(Study group) and the other group should not receive intervention
(control group). Here, we study and compare the results of study
group and control group.
2) Natural Experiments:
Here researcher makes different groups, which are
corresponding to natural phenomena‟s. It is used where experimental
studies are difficult to study in human beings.
Therefore peoples have separated into two groups naturally.
E.g. smokers and non-smokers, where researcher confirmed his
hypothesis regarding lung cancer and smoking.
BIO – STATISTICS AND RESEARCH METHODOLOGY 230
3) Comparison Studies:
These are the community trials having following two groups:
Block Design :
i) Random Block Design (R.B. design):-
Here, the peoples are divided into certain groups or blocks.
The groups have same similarity at certain variables. Then one subject
from each group is selected randomly. Random block design can be
analyzed by two way ANOVA test.
ii) Latin Squares Design (L.S. Design):
This type of design is mainly used in Agricultural fields
because the crop production is mainly depends on environment that is
BIO – STATISTICS AND RESEARCH METHODOLOGY 231
1] Descriptive Design:
i) Cross – Sectional Study Design:
This is an examination or cross section of a population in a
defined area. It is also called as „prevalence studies‟. Here the
investigator measures the disease.
Utility:
It is used to find the prevalence rate of disease.
To formulate the etiological hypothesis.
It is also useful for screening of population groups
for undiagnosed disease.
Exposure No Exposure
After observation
Utility:
It is nothing but background looking study.
It is mainly used for epidemiological studies.
It is used for establishment of cause of disease by
investigating the association between risk factors and
occurrence of disease.
ii) Prospective Cohort Study Design:
It is a case control study design having forward-looking study.
Cohort studies are used to test the hypothesis. (Cohort is a group of
people having common characteristics).
Here first of all we select target population. From this we
select study population by random sampling method. From study
group we select two cohorts one is exposure to risk factors and other
is not exposed. After some period we analyze the data and find how
many peoples developed disease from exposed and non-exposed
peoples.
Target Population
Study population (selected randomly)
Exposed group not exposed group
Disease Disease
C) Experience Surveys:
It is the survey of peoples who had experience with the
problem to be studied. Here the researcher should select skilled
persons for study. The purpose of this survey is to obtain insight into
the problem.
Chapter No. 3
ANALYTICAL STUDIES
Basic Steps:
Following are the basic steps in conducting a case control study:
1) Selection of cases and controls – collection of data.
2) Matching.
3) Measurement of exposure.
4) Analysis and interpretation.
1. Selection of Cases and Controls:
a) Selection of Cases:
The cases may be selected from hospitals or general
population during a specified period of time. The cases should be
representative of all cases in the community.
b) Selection of Controls:
The prerequisite criteria for controls are that it should be free
from any disease. Many diseases have subbed clinical nature therefore
such control individuals excluded.
BIO – STATISTICS AND RESEARCH METHODOLOGY 239
2. Matching:
It is the process by which we select controls in such a way that
they are similar to cases in certain variables which influences disease
process.
Suspected etiological factors should not be matched because
matching lead elimination of etiology.
Types of Matching:
I) Group Matching:
Here groups are matched with each other, which are similar in
their characteristics.
ii) Pairs Matching:
Here, for each case a control is selected mostly similar in
variables such as age, sex, duration, and intensity of symptoms and
grading of disease.
3. Measurement of Exposure:
Information about exposure should be collected in the same
manner both for cases and controls by interviews, questioners and
from previous records.
4. Analysis:
BIO – STATISTICS AND RESEARCH METHODOLOGY 240
B] Cohort Study:
Synonyms: Prospective study, longitudinal study, Incidence study,
forward – looking study.
It is one of the types of observational study, which is
undertaken to obtain additional evidence to support the existence of
the relation between suspected cause and disease.
Meaning of Cohort:
It is a group of people who have common characteristics
within a defined time period.
Example: A peoples exposed to a common infection within a defined
period is called, ‟Exposure Cohort‟.
The basic approach in Cohort studies is to work from cause to
effect. ( In control study – effect to cause)
Indications for Cohort Study:
1) When there is good evidence of an association between
exposure and disease.
2) When exposure is rare but incidence of disease is high.
BIO – STATISTICS AND RESEARCH METHODOLOGY 242
a) General Population:
When the exposure or cause of death is fairly frequent in the
population cohorts should be selected from the general population. If
the population is very large an appropriate sample is taken and the
result can be generalized to the population.
2. Collection of a Data:
Information about exposure may be obtained from:
a) Cohort Members:
Through personal interviews or mailed questionnaires.
b) Review of Records:
Much information can be obtained from medical records.
E.g. Records of Surgery or Medicinal treatment.
c) Medical Examination or Special Test:
Some types of information can be obtained by medical
examination or special tests. E.g. Electrocardiogram, Blood
sugar level etc.
d) Environmental Surgery’s:
Here, information can be obtained form environment where
the Cohort lived or worked.
BIO – STATISTICS AND RESEARCH METHODOLOGY 245
Chapter No. 4
HYPOTHESIS
An Ideal Hypothesis:
A good hypothesis should have following characteristics.
It should be simple and easily comprehensible.
BIO – STATISTICS AND RESEARCH METHODOLOGY 249
Sources of Hypothesis:
Portrait of Hypothesis depends upon its sources. There fore
researcher should pay more attention towards its sources. The
important sources are as follows:
i) Historical background of concerned subject
ii) Doctrine of Analogy explains relation between histories to present
Study.
iii) Observations, Results, Conclusions, Findings of other studies.
iv) Cultural / Empirical / Scientific Theories.
Role of Hypothesis:
1. It serves as a chain between theory and investigation.
2. It helps in holistic study as it links facts and information together
After organization.
BIO – STATISTICS AND RESEARCH METHODOLOGY 250
2) Relational Hypothesis:
These are the propositions, which state the relationship
between two or more variables. They are related with analytical
variables – dependant and independent variables. Here, independent
variable has effect on the dependant variable where the number of
variables, which can be studied and separated, is limited only by
theory. Here we study how the independent variable influences the
dependent variable and how they are related to each other.
BIO – STATISTICS AND RESEARCH METHODOLOGY 251
3) Null Hypothesis:
If there is no difference between the two samples is called
as „Null hypotheses. If we get any difference it is only due to chance.
By applying statistical tests we can either reject or accept the
hypothesis. If null hypothesis is rejected then the alternative
hypothesis must be accepted.
Testing of Hypothesis:
Hypothesis testing is a process of deciding statistically
whether the findings of a research show chance or real effects at a
given level of probability. Therefore hypothesis testing is depending
on probability theory and sampling. It is just a procedure used to
obtain a result.
Descriptive studies do not require testing, as this research
describes the characteristics of certain population only.
Diagram for hypothesis testing:
Calculation of probability
BIO – STATISTICS AND RESEARCH METHODOLOGY 252
Verification of probability
S.E.D. = 62 1 + 622
n1 n2
= 32 + 42
100 100
= 9 + 16
100 100
= 0. 25
S.E.D = 0.5
Z = Observed difference
S.E.D
= 95 - 80
0.5
Z = 30.
BIO – STATISTICS AND RESEARCH METHODOLOGY 257
Chapter No. 5
COMPUTERS IN RESEARCH
History of Computer:
Prof. Charles Babage (Cambridge University) is called
father of modern computers.
1) 1942-1955:
Vacant tubes were being used in computers which controls
electronic excitation. Here thousands of tubes are used. Therefore it
generates large amount of heat. In 1947 Transistor was investigated.
BIO – STATISTICS AND RESEARCH METHODOLOGY 258
2) 1955-1964:
Here transistors are used instead of vacant tubes which
required air conditioned environment which is costly.
3) 1964-1975:
During this period, micro electrons are investigated.
Therefore small electric silicon chips were used.
4) 1975-2006:
Here Small Scale Integration (SSI) technique is used where
100 parts are situated on a small chip is called Medium Scale
Integration. (MSI) Now days there are 30000 parts are situated on a
single silicon chip.
5) 2006-2007:
Japan started research on computer technology. They
investigated a software named as, Prolog (logical programming)
where a man can talk with the computer.
Features of a Computer:
Computer has following features:
Speed: Computer does calculations at very fast speed within 5 nano –
seconds (1 nano – second = 0.000,000,001 sec.)
Accuracy: It has high degree of accuracy and very less error and for
this error computer provides automatic error checking. There is
arrangement of alarm. With this the source of error can be easily
identified.
Memory: A computer has its own memory. It has capacity to
memorize large number of information and when there is need it can
use as it is.
BIO – STATISTICS AND RESEARCH METHODOLOGY 259
Functions of a Computer:
A computer performs many functions as follows:
i] Data Management:
The collected data can be systematically arranged by
computers and whenever necessary they meet the specific demand.
ii] File Management:
The stored information, which is complex in structure,
becomes easily comprehensible by separating them into number of
files.
BIO – STATISTICS AND RESEARCH METHODOLOGY 260
iii] Summarization:
With the help of computers a large mass of data we can
summarized into a short passage within a few seconds.
iv] Data Analysis:
In the process of data analysis, cost and time is considerably
reduced and therefore analysis becomes easily understandable by
using computer.
v] Data Presentation:
With the help of a computer we can present data in many
different ways for e.g. Data can be summarized as Tabulation,
different Graphs, Bar diagrams, Maps, Pictographs, Histograms etc.
vi] Predictive :
We can pretend many future trends and events with available
source of data.
vii] Decision:
Computer can take logical decision on various aspects of
research.
viii] Biostatistics:
We can use Computer in analysis of variance (ANOVA)
calculations. With the help of EDP machine (Electronic Data
Processing) we can analyze our data statistically.
ix] Chaos Theory:
It can be applied in hospitals for ventricular fibrillation and
mental disorders.
BIO – STATISTICS AND RESEARCH METHODOLOGY 261
Utility of Computers:
Important applications of computers in different fields are
summarized as follows :
Government administrative processes.
In the field of Commercial Banks.
Transport and Communication.
Health Department.
Business and Industry.
Scientific Research
Disadvantages of Computer:
The limitations of computer are as follows:
1) A computer should not be regarded as substitute of human
brain. Without instructions of an operator it cannot work.
BIO – STATISTICS AND RESEARCH METHODOLOGY 262
Chapter No. 6
LABORATORY TESTS
Sensitivity = TP TP
X 100
TP+FN TP+FN
BIO – STATISTICS AND RESEARCH METHODOLOGY 266
Specificity = TN TN
X 100
TN+FP FP+TN
our scope and limitations of our science. Following tests are widely
used in clinical practice:
Blood Analysis:
i) Hemoglobin:
Estimation of Hb% is essential for the diagnosis of anemia.
ii) White Blood Cells (W.B.C.):
White blood cell count is essential for the diagnosis of
metabolic disorders like Yellow atrophy of liver, Uremia, Diabetes,
Acidosis, Gout, and Eclampsia etc. In certain skin diseases like Kala
azar basophils may increase. In any suspected case of infection or
allergy to drugs the total and differential leukocyte count must be
examined to find out the type of infection and to differentiate it from
other conditions.
iii) Erythrocyte Sedimentation Rate (ESR):
ESR is increased during pregnancy, and in conditions like
tuberculosis, rheumatoid arthritis, rheumatic fever, and malignancy. It
is of more prognostic than diagnostic.
iv) Examination of Platelets:
Bleeding time, coagulation time, prothrombin time etc. should
be examined in all bleeding disorders.
v) Blood Grouping:
Investigation of blood group and Rh. - typing is necessary for
blood transfusion, in many acute cases and problems of Rh
incompatibility arising from pregnancy.
i) Metabolic Disorders:
Screening tests: - Urinary glucose, fasting blood sugar, two hour
Post – prandial blood glucose etc.
Diagnostic tests: - Oral glucose tolerance test.
ii) Renal Conditions:-
Blood urea, creatinine, uric acid, serum electrolytes and serum
proteins are determined to ass‟s kidney functions.
iii) Liver Conditions:-
Liver function tests determine the presence of any liver
disease, type of liver disease and the extent and progression of liver
disease. It includes evaluation of bile pigments, bilirubin, SGPT and
SGOT.
iv) Heart Conditions:
Serum cholesterol, SGOT, SGPT, CPK, and LDH is of value
in the diagnosis of Myocardial Infarction.
v) Thyroid Conditions:
Examination of Thyroid hormone T3 and T4 for diagnosis of
Thyroiditis, Hyperthyroidism and Hypothyroidism.
vi) Parathyroid Conditions:
Evaluation of serum calcium, phosphorus and alkaline
phosphates is useful for diagnosis of Tetany, Hypoparathyroidism,
Pancratitis, Osteoporosis and Hyperparathyroidism.
Serological Tests:
BIO – STATISTICS AND RESEARCH METHODOLOGY 269
Examination of Urine:
It is useful in renal diseases and also in other diseases e.g.
Sugar in Diabetes Mellitus. Acetone in Diabetic coma and starvation,
Urobilinogen in Hemolysis and in Viral hepatitis.
Bile salts and pigments are present in Jaundice. R.B.C.‟s are
present in calculi, Nephritis and even in Hypertension also. Crystal
and amorphous deposits in urine indicate the nature of calculus, casts
may be present in renal damages. e.g. Hyaline casts indicate passage
of excess of protein through the tubules.
Examination of Stool :
Macroscopically, inspection of faces may diagnose Parasitic
Infestation, Obstructive Jaundice, Diarrhea, Malabsorption, Ulcerative
Colitis, Dysentery and Gastrointestinal tract bleeding.
Microscopically, presence of pus in stool indicates Chronic
Ulcerative Colitis, Chronic Bacillary Dysentery, Localized Abscess
and Fistula. Presence of blood in stool indicates upper GIT bleeding.
E.g. In Gastric / Duodenal Ulcer, Gastritis, Hiatus Hernia. Bleeding
from rectum and anus – Hemorrhoids, Anorectal Fissure etc.
BIO – STATISTICS AND RESEARCH METHODOLOGY 270
Semen Analysis:
Examination of seminal fluid is useful in the cases of Sterility
and Infertility in males.
Examination of Sputum:
The volume, consistency, appearance, colour and odour of
sputum may be helpful for diagnosis of many respiratory diseases.
Microscopic examination and sputum culture are useful for the
diagnosis of Tuberculosis, Respiratory fungal disease, Lung abscess,
and Staphylococcal pneumonia.
Gastric Analysis:
It is an examination of the gastric contents at various phases of
digestion. The amount, colour, odour, character of gastric content may
be helpful for diagnosis of various gastrointestinal diseases.
Gastric lavage sediment is examined for Tubercular bacilli.
Gastric meal test is useful to ass‟s response of the stomach to stimuli
which may helps for diagnosis of gastric Carcinoma and benign
gastric ulcers.
A] Microscopic Examination:
BIO – STATISTICS AND RESEARCH METHODOLOGY 273
3] Immuno Assays:
These methods may be applied to the measurement of the
amount of antigen or antibody in a specimen. Some of the tests are
listed below:
Tests Application
ELISA Test Used for detection of antigens,
(Enzyme linked Immuno Hepten and Antibodies e.g.
Sorbent Assay) Detection of HbsHg, HCG
levels in urine and blood,
Steroids, Hormones, Antibodies
to bacteria, Viruses and DNA.
Radio Immuno Assay Helpful in a laboratory diagnosis
of a Cancer.
Used for the determination of an
Antigen.
Endocrine Function Test:
The laboratory testing of endocrine functions and the
measurement of a specific hormone helps for diagnosis of a disease.
Now days, the Radio Immunoassay and Competitive Protein Binding
techniques were used for the detection of hormones. Some hormones
are given below:
BIO – STATISTICS AND RESEARCH METHODOLOGY 275
Histo-pathological Examination:
It is useful for diagnosis and stages of various tumors and
swellings. Therefore it suggests type and nature of the tumor. It is
BIO – STATISTICS AND RESEARCH METHODOLOGY 276
B) Cardiac Catheterization :
Here a catheter is inserted via vein or artery into the heart
under radiographic fluoroscopic guidance.
i) Coronary Angiography:
It is used to detect stenosis and helps in revascularization
procedures like percutaneous transluminal coronary angioplasty to
diagnose coronary artery disease.
ii) Radio Nuclide Scanning:
Certain radionuclides are used for studying cardiac function.
Here the gamma rays are detected by means of a planer or
tomographic camera and thus images of heart to be reconstructed by
using following techniques.
BIO – STATISTICS AND RESEARCH METHODOLOGY 279
iii) Xylography:
It requires the insertion of a fine needle into the pelvicalyceal
system under ultrasound or radiographic control. It is used to localize
obstruction Retrograde Pyelography can be performed by inserting
catheters into the ureteric orifices at cystoscopy.
iv) Micturating Cystourethrography:
This is used to diagnose vesico-ureteric reflux. Here, the
bladder is filled with contrast medium through a urinary catheter and
films are taken while the patient voids.
It is also used for investigating patients with recurrent urinary
tract infections, renal stones or renal failure of unknown etiology.
v) Renal Arteriography and Venography:
It is used to investigate renal artery stenosis, Hemorrhage, and
Renal tumors. Placing a catheter into the inferior vena cava via the
femoral vein does Venography. It is useful for the diagnosis of renal
vein thrombosis and renal tumors.
vi) Computed Tomography (CT):
It is used to identify lesions within the kidney or cysts with
masses. Now days Spiral CT is used for renal and adrenal images and
renal artery stenosis.
BIO – STATISTICS AND RESEARCH METHODOLOGY 282
Chapter No. 8
Homoeopathic Concept:
Potentisation :
BIO – STATISTICS AND RESEARCH METHODOLOGY 292
Chapter No. 9
MICROSCOPE:
It provides the microphotographs of object with magnification.
This palorthoplan microscope has camera attachment with automatic
exposure control (WILD MPS45).
The result in form of negative of microphotograph on 35mm
Black and White or colour film.
Model - Orthoplan - Polariser microscope.
Make - Ernst Leitz, - West Germany.
BIO – STATISTICS AND RESEARCH METHODOLOGY 295
Magnification 25 x to 250 x
Applications - It is used in the field of Medicine, Life sciences,
Metallurgy, Material sciences, Polymer sciences and Agriculture.
SPECTROPHOTOMETER:
Model and Make: 330 Hitachi Japan
The model 330 is used for measuring transmittance and
absorbance of liquid, solid and gas samples in the visible, ultraviolet
and near infrared region.
The light emitted from the light source passes through two grating
monochromatic for preparing a monochrome beam, and then is split
into two beams. After passing through the sample compartment, the
monochrome beams are converged and then converted into an electric
signal by a detector – Photo multiplier Tube or PbS cell. The electric
signal provided from the detector is amplified by a preamplifier and
immediately converted into a digital variable by 16 bit A/D converter.
The result is displayed directly to the recorder.
Specifications:
Range - 187nm to 2500 nm
Model - Transmittance (%T) - 0 to 100
- Absorption (Abs) - 0 to 6
Applications- It is used in the fields of Chemical,
Pharmaceutical, Medical Science, Industries etc. as follows:
a. Qualitative and Quantitative analysis.
b. Impurities detection, determination of molecular
weight.
c. Dissociation constants of acid and base.
BIO – STATISTICS AND RESEARCH METHODOLOGY 296
ULTRA CENTRIFUGE:
Ultra Centrifuge is an instrument, which separates the
components of the mixture from the liquid samples when rotated at
great speed at predetermined physical condition. SCP 854 has
maximum speed of 85,000 rpm and it can be carried out at users
determined condition. The ultimate vacuum pressure of the rotator
chamber is of the order of 103 Torr (0.1) and temperature can be
controlled between 0 to 450 c with 10 ± tolerance.
Make - HITACHI
Model – SCP 85H
Applications: (In the field of life sciences to study the biological
molecules):
a. It is used for separation of biological molecules
(Nucleic Acid, Proteins, Lipoproteins, and Glycols
etc.)
b. It is used to study density, shape, and weight of
biological molecules.
c. It is used for separation of cells and its organelle.
d. It is used for separation of bacteria.
e. It is used for separation of virus.
BIO – STATISTICS AND RESEARCH METHODOLOGY 297
Sensitivity – 1 ppb.
Lamps available - As, Au, Ca, Cd, Cu, Cr, Fe, Co, Hg, In,
Mg, Mn, Ni, Fd, Se, Zn, K, Ag, Na, Li, Rb, Be, B, Ga, Mo, V, AI,
Ti, Sr, Ba, Y, Zr.
Applications: - It is used for analysis of soils, plant tissues,
inorganic fertilizers, serum and plasma of blood, seawater, natural
water, foodstuff, alcoholic beverages, paints, ores etc.
THERMAL ANALYZER (TG- DTA-DSC):
It measures the heat flow and weight changes associated with
transitions and reactions in material over the temperature range from
RT to 15000 C. These experiments can be carried out in optional
environments like N2, 02, air at pre-decided flow-rate (ml/min) and
heating rate (0C/min).
The gas-switching accessory is used to turn on and off or to
switch between two different purge gases during SDT experiment-
Heat flow accuracy is 2% and temperature and weight accuracy is
1% where as DTA sensitivity is 0.0010C and that weight is 0.1
Ugm. This SDT –2960 controller is connected to PC and the social
software makes the thermal analysis. It stores the data as well as runs
the analysis programs.
Applications: - It is used for melting point, crystallization behavior,
glass transition specific heat, and reaction kinetics etc. of various
substances. It is also used in industries in the filed of Metallurgy,
glass, Ceramics, earth sciences, building material energy food etc.
FLOWCYTOMETER:
BIO – STATISTICS AND RESEARCH METHODOLOGY 301
Bio-Technology:
Biotechnology is an integrated application of Bio-chemical,
Micro-biological and Engineering science to the technological
employment of micro organisms, cell cultures or their components.
BIO – STATISTICS AND RESEARCH METHODOLOGY 302
Genetic Medicine:
This science studies genes their functions and their influence on
humans during health and disease. It plays important role to the
diagnosis, treatment and prevention of diseases.
i) Gene Chips:
These are nothing but DNA chips. They classifies genes which are
active in different diseases Here few cells are focused on a gene-chip
for the scanning and thus elicit and disease for e.g. Cancer. These
chips also predict whether an individual is predisposed to develop
certain disease or not. Gene chips are also used in Geno typing
procedures which help in preventing gene induced diseases like
Neurofibromatosis, Retinoblastoma etc.
ii) Gene Mutation:
BIO – STATISTICS AND RESEARCH METHODOLOGY 303
Nano Technology:
The molecular machine builds universal assembler having
capacity of assembling objects by atom or molecule by molecule with
a Nono Scale Robotic Arm under control of a computer.
Bio-Sensors:
The stable but sensitive Biosensors are operated by ion –
channel in a lipid membrane having biological property of
recognition with a physical transduction .It plays an important role in
converting biochemical events into an electrical signal ( like ECG )
which are sensitive to Pico molar concentrations of proteins. Today
Biosensors are used for measuring blood glucose. For detection of
blood glucose one can use a light emitting reaction couplet with a
receptor or an enzymes for a biochemical substance to a filament
placed in a blood vessel can give the amount of concentration of
glucose.
BIO – STATISTICS AND RESEARCH METHODOLOGY 304
Cloning:
It is making the Similimum or Identical from original one. The
first cloning performed in 1997 (Dolly- Sheep). It consists the union
of somatic cell with an enucleated egg or the transfer of a nucleus of a
somatic cell into an enucleated egg. We can derive somatic cell / egg
from different individuals or from the same individuals.
Indications:
BIO – STATISTICS AND RESEARCH METHODOLOGY 305
1) Reproductive :
Here, cloning done by nuclear transfer of differentiated
somatic cell in other species than humans. Human
reproductive cloning is unethical according to some experts.
2) Therapeutic :
This technique is just like an Auto Transplantation where
cells / tissues / organs which are to be transplanted back into
the same individual who donated already that cell. This
technique reduced chances of transplanted organ rejection.
Chapter No. 11
DRUG PROVING
Human Experiments:
Dr. Wagner got himself injected with radio- labeled drug
(Carfentanil) there by obtaining the first „Positron Emission
Tomography‟ of the opiate receptors in the human brain.
After an invention of X-rays in 1895, Roentgen took
a picture of this own hands.
Forssmann and Cournand passed a cardiac catheter
in their own veins and observed its progress through
the vessels to the heart.
John Hunter got himself injected with the discharge
from a Syphilitic chancre and got the chancre and
Gonorrhea himself.
Head (1905), Trotter and Davis (1909-1913) and
Boring (1916) cut their own sensory nerves in the
forearm to study sensory loss and the restoration of
sensory activity on regeneration.
Lanier (1935) injected alcohol into his own
cutaneous nerve to observe its effects.
James Carroll volunteered to be bitten by infected
mosquitoes and developed yellow fever.
BIO – STATISTICS AND RESEARCH METHODOLOGY 307
2) Proving Team:
Homoeopathic proving is teamwork. Chief fixes the role of
each member in a team. Following persons plays important role, in
proving.
I) Project Director:
He is a chief person of all team. He plans the methodology and
entire proving procedure. Therefore he is called as Master Prover.
Master prover decide protocol of proving, drug substance and its
potency. He should be blinded to the remedy, which is to be proved.
ii) Adviser:
The adviser assists to the Master Prover and provided him the
knowledge of drug substance. Therefore he must be a Botanist,
Zoologist or a Chemist who has thorough knowledge about source of
a drug.
iii) Supervisor:
They monitor the records of the provers and check each
symptom recorded in day book whether it was complete or
incomplete. They are experienced Homoeopaths.
They keep a contact with the prover from first day to last day.
If any change in the state of health of prover seen they immediately
BIO – STATISTICS AND RESEARCH METHODOLOGY 311
informed this altered state and its nature to the chief personnel. They
also need to be blinded from medicine that is proved.
iv) Provers:
Hahnemann said that, „Homoeopathic physician himself is an
ideal prover.‟ His concept of an ideal prover is that it should be
Healthy, Intelligent, Delicate, Sensitive, Irritable, Unprejudiced,
Honest, Trustworthy and Lover of truth.
Guidelines for Selection of a Prover:
Prover should be healthy.
He should be examined carefully and certified by authorized
medical physician.
Some groups of peoples should be made according to
environmental, biological and social variation.
Different age groups should be considered to ascertain the
pathogenic power of a drug.
Both sexes should be considered for proving as per their race,
religion and geographical distribution.
Pre – proving education regarding actual proving and
Homoeopathy is essential.
Hysterical, maniac, impulsive personalities should not be
taken for proving.
Breast feeding, pregnant and immunological deficient persons
should be avoided.
Ideally they should be free from any harassment, hurry,
tension and any disturbances.
3. Methodology:
BIO – STATISTICS AND RESEARCH METHODOLOGY 312
Committee:
There is a separate committee of well-known Homoeopaths
and experts in medical science to assist to chief director and his team.
They organize the publication and account of proving.
Recording of Symptoms:
It is very very important phase of proving. Each prover has
provided a daybook to make a record of all sings and symptoms
(Subjective or Objective) Sensations or any alterations during
proving. Prover should begin to take notes 7 days before taking
remedy.
Each symptom should be as far as possible complete one. How
to note symptom and what should be included and excluded are to be
thought by chief.
Analysis of Record:
This is time consuming stage of proving. Every symptom
should be extracted, collected, analyzed and repertories and included
in Materia Medica. For this process knowledge of repertory and local
languages are essential because without local language knowledge we
cannot convert them in to correct rubrics.
1. Extraction:
Here the recorded symptoms converted into the format of
Materia Medica after extracting of valid symptoms. Then it is to be
compared to control or cross over group.
BIO – STATISTICS AND RESEARCH METHODOLOGY 316
2. Collation:
In this stage all the provings from separate provers are
synthesized and make into single notes under each section modalities
are expressed as per persons holistic nature. Therefore it should be
represented separately. Intensity of each symptom should be clearly
mentioned.
3. Repertorisation:
Each symptom is analyzed interpretated accurately and
converted into repertory language that is into rubrics. If necessary
new rubrics should be investigated and added in case of reproving of
already proved drug. For this process ones logic and art is essential.
4. Theaming the Symptoms:
Theaming of proving is the practical outcome of entire
proving project. The symptoms are studied under generalities and
particulars.
5. Publication:
The whole data should be summarized after discussion with all
the members of project. The conclusions of proving are indexing
according to schema of repertory (e.g. Kent‟s Repertory) the aims and
objects of proving / reproving are mentioned in the introductory part
of the proving article including Materia medica, reperatorization and
all the relevant information of that specific drug.
Registration:
A Centralized authority – recognized by the State government
or the Competent Homoeopathic Body are in charge of Homoeopathic
drug proving.
Each proving should be first pre-registered with the central
registered authority responsible for Homoeopathic drug proving. They
keep a record and checking of provings carried out in different
research institution and then accepted.
only one researcher for verification of all remedies which are listed in
that particular rubric starting from first to last chapter of that
repertory.
4. Radiologist...etc.
Pathogenesis:
1) Mind: Symptoms.
2) Head: Symptoms.
--- “ ---
--- “ ---
--- “--- All Particulars
--- “ ---
10) Generals
B] Format for New Drug Proving:
1) Drug Name :
E. g. Arsenic Album.
2) Proving Conducted at :
E.g. Place of Drug proving.
3) Period of Proving :
E.g. 2005 – 2006.
4) Potencies Proved :
E.g. 200, 30, Q in descending order.
5) Source of Drug :
E.g. Name of Manufacturing Company
6) Provers List.
Place of proving --------- Proving Code: 7004
Sr. Code Name of Age Sex Rx
No. No Provers (Yrs.)
1 700 401 Miss. ----- 10 F ----
2 700 402 Mr. ------ 29 M Control
3 700 403 Miss.-----. 25 F ----
BIO – STATISTICS AND RESEARCH METHODOLOGY 323
Note:
Ordinary or habits of life must observe and ordinary work
maintained.
BIO – STATISTICS AND RESEARCH METHODOLOGY 324
Provers Signature
Chapter No. 12
META ANALYSIS
have been statistically sound, but should have been excluded because
they lacked a fundamental understanding of laws of Homoeopathy.
The use of complex Homoeopathy and Isopathy as merely educated
guesses because here the patients receive remedies that again are not
individualized but are used for such Homoeopathy will not end by
pseudo analysis.
Randomized control trials are not applicable to Homeopathy
because it cures the disease and not the patient. Instead of nosological
diagnosis Homoeopathy advocates person diagnosis.
Randomizatition of patients, placebo controlled trials and
administrating pre-determined medicines or selecting one or a very
few medicines to different patients randomly, do not it to the tenets of
Homeopathy. Trials should be done under similar circumstances.
Condition. Here there are many chances of selecting a wrong remedy.
This should not be the end of our Homoeopathy; instead our
understanding of this subject will continue to grow.
BIO – STATISTICS AND RESEARCH METHODOLOGY 331
Chapter No. 13
SIGNIFICANT TESTS
Rule of Significance:
1) If, the calculated value of „t‟ is higher than the value given at
P = 0.05 (5% level) in the table it is significant.
2) If the calculated value of „t‟ is less than the value given in „t‟
table it is not significant.
Degree of Freedom:
It is the quantity in a series which is one less than the
independent number of observations in a sample is called – Degree of
freedom.
E.g. In unpaired t test df = N – 1 and in paired t test df = N1+ N2 – 2
(Where, N1 and N2 are the number of observations.)
X X– X =x x2
-5 - 5 - 1.5 = - 6.5 42.25
3 3 - 1.5 = 1.5 2.25
4 4 - 1.5 = 2.5 6.25
-2 - 2 - 1.5 = - 3.5 12.25
7 7 - 1.5 = 5.5 30.25
3 3 - 1.5 = 1.5 2.25
0 0 - 1.5 = - 1.5 2.25
2 2 - 1.5 = 0.5 0.25
N=8 ∑ x2 = 98
= S. D. = ∑ x2
N-1
= 98 = 14 = 3.74
7
Now, t = X x N
S.D.
= 1.5 x 2.82
3.74
= 1.13
Degree of freedom =N-1
=8-1
=7
Here, calculated value of „t‟ is less than the given value in t
table hence the difference between the two means is significant.
E.g. 2) The weight of an untreated group of six persons are 60kg,
40kg, 45kg, 50kg, 65kg, 70kg. The weight of another group persons
from the same population other treatment with Phytolacca drug was
BIO – STATISTICS AND RESEARCH METHODOLOGY 336
obtained as, 45kg, 35kg, 40kg, 45kg, 60kg, 65kg and 45 kg. Apply t
test to find out significance of difference between means of two
groups.
Solution:
Calculation of Mean:
Untreated Weight X1 X1 – X1 = x x 12
Persons
1 60 60 - 55 = 5 25
2 40 40 - 55 = -15 225
3 45 45 - 55 = -10 100
4 50 50 - 55 = -5 25
5 65 65 - 55 = 10 100
6 70 70 - 55 = 15 225
∑X1 = 330 ∑x12 = 700
= 55 – 47.85
11.25 6+7
BIO – STATISTICS AND RESEARCH METHODOLOGY 338
6+7
= 7.15
11.25 x 13
13
t = 7.15
11.25 x 1
= 7.15
11.25 x 1
t = 0.635
Here, the calculated value for t. (0.635) is more than that given
in the „t‟ table for degree of freedom N = (N1 + N2 – 2 = 6+7 – 2 =
11).Hence, the difference between two means is not significant.
N
4) Apply formula for „t‟ that is t = X – 0 = X
5 SEd
N
5) Find the degree of freedom.
Example: 1. Two research centers carry out independent estimates of
calcium carbonate content for water made by a certain firm. A sample
is taken from each place and sent to the two centers separately. They
obtain the following results.
Percentage of Calc. Carbonate content in water.
Place No. 1 2 3 4
Center A 8 5 6 3
Center B 6 6 5 4
1 8 6 -2 4
2 5 6 1 1
3 6 5 -1 1
4 3 4 1 1
Total 22 21 -1 7
∑ (D – D )2 = ∑ D2 – (∑ D)2
=7– (-1)2
4
= 7 – (- 2)
4
= 7 – (- 0.5)
= 6.5
S.E of D = ∑(D- D)
N (N -1)
= 6.5
4 (4 -1)
= 6.5
12
= 0.735
BIO – STATISTICS AND RESEARCH METHODOLOGY 341
S.E. of D 0.735
D. f. = 4 –1 = 3.
Since the observed value of the t = (- 0.340) is less then the
value of t at 5% level of significance for 3 df. So it is non significant.
Hence the hypothesis will be accepted that is the testing is reliable.
Example 2. The effect of Synz. Jamb. drug on 8 patients showed
concentration of glucose (mg/hr) after 24hrs as follows:
Here,
∑D = 8.3
N=8
∑D2 = 14.61
D = 8.3 = 1.0375
8
= 14.61 - (8.3)2
8
7
= 14.61 - 68.89
8
7
S. D. = 14.61 - 8.6112
BIO – STATISTICS AND RESEARCH METHODOLOGY 343
= 0.8569
S. D. = 0.9257
Now, standard error of the difference (SED)
. = S D = 0.9257 = 0.9257
N 8 2.8284
S. E. = 0.3272
t = D = 1.0375 = 3.1708
S ED 0.3272
Here, the calculated value for „t‟ exceeds the tabulated „t‟
value at p = 0.05 level with 7df. Therefore the glucose concentration
by the patients after treatment is not significant.
Utility:
It is widely used in the field of Medical science, Agriculture
and Veterinary as follows:
To compare the results of two drugs which is given to same
individuals in the sample at two different situations? E.g.
Effect of Bryonia and Lycopodium on general symptoms like
sleep, appetite etc.
It is used to study of drug specificity on a particular organ /
tissue / cell level. E.g. Effect of Belberis Vulg. on renal
system.
It is used to compare results of two different methods. E.g.
Estimation of Hb% by Sahlis method and Tallquist method.
BIO – STATISTICS AND RESEARCH METHODOLOGY 344
Here, two variances are derived from two samples. The values
in each group are to be normally distributed. Therefore the variation
of each value around its group mean that is error is remain
independent of each value provided the variances within each group
should be equal for all groups.
Calculation for F Test:
Tests of hypothesis about the variance of two populations: -
Steps:-
BIO – STATISTICS AND RESEARCH METHODOLOGY 345
F = 6ˆ12
6ˆ22 if, 6ˆ12 > 6ˆ22
OR
F = 6ˆ22
6ˆ12 if, 6ˆ22 > 6ˆ12
If, H0 is of the form 6ˆ12 < 6ˆ22 then we calculate F = 6ˆ12 / 6ˆ22
which has F – distribution but with n2 - 1 d.f. in the numerator and
n -1 d.f. in the denominator.
2. Calculation of test statistics:
F statistics = 6ˆ12
6ˆ22
3. Set the level of significance α = 0.05 if value of α is not
known to us.
4. Rules for Significance:
If calculated F α table F2 α then accept the null hypothesis H0
and reject H0 if calculated F > table Fα.
Example:
1) Random samples are drawn from the two sets of students and
the following results were obtained.
10 -5 25 14 - 4.5 20.25
12 -3 9 16 - 2.5 6.25
18 3 9 20 1.5 2.25
14 -1 1 18 - 0.5 0.25
16 1 1 21 2.5 6.25
20 5 25 22 3.5 12.25
∑A= ∑ (A – ∑B= ∑ (B – B)2
90 A)2 = 70 111 = 47.5
We know, A = 90 = 15.
6
B = 111 = 18.5 (we know x = ∑ x)
6 n
6ˆ12 = ∑ (A – A)2 = 70 = 14
n1 – 1 5
6ˆ22 = ∑ (B – B)2 = 47.5 = 9.5
n2 – 1 5
Test statistics: F = 6ˆ22 = 9.5 = 0.6785
6ˆ12 14
Conclusion:
The calculation value of F = 0.6785,
< Table value F 0.05 the null hypothesis H0 is accepted.
The two samples have the same variance.
F = Largest Variance
BIO – STATISTICS AND RESEARCH METHODOLOGY 348
Smallest Variance
= 1.47
0.076
df = 13 – 1 = 12
And = 10 – 1 = 9
x2 = ∑ (0 – E)2
E
OR
x2 = ∑ (fo – fe)2
fe
1) If the tabular value is lower than the calculated value then the
results are significant.
2) If fo = fe then the value of x2 will be zero (but due to chance
error this never happens).
Example:
1) There are two factors showing dominance X and Y. Suppose in A
the progeny were in the ratio of XY = 436, Xy = 122, xY = 120 and
xy = 64 out of 646 individuals.
Test the hypothesis that an „A‟ gives 6: 2: 2: 1 isolation.
Solution
Steps:
1) XY = observed = 436.
Expected = 646 x 6 = 352.36
11
O – E = 436 – 352.36 = 83.64
(O – E) 2 = (83.64) 2 = 6995.6496
2) Xy = observed = 122,
Expected = 646 x 2 = 117.45
11
(O – E) 2 = 20.7025
(O – E) 2 = 20.7025 = 0.1762
E 117.45
BIO – STATISTICS AND RESEARCH METHODOLOGY 351
3) xY = observed = 120.
(O – E) 2 = 6.5025
(O – E) 2 = 6.5025 = 0.05536
E 117.45
4) xy = observed = 64.
O – E = 64 – 58.7272 = 5.2728
(O – E) = 5.2728
(O – E) 2 = 27.8024 = 0.4734
E 58.7272
X2 = ∑ (0 – E) 2
E
= 19.85 + 0.1762 + 0.05536 + 0.4734
= 20.554
In all such cases degree of freedom (df) will be n = k – 1
(where k is the number of classes).
Thus we have in this case degree of freedom „n‟ = 4 – 1 = 3
Now, table value of x2 is 7.815 t 0.05 for 3 degree of freedom, which
is much less than the obtained value that is 20.554
BIO – STATISTICS AND RESEARCH METHODOLOGY 352
2. Median Test:
It is used to observe if two groups come from population
having same median. Here, by using this test we will test the
hypothesis of no difference between these two groups.
3. Mann – Whitney (U – Test):
i) For Small Samples:
This test is used for un-correlated data. Here, samples should
be randomly selected and independently drawn. One of the two
samples must have more than nine values.
ii) For Large Samples:
It is a substitute for F test where either N1 or N2 is larger than
20. (Where, N = number of samples)
4. Kruscal Wallis H – Test:
This test is used whether or not a group of independent
samples are from the same or different population. If the number of
BIO – STATISTICS AND RESEARCH METHODOLOGY 353
cases in the sample is from one to five. Specific tables are used in
interpretation of H (Kruscal Wallis table) and if samples contain five
or more cases H is interpreted as Chi-square test.
5. Sign Test:
This test is especially used for correlated data where the scores
are in pairs. Here, it is assumed that the variable is distributed
continuously.
6. Wilcoxan Test:
In this type of test we are having matched pairs and we are
giving ranks to the differences of pairs.
ANOVA Test (Analysis of Variance):
There are four types of ANOVA tests:
i) One way ANOVA.
ii) Two way ANOVA.
iii) Single factor repeated measures design
iv) Nested design
ANOVA test is applied for comparison of Means of several
groups.
Before application of ANOVA test following assumptions
should be considered.
1) Random sampling should be used for selection of variables.
2) Collected samples should be independent.
3) Variables should follow normal distribution.
4) Using variance ratio test for their homogeneity should test
variables.
5) Degree of skew ness may not affect the significance test.
1] One way ANOVA Test:
BIO – STATISTICS AND RESEARCH METHODOLOGY 354
OR
∑x b2 = ∑ (∑X) 2 - (∑X all)2
n N
BIO – STATISTICS AND RESEARCH METHODOLOGY 355
v) Calculation of F:
+ (2nd group – 1)
= ∑ X2 – (∑X)2
N
2) Derive sum of squares of Rows, that is SS rows
= x12 + x22 + x32 - - - xn2 – (∑X) 2
xn N
DEMOGRAPHY
3. Reports :
It includes epidemiological surveys about disease incidence,
prevalence, before and after treatment, morbidity etc. which provides
valuable data about health status in given area. Nutritional surveys
give data regarding dietary status in a community. An epidemiologist
should conduct a survey.
4. Publications :
Periodic publications are helpful in the study of health
statistics like publications of General Registers World Health
Organization, State or National Health Directors etc. Their reports are
published weekly, monthly or annually as per requirement and
demand.
5. Miscellaneous :
It includes other health agencies like Insurance companies,
Industrial companies where morbidity and mortality data was
recorded which provides valuable information regarding death and its
cause.
Presentation of Data :
All the collected data should be arranged logically, tabulated
and presented systematically at District / State / National /
BIO – STATISTICS AND RESEARCH METHODOLOGY 361
Chapter No. 15
Operation Research:
As name suggests it involves – research on Operation /
Activities, which is applied to problems concerned with organization.
It is widely used in various fields like, Transportation, Manufacturing,
BIO – STATISTICS AND RESEARCH METHODOLOGY 363
Chapter No. 16
MEDICAL ETHICS
Historical Background:
The older code of medical ethics is the „Hippocratic Oath‟. It
is now restated in modern style and known as the „Declaration of
Geneva‟. The Medical Council of India as the code of ethics follows
it. Accordingly every applicant, at the time of registration shall submit
the written and signed declaration to the concerned Register.
Declaration of Helsinki:
It is the worlds most widely recognized source of ethical
guidance on Bio-medical research on humans.
BIO – STATISTICS AND RESEARCH METHODOLOGY 367
General principles.
Duties of Homoeopathic practioners to their patients.
Duties of practioners to in profession.
Duties of practioners to in consultation.
Duties of practioners to the public.
Professional misconduct.
A survey of the texts of the various oaths as applicable to
medical practioners, can serve as an introduction to medical ethics.
Each country has its own code of ethics, usually modeled on
the lines of the international code. However, Charak, 4700 yrs. ago
mentioned an oath.
As Homoeopathic drugs are proved on human beings on both
the sexes and at various ages. There fore here only ethics concerned
with humans are considered.
Protocol:
It contains:
i) Aims and objectives of research.
ii) Reasons for conducting the research on humans.
iii) Any risks / hazards in research process should be mentioned.
iv) Consent.
Any new drug or vaccine must be tested on human subjects in clinical
trials only.
Council for International Organization of Medical Sciences
explains certain rules for conducting any research in humans. These
are concerned with following points:
- Ethical justification and scientific validity of biomedical research
Involving human subjects.
- Ethical review comities.
- Ethical review of externally sponsored research.
- Individual informed consent.
BIO – STATISTICS AND RESEARCH METHODOLOGY 370
Statements:
Every physician should report each case of HIV
infected patient to Government.
- He should treat the patient as far as possible.
- He should educate the patients regarding its spread.
- Isolate the patients (if necessary).
Criticisms:
AIDS is not a contagious disease because it is not transmitted
through water, air or through vectors. But AIDS is a communicable
disease that is it spreads from one person to others by sexual contact
or blood. Therefore it is not epidemic disease. Thus isolating these
patients are not logical for prevention or spread of disease.
W.H.O. and U.S.A. committee statement includes following
features:
BIO – STATISTICS AND RESEARCH METHODOLOGY 372
Chapter No. 17
THESIS (DISSERTATION)
Presentation of Thesis:
Thesis writing is compulsory for the completion of M.D.
Course. (As per Curriculum for M.D. (Hom.) Post Graduate Degree
Course Regulations 1989.) Therefore before proceeding to write a
thesis one should pay attention towards following things.
1. Thesis must be typed on computer than typewriter.
2. The paper size should be A4 or letter size having 50 mm thickness
for black and white pages and 70 mm should be used for color
pages.
3. The pages should be numbered. (Except title page)
4. Ideally the volume of thesis should be single one.
5. The binding of thesis should be simplest but attractive one.
6. Students should submit seven copies of their thesis. Four for
university, one for guide or examiner, one for central council of
Homoeopathy and one for self.
7. It should contain minimum errors and mistakes.
BIO – STATISTICS AND RESEARCH METHODOLOGY 374
2. Text Part:
i) Introduction.
ii) Aims and Objectives.
iii) Review of literature.
iv) Materials and Methods.
v) Discussion and Results.
vi) Summary and Conclusion.
vii) Cases (usually 30 cases)
3. Appendix Part:
i) Reference (Arranged alphabetically).
ii) Supplements (if necessary).
For the sake of students the whole thesis is divided into 3 main
Parts:
BIO – STATISTICS AND RESEARCH METHODOLOGY 375
Introductory.
Text part.
Appendix part.
Introductory:
It is the first part of our research work. It consists following
components.
i) Title Page:
Here, we have to mention the Problem, Type of Study and
Purpose of the whole project. This consist of Name of an investigator
his Guide / Co- guide and name of his Institution and University.
ii) Acknowledgement:
Research is a teamwork and therefore many peoples helped us
like Statistician, Computer operator, Guide, Friends and Teachers etc.
Therefore we should acknowledge our thanks and express sincere
gratitude‟s for their immerse support and valuable guidance to all
those who have contributed for this study.
iii) Declaration and Certificates:
In the Declaration part student must mention that the research
has been done by him only and is not submitted to any other
University or published previously.
The certificates are signed by the Guide and or Co-Guide,
Head of Institution or Department and Dean faculty of Homoeopathy
in concerned university.
iv) Bio –Data:
It consist of researchers Full name, Year of admission, P.G.
Course (Regular/External), Appearing month / year, M.D first
examination year, Specialty subject, Correspondence address, Clinic
BIO – STATISTICS AND RESEARCH METHODOLOGY 376
vii) Cases:
Case studies are the essential part of an experimental study.
Student should present at least 30 cases with investigations (before
and after the treatment) and Follow ups in detail. Many Descriptive
types of researches need not require cases. At the end, all cases should
BIO – STATISTICS AND RESEARCH METHODOLOGY 379
Appendix Part:
This is the last part of our thesis. This part is further
subdivided as follows:
I) References:
The References or Bibliography should be arranged
alphabetically. For e.g. first of all you should mention the name of the
author and then title of book, its edition, place and year of publication
volume numbers, Publishers name or company and pages
successively. As far as possible, we should mention large number of
references, which will reflect our depth of knowledge and hard work
taken for it.
ii) Supplements:
This part is not a compulsory part of any thesis. If a student
may feel that something should be added which is not given in
previous part he has an opportunity to present his additions in this
part. For e.g. List of latest equipments or instruments used of research
process. We can make this appendix part more attractive by using
different graphs, maps or diagrams with their significance in research.
LOGARITHMS
BIO – STATISTICS AND RESEARCH METHODOLOGY 380
0 1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9
10 0000 0043 0086 0128 0170 5 9 13 17 21 26 30 34 38
0212 0253 0294 0334 0374 4 8 12 16 20 24 28 32 36
11 0414 0453 0492 0531 0569 4 8 12 16 20 23 27 31 35
0607 0645 0682 0719 0755 4 7 11 15 18 22 26 29 33
12 3 7 11
0792 0828 0864 0899 0934 3 7 10 14 18 21 25 28 32
0969 1004 1038 1072 1106 14 17 20 24 27 31
0 1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9
51 7076 7084 7093 7101 7110 7118 7126 7135 7143 7152 1 2 3 3 4 5 6 7 8
52 7160 7168 7177 7185 7193 7202 7210 7218 7226 7235 1 2 2 3 4 5 6 7 7
53 7243 7251 7259 7267 7275 7284 7292 7300 7308 7316 1 2 2 3 4 5 6 6 7
54 7324 7332 7340 7348 7356 7364 7372 7380 7388 7396 1 2 2 3 4 5 6 6 7
55 7404 7412 7419 7427 7435 7443 7451 7459 7466 7474 1 2 2 3 4 5 5 6 7
56 7482 7490 7497 7505 7513 7520 7528 7536 7543 7551 1 2 2 3 4 5 5 6 7
57 7559 7566 7574 7582 7589 7597 7604 7612 7619 7627 1 2 2 3 4 5 5 6 7
58 7634 7642 7649 7657 7664 7672 7679 7686 7694 7701 1 2 2 3 4 4 5 6 7
59 7709 7716 7723 7731 7738 7745 7752 7760 7767 7774 1 2 2 3 4 4 5 6 7
60 7782 7789 7796 7803 7810 7818 7825 7832 7839 7846 1 2 2 3 4 4 5 6 6
61 7853 7860 7868 7875 7882 7889 7896 7903 7910 7917 1 2 2 3 4 4 5 6 6
62 7924 7931 7938 7945 7952 7959 7966 7973 7980 7987 1 2 2 3 3 4 5 6 6
63 7993 8000 8007 8014 8021 8028 8035 8041 8048 8055 1 2 2 3 3 4 5 5 6
64 8062 8079 8075 8082 8089 8096 8102 8109 8816 8122 1 2 2 3 3 4 5 5 6
65 8129 8136 8142 8149 8156 8162 8169 8176 8182 8189 1 2 2 3 3 4 5 5 6
66 8195 8202 8209 8215 8222 8228 8235 8241 8248 8254 1 2 2 3 3 4 5 5 6
67 8261 8267 8274 8280 8287 8293 8299 8306 8312 8319 1 2 2 3 3 4 5 5 6
68 8325 8331 8338 8344 8351 8357 8363 8370 8376 8382 1 1 2 3 3 4 4 5 6
69 8388 8395 8401 8407 8414 8420 8426 8432 8439 8445 1 1 2 3 3 4 4 5 6
70 8451 8457 8463 8470 8476 8482 8488 8494 8500 8506 1 1 2 3 3 4 4 5 6
71 8513 8519 8525 8531 8537 8543 8549 8555 8561 8567 1 1 2 3 3 4 4 5 5
72 8573 8579 8585 8591 8597 8603 8609 8615 8621 8627 1 1 2 3 3 4 4 5 5
73 8633 8639 8645 8651 8657 8663 8669 8675 8681 8686 1 1 2 3 3 4 4 5 5
74 8692 8698 8704 8710 8716 8722 8727 8733 8739 8745 1 1 2 3 3 4 4 5 5
75 8751 8756 8762 8768 8774 8779 8785 8791 8797 8802 1 1 2 2 3 3 4 5 5
76 8808 8814 8820 8825 8831 8837 8842 8848 8854 8859 1 1 2 2 3 3 4 5 5
77 8865 8871 8876 8882 8887 8893 8899 8904 8910 8915 1 1 2 2 3 3 4 4 5
78 8921 8927 8932 8938 8943 8949 8954 8960 8965 8971 1 1 2 2 3 3 4 4 5
79 8976 8982 8987 8993 8998 9004 9009 9015 9020 9025 1 1 2 2 3 3 4 4 5
80 9031 9036 9042 9047 9053 9058 9063 9069 9074 9079 1 1 2 2 3 3 4 4 5
81 9085 9090 9096 9101 9106 9112 9117 9122 9128 9133 1 1 2 2 3 3 4 4 5
82 9138 9143 9149 9154 9159 9165 9170 9175 9180 9186 1 1 2 2 3 3 4 4 5
83 9191 9196 9201 9206 9212 9217 9222 9227 9232 9238 1 1 2 2 3 3 4 4 5
84 9243 9248 9253 9258 9263 9269 9274 9279 9284 9289 1 1 2 2 3 3 4 4 5
85 9294 9299 9304 9309 9315 9320 9325 9330 9335 9340 1 1 2 2 3 3 4 4 5
86 9345 9350 9355 9360 9365 9370 9375 9380 9385 9390 1 1 2 2 3 3 4 4 5
87 9395 9400 9405 9410 9415 9420 9425 9430 9435 9440 0 1 1 2 2 3 3 4 4
88 9445 9445 9450 9455 9465 9469 9474 9479 9484 9489 0 1 1 2 2 3 3 4 4
89 9494 9499 9504 9509 9513 9518 9523 9528 9533 9538 0 1 1 2 2 3 3 4 4
90 9542 9547 9552 9557 9562 9566 9571 9576 9581 9586 0 1 1 2 2 3 3 4 4
91 9590 9595 9600 9605 9609 9614 9619 9624 9628 9633 0 1 1 2 2 3 3 4 4
92 9638 9643 9647 9652 9657 9661 9666 9671 9675 9680 0 1 1 2 2 3 3 4 4
93 9685 9689 9694 9699 9703 9708 9713 9717 9722 9727 0 1 1 2 2 3 3 4 4
94 9731 9736 9741 9745 9750 9754 9759 9763 9768 9773 0 1 1 2 2 3 3 4 4
95 9777 9782 9786 9791 9795 9800 9805 9809 9814 9818 0 1 1 2 2 3 3 4 4
96 9823 9827 9832 9836 9841 9845 9850 9854 9859 9863 0 1 1 2 2 3 3 4 4
97 9868 9872 9877 9881 9886 9890 9894 9899 9803 9808 0 1 1 2 2 3 3 4 4
98 9912 9917 9921 9926 9930 9934 9939 9943 9948 9952 0 1 1 2 2 3 3 4 4
99 9956 9961 9965 9969 9974 9978 9983 9987 9991 9996 0 1 1 2 2 3 3 4 4
ANTILOGARITHMS
v 0 1 2 3 4 5 6 7 8 9 Δm 1 2 3 4 5 6 7 8 9
+ ADD
.00 1000 1002 1005 1007 1009 1012 1014 1016 1019 1021 2 0 0 1 1 1 1 1 2 2
.01 1023 1026 1028 1030 1033 1035 1038 1040 1042 1045 2 0 0 1 1 1 1 1 2 2
.02 1047 1050 1052 1054 1057 1059 1062 1064 1067 1069 2 0 0 1 1 1 1 1 2 2
.03 1072 1074 1076 1079 1081 1084 1086 1089 1091 1094 2 0 0 1 1 1 1 1 2 2
.04 1096 1099 1102 1104 1107 1109 1112 1114 1117 1119 3 0 1 1 1 1 2 2 2 3
.05 1122 1125 1127 1130 1132 1135 1138 1140 1143 1146 3 0 1 1 1 1 2 2 2 3
BIO – STATISTICS AND RESEARCH METHODOLOGY 382
.06 1148 1151 1153 1156 1159 1161 1164 1167 1169 1172 3 0 1 1 1 1 2 2 2 3
.07 1175 1178 1180 1183 1186 1189 1191 1194 1197 1199 3 0 1 1 1 1 2 2 2 3
.08 1202 1205 1208 1211 1213 1216 1219 1222 1225 1227 3 0 1 1 1 1 2 2 2 3
.09 1230 1233 1236 1239 1242 1245 1247 1250 1253 1256 3 0 1 1 1 1 2 2 2 3
.10 1259 1262 1265 1268 1271 1274 1276 1279 1282 1285 3 0 1 1 1 1 2 2 2 3
.11 1288 1291 1294 1297 1300 1303 1306 1309 1312 1315 3 0 1 1 1 2 2 2 2 3
.12 1318 1321 1324 1327 1330 1334 1337 1340 1343 1346 3 0 1 1 1 2 2 2 2 3
.13 1349 1352 1355 1358 1361 1365 1368 1371 1374 1377 3 0 1 1 1 2 2 2 2 3
.14 1380 1384 1387 1390 1393 1396 1400 1403 1406 1409 3 0 1 1 1 2 2 2 2 3
.15 1413 1416 1419 1422 1426 1429 1432 1435 1439 1442 3 0 1 1 1 2 2 2 2 3
.16 1445 1449 1452 1455 1459 1462 1466 1469 1472 1467 3 0 1 1 1 2 2 2 2 3
.17 1479 1483 1486 1489 1493 1496 1500 1503 1507 1510 4 0 1 1 2 2 2 3 3 4
.18 1514 1517 1521 1524 1528 1531 1535 1538 1542 1545 4 0 1 1 2 2 2 3 3 4
.19 1549 1552 1556 1560 1563 1567 1570 1574 1578 1581 4 0 1 1 2 2 2 3 3 4
.20 1585 1589 1592 1596 1600 1603 1607 1611 1614 1618 4 0 1 1 2 2 2 3 3 4
.21 1622 1626 1629 1633 1637 1641 1644 1649 1652 1656 4 0 1 1 2 2 2 3 3 4
.22 1660 1663 1667 1671 1675 1679 1683 1687 1690 1694 4 0 1 1 2 2 2 3 3 4
.23 1698 1702 1706 1710 1714 1718 1722 1727 1730 1734 4 0 1 1 2 2 2 3 3 4
.24 1738 1742 1746 1750 1754 1758 1762 1766 1770 1774 4 0 1 1 2 2 2 3 3 4
.25 1778 1782 1786 1791 1795 1799 1803 1807 1811 1816 4 0 1 1 2 2 2 3 3 4
.26 1820 1824 1828 1832 1837 1841 1845 1849 1854 1858 4 0 1 1 2 2 2 3 3 4
.27 1862 1866 1871 1875 1879 1884 1888 1892 1897 1901 4 0 1 1 2 2 2 3 3 4
.28 1905 1910 1914 1919 1923 1928 1932 1936 1941 1945 4 0 1 1 2 2 2 3 3 4
.29 1950 1954 1959 1963 1968 1972 1977 1982 1986 1991 4 0 1 1 2 2 2 3 3 4
.30 1995 2000 2004 2009 2014 2018 2023 2028 2032 2037 5 0 1 1 2 2 3 3 4 4
.31 2042 2046 2051 2056 2061 2065 2070 2075 2080 2084 5 0 1 1 2 2 3 3 4 4
.32 2089 2094 2099 2104 2109 2113 2118 2123 2128 2133 5 0 1 1 2 2 3 3 4 4
.33 2138 2143 2148 2153 2158 2163 2168 2173 2178 2183 5 1 1 2 2 3 3 4 4 5
BIO – STATISTICS AND RESEARCH METHODOLOGY 383
x 0 1 2 3 4 5 6 7 8 9 Δm 1 2 3 4 5 6 7 8 9
+ ADD
.34 2188 2193 2198 2203 2208 2213 2218 2223 2228 2234 5 1 1 2 2 3 3 4 4 5
.35 2239 2244 2249 2254 2259 2265 2270 2275 2280 2286 5 1 1 2 2 3 3 4 4 5
.36 2291 2296 2301 2307 2312 2317 2323 2328 2333 2339 5 1 1 2 2 3 3 4 4 5
.37 2344 2350 2355 2360 2366 2371 2377 2382 2388 2393 6 1 1 2 2 3 4 4 5 5
.38 2399 2404 2410 2415 2421 2427 2432 2438 2443 2449 6 1 1 2 2 3 4 4 5 5
.39 2455 2460 2466 2472 2477 2483 2489 2495 2500 2506 6 1 1 2 2 3 4 4 5 5
.40 2512 2518 2523 2529 2535 2541 2547 2553 2559 2564 6 1 1 2 2 3 4 4 5 5
.41 2570 2576 2582 2588 2594 2600 2606 2612 2618 2624 6 1 1 2 2 3 4 4 5 5
.42 2630 2636 2642 2649 2655 2661 2667 2673 2679 2685 6 1 1 2 2 3 4 4 5 5
.43 2692 2698 2704 2710 2716 2723 2729 2735 2742 2748 6 1 1 2 2 3 4 4 5 5
.44 2754 2761 2767 2773 2780 2786 2793 2799 2805 2812 6 1 1 2 2 3 4 4 5 5
.45 2818 2825 2831 2838 2844 2851 2858 2864 2871 2877 7 1 1 2 3 3 4 5 6 6
.46 2884 2891 2897 2904 2911 2917 2924 2931 2938 2944 7 1 1 2 3 3 4 5 6 6
.47 2951 2958 2965 2972 2979 2985 2992 2999 3006 3013 7 1 1 2 3 3 4 5 6 6
.48 3020 3027 3034 3041 3048 3055 3062 3069 3076 3083 7 1 1 2 3 4 4 5 6 6
.49 3090 3097 3105 3112 3119 3126 3133 3141 3148 3155 7 1 1 2 3 4 4 5 6 6
.50 3162 3170 3177 3184 3192 3199 3206 3214 3221 3228 1 1 2 3 4 4 5 6 7
.51 3236 3243 3251 3258 3266 3273 3281 3289 3296 3304 1 2 2 3 4 5 5 6 7
.52 3211 3319 3327 3334 3342 3350 3357 3365 3372 3381 1 2 2 3 4 5 5 6 7
.53 3388 3396 3404 3412 3420 3428 3436 3443 3451 3459 1 2 2 3 4 5 5 6 7
.54 3467 3475 3483 3491 3499 3508 3536 3524 3532 3540 1 2 2 3 4 5 5 6 7
.55 3548 3556 3565 3573 3581 3589 3597 3606 3614 3622 1 2 2 3 4 5 5 6 7
.56 3631 3639 3648 3656 3664 3673 3681 3690 3608 3707 1 2 3 3 4 5 6 7 8
.57 3715 3724 3733 3741 3750 3758 3707 3776 3784 3793 1 2 3 3 4 5 6 7 8
.58 3802 3311 3819 3828 3837 3846 3855 3864 3873 3882 1 2 3 4 4 5 6 7 8
.59 3890 3899 3908 3917 3926 3936 3945 3854 3963 3972 1 2 3 4 5 6 6 7 8
.60 3981 3990 3999 4009 4018 4027 4036 4046 4055 4064 1 2 3 4 5 6 6 7 8
.61 4074 4083 4093 4102 4111 4121 4130 4140 4150 5159 1 2 3 4 5 6 7 8 9
.62 4169 4178 4188 4198 4207 4217 4227 4236 4246 4256 1 2 3 4 5 6 7 8 9
.63 4266 4276 4285 4295 4305 4315 4325 4335 4345 4355 1 2 3 4 5 6 7 8 9
.64 4365 4375 4385 4395 4406 4410 4426 4436 4446 4457 1 2 3 4 5 6 7 8 9
.65 4467 4477 4487 4498 4508 4519 4529 4539 4550 4660 1 2 3 4 5 6 7 8 9
.66 4571 4581 4592 4603 4613 4724 4634 4645 4656 4667 1 2 3 4 5 6 7 9 10
.67 4677 4688 4699 4710 4721 4732 4742 4753 4764 4775 1 2 3 4 5 7 7 9 10
BIO – STATISTICS AND RESEARCH METHODOLOGY 384
V 0 1 2 3 4 5 6 7 8 9 Δm 1 2 3 4 5 6 7 8 9
+ ADD
.68 4786 4797 4808 4819 4831 4840 4853 4864 4875 4887 1 2 3 4 6 7 8 9 10
.69 4898 4909 4920 4932 4943 4955 4966 4977 4989 5000 1 2 3 5 6 7 8 9 10
.70 5012 5023 5035 5047 5058 5070 5082 5093 5105 5117 1 2 4 5 6 7 8 9 11
.71 5129 5140 5152 5164 5176 5188 5200 5212 5224 5236 1 2 4 5 6 7 8 10 11
.72 5248 5260 5272 5284 5297 5309 5321 5333 5346 5358 1 3 4 5 6 7 9 10 11
.73 5370 5383 5395 5408 5420 5433 5445 5458 5470 5483 1 3 4 5 6 8 9 10 11
.74 5495 5508 5521 5534 5546 5559 5572 5585 5598 5610 1 3 4 5 6 8 9 10 12
.75 5623 5636 5649 5062 5675 5689 5702 5715 5728 5741 1 3 4 5 7 8 9 10 12
.76 5754 5768 5781 5794 5808 5821 5834 5448 5861 5875 1 3 4 5 7 8 9 11 12
.77 5888 5902 5916 5929 5943 5957 5970 5984 5998 6015 1 3 4 5 7 8 10 11 12
.78 6026 6039 6053 6067 6081 6095 6109 6124 6138 6153 1 3 4 6 7 8 10 11 13
.79 6166 6180 6194 6209 6223 6237 6252 6206 6281 6295 1 3 4 6 7 9 10 11 13
.80 6310 6324 6339 6353 6368 6383 6397 6412 6427 6442 1 3 4 6 7 9 10 12 13
.81 6457 6471 6486 6501 6516 6531 6536 6551 6577 6592 2 3 5 6 8 9 11 12 14
.82 6607 6622 6637 6653 6668 6683 6699 6714 6730 6745 2 3 5 6 8 9 11 12 14
.83 6761 6776 6792 6808 6823 6839 6855 6871 6887 6902 2 3 5 6 8 9 11 13 14
.84 6918 6934 6950 6966 6982 6998 7015 7031 7047 7063 2 3 5 6 8 10 11 13 15
.85 7079 7096 7112 7129 7145 7161 7178 7194 7211 7228 2 3 5 7 8 10 12 13 15
.86 7244 7261 7278 7278 7295 7311 7328 7345 7362 7379 2 3 5 7 8 10 12 13 15
.87 7413 7430 7447 7464 7482 7499 7516 7534 7551 7568 2 3 5 7 9 10 12 14 16
.88 7586 7603 7621 7638 7656 7674 7691 7709 7727 7745 2 4 5 7 9 11 12 14 16
.89 7762 7780 7798 9816 7834 7852 7870 7889 7907 7925 2 4 5 7 9 11 13 14 16
.90 7943 7962 7980 7998 8017 8035 8054 8072 8091 8110 2 4 6 7 9 11 13 15 17
.91 8128 8147 8166 8185 8204 8222 8241 8260 8279 8299 2 4 6 8 9 11 13 15 17
.92 8318 8337 8356 8375 8395 9414 9433 9453 9472 8492 2 4 6 8 10 12 14 15 17
.93 8511 8531 8551 8570 8590 8610 8630 8650 8670 8690 2 4 6 8 10 12 14 16 18
.94 8710 8730 8750 8770 8790 8810 8831 8851 8872 8892 2 4 6 8 10 12 14 16 18
.95 8913 8933 8954 8974 8995 9016 9036 9057 9078 9099 2 4 6 8 10 12 15 17 19
.96 9120 9141 9162 9183 9204 9226 9247 9268 9290 9311 2 4 6 8 11 13 15 17 19
.97 9333 9354 9376 9397 0419 9441 9462 9484 9506 9528 2 4 7 9 11 13 15 17 20
.98 9550 9572 9594 9616 9638 9661 9683 9705 9727 9750 2 4 7 9 11 13 16 18 20
.99 9772 9795 9817 9840 9863 9886 9908 9931 9954 9977 2 5 7 9 11 14 16 18 20
BIO – STATISTICS AND RESEARCH METHODOLOGY 385
7. .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
0.0 .0000 .0040 .0080 .0120 .0160 .0199 .0239 .0279 .0319 .0359
0.1 .0398 .0438 .0478 0.517 .0557 .0596 .0636 .0675 .0714 .0753
0.2 .0793 .0832 .0871 .0910 .0948 .0987 .1026 .1064 .1103 .1141
0.3 .1179 .1217 .1255 .1293 .1331 .1368 .1406 .1443 .1480 .1517
0.4 .1554 .1591 .1628 .1664 ..1700 .1736 .1772 .1808 .1844 .1879
0.5 .1915 .1950 .1985 .2019 .2054 .2088 .2123 .2157 .2190 .2224
0.6 .2257 .2291 .2324 .2357 ..2389 .2422 .2454 .2486 .2517 .2549
0.7 .2580 .2611 .2642 .2673 .2703 .2734 .2764 .2794 .2823 .2852
0.8 .2881 .2910 .2939 .2967 .2995 .3023 .3051 .3078 .3106 .3133
0.9 .3159 .3186 .3212 .3238 .3264 .3289 .3315 .5340 .3365 .3389
1.0 .3413 .3438 .3461 .3485 .3508 .3531 .3554 .3577 .3599 .3621
1.1 .3643 .3665 .3686 .3708 .3729 .3749 .3770 .3790 .3810 .3830
1.2 .3849 .3869 .3888 .3907 .3925 3944 .3962 .3980 ..3997 .4015
1.3 .4032 .4049 .4066 .4082 .4099 .4115 .4131 .4147 .4162 .4177
1.4 .4192 .4207 .4222 .4236 .4251 .4265 .4279 .4292 .4306 .4318
1.5 .4332 .4345 .4357 .4370 ..4382 .4394 .4406 .4418 .4429 .4441
1.6 .4452 .4463 .4474 .4484 .4490 .4505 .4515 .4525 .4535 .4545
1.7 .4554 .4564 .4573 .4582 ..4591 .4599 .4608 .4614 .4625 4632
1.8 .4641 .4649 .4656 .4664 .4671 .4678 .4686 .4693 .4699 .4708
1.9 .4713 .4719 .4726 .4732 .4738 .4744 .4750 .4756 .4761 .4767
2.0 .4772 .4778 .4783 .4788 .4793 .4798 .4803 .4808 .4812 .4817
2.1 4821 .4826 .4830 .4834 .4838 .4842 .4846 .4850 .4854 .4857
2.2 .4861 .4864 .4868 .4871 .4875 .4878 .4881 .4884 .4887 .4890
2.3 .4893 .4896 .4898 .4901 .4904 .4906 .4909 .4911 .4913 .4916
2.4 .4918 .4920 .4922 .4925 .4927 .4929 .4931 .4932 .4934 .4936
2.5 .4938 .4940 .4941 .4943 .4945 .4946 .4948 .4949 .4951 .4952
2.6 .4953 .4955 .4956 .4957 .4959 .4960 .4961 .4962 .4963 4964
2.7 .4965 .4966 .4967 .4968 .4969 .4970 .4971 .4972 .4973 .4974
2.8 .4974 .4975 .4976 .4977 .4977 .4978 .4979 .4979 .4980 .4981
2.9 .4981 .4982 .4982 .4983 .4984 .4984 .4985 .4985 .4986 .4986
3.0 .4981 .4987 .4987 .4938 .4988 .4989 .4989 .4989 .4990 .4990
BIO – STATISTICS AND RESEARCH METHODOLOGY 386
Po t.955 t.89 t.975 t.85 t.90 t.80 t.75 t.79 t.60 t.35
1 63.66 31.82 12.71 6.31 3.08 1.376 1.000 .727 .325 .158
2 9.92 6.96 4.30 2.92 1.89 1.061 .816 .617 .289 .142
3 5.84 4.54 3.18 2.35 1.64 .978 .765 .584 .277 .137
4 4.60 3.75 2.78 2.13 1.53 .941 .741 .569 .271 .134
5 4.03 3.36 2.57 2.02 1.48 .920 .727 .550 .267 .132
6 3.71 3.14 2.45 1.94 1.44 .906 .718 .553 .265 .131
7 3.50 3.00 2.36 1.90 1.42 .806 .711 .549 .263 .130
8 3.25 2.90 2.31 1.86 1.40 .889 .706 .546 .262 .130
9 4.03 2.82 2.26 1.83 1.38 .886 .703 .543 .261 .129
10 3.17 2.76 2.23 1.81 1.37 .879 .700 .542 .260 .129
11 3.11 2.72 2.20 1.80 1.30 .876 .697 .540 .260 .129
12 3.00 2.68 2.18 1.78 1.30 .873 .695 .539 .259 .128
13 3.01 2.65 2.16 1.77 1.35 .870 .694 .538 .259 .128
14 2.98 2.62 2.14 1.76 1.34 .868 .692 .537 .258 .128
15 2.95 2.60 2.13 1.75 1.34 .866 .691 .536 .258 .128
16 2.92 2.58 2.12 1.75 1.34 .865 .690 .535 .258 .128
17 2.70 2.57 2.11 1.74 1.33 .863 .689 .534 .257 .128
18 2.88 2.55 2.10 1.73 1.33 .862 .688 .534 .257 .127
19 2.86 2.54 2.09 173 1.33 .861 .688 .533 .257 .127
20 2.84 2.53 2.09 1.72 1.32 .860 .687 .533 .257 .127
21 2.83 2.52 2.08 1.72 1.32 .859 .686 .532 .256 .127
22 2.82 2.51 2.07 1.72 1.32 .858 .686 .532 .256 .127
23 2.81 2.50 2.07 1.71 1.32 .858 .685 .532 .256 .127
24 2.80 2.49 2.06 1.71 1.32 .857 .685 .531 .256 .127
25 2.79 2.48 2.06 1.71 1.32 .856 .681 .531 .256 .127
26 2.78 2.48 2.06 1.71 1.32 .856 .684 .531 .256 .127
27 2.77 2.47 2.05 1.70 1.31 .855 .684 .531 .256 .127
28 2.76 2.47 2.05 1.70 1.31 .855 .683 .530 .256 .127
29 2.70 2.46 2.04 1.70 1.31 .854 .683 .530 .256 .127
30 2.76 2.46 2.01 1.70 1.31 .854 .683 .530 .256 .127
40 2.70 2.42 2.02 1.68 1.30 .851 .681 .529 .255 .126
60 2.66 2.39 2.00 1.67 1.30 .848 .679 .527 .254 .126
120 2.62 2.36 1.98 1.66 1.29 .845 .677 .526 .254 .126
x 2.56 2.33 1.96 1.645 1.28 .842 .674 .524 .253 .126
BIO – STATISTICS AND RESEARCH METHODOLOGY 387
y x2995 x2989 x2975 x2.95 x2.80 x2.75 x2.50 x2.23 x2.10 x2.05 x2.025 x2.01 x2.005
1 7.88 6.68 5.02 3.84 2.71 1.32 .455 .102 .0158 .0039 .0010 .0002 .0000
2 10.6 9.21 7.38 5.99 4.61 2.77 1.39 .575 .211 .103 .0506 .0201 .0100
3 12.8 11.3 9.35 7.81 6.25 4.11 2.37 1.21 .584 .352 .216 .116 .072
4 14.9 13.3 11.1 9.49 7.78 5.39 3.36 1.92 1.06 .711 .484 .297 .207
5 16.7 15.1 12.8 11.1 9.24 6.63 4.35 2.67 1.61 1.15 .331 .554 .412
6 18.5 16.8 14.4 12.6 10.6 7.84 5.35 3.45 2.20 1.64 1.24 .872 .676
7 20.3 18.5 16.0 14.1 12.0 9.04 6.35 4.25 2.83 2.17 1.69 1.24 .989
8 22.0 20.1 17.5 15.5 13.4 10.2 7.34 5.07 3.49 2.73 2.18 1.26 1.34
9 23.6 21.7 19.0 16.9 14.7 11.4 8.34 5.90 4.17 3.33 2.70 2.09 1.73
10 25.2 23.2 20.5 18.3 16.0 12.5 9.34 6.74 4.87 3.94 3.25 2.56 2.16
11 26.3 24.7 21.9 19.7 17.3 13.7 10.3 7.58 5.58 4.57 3.62 3.05 2.60
12 28.3 26.2 23.9 21.0 18.5 14.8 11.3 8.44 6.30 5.23 4.40 3.57 3.07
13 29.8 27.7 24.7 22.4 19.8 16.0 12.3 9.30 7.04 5.89 5.01 4.11 3.67
14 31.3 29.1 26.1 23.7 21.1 17.1 13.3 10.2 7.79 6.57 5.63 4.66 4.07
15 32.8 30.6 27.5 25.0 22.3 18.2 14.3 11.0 8.55 7.26 6.26 5.23 4.60
16 34.3 32.0 28.8 26.3 23.5 19.4 15.3 11.9 9.31 7.96 6.91 5.81 5.14
17 35.7 33.4 30.2 27.6 24.8 20.5 16.3 12.8 10.1 8.67 7.66 6.41 5.70
18 37.2 34.8 31.5 28.9 26.0 21.6 17.3 13.7 10.9 9.39 8.23 7.01 6.26
19 38.6 36.2 32.9 30.1 27.2 22.7 18.3 14.6 11.7 10.1 8.91 7.63 6.84
(Contd.)
BIO – STATISTICS AND RESEARCH METHODOLOGY 388
(Contd.)
y x2995 x2989 x2975 x2.95 x2.80 x2.75 x2.50 x2.23 x2.10 x2.05 x2.025 x2.01 x2.005
20 40.0 37.6 34.2 31.4 28.4 23.8 19.3 15.5 12.4 10.9 9.59 8.26 7.43
21 41.4 38.9 35.5 32.7 29.6 24.9 20.3 16.3 13.2 11.6 10.3 8.90 8.03
22 42.8 40.4 36.8 33.9 30.8 26.0 21.3 17.2 14.0 12.3 11.0 9.54 8.64
23 44.2 41.6 38.1 35.2 32.0 27.1 22.3 18.1 14.8 13.1 11.7 10.2 9.26
24 45.6 43.0 39.4 36.4 33.2 28.2 23.3 19.0 15.7 13.8 12.4 10.9 9.89
25 46.9 44.3 49.6 37.7 84.4 29.2 24.3 19.9 16.5 14.6 13.1 11.5 10.5
26 48.3 45.6 41.9 38.9 35.6 30.4 25.3 20.8 17.3 15.4 13.8 12.2 11.2
27 49.6 47.0 43.2 40.1 36.7 31.5 26.3 21.7 18.1 16.2 14.6 12.9 11.8
28 51.0 48.3 44.5 41.3 37.9 32.6 27.3 22.7 18.9 16.9 15.3 13.6 12.5
29 52.3 49.6 45.7 42.6 39.1 33.7 28.3 23.6 19.8 17.7 16.0 14.3 13.1
30 53.7 50.9 47.0 43.8 40.3 34.8 29.3 24.5 20.6 18.5 16.8 15.0 13.8
40 66.8 63.7 59.3 55.8 51.8 45.6 39.3 33.7 29.1 26.5 24.4 22.2 20.7
50 79.5 76.2 71.4 67.5 63.2 56.3 49.3 42.9 37.7 34.8 32.4 29.7 28.0
60 92.0 88.4 83.3 79.1 74.4 67.0 59.3 52.3 46.3 43.2 40.5 37.5 35.5
70 104.2 100.4 95.0 90.5 85.5 77.6 69.3 61.7 55.3 51.7 48.8 45.5 43.3
80 116.3 112.3 106.6 101.9 96.6 88.1 79.3 71.1 64.3 60.4 57.2 53.5 51.2
90 128.3 124.1 118.1 113.1 107.6 98.6 89.3 80.6 73.3 69.1 65.6 61.8 59.2
100 140.2 135.8 129.6 124.3 118.5 109.1 99.3 90.1 82.4 77.9 74.2 70.1 67.3
BIO – STATISTICS AND RESEARCH METHODOLOGY 389
r z r z r z r z r z r z
.25 .26 .41 .42 .55 .62 .70 .87 .85 1.26 .950 1.83
.26 .27 .42 .44 .56 .63 .71 .89 .86 1.29 .955 1.89
.27 .28 .43 .45 .57 .65 .72 .91 .87 1.33 .960 1.95
.28 .29 .44 .46 .58 .66 .73 .93 .88 1.38 .965 2.01
.29 .30 .45 .47 .59 .68 .74 .95 .89 1.42 .970 2.09
.30 .31 .46 .48 .60 .69 .75 .97 .90 1.47 .975 2.18
.31 .32 .47 .50 .61 .71 .76 1.00 .905 1.50 .980 2.30
.32 .33 .48 .51 .62 .73 .77 1.02 .915 1.53 .985 2.44
.33 .34 .49 .52 .63 .74 .78 1.05 .920 1.56 .990 2.65
.34 .35 .50 .53 .64 .76 .79 1.07 .925 1.59 .995 2.99
.35 .37 .51 .54 .65 .78 .80 1.10 .930 1.62
.36 .38 .52 .55 .66 .79 .81 1.13 .930 1.66
.37 .39 .53 .56 .67 .81 .82 1.16 .935 1.70
.38 .40 .54 .58 .68 .83 .83 1.19 .940 1.78
.39 .41 .55 .60 .69 .85 .84 1.22 .945 1.74
BIO – STATISTICS AND RESEARCH METHODOLOGY 390
N 0.05 0.01
5 0.900 1.000
6 0.829 0.943
7 0.714 0.893
8 0.643 0.833
9 0.600 0.783
10 0.564 0.746
12 0.506 0.712
14 0.456 0.645
16 0.425 0.601
18 0.399 0.564
20 0.377 0.534
22 0.359 0.508
24 0.343 0.485
26 0.329 0.465
28 0.317 0.448
30 0.306 0.432
EXERCISE
BIO – STATISTICS AND RESEARCH METHODOLOGY 391
Treatment B 20 190
41. The table given below shows the data obtained during an
epidemic of cholera.
Attacked Non- Total
Attacked
Inoculated 31 469 500
Not 185 1315 1500
inoculated
No. of patients
Treatment Favorable response No response
New 60 20
Conventional 70 50
Protein intake
(Consumption No. of Families
Unit/Day gm)
15 – 25 20
25 – 35 10
35 – 45 90
45 – 55 80
55 – 65 70
65 – 75 20
75 – 85 30
Total 320
BIO – STATISTICS AND RESEARCH METHODOLOGY 399
54. A Card is drawn from a pack of 52 cards and then 2nd card is
drawn. What is probability that both cards drawn are Queens?
55. Calculate mean, median and mode from following data :-
2, 14, 13, 15, 18, 19, 16, 17, 19, 20
56. 10 Anemic patients were given Fer.Met. Medicine for one
month regularly. Their Hb values were recorded before and
BIO – STATISTICS AND RESEARCH METHODOLOGY 400
6 8
5 8
8 12
4 6
7 11
8 12
6 8
4 7
6 9
5 8
REFERENCES
BIO – STATISTICS AND RESEARCH METHODOLOGY 401
NOTES
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________
__________________________________________