Biostate Simple
Biostate Simple
Dr.@mit
STATISTICS- is a science of compiling, classifying,
and tabulating numerical data and expressing the
results in a mathematical and graphical form.
• Variable
– Characteristics which takes different values for
different person, place or thing such as
height, weight, blood pressure
• Parameter
– It is a constant that describes a population e.g. in a
college there are 40% girls. This describes the
population, hence it is a parameter.
• Statistic
– Statistic is a constant that describes the sample e.g. out
of 200 students of the same college 45% girls. This
45% will be statistic as it describes the sample
• Attribute
• A characteristic based on which the population can
be described into categories or class e.g. gender,
caste, religion
WHAT IS STATISTICS ??
• The following essential features of statistics are evident
from various definitions of statistics:
• Biostatistics converts all the facts into figures and at the end
translates them into facts, interpreting the significance of their
results.
• Epidemiology and biostatistics both deal with the
facts-figures-facts
QUANITATIVE METHADOLOGY
USES OF BIOSTATISTICS
1. To test whether the difference between two populations is real or
by chance occurrence.
PRIMARY SECONDARY
Data obtained by the investigator himself. Data has already recorded.
Eg: hospital records
Primary data can be obtained using any
one of the following methods:
Two methods
• EXAMPLE
• Number of decayed teeth in 10 children
• 2,2,4,1,3,0,10,2,3,8
• Mean = 34 / 10 = 3.4
•
• Median = (0,1,2,2,2,3,3,4,8,10) = 2+3 /2
• = 2.5
• Mode = 2 ( 3 Times)
Types of variability
• The standard error of difference is 6 whereas the observed difference (24.4 - 16.2)
was 8.2.
• In other words the observed difference between the two groups is less than twice
the S.E. of difference, i.e., 2 x 6.
• There was no strong evidence of any difference between the efficacy of the two
vaccines. Therefore, the observed difference might be easily due to chance.
• A null hypothesis or hypothesis of no difference
(H0) asserts that there is no real difference in
sample and the population in particular matter
under consideration and the difference found is
accidental and arised out of sampling variations.
ii. Zone of rejection: If the result of a sample falls in the shaded area, i.e. beyond
mean
± 1.96 SE it is significantly different from the universe value. Hence, the H0 of no
difference is rejected and the alternate H1 is accepted. This shaded area, therefore, is
called the zone of rejection for null hypothesis.
• Degree of freedom:
Defined as the number of independent members in
the sample.
EXAMPL
E:-
X+Y+Z/3=5
Out of 3 values, we can choose only 2 of them
freely, but the choice of the third depends upon
the fact that the total of the three values should be
15.
SIGNIFICANCE OF DIFFERENCE BETWEEN MEANS OF
SMALL SAMPLES BY STUDENT’S t-TEST
• Small samples or their Z values do not follow normal
distribution as the large ones do.
Criteria for
applying t-
test
• 1. Random samples
• 2. Quantitative data
• 3. Variable normally
distributed
Unpaired t test
• Test of Proportions
• As an alternate test to find the significance of
difference in two or more than two proportions.
• Test of Association
• The test of association between two events in
binomial or multinomial samples is the most
important application of the test in statistical
methods. It measures the probability of association
between two discrete attributes.
• Two events can often be studied for their
association such as smoking and cancer, treatment
and outcome of a disease, vaccination and
immunity, nutrition and intelligence, etc.
• Test of Goodness of Fit
• Chi-square (χ2) test is also applied as a test of
“goodness of fit”, to determine if actual
numbers are similar to the expected or
theoretical numbers—goodness of fit to a
theory.
Analysis of Variance (ANOVA) Test