Lecture Note On Biostatistics
Lecture Note On Biostatistics
Lecture note on
Biostatistics
Biostatistics is the science of data when the focus is on the biological and health sciences.
Summarization of data describing with graph, table and summary measure in statistical idea is called descriptive
statistics.
Drawing conclusion, decision or prediction about population from the sample evidence is inferential statistics.
Data – The numbers derived from measuring or counting/ the raw material of statistics.
Variable - The characteristics which can vary in focus may be people, place , material etc. E.g. height, sex. Age.
Types of variables
1. Qualitative variable - It measures the quality or characteristic of each experiment. eg. Eye color, Gender,
Degree. Presented by graphic. Category: Nominal and Ordinal
2. Quantitative variables – It measures the numerical quantity or the amount of each experiment. Presented
by Central tendency, Measurement of Dispersion.
Two classifications
1. Discrete (no. of children, no. of egg)
2. Continuous (Age, weight, height)
Measurement scales
1. Nominal: Categorized, naming or classifying observations into collectively exhaustive categories. E.g.
Gender (male/female), Status (No IHD, IHD).
2. Ordinal: Categorized and ranked according to criteria. E.g. Degree of pain, Socioeconomic status.
3. Interval: Categorized, ranked and has constant units but not true zero. Distance between two
measurements is known, not true zero (Arbitrary zero). E.g. Temperature degree, IQ level
4. Ratio: Categorized, ranked, constant units and has true zero i.e. zero means that absence of
characteristics. E.g. Height, weight.
In numerical data – measurement of central tendency and measurement of dispersion. Line chart, stem and leaf,
box and whisker can be used.
1
In categorical data – tables, frequency distribution. Histogram, frequency polygon, pie chart, bar graph can be
used.
Properties of mean
Properties of median
Trimmed mean
Estimators that are insensitive to outliers are called robust estimator. Another robust measure and estimator of
central tendency is the trimmed mean.
Range
Variance
Standard deviation
Standard error
Coefficient of variation
Percentile, quartiles
Interquartile range
Range
Variance
(For population)
Standard deviation: The square root of the variance. (Individual true variation). In journals, Mean ± SD , Median
± IQR
Formula
Krutosis
Kurtosis is a measure of the degree to which a distribution is “peaked” or flat in comparison to a normal
distribution whose graph is characterized by a bell-shaped appearance.
Characteristics of normal distribution
1. It is symmetrical about the mean (µ). The curve on either side of µ is mirror image of the other side.
2. The mean, median and mode are all equal.
3. The total area under the curve above X axis is one square unit.
4. 1SD from the mean in both directions, the area is 68.2%, 2SD and 3SD are 95.5% and 99.7%
respectively.
5. The normal distribution is determined by µ(shift axis) and σ degree of flatness). OR Mean shows height
and SD shows width of bell shape.
6. Tails extend indefinitely & never touch the base line.
Coefficient of Variation
If random sample of n observation are drawn from a non - normal population with finite µ & σ, then n is large (n
≥ 30) the sampling distribution of sample mean ̅ is approximately normal distributed. The approximation
become more accurate as n becomes large.
3
Probability
Probability is the likelihood of an event occur and it can be assumed a value between 0 and 1.
I. Subjective probability
Called personalistic (based on belief, experience, prejudices)
Does not rely on the repeatability
Not fully accepted by statistician
E.g. probability of major earthquake, probability of winning lottery.
II. Objective probability (Classical, Relative frequency probability)
Based on equally likely events and long run relative frequency of events.
Is the same for all observers (objective)
Not based on personal belief
E.g. toss a coin, pick a card.
Rules of probability
1. Additional rule
In mutually exclusive events,
P (E1 or E2) = P (E1 U E2) = P (E1) + P (E2)
In not mutually exclusive events,
P (E1 U E2) = P (E1) + P (E2) – P (E1 and E2)
2. Multiplicative rule
If two events are independent,
P (A and B) = P (A) × P (B)
If two events are not independent,
P (A and B) = P (A) × P (B/A) (OR) P (B)× P (A/B)
3. Complementary rule
P (A) = 1- P (not A)
4
Types of probability
1. Marginal probability: the probability of the marginal total is used as numerator and the total group as
the denominator. E.g. P(Male) , P (Eye glass wearing)
2. Joint probability: the probability of an event possessing two characteristics at the same time. E.g. P
(male and E+)
3. Conditional probability: the probability of an event occurring given that another event has occurred.
E.g. P(A/B) =No. of occurrence possessing A and B / marginal total B
Probability Distribution
Estimation
Estimation is the process entails calculating from the data of a sample, some statistic that is offered as an
approximation of the corresponding parameter of the population from which the sample was drawn.
A point estimate is a single numerical value used to estimate the corresponding population parameter.
̅=
An interval estimate consists of two numerical values defining a range of values that, with a specified degree of
confidence, we feel includes the parameter being estimated.
Sample population is the population from which one actually draws a sample.
Confidence Interval An interval estimate provides more information about a population characteristic than does
a point estimate. Such interval estimates are called confidence intervals.
̂ -d <p< ̂ +d
The estimate is a single computed value, but the estimator is the rule that tell us how to compute this value, or
estimate.
5
For example, ̅ = ∑ is an estimator of the population mean,. The single numerical value that results from
evaluating this formula is called an estimate of the parameter .
Margin of error
The maximum likely difference observed between sample mean (̅̅̅ and the population mean (µ) and also called
maximum error of the estimate.
It is denoted by (d).
.
√
The T Distribution
Hypothesis
- A hypothesis may be defined simply as a statement about one or more populations. (OR)
- An “educated guess” based on prior knowledge and observation
- Hypothesis testing is the process of making an inference or generalization on population parameters based
on the results of the study on samples.
6
- The research hypothesis is the conjecture or supposition that motivates the research. Statement about the
expected relationship of the variables.
- Statistical hypotheses are hypotheses that are stated in such a way that they may be evaluated by
appropriate statistical techniques. States there is no relationship between variables.
P Value
– The p-value is defined as the smallest value of α for which the null hypothesis can be rejected.
– If the p-value is less than or equal to α ,we reject the null hypothesis (p ≤ α)
– If the p-value is greater than α ,we do not reject the null hypothesis (p > α)
Chi square
Characteristics
Uses
7
Chi square test of independence
- Testing the null hypothesis that in the population the two criteria of classification are independent.
- A single sample drawn from a single population.
- Observation cross classified on the basis of two variables of interest.
- Calculating expected based on joint probability law.
Advantages
8
– Used for data ranking and classifications.
Disadvantages
– Waste of data
– Laborious when n is large.
Correlation
– Correlation analysis is used to measure strength of the association (linear relationship) between two
variables
o Correlation is only concerned with strength of the relationship
o No causal effect is implied with correlation
Types of coefficients
– (r) is to measure the precision of the linear relationship between two variables.
– It is a measure of how well the data fit a straight line.
9
Uses for correlations
• Prediction
• Validity
• Reliability
• Theory Verifications
Features of Correlation Coefficient, r
– Unit free
– Ranges between –1 and 1
– The closer to –1, the stronger the negative linear relationship
– The closer to 1, the stronger the positive linear relationship
– The closer to 0, the weaker any positive linear relationship
– If r > 0 we have a Positive correlation
– Plus sign means positive correlation. They tend to increase or decrease together.
– If r < 0 we have a Negative correlation
– Minus sign means negative correlation. One tends to increase as the other tends to decrease.
– If r = 0 we have No linear correlation
10
– Linearity: There may be some relationship between x and y even when there is no significant linear
correlation.
– Linearity
– Homogeneity or Restriction of Range
– Outliers
When to use
- Relationship is non-linear.
– This straight line is the linear regression line; it represents how, on average, a change in the X variable is
associated with a change in the Y variable.
– Regression is most often used to study things that can’t be studied in experimental procedures of quasi-
experimental studies.
– Obtain the equation for the line that best fits the sample data.
– Strength of the relationship and the usefulness of the equation for predicting and estimating.
– Satisfactorily to the linear model, use the equation obtained from the sample data to predict and to
estimate.
11
Interpretation of the Slope and the Intercept
– B (slope) is the estimated change in the average value of Y as a result of a one-unit change in X
– The coefficient of determination is the portion of the total variation in the dependent variable that is
explained by variation in the independent variable
– The coefficient of determination is also called r-squared and is denoted as r2
Note. 0 r 2 1
ANOVA
– Analysis of variance (ANOVA) is a technique whereby the total variation present in a set of data is
partitioned into two or more components. Associated with each of these components is a specific source
of variation, so that in the analysis it is possible to ascertain the magnitude of the contributions of each of
these sources to the total variation.
– used for two different purposes:
(1) to estimate and test hypotheses about population variances, and
(2) to estimate and test hypotheses about population means.
Assumptions
Hypothesis
Single mean
• Population is normally or approximately normally distributed with known or unknown variance (sample
size n may be small or large),
• Population is not normal with known or unknown variance (n is large i.e. n≥30).
Two means
Two proportion
Chi Square
– Variables are QUALITATIVE Data (e.g. sex, age group, intensity of exposure, etc.)
– For every cell in the table, the expected frequency cannot be equal to zero
– Not more than 20% of total number of cell in the table should have values <5
– If these are not met, the Fisher’s Exact Test should be used instead of the 2 test.
13
Correlation
(3) The joint distribution of X and Y is a normal distribution called the bivariate normal distribution.
ANOVA
VIVA QUOTES
14
Nominal (Just naming) Eg.
blood Group
Qualitative (Categorized)
By Characteristics
Discrete (Gap+)
Quantitative (Measurable)
Continuous (Gap-) Eg. Blood
glucose Level,Wt. Height
Nominal
Ordinal
By Measurement Scale
Variable
Interval
Ratio
Dependent (Outcome)
By Function
Independent (Exposure)
Univariate
Multivariate
Probability Concepts
Probability distribution
Relationship between probabilities and Random variable (d/t chance factor) summarized by mean.
75 2642.5
16
Mean = = = 35.23 Li = lower level of mean frequency group
SD = √ = 12.96
……………………………………………………………………………………………..
17
Binomial distribution
- is a probability distribution
- is derived from an experiment or a trial can result in only one of two mutually exclusive outcomes.
(Bernoulli trial)
- Probability of success = p
- no. of occurrence = x
- Combination formula =C
= n!/ x! (n-x)!
1) results are one or two possible, mutually exclusive, outcomes. One is denoted as a success and the other is
denoted as a failure
Mean (µ) = n p
Variance ( σ2) = n .p .q
F(X) is used in exactly data and P(X) is used when table apply.
Rule of Combinations
where:
0! = 1 (by definition)
18
Binomial Distribution Formula
p = probability of “success”
1. What is the probability of one success in five observations if the probability of success is .1?
X = 1, n = 5, and p = .1
n!
P( X 1) p X (1 p) n X
X !(n X )!
5!
(.1)1 (1 .1)51
1!(5 1)!
(5)(.1)(.9) 4
.32805
2. The data from the North Carolina State Center for Health Statistics (A-3) show that 14 percent of mothers
admitted to smoking one or more cigarettes per day during pregnancy. If a random sample of size 10 is
selected from this population, what is the probability that it will contain exactly four mothers who
admitted to smoking during pregnancy?
1
N= 10 x= 4 p= 14% = 0.14 P(4) = ? Use P(x) = n C x . (p) x . (q) n-x
ANS . 0.0326
3. Suppose it is known that 10 percent of a certain population is color blind. If a random sample of 25
people is drawn from this population, use Table B in the Appendix to find the probability that:
N = 25 , p= 10% = 0.1
(a) Five or fewer will be color blind. P ( x ≤5 ) = 0.9666 ( in ≤ use direct number)
(b) Six or more will be color blind. P ( x≥6 ) = 1- 0.9666 = 0. 0334( in ≥ , 1- lower number)
19
(d) Two, three, or four will be color blind.
P (2 ≥ x ≤4 ) = P ( x ≤ 4 ) – P ( x ≤ 1 ) = 0.8302
P ( X = x / n, p > .5) = P ( X = n - x, n, 1 – p)
P ( X ≤ x / n, p > .5) = P ( X ≥ n – x/ n, 1 – p)
P ( X ≥ x / n, p > .5) = P ( X ≤ n – x/ n, 1 – p)
4. Assuming that the probability of giving this answer to the question is .55 for any Massachusetts resident,
use Table B to find the probability that if 12 residents are chosen at random: p= .55 , n = 12
P ( X = x / n, p > .5) = P ( X = n - x, n, 1 – p)
P ( X ≤ x / n, p > .5) = P ( X ≥n – x/ n, 1 – p)
P (X ≥ x / n, p > .5) = P ( X ≤ n – x/ n, 1 – p)
EXERCISE 4.3.1
n= 20, p = .24
EXERCISE 4.3.4
N = 15, P =0.32
EXERCISE 4.3.7
20
N= 3, P = 0.19
(a) Exactly 0 =
P (0) = =0.5314 #
(b) P (1) = 0.374 #
(c) P (x > 1) = P (x=2) + P (x=3) = 0.0945 #
(d) P (x≤2) = P (x=0) + P (x=1) + P (x=2) = 0.9931 #
(e) P (x=2 or 3) = P (x≥ 2) = P (x=2) + P (x=3) = 0.0945 #
(f) P (x=3) = 0.0069 #
Poisson Model
𝒒 or P(x) =
Exercise 4.4.1
λ=4
Exercise 4.4.3
λ=5
(a) P (exactly 7) =
(b) P (x >10) = 1 – P (x = 9) = 1- 0.0968 = 0.032 #
(c) P (x= 0) = 0.007 #
(d) P (x <5) = 0.440 #
Exercise 4.4.4
λ = 0.5
Exercise 4.4.5
λ = 13
Exercise 5.3.1
µ = 204 σ= 44 N = 50
Use √
= 6.2225#
Exercise 5.3.3
If the uric acid values in normal adult males are approximately normally distributed with a mean and standard
deviation of 5.7 and 1 mg percent, respectively, find the probability that a sample of size 9 will yield a mean:
(a) Greater than 6 (b) Between 5 and 6 (c) Less than 5.2
µ = 5.7 σ = 1 N = 9
̅
Use Ẕ=
√
Exercise 5.4.1
Use these estimates as the mean m and standard deviation s for the respective U.S. populations. Suppose we select
a simple random sample of size 50 independently from each population. What is the probability that the
difference between sample means _xB _ _xA will be more than 8.
22
=183 = 189 = 37.2 = 34.7 n=50 * by given.
̅ ̅
= = 0.28 = 0.6103 in Table D
√ √
*TO READ THE SUM CAREFULLY AND NOTICE THE WORDS SUCH AS EQUAL MEANS,
VARIANCES*
Remember q = 1-p .
EXERCISE 5.5.1
Smith et al. [A-5] performed a retrospective analysis of data on 782 eligible patients admitted with myocardial
infarction to a 46-bed cardiac service facility. Of these patients, 248 (32 percent) reported a past myocardial
infarction. Use .32 as the population proportion. Suppose 50 subjects are chosen at random from the population.
What is the probability that over 40 percent would report previous myocardial infarctions?
N=50, p=0.32, ̂ q = (1-p) = 0.68 , np = 16 , nq = 34
̂
=0.8869
√
The variance , ̂ ̂
̂ ̂
23
EXERCISE 5.6.1
According to the 2000 U.S. Census Bureau [A-8], in 2000, 9.5 percent of children in the state of Ohio were not
covered by private or government health insurance. In the neighboring state of Pennsylvania, 4.9 percent of
children were not covered by health insurance. Assume that these proportions are parameters for the child
populations of the respective states. If a random sample of size 100 children is drawn from the Ohio population,
and an independent random sample of size 120 is drawn from the Pennsylvania population, what is the probability
that the samples would yield a difference, ^p1 - ^p2 of .09 or more?
̂ ̂
̂ ̂ =
̂ ̂
= (Table D)
√
√
EXERCISE 5.6.3
From the results of a survey conducted by the U.S. Bureau of Labor Statistics [A-9], it was estimated that 21
percent of workers employed in the Northeast participated in health care benefits programs that included vision
care. The percentage in the South was 13 percent. Assume these percentages are population parameters for the
respective U.S. regions. Suppose we select a simple random sample of size 120 northeastern workers and an
independent simple random sample of 130 southern workers. What is the probability that the difference between
sample proportions, ^p1_ ^p2, will be between .04 and .20?
̂ ̂= 0.00225
z2
1
f ( z) e 2
2
Note that
The cumulative probabilities P(Z z) are given in
tables for -3.49 < z < 3.49. Thus, P (-3.49 < Z < 3.49) 1.
For standard normal distribution, P (Z > 0) = P (Z < 0) = 0.5
24
Example 4.6.1:
Example 4.7.1:
The ‘Uptime ’is a custom-made light weight battery-operated activity monitor that records the amount of time an
individual spend the upright position. In a study of children ages 8 to 15 years. The researchers found that the
amount of time children spend in the upright position followed a normal distribution with
Mean of 5.4 hours and standard deviation of 1.3. Find If a child selected at random, then 1-The probability that
the child spends less than 3 hours in the upright position 24-hour period
-------------------------------------------------------------------------
2-The probability that the child spend more than 5 hours in the upright position 24-hour period
3-The probability that the child spends exactly 6.2 hours in the upright position 24-hour period
25
P (X = 6.2) = 0
4-The probability that the child spends from 4.5 to 7.3 hours in the upright position 24-hour period
P ( 4.5 < X < 7.3) = P ( < < ) = P( -0.69 < Z < 1.46 )
EXAMPLE 4.7.2
Diskin et al. (A-11) studied common breath metabolites such as ammonia, acetone, isoprene, ethanol, and
acetaldehyde in five subjects over a period of 30 days. Each day, breath samples were taken and analyzed in the
early morning on arrival at the laboratory. For subject A, a 27-year-old female, the ammonia concentration in
parts per billion (ppb)followed a normal distribution over 30 days with mean 491 and standard deviation
119.What is the probability that on a random day, the subject’s ammonia concentration is between 292 and 649
ppb?
EXERCISE 4.7.1
For another subject (a 29-year-old male) in the study by Diskin et al. (A-11), acetone levels were normally
distributed with a mean of 870 and a standard deviation of 211 ppb. Find the probability that on a given day the
subject’s acetone level is:
µ = 870, σ = 211,
= #
= = 0.62 = 0.7324 #
In the study of fingerprints, an important quantitative characteristic is the total ridge count for the
10 fingers of an individual. Suppose that the total ridge counts of individuals in a certain population
26
are approximately normally distributed with a mean of 140 and a standard deviation of 50. Find the
probability that an individual picked at random from this population will have a ridge count of:
EXERCISE 4.7.4***
Suppose the average length of stay in a chronic disease hospital of a certain type of patient is 60 days with
a standard deviation of 15. If it is reasonable to assume an approximately normal distribution of lengths of
stay, find the probability that a randomly selected patient from this group will have a length of stay:
(a) Greater than 50 days (b) Less than 30 days
(c) Between 30 and 60 days (d) Greater than 90 days
27
Estimation
Case 1
Population is N or ≈ N
is known (N large
Ẕ
or small)
N large Ẕ or t
is unknown
N small t
Case 2
Case 3
Two population are N or ≈ N
is known ( N1, N2
Ẕ
large or small)
N1, N2 large Ẕ or t
Case 4
If population is not normally distributed and n1, n2 are large (n1 ≥ 30 , n2≥ 30) and population variances
is known → z (Central Limit Theoram).
σ
XZ
Confidence Interval for μ (σ Known) n
28
Example 1.
A sample of 11 circuits from a large normal population has a mean resistance of 2.20 ohms. We know
from past testing that the population standard deviation is .35 ohms. Determine a 95% confidence
interval for the true mean resistance of the population.
Solution:
Interpretation
σ
X Z
n We are 95% confident that the true mean resistance
2.20 1.96 (.35/ 11) is between 1.9932 and 2.4068 ohms
2.20 .2068 Although the true mean may or may not be in this
(1.9932 , 2.4068) interval, 95% of intervals formed in this manner will
contain the true mean
If the population standard deviation σ is unknown, we can substitute the sample standard deviation, S
Assumptions
Population standard deviation is unknown. Population is normally distributed. If population is not normal,
use large sample. Use Student’s t Distribution or z distribution substitute the sample SD (s)
S
X t n-1
Confidence Interval Estimate: n
Example 2 A random sample of n = 25 has X = 50 and S = 8. Form a 95% confidence interval for μ. d.f. = n – 1
= 24, so
S 8
X t /2, n-1 50 (2.0639)
n 25
(46.698, 53.302)
Confidence Interval for the difference between two Population Means: (CI)
When the variance is known and the sample sizes is large or small, the C.I. has the form:
12 22 12 22
( x1 x2 ) Z 1 2 ( x1 x2 ) Z
1 n1 n2 1 n1 n2
2 2
29
When the variance is unknown and the sample sizes is large, the C.I. has the form;
12 22 12 22
( x1 x2 ) Z 1 2 ( x1 x2 ) Z
1 n1 n2 1 n1 n2
2 2
When variances are unknown but equal, and the sample size is small, the C.I. has the form:
S p2 S p2 S p2 S p2
( x1 x2 ) t 1 2 ( x1 x2 ) t
1 , ( n1 n2 2 ) n1 n2 1 , ( n1 n2 2 ) n1 n2
2 2
where
(n1 1) S12 (n2 1) S 22
S p2
n1 n2 2
When variances are unknown and not equal and the sample size is small, The CI has the form;
S12 S2 S12 S2
( x1 x2 ) t 2 1 2 ( x1 x2 ) t 2
1 , ( n1 n2 2 ) n1 n2 1 , ( n1 n2 2 ) n1 n2
2 2
where
w1t1 w2t 2
t
1
2 w1 w2
Example 3
The researcher team interested in the difference between serum uric acid level in a patient with and without
Down’s syndrome. In a large hospital for the treatment of the mentally retarded, a sample of 12 individuals with
Down’s Syndrome yielded a mean of mg/ 100 ml. In a general hospital a sample of 15 normal
individuals of the same age and sex were found to have a mean value of . If it is reasonable to assume
that the two population of values are normally distributed with variances equal to 1 and 1.5, find the 95% C.I for
μ1 - μ2
Solution:
Example 4
The purpose of the study was to determine the effectiveness of an integrated outpatient dual-diagnosis treatment
program for mentally ill subject. The authors were addressing the problem of substance abuse issues among
people with severe mental disorder. A retrospective chart review was carried out on 50 patients, the recherché was
interested in the number of inpatient treatment days for physics disorder during a year following the end of the
program. Among 18 patients with schizophrenia. The mean number of treatment days was 4.7 with standard
30
deviation of 9.3. For 10 subject with bipolar disorder, the mean number of treatment days was 8.8 with
standard deviation of 11.5. We wish to construct 95% C.I for the difference between the means of the
populations Represented by the two samples.
S p2 S p2
( x1 x2 ) t
1 ,( n1 n2 2 ) n1 n2
2
NOTES;
When the interval includes zero, the population means may be equal. When not includes zero, the population
means are different.
pˆ (1 pˆ )
pˆ Z1 / 2
n
where
Example 5
A random sample of 100 people shows that 25 are left-handed. Form a 95% confidence interval for the true
proportion of left-handers.
pˆ Z p̂(1 p̂)/n
25/100 1.96 .25(.75)/1 00 Interpretation
.25 1.96 (.0433) We are 95% confident that the true percentage of left-handers in the
(0.1651 , 0.3349) population is between
ˆ (1 P
P ˆ ) Pˆ (1 P
ˆ )
ˆ P
(P ˆ )Z 1 1
2 2
1 2
1 n1 n2
2
31
Example 6
Connor investigated gender differences in proactive and reactive aggression in a sample of 323 adults (68 females
and 255 males). In the sample ,31 of the female and 53 of the males were using internet in the internet café. We
wish to construct 99 % confident interval for the difference between the proportions of adults go to internet café
in the two sampled population.
Solution:
1-α =0.99 → α = 0.01 → α/2 =0.005 → 1- α/2 = 0.995, Z 1- α/2 = Z 0.995 =2.58 , nF=68, nM=255,
aF 31 aM 53
pˆ F 0.4559, pˆ M 0.2078
nF 68 nM 255
Exercises
Exercise 6.2.1
We wish to estimate the average number of heartbeats per minute for a certain population. The average number of
heartbeats per minute for a sample of 49 subjects was found to be 90. Assume that these 49 patients constitute a
random sample, and that the population is normally distributed with a standard deviation of 10.
n= 49, ̅ , σ = 10
̅
√
This interval may be interpreted from both the probabilistic and practical points of view. We are 90 percent
confident that the true population mean µ, is somewhere between 87.65 and 92.35 because, in repeated sampling,
90 percent of intervals constructed in like manner will include the population mean.
Calculate 95% CI for µ, use Ẕ value is 1.96 and 99% for 2.58.
Ans; 87.2, 92.8 for 95% and 86.32, 93.68 for 99%)
32
Exercise 6.3.3
Pedroletti et al. (A-3) reported the maximal nitric oxide diffusion rate in a sample of 15 asthmatic schoolchildren
and 15 controls as mean ± standard error of the mean. For asthmatic children, they reported 3:5 ± 0.4nL/s
(nanoliters per second) and for control subjects they reported 0:7 ± .1nL/s. For each group, determine the
following:
(a) What was the sample standard deviation?
(b) What is the 95 percent confidence interval for the mean maximal nitric oxide diffusion rate of the population?
(c) What assumptions are necessary for the validity of the confidence interval you constructed?
(d) What are the practical and probabilistic interpretations of the interval you constructed?
(e) Which interpretation would be more appropriate to use when discussing confidence intervals
with someone who has not had a course in statistics? State the reasons for your choice.
(f) If you were to construct a 90 percent confidence interval for the population mean from the information given
here, would the interval be wider or narrower than the 95 percent confidence interval? Explain your answer
without actually constructing the interval.
(g) If you were to construct a 99 percent confidence interval for the population mean from the
information given here, would the interval be wider or narrower than the 95 percent confidence
interval? Explain your answer without actually constructing the interval.
(f) Narrower
(g) Wider
Exercise 6.4.1
Iannelo et al. (A-8) performed a study that examined free fatty acid concentrations in 18 lean subjects and 11
obese subjects. The lean subjects had a mean level of 299 mEq/L with a standard error of the mean of 30, while
the obese subjects had a mean of 744 mEq/L with a standard error of the mean of 62.
ո 1 = 18, ̅ = 127.28
ո 2 = 11, ̅ , , = 205.63
33
12 22
( x1 x2 ) Z
1 n1 n2
̅ ̅ √ =√ = 68.876 , By the equation, 2
Exercise 6.4.7
Twenty-four experimental animals with vitamin D deficiency were divided equally into two groups. Group 1
received treatment consisting of a diet that provided vitamin D. The second group was not treated. At the end of
the experimental period, serum calcium determinations were made with the following results:
Treated group: ̅ = 11:1mg/100 ml; s = 1:5
Untreated group: ̅ = 7:8mg/100 ml; s = 2:0
Assume normally distributed populations with equal variances.
, ̅ , ̃ =2
= = 3.13 #
Exercise 6.4.9
The average length of stay of a sample of 20 patients discharged from a general hospital was 7 days with a
standard deviation of 2 days. A sample of 24 patients discharged from a chronic disease hospital had an average
length of stay of 36 days with a standard deviation of 10 days. Assume normally distributed populations with
unequal variances.
Supposing acute case is 1 and 2 is chronic cases.
where
w1t1 w2t 2
t
1
2 w1 w2 ,
= 0.2 , = 4.17 ,
34
For 95% CI = = -1= 19, = 2.0930 , = -1 = 23, = 2.0687 (using Table E)
= = 2.07#
Exercise 6.5.1
Luna et al. (A-14) studied patients who were mechanically ventilated in the intensive care unit of six hospitals in
Buenos Aires, Argentina. The researchers found that of 472 mechanically ventilated patients, 63 had clinical
evidence of ventilator-associated pneumonia (VAP). Construct a 95 percent confidence interval for the proportion
of all mechanically ventilated patients at these hospitals who may be expected to develop VAP.
…………………………………………………………………………………………………………….
35
Hypothesis Testing
Testing a hypothesis about the mean of a population
1. Data: determine variable, sample size (n), sample mean ( ̅ ) , population standard deviation or sample standard
deviation (s). In numerical data, we can know means and in categorical data, can know proportion.
• Case1: Population is normally or approximately normal distributed with known or unknown variance and
sample size n may be small or large.
• Case 2: Population is not normal with known or unknown variance may use z or t test.
• Case I : e.g. we want to test that the population mean is different from 50
• Case II : e.g. we want to test that the population mean is greater than 50
• Case III: e.g. we want to test that the population mean is less than 50
4.Test Statistics
36
5. Distribution of test statistic
If the null hypothesis is true and assumption are met, follow the standard normal distribution
6. Decision rule
If HA: μ ≠ μ0, Reject H 0 if Z >Z1-α/2 or Z< - Z1-α/2 (when use Z - test) Or
Reject H 0 if T >t1-α/2,n-1 or T< - t1-α/2,n-1 (when use T- test)
Note*In t table, if 95% 2 sided, look t 0.975 and 95% 1 sided look t 0.95. Due to lack of –. In t table, use
– for < cases*
8. Statistical decision
We are able to reject the null hypothesis when our calculated value is greater than or equal to the critical
value according to table.
9. Conclusion
We can conclude the HA according to our statistical decision.
Note* Depend on α or other way CI. Look table d of calculated Ẕ value. For upper tail 1-calculated value
and not for lower tail. In 2 sided case, calculated p value x 2 and not for 1 sided case*
Exercise 7.2.3
The purpose of a study by Luglie et al. (A-5) was to investigate the oral status of a group of patients diagnosed
with thalassemia major (TM). One of the outcome measures was the decayed, missing, and filled teeth index
(DMFT). In a sample of 18 patients the mean DMFT index value was 10.3 with a standard deviation of 7.3. Is this
sufficient evidence to allow us to conclude that the mean DMFT index is greater than 9.0 in a population of
similar subjects? Let α = 0.1
Solution
1. Data
Sample patients (n) = 18, The mean DMFT index ( ̅ ) = 10.3, Standard deviation (σ) = 7.3
α= 0.1
2. Assumptions
The sample 18 patients are assumed normally distributed and it has known variance.
3. Hypothesis
:µ≤9
:µ>9
37
4. Test statistics
Since we assume that the population is normally distributed, and since the population variance is known,
our test statistic is given by
̅
Rejection zone
0 1.29
7. Calculation of the test
̅
= = 0.76
√ √
8. Statistical decision
We are not able to reject the null hypothesis since our calculated value is not greater than 1.29.
9. Conclusion
The mean DMFT index is not greater than 9 in the population of similar subjects.
10. P value
The P value for this test is 0.2 or > 0.1.
Exercise 7.2.7
A sample of 25 freshman nursing students made a mean score of 77 on a test designed to measure attitude toward
the dying patient. The sample standard deviation was 10. Do these data provide sufficient evidence to indicate, at
the .05 level of significance, that the population mean is less than 80? What assumptions are necessary?
Solution
1. Data
Sample (n) = 25, mean score ( ̅ ) = 77, sample standard deviation (s) = 10, α= 0.05
2. Assumption
The sample size is small but assumed to be normally distributed.
3. Hypothesis
: µ ≥ 80 , : µ < 80
4. Test statistics
38
-1.7901 0
Exercise 7.2.17
Suppose it is known that the IQ scores of a certain population of adults are approximately normally distributed
with a standard deviation of 15. A simple random sample of 25 adults drawn from this population had a mean IQ
score of 105. On the basis of these data can we conclude that the mean IQ score for the population is not 100? Let
the probability of committing a type I error be .05.
Solution
1. Data
n = 25, standard deviation ( ̅ ,mean (µ) = 105, α = 0.05
2. Assumption
Population of adults are approximately normally distributed and variance is known.
3. Hypothesis
: µ = 100, ≠ 100
4. Test statistics
̅
√
5. Distribution of test statistic.
When the null hypothesis is true, the test statistic follows the standard normal distribution.
6. Decision rule
Let α = 0.05, since we have 2 sided test and so = 0.025 in each tail. The critical values of z are -1:96.
Reject H0 unless -1.96 > Ẕ computed > 1.96. The rejection and non-rejection regions are shown in Figure.
-1.96 0 + 1.96
7. Calculation of test statistics
̅
= = 1.67
√ √
39
8. Statistical decision - Do not reject the since 1.67 fall in non- rejection zone.
9. Conclusion – the mean of the population from which the sample may be 100.
10. P – value
P = 2 x 0.0475 = 0.095 or p value is > 0.05
Exercise 7.3.1
Subjects in a study by Dabonneville et al. (A-9) included a sample of 40 men who claimed to engage in a variety
of sports activities (multisport). The mean body mass index (BMI) for these men was 22.41 with a standard
deviation of 1.27. A sample of 24 male rugby players had a mean BMI of 27.75 with a standard deviation of 2.64.
Is there sufficient evidence for one to claim that, in general, rugby players have a higher BMI than the multisport
men? Let α = 0.01.
Solution
1. Data
Supposing, multisport is 1 and rugby player is 2
So, = 40, ̅ = 22.4, = 1.27
= 24, ̅ = 27.75, = 2.64
2. Assumption
The statistics were computed from two independent samples. Since the population variances are
unknown, we will use the sample variances in the calculation of the test statistic.
3. Hypothesis
: ≥ , : <
4. Test statistics
Since we have small sample and population variance is unknown,
(X1 - X 2 ) - ( 1 2 )0 S 2 (n1 1) S1 (n 2 1) S 2
2 2
T
n1 n2 2
p
S p2 S p2
n1 n2
5. Distribution of test statistic
When the null hypothesis is true, the test statistic is distributed approximately as the standard normal.
6. Decision rule
Let α = 0.01. This is a one-sided test with a critical value of T equal to - 2.388. Reject H0 if T computed ≤
- 2.388.
7. Calculation of test statistics
= 3.6,
(X - X 2 ) - ( 1 2 )0 = = -10.7
T 1
2 2 √
Sp Sp
n1 n2
8. Statistical decision
Reject since T = - 10.7 is in the rejection region.
9. Conclusion
By the calculation, rugby players have a higher BMI than the multisport men.
10. P value
P < 0.0001
Exercise 7.3.5
GarSc~ao and Cabrita (A-13) wanted to evaluate the community pharmacist’s capacity to positively influence the
results of antihypertensive drug therapy through a pharmaceutical care program in Portugal. Eighty-two subjects
with essential hypertension were randomly assigned to an intervention or a control group. The intervention group
40
received monthly monitoring by a research pharmacist to monitor blood pressure, assess adherence to treatment,
prevent, detect, and resolve drug-related problems, and encourage non-pharmacologic measures for blood
pressure control. The changes after 6 months in diastolic blood pressure (pre _ post, mm Hg) are given in the
following table for patients in each of the two groups.
On the basis of these data, what should the researcher conclude? Let α = 0.05.
Solution
1. Data
Supposing intervention group is 1 and control group is 2,
So, = 42, ̅̅̅ = 13.22, = 9.5
=42, ̅̅̅ = 6.44, = 6.8, α = 0.05
2. Assumption
Since the sample size is large and the population variances are unknown, we will use the sample variances
in the calculation of the test statistic.
3. Hypothesis
: = : ≠
4. Test statistics
Since we have large sample and population variance is unknown,
̅ ̅
√ √
6. Decision rule
Let α = 0.05, since we have 2 sided test and so = 0.025 in each tail. The critical values of z are -1:96.
Reject H0 unless -1.96 > Ẕ computed > 1.96.
8. Statistical Decision
Reject since 3.8 > 1.96.
9. Conclusion
The 2 means of the population are not equal.
41
10. P Value = 2 x 0.0001= 0.0002
Exercise 7.4.1
Ellen Davis Jones (A-15) studied the effects of reminiscence therapy for older women with
depression. She studied 15 women 60 years or older residing for 3 months or longer in an assisted living long-
term care facility. For this study, depression was measured by the Geriatric Depression Scale (GDS). Higher
scores indicate more severe depression symptoms. The participants received reminiscence therapy for long-term
care, which uses family photographs, scrapbooks, and personal memorabilia to stimulate memory and
conversation among group members. Pre-treatment and post treatment depression scores are given in the
following table. Can we conclude, based on these data, that subjects who participate in reminiscence therapy
experience, on average, a decline in GDS depression scores? Let α = 0.01
Pre–GDS: 12 10 16 2 12 18 11 16 16 10 14 21 9 19 20
Post–GDS: 11 10 11 3 9 13 8 14 16 10 12 22 9 16 18
Solution
1. Data
Pre–GDS: 12 10 16 2 12 18 11 16 16 10 14 21 9 19 20
Post–GDS: 11 10 11 3 9 13 8 14 16 10 12 22 9 16 18
= Postop – Preop = -1,0, -5, 1, -3, -5, -3, -2, 0, 0, -2, 1, 0, -3, -2,
= - 24
= 1, 0, 25, 1, 9, 25, 9, 4, 0, 0, 4, 1, 0, 9, 4
= 92
2. Assumption
The observed differences constitute a simple random sample from a normally distributed population
3. Hypothesis
: ≥0, <0
4. Test Statistics
6. Decision rule
Let α = 0.01. The critical value of t is 2.624. Reject H0 if computed t is less than or equal to the critical
value. The rejection and non-rejection regions are shown in Figure.
Rejection zone
Non – rejection zone
2.624
7. Calculation of test statistics
n = 15
̅= = - 1.6
= = 3.83
42
= - 3.2
√
8. Statistical decision
Reject the since – 3.2 is in the rejection zone.
9. Conclusion
We may conclude that that subjects who participate in reminiscence therapy decline in GDS depression
scores.
10. P value
P < 0.005
Exercise 7.4.3
The purpose of an investigation by Morley et al. (A-17) was to evaluate the analgesic effectiveness of a daily dose
of oral methadone in patients with chronic neuropathic pain syndromes. The researchers used a visual analogue
scale (0–100 mm, higher number indicates higher pain) ratings for maximum pain intensity over the course of the
day. Each subject took either 20 mg of methadone or a placebo each day for 5 days. Subjects did not know which
treatment they were taking. The following table gives the mean maximum pain intensity scores for the 5 days on
methadone and the 5 days on placebo. Do these data provide sufficient evidence, at the .05 level of significance,
to indicate that in general the maximum pain intensity is lower on days when methadone is taken?
Solution
1. Data
=- 27.4, 3.2, 0.4, -3.6, - 6.6, -13.4, -10.6, -6.4, -1.4,- 13.4, -26.6
= -105.8
= 750.76, 10.24, 0.16, 12.96, 43.56, 179.56, 112.36, 40.96, 1.96, 179.56, 707.56
= 2039.64
2. Assumption
The observed differences constitute a simple random sample from a normally distributed population
3. Hypothesis
: ≥0, <0
43
4. Test Statistics
6. Decision rule
Let α = 0.05. The critical value of t is -1.8125. Reject H0 if computed t is less than or equal to the critical
value. The rejection and non-rejection regions are shown in Figure.
-1.8125
7. Calculation of test statistics
n = 11
̅= = - 9.6
= = 102.2
= - 3.15
√
8. Statistical decision
Reject the since – 3.15 is in the rejection zone.
9. Conclusion
We may conclude that the patients who taken methadone has low pain intensity.
10. P value
P = 0.0007 so, 0.0005 < P < 0.05
Paired Comparison
The objective in paired comparisons tests is to eliminate a maximum number of sources of extraneous variation
by making the pairs similar with respect to as many variables as possible.
di is the difference between pairs of observations ̅ is the sample mean difference is the hypothesized
population mean difference, ̅ = n is the number of sample differences, and sd is the standard deviation of
√
44
The Use of Ẕ
If, in the analysis of paired data, the population variance of the differences is known, the appropriate test statistic
is
If the assumption of normally distributed di’s cannot be made, the central limit theorem may be employed if n is
large. In such cases, the test statistic is Ẕ Equation. with sd used to estimate when, as is generally the case. If
we do not use paired observations, we have 2n - 2 degrees of freedom available as compared to n - 1 when we use
the paired comparisons procedure.
Ellen Davis Jones (A-15) studied the effects of reminiscence therapy for older women with
depression. She studied 15 women 60 years or older residing for 3 months or longer in an assisted living long-
term care facility. For this study, depression was measured by the Geriatric Depression Scale (GDS). Higher
scores indicate more severe depression symptoms. The participants received reminiscence therapy for long-term
care, which uses family photographs, scrapbooks, and personal memorabilia to stimulate memory and
conversation among group members. Pre-treatment and post treatment depression scores are given in the
following table. Can we conclude, based on these data, that subjects who participate in reminiscence therapy
experience, on average, a decline in GDS depression scores? Let α = 0.01.
Solution
1. Data – the data consist of the reminiscence therapy for 15 older women before and after treatment.
4. Test statistics -
45
5. Distribution of test statistic - If the null hypothesis is true, the test statistic is distributed as Student’s t
with n - 1 degrees of freedom.
6. Decision rule - Let α = 0.01. The critical value of t is 2.624. Reject H0 if computed t is greater than or
equal to the critical value. The rejection and non-rejection regions are shown in Figure.
α= 0.01
2.624
= = 1.6
= = = 3.83
= = = 3.137 #
√
8. Statistical Decision
Reject the since 3.137 is in the rejection zone.
9. Conclusion - subjects who participate in reminiscence therapy are decreasing GDS.
10. P value – p < 0.005
Exercise 7.5.1
Jacquemyn et al. (A-21) conducted a survey among gynecologists-obstetricians in the Flanders region and
obtained 295 responses. Of those responding, 90 indicated that they had performed at least one cesarean section
on demand every year. Does this study provide sufficient evidence for us to conclude that less than 35 percent of
the gynecologists-obstetricians in the Flanders region perform at least one cesarean section on demand each year?
Let α = 0.05
Solution
1. Data – the data are obtained from 295 responses of which 90 indicated that they had performed at least
one cesarean section on demand every year. ̂ = = 0.305
46
2. Assumption - The study subjects may be treated as a simple random sample from a population of similar
subjects, and the sampling distribution of ^p is approximately normally distributed in accordance with the
central limit theorem.
3. Hypothesis
: p ≥ 0.35
: p < 0.35
̂
4. Test statistics - Ẕ = 𝒒
√
5. Distribution of test statistic. If the null hypothesis is true, the test statistic is approximately normally
distributed with a mean of zero.
6. Decision rule
Let α = 0.05. The critical value of Ẕ is 1.645. Reject H0 if the computed Ẕ is < 1.645.
7. Calculation of test statistics
̂
Ẕ= 𝒒
= = = - 1.67 #
√ √
8. Statistical decision
Reject since – 1.67 is less than – 1.645.
9. Conclusion - We can conclude that in the sampled population the proportion who are performed CS is
less than 35 percent.
10. P value – p = 0.0475 so it is less than 0.05.
47
48
Analysis of Variance (ANOVA)
Hypotheses of One-Way ANOVA
H 0 : μ1 μ 2 μ 3 μi
Does not mean that all population means are different (some pairs may be the same)
49
k nj
SST ( X ij X ) 2
j 1 i 1
SST ( X 11 X ) 2 ( X 12 X ) 2 ... ( X ij X ) 2
SST
MST
n 1
SST = Total sum of squares
k
SSA n j ( X j X ) 2 (Or) SSB = ( + + )-
j 1
SSA n1 ( x1 x ) 2 n2 ( x2 x ) 2 ... nk ( xk x ) 2
SSA
MSA / Mean Square Among = SSA/degrees of freedom
k 1
k = number of groups or populations
k nj
SSW ( X ij X j ) 2
j 1 i 1
=∑ -( + + )
SSW
MSW / Mean Square Within = SSW/degrees of freedom
nk
F Test Statistic
MSA
F
MSW
50
Degrees of freedom
Example 1
You want to see if three different golf clubs yield different distances. You randomly select five measurements
from trials on an automated driving machine for each club. At the .05 significance level, is there a difference in
mean distance?
OR
SSB = + + -
MSA
F
MSW
F= = 25.275.#
Example 2
N – k =27 (n = 30, k = 3)
51
SSW = SST - SSA = 65.4209
MSW 1 1
Critical Range qc
2 n j n j'
where:
qc = Value from Standardized Range Distribution with k and n - k degrees of freedom for the
desired level of
MSW = Mean Square Within
ni and nj = Sample sizes from groups j and j’
By Example 1,
2. Find the qc value from the appendix table H with k = 3 and (n – k) = (15 – 3) = 12 degrees of
freedom for the desired level of ( = .05 used here):
qc 3.77
4. Compare:
x1 x 2 23.2
x1 x 3 43.4 with Critical range (16.285)
x 2 x 3 20.2
5. All of the absolute mean differences are greater than critical range. Therefore there is a
significant difference between each pair of means at 5% level of significance.
………………………………………………………………………………………………………………
Two-Way ANOVA
Assumptions
Sources of Variation
Xijk = value of the kth observation of level i of factor A and level j of factor B
53
Two Factor ANOVA Equations
Total Variation:
r c n
SST ( X ijk X ) 2
i 1 j 1 k 1
Factor A Variation
r
SSA cn ( X i.. X ) 2
i 1
Factor B Variation:
c
SSB rn ( X . j . X ) 2
j 1
Interaction Variation
r c
SSAB n ( X ij . X i.. X . j . X ) 2
i 1 j 1
Grand Mean
r c n
X
i 1 j 1 k 1
ijk
X
rcn
c n
X
j 1 k 1
ijk
X i..
cn
r n
X ijk
X . j. i 1 k 1
rn
Mean of cell ij
54
n X ijk
X ij .
k 1 n
SSB
MSB Mean square factor B
c 1
SSAB
MSAB Mean square interactio n
(r 1)(c 1)
SSE
MSE Mean square error
rc(n'1)
55
Features of Two-Way ANOVA F Test
Exercise 8.3.3
A remotivation team in a psychiatric hospital conducted an experiment to compare five methods for remotivating
patients. Patients were grouped according to level of initial motivation. Patients in each group were randomly
assigned to the five methods. At the end of the experimental period the patients were evaluated by a team
composed of a psychiatrist, a psychologist, a nurse, and a social worker, none of whom was aware of the method
to which patients had been assigned. The team assigned each patient a composite score as a measure of his or her
level of motivation. The results were as follows:
Level of Remotivation method
initial A B C D E Total Mean
motivation (̅̅̅
Nil 58 68 60 68 64 318 63.6
Very low 62 70 65 80 69 346 69.2
Low 67 78 68 81 70 364 72.8
Average 70 81 70 89 74 384 76.8
Total 263 297 263 318 277 1418
Mean (̅̅̅ 64.25 74.25 65.75 79.5 69.25 70.6
56
Solution
The randomized complete block design is the appropriate design for this remotivation team.
Data
Assumption
Hypotheses
: = 0, : not all =0
Test statistic
Statistical decision
Since our computed variance ratio, 30.22, is greater than 3.26 , we reject the null hypothesis of no treatment
effects on the assumption that such a large V.R. reflects the fact that the two sample mean squares are not
estimating the same quantity.
ANOVA Table
57
Source SS d.f MS V.R
Treatments 632.8 4 158.2 30.22
Blocks 471.2 3 157.06
Residuals 62.8 12 5.233
Conclusion
Yes, the provided data were sufficient to indicate a difference in mean score among methods.
P value.
P < 0.01
………………………………………………………………………………………………………………
58
Chi-Square Tests
Test of Qualitative Variables
– One of the most frequently employed statistical technique
– Commonly utilized for the analysis of COUNT or FREQUENCY data
– The 2 value is a measure of the extent to which pairs of Observed and Expected frequencies agree.
– 2 value is small if “agreement” is HIGH, while if the “agreement” is low 2 is HIGH
– Direction of agreement, whether positive or negative, has no influence since the differences are squared.
Some formulae indicate squaring of absolute values.
– Chi-square assumes values between 0 and infinity.
– Chi-square distribution derived from normal distribution
– For analysis of count data or frequency data
– The Chi-square test statistic is
(Oi Ei ) 2
2
all cells Ei
where:
Oi = observed frequency in a particular cell
Ei = expected frequency in a particular cell if H0 is true
2 for the 2 x 2 case has 1 degree of freedom
Degree of freedom
59
Test for Goodness of Fit
– Use for chi square that test the significance of the distribution of a single variable.
– Uses sample data to test hypotheses about the shape or proportions of a population distribution
– Tests the fit of the proportions in the obtained sample with the hypothesized proportions of the population
– This test enables us to see how well does the assumed theoretical distribution (such as Binomial
distribution, Poission or normal distribution) fit to the observed data.
– The 2 test formula for goodness of fit is:
2 = ∑(o-e) 2 / e
– Chi-square distribution is positively skewed
– Degrees of freedom for Goodness of Fit Test (df = C – 1) . C is the number of categories
– If 2( calculated)> 2 (tabulated), with (k/n-1) d.f, then null hypothesis is rejected otherwise accepted.
– And if null hypothesis is accepted, then it can be concluded that the given distribution follows theoretical
distribution.
Example
Cranor and Christensen (A-1) conducted a study to assess short-term clinical, economic, and humanistic outcomes
of pharmaceutical care services for patients with diabetes in community pharmacies. For 47 of the subjects in the
study, cholesterol levels are summarized in Table 12.3.1. We wish to know whether these data provide sufficient
evidence to indicate that the sample did not come from a normally distributed population. Let α.05
Solution:
1. Data. See Table 12.3.1.
2. Assumptions. We assume that the sample available for analysis is a simple random sample.
3. Hypotheses
H0: In the population from which the sample was drawn, cholesterol levels are normally distributed.
HA: The sampled population is not normally distributed.
4. Test statistic. The test statistic is
2 = ∑(o-e) 2 / e
5. Distribution of test statistics
Approximately as chi-square with k - r degrees of freedom. The values of k and r will be determined
later.
6. Decision rule. We will reject H0 if the computed value of X2 is equal to or greater than the critical value
of chi-square.
60
7. Calculation of test statistic. Since the mean and variance of the hypothesized distribution are not
specified, the sample data must be used to estimate them.
̅ = 198.67
_
8. Statistical decision. When we compare X2 is 10.566 with values of x2 in Appendix Table F, we see that it
is less than x2. = 11.07 so that, at the .05 level of significance, we cannot reject the null hypothesis
that the sample came from a normally distributed population.
9. Conclusion. We conclude that in the sampled population, cholesterol levels may follow a normal
distribution.
10. p value. Since 11:070 > 10:566 > 9:236, .05 < p < .10.
Example
61
The flu season in southern Nevada for 2005–2006 ran from December to April, the coldest months of the year.
The Southern Nevada Health District reported the numbers of vaccine-preventable influenza cases shown in Table
12.3.9. We are interested in knowing whether the numbers of flu cases in the district are equally distributed
among the five flu season months. That is, we wish to know if flu cases follow a uniform distribution.
Solution:
1. Data. See Table 12.3.9.
2. Assumptions. We assume that the reported cases of flu constitute a simple random sample of cases of flu
that occurred in the district.
3. Hypotheses.
H0: Flu cases in southern Nevada are uniformly distributed over the five flu season months.
HA: Flu cases in southern Nevada are not uniformly distributed over the five flu season months.
Let α = .01.
4. Test statistic.
The test statistic is
(Oi Ei ) 2
2 Ei
all cells
8. Statistical decision. Since 97.15, the computed value of X2, is greater than 13.277, we reject, based on
these data, the null hypothesis of a uniform distribution of flu cases during the flu season in southern
Nevada.
9. Conclusion. We conclude that the occurrence of flu cases does not follow a uniform distribution.
10. p value. 0.0001
62
Test of Independence
– Testing the null hypothesis that in the population the two criteria of classification are independent
– A single sample drawn from a single population
– Observations cross-classified on the basis of two variables of interest
– Calculating Ei based on joint probability law
Calculating Expected Frequencies
The null hypothesis here is that the two criteria are independent.
Then,
If A and B are independent,
– P(A and B) = P(A) x P(B)
– The expected frequency = P x total number
– P(A) and P(B) here are marginal probabilities.
– For rows, marginal probability P(r) = row total/grand total
– For columns, marginal probability P(c) = column total/grand total
nad bc
2
2
a c b d a b c d
Example
Observed
Leptospira+ 31 91 122
63
Expected
Test of Homogeneity
– Two or more populations are identified in advance
– An independent sample drawn from each
– Sampled observations placed in appropriate categories of variables of interest
– There is researcher’s manipulation about marginal probabilities (one set of marginal totals is
fixed)
Example
Narcolepsy is a disease involving disturbances of the sleep–wake cycle. Members of the German Migraine and
Headache Society (A-8) studied the relationship between migraine headaches in 96 subjects diagnosed with
narcolepsy and 96 healthy controls. The results are shown in Table 12.5.2.We wish to know if we may conclude,
on the basis of these data, that the narcolepsy population and healthy populations represented by the samples are
not homogeneous with respect to migraine frequency.
64
Solution:
Data. See Table 12.5.2.
Assumptions. We assume that we have a simple random sample from each of the two populations of interest.
Hypotheses.
H0: The two populations are homogeneous with respect to migraine frequency.
HA: The two populations are not homogeneous with respect to migraine frequency.
Let α = .05
Test statistic
The test statistic is
(Oi Ei ) 2
2 Ei
all cells
χ 2 = 0.126
Statistical decision.
Since .126 is less than the critical value of 3.841, we are unable to reject the null hypothesis.
Conclusion.
We conclude that the two populations may be homogeneous with respect to migraine frequency.
p value.
0.05 < p < 0.1
65
Exercise 12.5.1
Refer to the study by Carter et al. [A-9], who investigated the effect of age at onset of bipolar disorder on the
course of the illness. One of the variables studied was subjects’ family history. Table 3.4.1 shows the frequency of
a family history of mood disorders in the two groups of interest: early age at onset (18 years or younger) and later
age at onset (later than 18 years). Can we conclude on the basis of these data that subjects 18 or younger differ
from subjects older than 18 with respect to family histories of mood disorders? Let α = .05.
Data in table
Assumption = example 1.
Hypotheses
H0 is the two population are homogenous
HA is the two population are not homogenous
Test statistics
Distribution of test statistics
d.f = 3
Decision rule
Reject H0 computed χ2 value is ≥ 7.815.
Calculation of teat statistics
Χ2 = 3.622
Statistical decision
Conclusion
P value > 0.1
Exercise 12.5.3
Swor et al. (A-11) examined the effectiveness of cardiopulmonary resuscitation (CPR) training in people over 55
years of age. They compared the skill retention rates of subjects in this age group who completed a course in
traditional CPR instruction with those who received chest-compression–only cardiopulmonary resuscitation (CC-
CPR). Independent groups were tested 3 months after training. Among the 27 subjects receiving traditional CPR,
12 were rated as competent. In the CC-CPR group, 15 out of 29 were rated competent. Do these data provide
sufficient evidence for us to conclude that the two populations are not homogeneous with respect to competency
rating 3 months after training? Let α = .05.
66
Solution
Competent Not competent Total
Traditional CPR 12 15 27
CC – CPR 15 14 29
Total 27 29 56
Exercise 12.5.5
In a simple random sample of 250 industrial workers with cancer, researchers found that 102 had worked at jobs
classified as “high exposure” with respect to suspected cancer-causing agents. Of the remainder, 84 had worked at
“moderate exposure” jobs, and 64 had experienced no known exposure because of their jobs. In an independent
simple random sample of 250 industrial workers from the same area who had no history of cancer, 31 worked in
“high exposure” jobs, 60 worked in “moderate exposure” jobs, and 159 worked in jobs involving no known
exposure to suspected cancer causing agents. Does it appear from these data that persons working in jobs that
expose them to suspected cancer-causing agents have an increased risk of contracting cancer? Let α = .05.
Solution
Cancer No cancer Total
High exposure 102 31 133
Moderate exposure 84 60 144
No known exposure 64 157 223
Total 250 250 500
67
Fisher Exact Test (Exact Probability Test)
– For 2 x 2 tables
– n < 20 (OR) 20 < n < 40 with any expected frequency < 5 (OR)
– Any cell with expected frequency < 1
– A>B then a/A > b/B
Hypotheses The following are the null hypotheses that may be tested and their alternatives.
1. (Two-sided)
H0: The proportion with the characteristic of interest is the same in both populations; that is, p1 = p2.
HA: The proportion with the characteristic of interest is not the same in both populations; p1≠ p2.
2. (One-sided)
H0: The proportion with the characteristic of interest in population 1 is less than or the same as the proportion in
population2. p1 ≤ p2.
HA: The proportion with the characteristic of interest is greater in population 1 than in population 2; p1 > p2.
Test Statistic The test statistic is b, the number in sample 2 with the characteristic of interest.
……………………………………………………………………………………………………………
68
Correlation
69
N XY ( X )( Y )
b
N X 2 ( X )2
a Y bX
Example
A real estate agent wishes to examine the relationship between the selling price of a home and its size (measured
in square feet). A random sample of 10 houses is selected
Dependent variable (Y) = house price in $1000s
Independent variable (X) = square feet
By the calculator, y = a + bx
house price 98.24833 0.10977 (square feet)
70
Measures of Variation or Deviation
SST = total sum of squares (Measures the variation of the Yi values around their mean Y)
SSR = regression sum of squares (Explained variation attributable to the relationship between X and Y)
SSE = error sum of squares (Variation attributable to factors other than the relationship between X and Y)
Coefficient of Determination, r2
SSR regression sum of squares
r2
SST total sum of squares
Inference about the Slope: t Test
t test for a population slope - Is there a linear relationship between X and Y?
Null and alternative hypotheses
H0: β = 0 (no linear relationship)
HA: β ¹ 0 (linear relationship does exist)
Test statistic –
b β
t (d.f = n – 2)
Sb
where:
b = regression slope coefficient
β = hypothesized slope
Sb = standard error of the slope
71
F-Test for Significance
MSR
F
MSE
SSR
MSR
1
SSE
MSE
n2
........................................................................................................................................................................
72
Type of outcome variable determines choice of statistical tests.
Type of outcome Example of outcome variable Type of statistics
Univariate
Bivariate analysis
Dichotomous Death, cancer, intensive care unit admission Binary logistic regression
Rare outcomes Time to rare cancer, number of urinary track infections Poisson regression
and counts
73