Unit IV Update
Unit IV Update
Where represents the estimated standard error of the mean; n equals the sample size;
and s has been defined as
where s is the sample standard deviation; df refers to the degrees of freedom; and
where X represents the sample mean; tconf represents a number (distributed with
n – 1 degrees of freedom) from the t tables, which satisfies the confidence
specifications for the confidence interval; represents the estimated standard
error of the mean.
where
it is the new standard error, are the two population
variances, and n1 and n2 are the two sample sizes.
12. List the steps for calculating t – ratio for two population means or two
independent samples.
Panel I
Requiring the most computational effort, this panel produces values for the two
sample means, X1 and X2, and for the two sample sums of squares, SS1 and
SS2,
Panel IV
Finally, dividing the difference between the two sample means, ,
Panel II
Dividing the sample standard deviation, sD, by the square root of its
Panel III
Finally, dividing the difference between the sample mean, D, and the
null hypothesized value, Dhyp(of zero), by the estimated standard
error, , culminates in the value for the t ratio.
F ratio.
A test statistic has an F-distribution under the null hypothesis is known as an F test.
It is used to compare the statistical models as per the data set available.
Example : If a researcher wants to test whether or not two independent samples have been drawn
from a normal population with the same variability, then he generally employs the F-test.
PART B
Figure 4.2
Hypothesized sampling distribution of t (gas mileage investigation).
Problem 4.1
Find the critical t values for the following hypothesis tests:
(a) two-tailed test, α = .05, df = 12
(b) one-tailed test, lower tail critical, α = .01, df = 19
(c) one-tailed test, upper tail critical, α = .05, df = 38
(d) two-tailed test, α = .01, df = 48
10
t TEST
t Ratio
A replacement for the z ratio whenever the unknown
population standard deviation must be estimated.
11
Problem 4.2
A consumers’ group randomly samples 10 “one-pound”
packages of ground beef sold by a supermarket. Calculate
(a) the mean and (b) the estimated standard error of the mean
for this sample, given the following weights in ounces: 16,
15, 14, 15, 14, 15, 16, 14, 14, 14.
Panel II
Dividing the sample standard deviation, s, by the square root of
the sample size, n, gives the value for the estimated standard
error.
Panel III
Finally, dividing the difference between the sample mean, X,
12
13
Problem 4.3
The consumers’ group suspects that a supermarket makes extra
money by supplying less than the specified weight of 16 ounces in
its “one- pound” packages of ground beef. Given that a random
sample of 10 packages yields a mean of 14.7 ounces and an
estimated standard error of the mean of 0.26 ounce, use the
customary step-by-step procedure to test the null hypothesis at the
.05 level of significance with t.
14
2. Discuss in detail about t-test for two independent samples using the
case study – EPO Experiment.
TWO INDEPENDENT SAMPLES
Observations in each sample are based on different (and unmatched)
subjects.
When samples are independent, observations in one sample are not
paired, on a one-to-one basis, with observations in the other sample.
Sampling Distribution of
Differences between sample means based on all possible pairs of
random samples from two underlying populations.
It represents the entire spectrum of differences between sample means
based on all possible pairs of random samples from the two underlying
populations.
15
Expressed in symbols,
Example 4.5
Find the critical t values for each of the following hypothesis tests:
(a) two-tailed test; α = .05; n1 = 12; n2 = 11
(b) one-tailed test, upper tail critical; α = .05; n1 = 15; n2 = 13
(c) one-tailed test, lower tail critical; α = .01; n1 = n2 = 25
(d) two-tailed test; α = .01; n1 = 8; n2 = 10
Panel I
Requiring the most computational effort, this panel produces values for the
two sample means, X1 and X2, and for the two sample sums of squares,
SS1 and SS2, where
Panel II
Panel III
Panel IV
Finally, dividing the difference between the two sample means,
, and the null hypothesized population mean difference,
(of zero) by the estimated standard error, , generates a
value for the t ratio.
17
p- value
The p-value for a test result represents the degree of rarity of that
result, Given that the null hypothesis is true.
Smaller p-values tend to discredit the null hypothesis and to support
the research hypothesis.
The p-value represents the proportion of area, beyond the observed
result, in the tail of the sampling distribution.
In the left panel of Figure 4.3, a relatively deviant (from zero)
observed t is associated with a small p-value that makes the null
hypothesis suspect, while in the right panel, a relatively non-deviant
observed t is associated with a large p-value that does not make the
null hypothesis suspect.
18
Specified after the test result has been observed, a p-value describes the most
impressive degree of rarity actually attained by the test result.
Example 4.7
Find the approximate p-value for each of the following test results:
(a) one-tailed test, upper tail critical; df = 12; t = 4.61
(b) one-tailed test, lower tail critical; df = 19; t = –2.41
(c) two-tailed test; df = 15; t = 3.76
(d) two-tailed test; df = 42; t = 1.305
(e) one-tailed test, upper tail critical; df = 11; t = –4.23 (Be careful!)
Example 4.8
Indicate which member of each of the following pairs of p-values
describes the more rare test result:
(a1) p > .05 (a2) p < .05
(b1) p < .001 (b2) p < .01
(c1) p < .05 (c2) p < .01
(d1) p < .10 (d2) p < .20
(e1) p = .04 (e2) p = .02
a2, b1, c2, d1, e2
Example 4.9
Treating each of the p-values in the previous exercise separately,
indicate those that would cause you to reject the null hypothesis at
the .05 level of significance.
a2, b1, b2, c1, c2, e1, e2
19
20
5. Discuss in detail about t-test for two related samples with case study.
t TEST
The null hypothesis for two related samples can be tested with a t ratio.
Panel I involves most of the computational labour, and it generates values for the
sample mean difference, D, and the sample standard deviation for the difference scores,
sD.
and then, after dividing the sum of squares, SSD, by its degrees of freedom,
n − 1, extract the square root, that is,
21
Panel II
Dividing the sample standard deviation, sD, by the square root of its sample size, n,
gives the estimated standard error, , that is,
Panel III
22
Example 4.10
An investigator tests a claim that vitamin C reduces the severity of
common colds. To eliminate the variability due to different family
environments, pairs of children from the same family are randomly
assigned to either a treatment group that receives vitamin C or a
control group that receives fake vitamin C. Each child estimates, on
a 10-point scale, the severity of their colds during the school year.
The following scores are obtained for ten pairs of children:
23
24
6. What is f – test? Discuss in detail about the purpose of f-test with the
case study?
In the two-sample case, t reflects the ratio between the observed
difference between the two sample means in the numerator and the
estimated standard error in the denominator.
For three or more samples, the null hypothesis is tested with a new ratio,
the F ratio.
Essentially, F reflects the ratio of the observed differences between all
sample means (measured as variability between groups) in the numerator
and the estimated error term or pooled variance estimate (measured as
variability within groups) in the denominator term, that is,
Except for chance, estimates in both the numerator and the denominator
are similar, and generally, F varies about a value of 1.
If Null Hypothesis Is False
If the null hypothesis is false (because there is a treatment effect due to
different sleep deprivation periods), both estimates still would reflect
25
random error, but the estimate for between groups would also reflect the
treatment effect. In this case,
26
Example 4.11
If the null hypothesis is true, both the numerator and denominator
of the F ratio would reflect only (a) . If the null hypothesis is false, the
numerator of the F ratio would also reflect the (b). If the null
hypothesis is false because of a large treatment effect, the value of F
would tend to be considerably larger than (c).
Example 4.12
Find the critical values for the following F tests:
(a) α = .05, dfbetween = 1, dfwithin = 18
(b) α = .01, dfbetween = 3, dfwithin = 56
(c) α = .05, dfbetween = 2, dfwithin = 36
(d) α = .05, dfbetween = 4, dfwithin = 95
Example 4.13
Find the approximate p-value for the following observed F ratios,
where the numbers in parentheses refer to the degrees of freedom
in the numerator and denominator, respectively.
(a) F (2, 11) = 4.56
(b) F (1, 13) = 11.25
(c) F (3, 20) = 2.92
(d) F (2, 29) = 3.66
27
o When data are quantitative, an overall test of the null hypothesis for more than
two population means is known as analysis of variance.
o An overall test of the null hypothesis for more than two population means.
One-Factor ANOVA
o The simplest type of ANOVA that tests for differences among population means
categorized by only one independent variable.
Table shows two fictitious experimental outcomes that, when analysed with ANOVA,
produce different decisions about the null hypothesis: It is retained for one outcome but
rejected for the other.
TWO SOURCES OF VARIABILITY
Differences between Group Means
Differences of 5, 6, and 4 appear between group means in Outcome A, and these
relatively small differences might reflect only chance.
28
Even though the null hypothesis is true (because sleep deprivation does not affect the
subjects’ aggression scores), group means tend to differ merely because of chance
sampling variability.
It’s reasonable to expect, therefore, that the null hypothesis for Outcome A should not
be rejected.
There appears to be a lack of evidence that sleep deprivation affects the subjects’
aggression scores in Outcome A.
On the other hand, differences of 2, 5, and 8 appear between the group means for
Outcome B, and these relatively large differences might not be attributable to chance.
Instead, they indicate that the null hypothesis probably is false (because sleep
deprivation affects the subjects’ aggression scores). It’s reasonable to expect,
therefore, that the null hypothesis for Outcome B should be rejected.
There appears to be evidence of a treatment effect, that is, the existence of at least one
difference between the population means defined by the independent variable (sleep
deprivation).
Two-Factor ANOVA
A more complex type of analysis that tests whether differences exist among population
means categorized by two factors or independent variables.
Example
For computational simplicity, assume that the social psychologist randomly
o assigns two subjects to be tested (one at a time) with crowds of either zero, two,
or four people and either the nondangerous or dangerous conditions.
The resulting six groups, each consisting of two subjects, represent all possible
combinations of the two factors.*
29
o Finally, the one mean for all three column means—or for both row means—
yields the overall or grand mean (12) for all subjects in the study.
Main Effect
The effect of a single factor when any other factor is ignored.
Graphs for Main Effects
The slanted line in panel A of Figure 4.4 depicts the large differences between column
means, that is, between mean reaction times for subjects, regardless of degree of danger,
with crowds of zero, two, and four people.
The relatively steep slant of this line suggests that the null hypothesis for crowd size
might be rejected.
The steeper the slant is, the larger the observed differences between column means
and the greater the suspected main effect of crowd size.
On the other hand, a fairly level line in panel A of Figure 4.4 would have reflected the
relative absence of any main effect due to crowd size.
The slanted line in panel B of Figure 4.4 depicts the large difference between row
means, that is, between mean reaction times for dangerous and non dangerous
conditions, regardless of crowd size.
The relatively steep slope of this line suggests that the null hypothesis for degree of
danger also might be rejected; that is, there might be a main effect due to degree of
danger.
30
Figure 4.4 depicts the large differences between column means
Example 4.14
A college dietician wishes to determine whether students prefer a particular pizza
topping (either plain, vegetarian, salami, or everything) and one type of crust
(either thick or thin). A total of 160 volunteers are randomly assigned to one of
the eight cells in this two- factor experiment. After eating their assigned pizza, the
20 subjects in each cell rate their preference on a scale ranging from 0 (inedible)
to10 (the best). The results, in the form of means for cells, rows, and columns, are
as follows:
Construct graphs for each of the three possible effects, and use this information to
make preliminary interpretations about pizza preferences. Ordinarily, of course, you
would verify these speculations by performing an ANOVA—a task that cannot be
performed for these data, since only means are supplied.
As suggested in Figure 4.5, F ratios in both a one- and a two-factor ANOVA always
consist of a numerator (shaded) that measures some aspect of variability between groups
or cells and a denominator that measures variability within groups or cells.
In a one-factor ANOVA, a single null hypothesis is tested with one F ratio.
In two-factor ANOVA, three different null hypotheses are tested, one at a time,
with three F ratios: Fcolumn, Frow, and Finteraction.
The numerator of each of these three F ratios reflects a different aspect of variability
between cells:
variability between columns (crowd size),
variability between rows (degree of danger),
interaction—any remaining variability between cells not attributable to either variability
between columns (crowd size) or rows (degree of danger ).
The shaded numerator terms for the three F ratios in the bottom panel of Figure 4.5 estimate
random error and, if present, a treatment effect (for subjects treated differently by the
investigator).
The denominator term always estimates only random error (for subjects treated similarly in
the same cell).
In practice, a sufficiently large F value is viewed as rare, given that the null hypothesis
is true, and therefore, it leads to the rejection of the null hypothesis.
Otherwise, the null hypothesis is retained.
32
33
35
36
10. The F test describes the ratio of two sources of variability: that for subjects treated
differently and that for subjects treated similarly. Is there any sense in the which
the t test for two independent groups can be viewed likewise? (Nov/Dem 2023)
Short Answer
Yes, the t test for two independent groups can be viewed similarly to the F test in the
sense that both are examining variability.
They differ in their specifics: the t test is comparing the mean difference against the
variability within groups, while the F test is comparing the variability between groups
against the variability within groups.
Step by step solution
Step 1: Understanding The F Test
The F test is normally used in the context of Analysis of Variance (ANOVA) to compare
the variances between different groups. It is essentially a ratio of two estimates of variance:
the variance between groups (numerator), and the variance within groups (denominator). If
the between-group variance is significantly greater than the within- group variance, it
would suggest that the means of the groups differ.
Step 2: Understanding The t Test
The t-test for two independent groups is used to compare the means of those groups to
determine if they are significantly different. The t- test is calculated using the mean
difference between the two groups (numerator) and the variability within the groups
(denominator).
Step 3: Identifying the Link between The t Test and The F Test
In the t-test, we are technically comparing variability too, though we are specifically
interested in whether the variability in group means is greater than what we would expect
by chance. In the F test, we're more broadly comparing variability to examine if the
amount of variability between group means is larger than the variability within groups. So
it can be said that there is a link between them, but they're not quite serving the same
purpose.
37
38
12. Brief about TUKEY'S HSD Test. Additionally, explain in brief about two-factor
ANOVA.(Nov/Dem 2023)
Two-factor ANOVA.
39
ANOVA tests for significance using the F test for statistical significance.
The F test is a groupwise comparison test, which means it compares the variance in each
group mean to the overall variance in the dependent variable.
If the variance within groups is smaller than the variance between groups, the F test
will find a higher F value, and therefore a higher likelihood that the difference
observed is real and not due to chance.
A two-way ANOVA with interaction tests three null hypotheses at the same time:
There is no difference in group means at any level of the first independent
variable.
There is no difference in group means at any level of the second independent variable.
The effect of one independent variable does not depend on the effect of the other
independent variable
40
13. A manufacture of a gas additive claims that it improves gas mileage. A random sample of 30
drivers tests this claim by determining their gas mileage for a full tank of gas that contain the
additive (X1) and for a full tank of gas that does not contain the additive(X2). The sample mean
difference, Dbar , equals 2.12 miles (in favour of the additive) , and the estimated standard error
equals 1.50 miles. (Apr/May 2024)
Decision
Since the calculated t (1.413) is less than the critical value (1.699), we cannot reject the null
hypothesis.
This means that, statistically, the gas additive does not significantly improve mileage.
Calculating P-Value
The p-value is the probability of observing a t-score as extreme as the calculated t given that
the null hypothesis is true.
41
While it's typically found using statistical software or a calculator, let's just note that it will
be greater than the significance level of 0.05 since we failed to reject the null hypothesis.
Conclusion:
The null hypothesis that the additive does not improve gas mileage cannot be rejected at the
0.05 significance level since the t-statistic (1.413) is less than the critical value (1.699)
The p-value is greater than 0.05. Special precautions for experimental design includes having a
large and representative sample size, controlling for external factors influencing gas mileage, and
randomizing the order of utilizing the additive and non-additive gas.
14. A library system lends books for periods of 21 days. This policy is being reevaluated in view of
a possible new loan period that could be either longer or shorter than 21 days. To aid in making
this decision, book-lending records were consulted to determine the loan periods actually used
by the patrons. A random sample of eight records revealed the following loan periods in days:
and 16 . Test the null hypothesis with , using the .05 level of significance.A library lends books
period for 21 days. This policy is being revaluated in view of possible, Calculation of Sample
Mean and Sample Standard Deviation
(Apr/May2024)
To begin the t-test, first calculate the sample mean (𝑥¯) and sample standard deviation (𝑠).
The sample mean is the average of the data points, which can be found by summing all of the data
and dividing by the number of data points, which is 8.
The sample standard deviation is a measure of the amount of variance or dispersion in the data set.
Calculation of Sample Standard Error
Once we have the standard deviation, we can calculate the standard error of the mean. The
standard error is the standard deviation divided by the square root of the number of data
points, which is 8 in this case.
42
Calculation of t-value
The calculated t-value, which is the test statistic, can now be calculated using the
formula:𝑡=𝑥¯−𝜇𝑠/𝑛where μ is the assumed population mean (21 days in our case), 𝑥¯ is the
sample mean calculated in step 1, 𝑠 is the standard deviation calculated in step 1, and 𝑛 is the
number of data points.
Conclusion:
The result of the hypothesis test (whether the average loan period is significantly different
from 21 days or not) is determined by comparing the absolute value of the calculated t-
value with the critical t-value from the t-distribution table for a significance level of 0.05
and 7 degrees of freedom.
If the absolute value of the calculated t-value is greater than the critical t-value, it means
there is sufficient evidence to reject the null hypothesis.
If it's less than or equal to the critical t-value, it means there is not enough evidence to
reject the null hypothesis.
43
15. A random sample of 90 college students indicates whether they most desire love, wealth, power, health,
fame, or family happiness. Using the 0.05 level of significance and the following results, test the null hypothesis
that, in the underlying population, the various desires are equally population using chi-square test. (Apr/May
2024)
Where
c = Degrees of freedom
O = Observed Value
E = Expected Value
44
Where
O = Observed Value
E = Expected Value
Null Hyposis:
p1 = p2 = p3 = p4 = p5 = p6 = 1/6
p 1 = p2 = p3 = p 4 = p5 = p6
p1 ≠ p2 ≠ p3 ≠ p4 ≠ p5 ≠ p6 ≠ 1/6
p 1 ≠ p 2 ≠ p 3 ≠ p 4 ≠ p5 ≠ p 6
H0 is not true
H1 is not true
Conclusion:
45
Based on our data, college students do not prefer love, wealth, power, health, fame, and
family happiness equally
Based on our data, college students prefer fame more than family happiness
16. An investigator polls common cold sufferers, asking them to estimate the number of hours of
physical discomfort caused by their most recent colds. Assume that their estimates approximate a
normal curve with a mean of 83 hours and a standard deviation of 20 hours. (Apr/May 2024)
i. What is the estimated number of hours for the shortest-suffering 5 percent? (3)
ii. What proportion of sufferers estimate that their colds lasted longer than 48 hours? (2)
iv. What is the estimated number of hours suffered by the extreme 1 percent either above
or below the mean? (2)
v. What proporation suffered between 1 and 3 days,that is between 24 and 75 hours? (3)
Solution
Iv) What is the estimated number of hours suffered by the extreme 1 percent either above or
below the mean?
Below = 31.48 and above = 13.452 hours, here assume that 1 % is the sum of extreme low and extreme
high.
46
Conclusion :
a) The estimated number of hours for the shortest-suffering 5 percent is 50.1 hours.
b)The proportion of sufferers who estimate their colds lasted longer than 48 hours is 0.9599
(95.99%).
(c) The proportion who suffered for fewer than 61 hours is 0.1357 (13.57%).
(d) The estimated number of hours suffered by the extreme 1 percent either above or below the
mean is between 31.48 and 134.52 hours.
(e) The proportion who suffered for between 1 and 3 days (24 and 72 hours) is 0.2896 (28.96%).
17. Admission to a state university depends partially on the applicant's high school GPA.
Assume that the applicants' GPAs approximate a normal curve with a mean of 3.20 and a
standard deviation of 0.30 .
(i) If applicants with GPAs of 3.50 or above are automatically admitted, what proportion
of applicants will be in this category?
(ii) If applicants with GPAs of 2.50 or below are automatically denied admission, what
proportion of applicants will be in this category?
(iii) A special honors program is open to all applicants with GPAs of 3.75 or better. What
proportion of applicants are eligible?
(iv) If the special honors program is limited to students whose GPAs rank in the upper 10
percent, what will Brittany's GPA have to be for admission to this program?(AprMay 2024)
Solution
(i) If applicants with GPAs of 3.50 or above are automatically admitted, what proportion
of applicants will be in this category?
To find the proportion of applicants with GPAs of 3.50 or above who are automatically
admitted, convert the GPA 3.50 to a z-score first.
The formula for z-score is
o µ𝑍=(𝑋−µ)𝜎,
o where X is the value for which we want to find the Z-score, µ is the Mean, and σ is the standard
deviation.
o Here X=3.50, µ=3.20, and σ=0.30.
o Using these values in the formula, to get Z score.
o . The result gives the proportion of students who have GPAs less than 3.5.
47
o To subtract this from 1 to get the proportion of students who have GPAs above this, which is
desired answer.
(ii) If applicants with GPAs of 2.50 or below are automatically denied admission, what
proportion of applicants will be in this category?
calculate the z-score for X=2.50. Again, use the standard normal distribution table or calculator
function to find the proportion p for this z score.
The result gives the proportion of students that have GPAs less than 2.5, and this is the
proportion of students who will be denied admission automatically.
(iii) A special honors program is open to all applicants with GPAs of 3.75 or better. What
proportion of applicants are eligible?
calculate the z-score for X=3.75, which is the GPA for a special honors program.
further, use the standard normal distribution table or function to find the proportion of students
who have GPAs less than 3.75.
Subtract this number from 1 to get the proportion of students who have GPAs greater than 3.75
and this number is the proportion of applicants that are eligible for the honors program.
(iv) If the special honors program is limited to students whose GPAs rank in the upper 10
percent, what will Brittany's GPA have to be for admission to this program?
In this problem, the proportion of students is known (0.10 or 10%), and the GPA score needs to
be found.
First, find the corresponding z-score for the proportion 0.10 in the standard normal distribution
table or use a calculator function.
Next, convert this z-score to the GPA score using the z-score formula in reverse: µ𝑋=𝑍𝜎+µ.
Here, Z is the calculated Z-score, σ is standard deviation and µ is the mean.
This gives the GPA a student should have in order to be in the top 10% and hence be eligible
for the special honors program.
Conclusion:
The proportion of students automatically admitted, automatically denied, eligible for honors
program and GPA needed to be in top 10% can be found using the z-score method and using the
standard normal distribution.
The exact numbers may vary based on the values retrieved from the standard normal
distribution table or calculator function.
48
49