0% found this document useful (0 votes)
82 views15 pages

Exercises - (Activity #6)

1. The document contains 4 exercises analyzing statistical data using t-tests and correlations. 2. Exercise 1 finds a significant difference in seriousness levels before and after for college students. 3. Exercise 2 finds no significant difference in pre-test and post-test scores for students. 4. Exercise 3 finds a significant correlation between Apple stock and the S&P 500. 5. Exercise 4 analyzes weight gain data from rats on different diets using a t-test and finds no significant difference between means.

Uploaded by

Mika Molina
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
82 views15 pages

Exercises - (Activity #6)

1. The document contains 4 exercises analyzing statistical data using t-tests and correlations. 2. Exercise 1 finds a significant difference in seriousness levels before and after for college students. 3. Exercise 2 finds no significant difference in pre-test and post-test scores for students. 4. Exercise 3 finds a significant correlation between Apple stock and the S&P 500. 5. Exercise 4 analyzes weight gain data from rats on different diets using a t-test and finds no significant difference between means.

Uploaded by

Mika Molina
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 15

Exercises:(Activity #6)

1. Consider the data below to determine if there exists a significance difference in the
degree of seriousness encountered by the College Freshmen Students. Use 0.01
as the level of significance.

Student Before After


1 3.5 3.6
2 2 2.75
3 2.75 3.5
4 3 3.75
5 2.5 3
6 3.75 4
7 4 4.52

Σ D=3.62 2
Σ D =2.28 η=7
1. H 0 : x1 =x2
H 0 : x1 ≠ x2
2. 0.01
3. (n-1) = (7-1) = 6 ; Critical Value = 16.81
4. If computed t-test is ≥ 2.15 .reject H 0
If computed t-test is ¿ 2.15 accept H 0
3.62
5. t=
√❑
t = 5.24
6. reject H 0
7. There is a significant difference between the before and after in the degree of
seriousness encountered by the College Freshmen Students.

2. A researcher tried to test if there is a significant difference in the knowledge of


students before and after the subject was taken using pre-test and post-test with an
alpha of 0.05.
The following data was obtained.

Student no. Pre-test Post test


1 40 80
2 35 75
3 30 76
4 50 79
5 55 81
6 42 74
7 49 75
8 59 78
9 62 75
10 47 80
11 31 73
12 43 74
13 51 78
14 62 90
15 58 82

Σ D=456 2
Σ D =14966 η=15

1. H 0 : x1 =x2
H 0 : x1 ≠ x2
2. 0.05
3. (n-1) = (15-1) = 14 ; Critical Value = 23.685
4. If computed t-test is ≥ 23.685 .reject H 0
If computed t-test is ¿ 23.685 accept H 0
456
5. t=
√❑
t = 13.26
6. accept H 0
7. There is no significant difference in the knowledge of students before and after
the subject was taken using pre-test and post-test.

3. John is an investor. His portfolio primarily tracks the performance of the S&P 500 and
John wants to add the stock of Apple Inc. Before adding Apple to his portfolio, he wants
to assess the correlation between the stock and the S&P 500 to ensure that adding the
stock won’t increase the systematic risk of his portfolio. To find the coefficient, John
gathers the following prices for the last five years

1. H 0 : x❑ = y
H 1 : x❑ ≠ y
2. 0.05
3. (n-2) = 5-2 = 3 ; Tabular Value=3.18
4. If computed t ≥ 3.18, reject H 0
If computed t < 3.18, accept H 0
5(1149737)−(10224.13)(544.43)
5. 𝑟 =
√❑

𝑟 = 0.95
t=0.95 √ ❑
t = 5.27
6. reject H 0
7. There is a significant correlation between the stocks of Apple and S&P500.
4. A researcher tried to test if there is a significant relationship in the knowledge of
students before and after the subject was taken using pre-test and post-test with an
alpha of 0.05.

The following data was obtained.

Student no. Pre-test Post test


1 40 80
2 35 75
3 30 76
4 50 79
5 55 81
6 42 74
7 49 75
8 59 78
9 62 75
10 47 80
11 31 73
12 43 74
13 51 78
14 62 90
15 58 82

1. H 0 : x1 =x2
H 1 : x1 ≠ x2
2. 0.05
3. (n-1) = (15-1) = 14 ; Critical Value = 23.685
4. If computed t-test is ≥ 23.685 .reject H 0
If computed t-test is ¿ 23.685 accept H 0
456
5. t=
√❑
t = 13.26
6. accept H 0
7. There is no significant difference in the knowledge of students before and after
the subject was taken using pre-test and post-test.

5. Consider the gain in weight of 19 female rats between 28 and 84 days after birth. 12
were fed on a high protein diet and 7 on a low protein diet. Use 95% confidence
level.
 
High protein Low protein
134 70
146 118
104 101
119 85
124 107
161 132
107 94
83  
113  
129  
97  
123  
A)
The sample mean of high protein:

134+146+104 +119+124+161+107+ 83+113+129+97+ 123


12

= 120

The sample mean of low protein:

70+118+101+85+107+132+94
7

= 101

B)

S1 = √ ❑

s 1=√❑

s1= 21.388

s 2=√❑

s2= 20.624

Calculate the pool variance:

2 ( n1−1 ) s 21+ ( n2−1 ) s22❑


S=
p
n1 +n2−2
( 12−1 ) 21.388 2❑ + ( 7−1 ) 20.6242
= 12+7−2

=446.118

C)

H 0 : μ1−μ2=0(There isno significant difference between the two population means)

Against the alternative hypothesis

H 1 : μ 1−μ2 >0(The mean of the first population is less than the second )

The level of significance is α =0.01 .

The degrees of freedom are:

df =df 1 +df 2= ( n1−1 ) + ( n2−1 )

= (12-1) + (7-1)

= 17

Now, calculate the estimated standard error

S M −M = √❑
1 2

¿ √❑

¿ 10.04

To analyze these data in StatsDirect first prepare them in two workbook columns and
label these columns appropriately. Alternatively, open the test workbook using the file
open function of the file menu. Then select the unpaired t test from the parametric
methods section of the analysis menu. Select the columns marked "High protein" and
"Low protein" when prompted for data.

For this example:

Unpaired t test

Mean of High Protein = 120 (n = 12)

Mean of Low Protein = 101 (n = 7)

Assuming equal variances

Combined standard error = 10.045276

df = 17

t = 1.891436

One sided P = 0.0379

Two sided P = 0.0757

95% confidence interval for difference between means = -2.193679 to 40.193679

Power (for 5% significance) = 82.25%

Assuming unequal variances

Combined standard error = 9.943999

df = 13.081702
t(d) = 1.9107

One sided P = 0.0391

Two sided P = 0.0782

95% confidence interval for difference between means = -1.980004 to 39.980004

Power (for 5% significance) = 40.39%

Comparison of variances

Two sided F test is not significant

No need to assume unequal variances

Thus we have a difference that is not quite significant at the 5% level. The most
important information is, however, conveyed by the confidence interval. The 95% CI
includes zero therefore we can not be confident (at the 95% level) that these data show
any difference in weight gain.

As most of the interval is toward weight gain and as the test result is in the grey
"suggestive" 5%-10% zone we have good evidence for repeating this experiment with
larger numbers. Bigger samples will probably shrink the range of uncertainty so that the
confidence interval contracts to a narrower band that excludes zero.

N.B. We did not consider a one sided P value here because we could not be absolutely
certain that the rats would all benefit from a high protein diet in comparison with those
on a low protein diet.
6. Three different milling machines were being considered for purchase by a
manufacturer. Potentially, the company would be purchasing hundreds of these
machines, so it wanted to make sure it made the best decision. Initially, five of each
machine were borrowed, and each was randomly assigned to one of 15 technicians
(all technicians were similar in skill). Each machine was put through a series of
tasks and rated using a standardized test. The higher the score on the test, the
better the performance of the machine. The data are:

Machine Machine Machine


1 2 3
24.5 28.4 26.1
23.5 34.2 28.3
26.4 29.5 24.3
27.1 32.2 26.2
29.9 30.1 27.8

a. Perform an ANOVA on this data using 0.05 level of significance. Remember


that the program expect two fields, a Group field and Observation field. Are
there differences between the means by group?
b. What specific machines, if any, are different? Which machine would you
recommend for purchase?
Group 1 Group 2 Group 3

24.5 28.4 26.1

23.5 34.2 28.3

26.4 29.5 24.3

27.1 32.2 26.2

29.9 30.1 27.8

Sum 131.4 154.4 128.7

Average 26.28 30.88 25.74


∑ ❑ X 2ij
ij
3477.88 4789.3 3325.47

Standard Deviation 2.484 2.315 1.784

SS 24.688 21.428 12.732

n 5 5 5

df total =15−1=14

df betweem=3−1=2

df within=df total −df betweem =14−2=12



∑ ❑ X ❑ij =131.4+154.4+ 128.7=414.5
ij

∑ ❑X 2ij=3477.88+ 4789.3+3325.47=11592.65
i

( )
❑ ❑ 2 ❑
1 414.5
SStotal=¿ ∑ ❑X ij −
2
N
∑ X ij =11592.65−

15
=138.633
ij ij

ss within=Σ ss withingroups =24.688+21.428+12.732=58.848

ssbetween =ss total −ss within =138.633−56.848=79.785

SS between 79.785
MS between = = =39.893
df between 2

SSwithin 58.848
MS within= = =4.904
df within 12

MSbetween 39.893
F= = =8.135
MS within 3.904

1) Null and Alternative Hypotheses

Ho: μ 1=μ2=μ3❑

Ha: Not all means are equal

2) Rejection Region

df 1 =2

df 2 =2

Therefore, the rejection region for this F-test is: R={ F : F>6.927 }

3) Test Statistics
MSbetween 39.893
F= = =8.135
MS within 4.904

4) Null hypothesis

Since from the sample information we get that

F=8.135> F c =6.927

It is then concluded that the null hypothesis is rejected

5) ANOVA

Summary

Groups Count Sum Average Variance

Group 1 5 131.4 26.280 6.172

Group 2 5 154.4 30.880 5.357

Group 3 5 128.7 25.74 3.183

ANOVA

Source of Variation SS df MS F P-value F crit

Between Groups 79.785 2 39.893 8.135 0.006 6.927

Within Groups 58.848 12 4.904

Total 138.633 14

Therefore, the null hypothesis is rejected

7. A firm wishes to compare four programs for training workers to perform a certain
manual task. Twenty new employees are randomly assigned to the training
programs, with 5 in each program. At the end of training period, a test conducted to
see how quickly trainees can perform the task. The number of times the task is
performed per minute is recorded for each trainee, with the following results:

Observation Program1 Program2 Program3 Program4

1 9 10 12 9

2 12 6 14 8

3 14 9 11 11

4 11 9 13 7

5 13 10 11 8

Perform hypothesis testing using 0.05 level of significance.

Observation Progr Program Program Program P12 P22 P32 P42


am 1 2 3 4
1 9 10 12 9 81 100 144 81
2 12 6 14 8 144 36 196 64
3 14 9 11 11 196 81 121 121
4 11 9 13 7 121 81 19 49
5 13 10 11 8 169 100 121 64

Σ P1 – 59 Σ P12 – 711
P2 – 44 P22 – 398
P3 – 61 P32 – 751
P4 – 43 P42 – 279
207 2,239
1. H 0: There is no significant relationship between the programs conducted on how
quickly the trainees can perform the certain manual task.
H 1: There is a significant relationship between the programs conducted on how
quickly the trainees can perform the certain manual task.
2. 0.05
3. (n-1) = (5-1) = 4 ; 2.776
4. If computed t ≥ 2.776 reject H 0
If computed t < 2.776 accept H 0
5. SSt =∑ x −¿ ¿ SSt =2239−¿¿ SSt =96.55
2
2 2 2 2 2
59 44 61 43 207
SSb = + + + − =54.95
5 5 5 5 20
2197.4 – 2142.45 = 54.95
SSw = SSt – SSb = 96.55 – 54.95 = 41.6
ss b 54.95
M sb= = = 18.32
k −1 3
ss w 41.6
M sw = = = ¿ 2.6
h−k 16
F ratio = 7.046
6. reject H 0
7. The time for the completion for the manual task of the trainee has no effect at the
program conducted by the firm.

You might also like