We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 9
Statistics for EMCL students
Assignment 4
Solutions
A
‘A background explanation for this part of the assignment: As you know, standard
deviation is a measure of the spread of the distribution, that is, of how far away from the
‘mean the data points are located. The idea behind it is to average the distances from the
average. In other words, the distance of a data point from the mean is called the deviation
of that data point, and the mean of these deviations should describe the “width”, or the
“spread”, of the distibution, Thus, a possible measure of spread would be:
meh
In words: we first calculate the absolute value of the difference of x; (a data point)
and of the mean x for each é. This difference |x, —3] is the deviation of data point x;..
‘Then, we calculate the mean of the deviations (summing them up, and dividing them by n,
the number of data points), This is exactly the value that you have (should have)
calculated in the assignment: first, you introduced a new variable that was the absolute
value of the deviation, and then you Iet SPSS calculate the mean of this new variable.
Now, a trick is that the absolute value can be replaced by the square root of the square:
bi -aJ=V 6,
So the same measure of spread can also be written thus:
1
suv
Nonetheless, as mentioned earlier in this cours, statisticians decided not 10 use this
formula for mathematical reasons. In order to have all our statistical procedures work, we
need to change two details inthis formula:
Le replace n with n-/3
2. reverse the order of the mean computation and of the square root (First sum up the
saquares and divide it by nse. by ne; and only then take its square 1001).
‘Therefore, the standart deviation has this form (compare it to the previous formula):
The goal of the assignment, beside learning how to introduce new variables by using
variable transformations, was exactly to compare how much these two changes influence
the measure of spread, Here are the results retumed by SPSS (the first table concems
variable MLU, whereas the second table concems the derived variables abwdev):‘Statistics
MLU
N Vaid a
Missing 0
Mean 5.80
‘Sid, Deviation 2,484
‘Statistics
N valid 2
Missing 0
Mean 2,160,
Hence, the correct answers are:
* 1, Copy the mean of ABSDEV to your report.
Mean of absdev: 2.16
* 2, Compare the SD (calculated last week) with the mean of the deviations. For
what two reasons (two differences in the way they are calculated) do they differ?
Std. Dev. of MLU: 2.48.
This value differs from the 2.16 given at question | for two reasons:
1. we used then the absolute values of deviations, instead of squares of deviations;
2. we divided by 7, and not by n-/.
* 3, Create a histogram of LONG_MILU, and copy it to your report.
f
@,
#* 4, Create a histogram including a Normal curve, as well asa boxplot of RDT.
Copy it to your report,
sean meparwa
Please note that the longer label you have specified in Variable View (and not the shorter
variable name) is used as the title of the boxplot,* 5, You can find two outliers among your data, Which are they, and what kind of
explanation(s) could you provide to explain them?
Two (possible) outliers can be observed in the boxplot, as they lie outside of the
“whiskers” (cf. 1.5 X IQR nule). These are case nt. 5 (having value 30), and case
nr, 23 (having value 979), Their case numbers are given next to the star, whereas
all other data points are replaced by the boxplot,
Possible explanations for the outliers:
~ These words are special in some sense, e.g., case 23 is an extremely rare word,
and case 5 is a function word (quickly read). Actually, if this is the case, then the
experiment is badly designed. Moreover, I don’t think that you can read a
function word in 30 msec,
- Errors in the measurement procedure: The subject typed the key by mistake
twice, and therefore did not actually read word 5, Whereas for word 23, the
subject might have paused for a moment (look elsewhere, ask a question, etc.).
+ 6 In case you decide to remove these cases from your data set, do you expect the
mean or the standard deviation to change more? Why?
Each of them has an important influence on the mean, which is not a resistant
measure, Stil, the two outliers are sort of “symmetrically” distributed around the
other data, so they neutralize somehow each other’s effect on the mean,
They both contribute largely to the standard deviation. So removing them will
significantly reduce the standard deviation,
* 7, What can you observe, as compared to your previous results?
Mean changed only slightly, from 484 to 482,
Standard deviation changed drastically, from 161 to 82.
* 8, You know the size of the sample, and you know its standard deviation, What is
the standard error, then? Calculate it both by hand (give details of your calculation
in the report) and let SPSS calculate it for you. Are the two values the same?
‘Standard error: standard dev /V sample size = 82.288/V22 = 17.5439
have used the calculator of Windows),
SPSS retums similarly 17.54,
* 9, Determine the confidence interval for the mean of the sample using the Student
‘t-statistic. Provide details of your calculations in your report.
The degree of freedom is df'= n=l = 22-1 = 21.
For C= 095, Table D gives r* = 2.080 (row with df= 2)‘The confidence interval is: mean + r+ SE
= 481.77 & 2.080 + 17.544 = 481.77 & 36A92
Inother words, the confidence interval is [445.28, 518.26], or simply [445, 519].
(NB: It does not make sense to keep too many digits, so Ihave rounded off the
values. Yet, in the case of a confidence interval, itis always safe to round values,
“away” from the mean, that is, to have a slightly larger interval.)
* 10. What is the meaning of this confidence interval?
We are 95% confident that the population mean of variable RDT is larger than
445 ms and smaller than 519 ms.
‘What does it mean that we are “95% confident”? It means that if we repeat this
statistical procedure many times on different samples drawn from the same
populations, then the real population mean will fall in 95% of the cases within the
confidence interval thus calculated. The confidence interval calculated for each
‘sample will differ, but 19 out of 20 will contain the unknown population mean.
* 11. Why have we used the t-statistic and not the z-statistic?
Because we do not know the standard deviation ¢ of the population. So we rather
use the tstatistic, which employs the standard deviation s of the sample, instead
of the standard deviation c of the population, But therefore the t-statistic follows a
different sampling distribution (a tistribution, and not a Normaldisuibution),
* 12, Suppose we know that the population standard deviation happens to be the
same as the standard deviation of the sample, Determine the confidence interval
using the z-statistic for this case,
Then z* = /,960 (last row of Table D, worth remembering, very close to 2).
We use the following formula:
42° ZL =481.77£1.960-17S44= 481,774 4.9
In
So we obtain the interval [447.38; 516.16], which is slightly narrower than the
interval obtained at question 9. This is so because the fact that we now know the
standard deviation of the population allows us to use the test, whose critical
value is slightly less (1,96, as opposed to 2,08), so we can be more certain about
‘our conclusions, Knowing the standard deviation « of the population helps us, and
so the same confidence level can already be achieved by a narrower interval. Yet,
the difference is not large, because the t-distribution with a degree of freedom of
21 is already very close to the standard Normal distribution,* 13. Now have the confidence interval calculated for you by SPSS. Copy the values
retumed by SPSS to your report. Is it different from your calculations?
‘To get the confidence interval, it is best to set the Test Value to 0. 1 got the following.
table:
‘One-Sample Test
Test Value = 0
95% Confidence Interval
of the Different
Mean
T at | Sa. @rziles) | itference | Lower | Usper
Teadnatmeparword_| 27,461 21 sooo] aen7ra[ aa5z0[ 51826
Thave to copy the last two columns to my report: 445.29 and 518.26, these are the
lower and upper boundaries of the confidence interval. These are the same values as
those I got at question 9, so Tam very happy.
* 14, Add again to your report the higher and lower values between which the
population mean must lie. Why and how is this confidence interval different from
the previously calculated one?
Thave changed the confidence level in the “Options” window. Do not forget to change it
‘back afterwards...
‘One-Sample Test
Test Valie=0
99% Confidence Interval
of the Different
Mean
t at | sig. tate | itference | Lower | Usper
Teadnatmeparword | 27461 21 cool aen7r[ 492,10] 53145
‘The new interval is now [432,10, 531.45], which is larger than the 95% interval. This,
is so, because this time we wanted to be on the safe side: for the same set of
observations, the interval must be larger in order to contain the population mean with,
allarger certainty, a larger “vonfidence”.
* 15, In each of the two cases, what is the null hypothesis exactly, and what is the
alternative hypothesis? (In words/one full sentence, please.)
‘The mean of our sample (482 ms) contradicts both the FRT (“Fast Reading Theor
at most 440 ms) and SRT ¢‘Slow Reading Theory”: at least 505 ms). The hypothesis
‘we are going to test in both cases is that the theory is true and the extreme value of
our sample is only due to chance,FRT: Original theory: the population mean of RT is 440 ms or less.
Null hypothesis in statistical test: the population mean of RT is 440 ms.
Altemative hypothesis: the population mean is more than 440 ms,
SRT: Original theory: the population mean of RT is 505 ms or more.
Null hypothesis in statistical test: the population mean of RT is 505 ms.
Altemative hypothesis: the population mean is /ess than 505 ms.
For both theories we employ the most extreme value (exactly 440 ms/505 ms) still
coherent with the theory (FRT/SRT). If our sample is consistent with this extreme value,
then our simple will be consistent with the theory. (In other words, for example in the
case of SRT, our sample will not refute the theory that RT can be 40 ms; even if the data
may refute a more severe version of SRT that would claim that RT is at most 430 ms,)
On the other hand, if our sample provides sufficient argument to reject the null
hypothesis for 40 ms (505 ms), it will even more so reject the more severe possibilities
allowed for by the theories (RT lower than 440 myfhigher than 505 ms),
* 16, Are you using a one-sided or a two-sided test?
For both theories (SRT, FRT) we use a one-sided (one-tailed) Hest, as is obvious
from the formulations of the alternative hypotheses above, Namely, the other tails of
the distribution are in harmony with the original theories (FRT, SRT).
NB: we use a stest, because we do not know the standard deviation of the population,
* 17. Perform the test for both cases by hand, and describe the steps of your
calculation.
ArHy _X—Hy _ 481.
siVn SE 17544
FRT: First, calculate statistic: ¢ =
SRT: First, calculate pstatistic: = Ho. = 5 Ho
si\n— SE
As the tdistributions are symmetric, we can use the absolute values of the t-values
caloulated (hence, 1.324 in the second case).
Second, we remember that df= ne] = 22-1 = 21,
Third, we remember that we are performing a onesided t-test with a significance
level of 5%, whose critical value is the same as the critical r* for a two-sided test
with = 0.10 or C= 0.90, That is what we have to look up in Table D, We find:
P= 1.721 (for df= 21, and for C=90% or upper tail probability = 0.05).
Fourth, we compare the calculated statistics to the critical t-value: 2.38 > 1,72 >
1.32, We can reject the null hypothesis if and only if t> t*, that is, if our sample has a
statistic that is as extreme as, or more extreme than the critical value,Fifth step: we dravw the conclusions:
ERT: We have sufficient evidence to reject the null hypothesis at the a= 0.05 level.
SRT: We do not have sufficient evidence to reject the null hypothesis at the «= 0.05
significance level.
Sixth step: see question 20.
* 18, Let SPSS calculate the test for you and copy the results.
FRT:
‘One-Sample Test
Test Value = 440
90% Confidence Interval
of the Differeroe
Mean
t at | Sig. tailed) | itference | Lower _|__Upper
Teadina time par word 281 21 wert airs, ise] 7196
Note that the p-value of a one-tailed test is the half of a two-tailed test given by SPSS.
‘Therefore, we conclude that p = 0.014, which is below the 5% significance leve
SRT:
‘One-Sample Test
Test Vabe = 505
90% Confidence Interval
of the Different
Mean
t at | Sa. @rziles) | itference | Lower | _Usper
reading tine par word “12 21 oo] -2a2e7[ 5342 696)
Note that the p-value of a one-tailed test is the half of a two-tailed test given by SPSS.
‘Therefore, we conclude that p = 0.100, which is still above the 5% significance level.
* 19, What is the meaning of the P-value in each of the two cases? (Hint: the
probability of exactly what is it?) Please write one full sentence for each case in your
report,
FRT: The p-value is the probability of drawing a simple random sample (SRS) whose
sample mean is 481.77 ms or higher (whose t-statistic is 2.381 or higher), providing
that the population mean is 440 ms,
SRT: The p-value is the probability of drawing a simple random sample (SRS) whose
sample mean is 481.77 ms or lower (whose t-statistic is -1.324 or lower), providing
that the population mean is 505 ms.* 20, For each case, provide the key sentence summarizing the results of the
statistical analysis, as it is done in scientific papers. That is, either "Based on our
data, we can reject the null-hypothesis at a significance level alpha = 0.05, that is, we
can conclude that [the alternative hypothesis in words] is true (t= w.» df "
or “our data do not provide sufficient evidence to reject the null hypothesis, that is, to
conclude that..
FRT:
Based on our data, we can reject the null hypothesis at a 0.05 significance level that
the RT (reading time) is 440 ms or lower (r= 2.38, df= 21, p = 0.014). We conclude
that the Fast Reading Theory of the phenomenon being examined cannot be tue, and
‘we apply to the Scientific Research Fund to provide us with money to come up with a
better theory,
SRT:
Our experiments do not provide sufficient evidence to reject the Slow Reading
Theory of the phenomenon being examined 22, df= 21, p= 0.10), However,
our data suggest that RT cannot be much higher that 505 ms, and might even be less
than 505 ms; a larger sample could refute the SRT hypothesis. Thus, more
experiments are needed to approximate better the real value of RT, and so we urge the
Scientific Research Found to provide us with more money,