Assignment May 2020 Semester: Subject Code: ESE633 Subject Title: Statistics in Education Level: Master
Assignment May 2020 Semester: Subject Code: ESE633 Subject Title: Statistics in Education Level: Master
LEVEL : MASTER
STUDENT’S NAME :
MATRIC NO. :
PROGRAMME :
ACADEMIC :
FACILITATOR
LEARNING CENTRE :
Question 1: [3 marks]
Class X
Class Y
Answer:
An English test was administered to Class X and Class Y and the distribution of
the scores are shown below:
In Class Y, the scores are widely spread out which means that there is high variance or a
bigger standard deviation which most the scores are in red color. If the mean is 50, then
we can say that 95% of the students scored between 44 and 56.
In Class X, there is low variance or a small standard deviation which explains why most
of the scores are clustered around the mean. Most of the scores are 'bunching' around the
mean; and shown that most of the scores in red color. If the mean is 50, 95% of the
students scored between 47 and 53.
Question 2 [3 marks]
Refer to the table above. Interpret the mean and standard deviation.
Answer:
The female score were little over 0.3 greater on average than the male score. The female having
an average of 49.1 and the male having an average of 48.8. The female data had standard
deviation of 4.46 whereas the male data had 3.56. The female data having a higher standard
deviation tells me that the female data is more spread out or dispersed than male data.
Question 3: [5 marks]
A research was conducted on the opinion of teachers on students bringing smart phones to
school. A sample 200 female and 165 male teachers were asked the question ‘Should students be
allowed to bring smart phones to school. The results of the study are shown below:
2
The χ value is 20.704 while the p-value is 0.0001.
categorical variables differ from one another. The Chi Square statistic
compares the tallies or counts of categorical responses between two
independent groups Male and Female
c) Both male and female agree students be allowed to bring smartphones to school. At =
0.05, the data does provide sufficient evidence to conclude that the mean scores on
allowed student smartphones to school of females is superior to that to males, even
though the mean scores obtained is higher than that of males
Question 4: [5 marks]
Reading
Score
Question 5: [5 marks]
A physical education teacher collected data on students’ height AND the span of jump (see Table 1
below:
a) What statistical procedure would you use to predict length of jump using height? Give reasons.
b) Explain clearly two assumptions that must be met in conducting the test that you have identified
in Q1(a). Why it is important these assumptions are met?
Answer:
a) Linear Regression is a Statistical Method that use to test the extent to which causal
relationships exist between the height (X) and span of jump(Y). Causal Factors are
generally denoted by X or referred to as Predictors while Affect Variables are denoted by
Y or otherwise called Response. Simple or often abbreviated Linear Regression also one
of the Statistical Methods used in production to make predictions or predictions of quality
and quantity characteristics.
b) Based on above data conduct Linear Regression Analysis for two assumptions:
I. Test the relationship between height and Span of jump. Expecting a positive
relationship between height and span of jump. In other words, as height increases,
you expect span of jump to also increase. How do you establish this to be true?
II. Second, have go further than just stating the relationship between height towards
physical and span of jump. You want to know whether you can PREDICT
values of one variable if you know or can estimate the other variable. In other
words, can predict performance in span of jump on what you know about their
height towards physical.
Question 6: [4 marks]
Critical Thinking N
High Income 54.2 120
Middle Income 60.1 120
Low Income 48.2 120
A study was conducted to test the critical thinking abilities of primary school children from different
socioeconomic backgrounds. The results are shown in the table above. Explain why you would use
Oneway ANOVA and not multiple t-tests to determine if the difference between the groups is significant.
Answer:
T-test was used to compare differences of means between two groups, such as comparing
outcomes between a control and treatment group in an experimental study. But in this case, by
multiple t-tests enhances the likelihood of committing Type 1 which can claim that two means
are not equal when in fact they are equal. It does also reject a null hypothesis when it is TRUE.
On a practical level, using the t-test to compare many means is a cumbersome process in
terms of the calculations involved.
Question 7: [7 marks]
A researcher conducted a study to measure Reasoning Skills among a group of 16- year
students. The sample consisted of 120 males and 126 female subjects. Reasoning Skills
consists of two constructs – Inductive Reasoning and Deductive Reasoning.
b) State the appropriate statistical tests to test the three research questions suggested.
Answer:
a) 1. the extent to which students in the 16-year-olds master the reasoning skills
2. male students master more reasoning skills than female students
3. female students master more reasoning skills compre than male
b) I recommend using the T Test for statistical tests to test the three research questions suggested.
Used when individual in the sample was measured twice and both measurement data used to create
comparison. Paired sample t tests is used when two sets of data are present from just one group same
subject (1 sample).
Question 8 (15 marks)
A study was conducted to compare the effectiveness of three teaching techniques in teaching English to
Year 6 students. The three teaching techniques are – Using video clips, Using worksheets and Using flash
cards. Equal number of students were randomly assigned to the three different treatments. The results are
shown in the tables below:
Scores in Statistics
Treatment N Mean Std. Deviation Std. Error
Answer :
Question 8 a:
1. The Mean explain about Using Video Clips have the highest mean (27.02), while Using Flash
Cards have the lowest mean (23.40). Using Worksheets fall in the middle, with a mean of
23.60.
2. Standard Deviations explain that the standard deviation for Using Video Clips (3.05) and
Using Flash Cards (3.24) students is fairly close while Using Worksheets have a somewhat
bigger standard deviation of 3.31
3. Refer to the table, the three Standard Errors shown that 0.96 for Using Video Clip.
However, the standard error for Using Worksheets and Using Flash Cards are comparatively
high = 1.04 and 1.02.
Question 8 b:
The standard error is a measure of how much the sample means vary if you were to take repeated
samples from the same population. The treatment using worksheet and using flash card contain
20 students each; the standard error of the mean for each of the last two groups is high. It is 1.04
for Using Worksheet and 1.02 for Using Flash Card. However, the standard error for the Using
Video clips group is comparatively low = 0.96. It because the smaller number of low ability
students and the larger standard deviation explains why the standard error is larger.
Question 8C:
The Levene's test of homogeneity of variance is used for the OneWay ANOVA and is show in
the Table above. The p-value which is 0.802 is greater than the alpha of 0.05. Hence, it can be
concluded that the variances are homogeneous which is reported as Levene (2, 57) = 0.115, p =
0.802
Question 8 d):
Having concluded that the assumption of homogeneity of variance has been, the means and
standard deviations of each of the four groups have been computed. The table above indicates as
below:-
The ‘Between groups’ row shows that the df is 2 (i.e. k - 1 = 3 – 1 = 2) and the mean
square is 45.733.
The ‘Within groups’ row shows that the df is 57 (N - k = 60– 3 = 57) and the mean
square is 10.237.
If you divide 45.733 by 10.237 you will get the F value of 4.467 which is significant at
0.031.
Since, 0.031 is < than α = 0.05, we can Reject the Null Hypothesis and accept the
alternative hypothesis. So. It concludes that there is a significant difference in inductive
reasoning between the four SES groups.
Question 8E:
There is a significant difference ONLY between ‘Using Video Clips’ (Mean = 27.20) and “Using
Flash Cards (Mean = 23.40) at p = 0.034. The Using Video Clips scored significantly higher than
Using Flash Cards at p = 0.034. There are no significant differences between the other groups.
Question 9 [7 marks]
A study was conducted to determine the relationships between spatial reasoning, memory,
metacognition, mathematical ability and verbal reasoning. The sample consisted of 80 male and
80 female 16 year old students. The Table above shows a correlation matrix between the five
variables. Based on the table answer the following questions:
Spatial Memory Metacogniton Mathematical Verbal
Reasoning ability reasoning
Spatial reasoning
Memory 0.56
Metacognition 0,65 0.67
Mathematical ability 0.43 0.60 0.59 .
Verbal reasoning 0.34 0.41 0.49 0.70
Answer: a)
i. There is an association between Spatial reasoning with Memory factors
ii. There is an association between Spatial reasoning with Verbal reasoning factors
iii. There is an association between Spatial reasoning with Metacognition factors
Answer b):
i. The results of this hypothesis clearly prove that there is a relationship significant positive
among variables of Spatial reasoning with Memory. The value of, r is 0.56. Which is less
than 0.05 (r = 0.865, p <.05). This means there is a link between Spatial reasoning with
Memory factors.
ii. In the research done, it is clear that there is a relevance significant positive correlation
between spatial reasoning and Verbal reasoning factor. This is shown by the lower
correlation value of 0.34 which is close to +1, while the significance value, p is smaller than
0.05, which is 0.012.
iii. There is a connection significant difference between Spatial reasoning and Metacognition
factors. Numerically, we can see the correlation coefficient value, that is 0.65 and the
significance level is high than 0.05. (r = .65, p >.05).
iv. There is a connection significant difference between Spatial reasoning and Mathematical
ability factors. Numerically, we can see the correlation coefficient value, that is 0.43 and the
significance level is less than 0.05. (r = .43, p <.05).
Table 1:
Gender Kolmogorov-Smirnova Shapiro-Wilk
Statistic df Sig. Statistic df Sig. See
Table
Spatial Male 0.879 34 0.012 0.971 78 0.016 above
Test Female 0.185 39 0.001 0.079 96 0.008 and
Scores answer
the
a. Lilliefors Significance Correction following:
Answer :
a) The purpose of the statistical test is to provide a mechanism for making quantitative
decisions about a process or processes. The intent is to determine whether there is enough
evidence to "reject" a conjecture or hypothesis about the process.
b) The Shapiro-Wilk used fairly powerful omnibus test. It’s also not good with small samples
or discrete data. Its Good power with symmetrical, short and long tails. Good with
asymmetry. In other hand, Kolmogorov-Smirnova used in all tends to have lower power.
Data have to be very non-normal to reject Ho., and these tests can outperform other tests
when using discrete or grouped data.
Answer a):
According the table the Mean shown that, Posttest has the highest
mean (23.86), while Pretest has the lowest mean (18.50). The Standard Deviations tell us
about standard deviation for Pretest (5.33) and Posttest (4.75). This is the Standard Deviation
of Pretest higher than Posttest. Standard Errors refer to the table; the standard error is a
measure of how much the sample means vary if you were to take repeated samples from the
same population. The both groups contain 30 samples each, the standard error of the mean
for each of these groups are 0.97 for Pretest and 0.87 for Posttest.
b) There is no significant difference between the pretest and the posttest scores Moral
Reasoning Score Before and After Teaching Using Moral Dilemmas
c) Ha: µ1 ≠ µ2
Ha: The Alternative Hypothesis might be that the reasoning scores between
discovery pretest and posttest are DIFFERENT.
Ha: µ1 > µ2
Ha: The Alternative Hypothesis might be that the reasoning scores of the
discovery pretest is HIGHER than the mean scores of the posttest.
Ha: µ1 < µ2
Ha: The Alternative Hypothesis might be that the reasoning scores of the
discovery pretest is LOWER than the mean scores of the posttest.
d) The Paired Samples t Test can only compare the means for two related units on a
continuous outcome that is normally distributed. The Paired Samples t Test is not
appropriate for analyses involving the following:
i. unpaired data;
ii. comparisons between more than two units/groups;
iii. a continuous outcome that is not normally distributed; and
iv. an ordinal/ranked outcome.
e) Table above reports that the mean values on the variable for the pretest and posttest.
The posttest mean is higher (23.86) than the pretest mean (18.50) indicating improved
performance in the history test after the treatment. The standard deviation for the
pretest 5.33 and is very close to the standard deviation for the posttest which is 4.75.
This is the difference between the means 18.50 – 23.86 = – 5.36 which students did
significantly better on the posttest.
Question 12: [5 marks]
The table above shows the results of a study which compared the anxiety and depression levels
of students from families whose parents are divorced and whose parents are not divorced. Briefly
describe the results of the study.
Answer|:
The results of the analysis showed that the mean and standard deviation of the Anxiety Test for
Parents Not Divorced were 21.49 and 2.6 respectively, while the mean and standard deviation of
the Anxiety Test for Parents Divorced were 22.70 and 4.3, respectively. The mean score
difference between the tests was -1.21. The results of the study were significant (t = −3.27, p
<0.07) This indicates that there is an increase in anxiety for divorced parents compared to not
divorced. Meanwhile, the mean and standard deviation of the Depression Test for Parents Not
Divorced were 15.67 and 3.1, respectively, and the mean and standard deviation of the Test for
Parents Divorced test were 17.90 and 4.6, respectively. The mean score difference between the
tests was -2.23. The results of the study were significant (t = −3.01, p <0.07) this indicates that
there is an increase in pressure for divorced parents compared to not divorcing
Question 13: [7 marks]
b) State the appropriate statistical tests to test the three hypotheses listed in (a)
[2 marks]
c) State the assumptions required for the statistical test(s) used in (b) [2 marks]
Answer:
a) i. There is no significant difference between the Emotional Intelligence and
constructs. (Null Hypothesis)
ii. The Alternative Hypothesis might be that the reasoning scores between
Emotional Intelligence and posttest are Stress Tolerance.
iii. The Alternative Hypothesis might be that the reasoning scores of the
Emotional Intelligence is HIGHER than the mean scores of the posttest
b) the appropriate statistical tests to test the three hypotheses T – test
c) As a parametric procedure the paired sample t-test makes several assumptions.
Although t-tests are quite robust, it is good practice to evaluate the degree of deviation
from these assumptions in order to assess the quality of the results. In a paired sample t-
test, the observations are defined as the differences between two sets of values, and each
assumption refers to these differences, not the original data values. The paired sample t-
test has four main assumptions:
The dependent variable must be continuous (interval/ratio).
The observations are independent of one another.
The dependent variable should be approximately normally distributed.
The dependent variable should not contain any outliers.
Question 14: [5 marks]
Weight
Height [Berat]
Answer:
I. ZERO CORRELATION
If Income (x) and Height (y) have NO relationship than the Slope
(β1) will be ZERO. In other words, there is NO SYSTEMATIC
RELATIONSHIP between X and Y. Some students with high Income have
positive low Height while some have low Income score have high
positive height.
A researcher conducted a study skills course over a period of five days among a group of
beginners, intermediate and advanced speakers of English. At the end of the programme he
administered a test to determine which group of students benefited from the programme.
The results of the study were analyzed using One-Way ANOVA and the results are shown in the
tables below:
Table 1
vi) What does the standard deviation tell you about the results?
Answer (a):
I. The null hypothesis (H0) is that there is no difference between the groups and English
between means. (Walruses weigh the same in different months)