Statistics in Education
Statistics in Education
_________________________________________________________________________
HMEF5113
STATISTICS FOR EDUCATIONAL RESEARCH
JANUARY 2020 SEMESTER
SPECIFIC INSTRUCTION
6. This assignment accounts for 100% of the total marks for the course.
1
ASSIGNMENT QUESTION
QUESTION 1 (50%)
OBJECTIVES:
The objective of this assignment is to enable you to develop knowledge and skills in the
following areas of competency:
i. Understand SPSS Statistics data file structure for exploratory data analysis
procedures; and
ii. Prepare the SPSS Statistics data file for descriptive data analysis.
INSTRUCTIONS:
Answer all questions in this assignment. Read up the HMEF5113 module and the
recommended books on statistical analysis and SPSS Statistics.
You must submit this assignment in MS Word file via myINSPIRE together with your SPSS
Statistics Data file and SPSS Statistics Viewer file (SPSS output file). Save your assignment
under your name (e.g. Zarina Assignment 1.docx for MS Word file, Zarina Assignment 1.sav
for SPSS Data file and Zarina Assignment 1.spv for SPSS Viewer file). See further instructions
on the submission of your assignment under “Notes” below.
ASSIGNMENT QUESTION:
HMEF5113 Dataset for Assignments 1 and 2 January 2019.sav was extracted from an
experimental research study which was undertaken to determine the effectiveness on the
teaching of mathematics among Form 2 students using Learning Sites via the Virtual
Learning Environment (VLE). Students in the experimental group learnt via the use of VLE
while those in the control group learnt under the conventional “chalk and talk” method. The
study involved 250 students from two schools in Kuala Lumpur.
a. Generate a frequency count and histogram with normal distribution curve for
Pretest mathematics scores (Variable name: Pretest). Describe the frequency count
and normal distribution curve.
[3 marks]
2
b. Generate a frequency count and bar chart to show the distribution of Family Income
classification. Describe the frequency distribution output.
[3 marks]
c. What is random sampling? Using the HMEF5113 Dataset for Assignments 1 & 2.sav,
create a random sample comprising 100 cases from the 250 students who
participated in this experimental research. Using this random sample of 100 cases,
run the frequency procedure to show the distribution by Gender.
[4 marks]
d. Using the Posttest variable, create an ordinal-type variable with the following
categories: i. 20 marks and below, ii. 21 to 40 marks, iii. 41 to 60 marks, 4. 61 to 80
marks, and 5. 81 marks and above. Show a frequency distribution table of the newly
created ordinal-type Posttest category variable. Describe the frequency distribution
table.
[6 marks]
e. Using the Pretest variable of students who participated in the experimental study,
compute the following measures of central tendency: i. Mean, ii. Median, and iii.
Mode and the measures of dispersion comprising i. Range, ii. Variance, and iii.
Standard deviation. Describe the output for measures of central tendency and
measures of dispersion.
[5 marks]
f. Run a crosstabulation between the Gender variable and the Posttest Category
variable (new variable created under section d. above). Describe the crosstabulation
findings.
[4 marks]
g. Run an exploratory data analysis of the Pretest and Posttest variables. Describe the
exploratory data analysis output.
[18 marks]
h. Describe how you would go about addressing missing values in a dataset when
respondents failed to fill-up their responses.
[3 marks]
3
i. Discuss the differences between parametric statistics and non-parametric statistics.
[4 marks]
ASSIGNMENT FORMAT:
a. Use double space and 12-point of Times New Roman font.
b. The assignment should contain about 3000 – 5000 words (15 – 20 pages)
c. Provide reference using the American Psychological Association (APA) format
d. References should be latest (year 2009 onwards)
4
QUESTION 2 (50%)
OBJECTIVE:
The objective of this assignment is to enable you to apply practical knowledge and skills in
statistical analysis using the SPSS Statistics software. This assignment will provide you with
knowledge and skills on statistics, statistical analysis and interpretation of the research
questions and hypotheses you have formulated.
INSTRUCTIONS:
Answer all questions in this assignment. Read up the HMEF5113 Statistics for Educational
Research Module and other recommended books on statistical analysis and SPSS Statistics.
ASSIGNMENT QUESTION:
HMEF5113 Dataset for Assignments 1 & 2 May 2017.sav was extracted from an
experimental research study which was undertaken to determine the effectiveness on the
teaching of mathematics among Form 2 students using Learning Sites via the Virtual
Learning Environment (VLE). Students in the experimental group learnt via the use of VLE
while those in the control group learnt under the conventional “chalk and talk” method. The
study involved 250 students from two schools in Kuala Lumpur.
Study the HMEF5113 Dataset for Assignments 1 & 2 May 2017.sav carefully, run your
analysis using SPSS Statistics and answer the following questions:
a) Use the Posttest and Pretest variables to generate “Gain Score”. Compute the mean
and standard deviation of this newly generated variable by Experimental and Control
Group (Variable name: Method). Describe the mean and standard deviation output.
[2 marks]
a) Classify the Gain Scores derived from a) above into i. Low Gain Score and ii. High
Gain Score. “Low Gain Score” is defined as those obtaining gain scores of 10 marks
and below while “High Gain Score” refers to those obtaining gain scores of 11 marks
and above. Run a frequency count of the Low and High Gain Scores. What
conclusions can you make regarding the output based on this classification?
[2 marks]
5
b) Using the chi-square statistical test, show the relationship between the two
categorical variables i.e. “Method” and “Low and High Gain Scores”. Discuss the chi-
square statistical test results.
[4 marks]
c) Test the normality of the distribution using Posttest as the dependent variable and
Gender as the independent variable. Make your conclusions based on the output
generated.
[3 marks]
d) Using the Posttest score as the dependent variable and Gender and Family Income as
the independent variables, state the relevant research questions and null and
alternative hypotheses to test whether significant differences occur among students
by Gender and Family Income. State the appropriate statistical tests for each
hypothesis.
[8 marks]
e) Run the appropriate statistical tests to test the hypotheses stated in e) above. You
are to make a decision whether to reject or not to reject the null hypotheses. Explain
your conclusions based on the scientific inquiry approach.
[17 marks]
f) Using the Pearson Product Moment Correlation Coefficient, show the relationship
between the Pretest and Posttest variables. Plot a scatter diagram to depict this
relationship with a fit line. What conclusions can you make based on the correlation
output?
[4 marks]
g) Run a multiple regression analysis using Posttest as the dependent variable and
Pretest, Tuition and Method as the independent variables. Discuss the regression
output.
[10 marks]
[TOTAL: 50 MARKS]
6
ASSIGNMENT FORMAT: