Inggris Mining
Inggris Mining
To cite this article: Milos Jovanovic , Milan Vukicevic , Milos Milovanovic & Miroslav Minovic (2012): Using data mining
on student behavior and cognitive style data for improving e-learning systems: a case study, International Journal of
Computational Intelligence Systems, 5:3, 597-610
This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to
anyone is expressly forbidden.
The publisher does not give any warranty express or implied or make any representation that the contents
will be complete or accurate or up to date. The accuracy of any instructions, formulae, and drug doses
should be independently verified with primary sources. The publisher shall not be liable for any loss, actions,
claims, proceedings, demand, or costs or damages whatsoever or howsoever caused arising directly or
indirectly in connection with or arising out of the use of this material.
International Journal of Computational Intelligence Systems, Vol. 5, No. 3 (June, 2012), 597-610
Using data mining on student behavior and cognitive style data for improving e-learning
systems: a case study
Abstract
In this research we applied classification models for prediction of students’ performance, and cluster models for
grouping students based on their cognitive styles in e-learning environment. Classification models described in this
paper should help: teachers, students and business people, for early engaging with students who are likely to
become excellent on a selected topic. Clustering students based on cognitive styles and their overall performance
should enable better adaption of the learning materials with respect to their learning styles. The approach is tested
using well-established data mining algorithms, and evaluated by several evaluation measures. Model building
process included data preprocessing, parameter optimization and attribute selection steps, which enhanced the
overall performance. Additionally we propose a Moodle module that allows automatic extraction of data needed for
educational data mining analysis and deploys models developed in this study.
Keywords: educational data mining, prediction, students, performance, classification, clustering, Moodle.
These goals are achieved with help of data mining often utilized as a significant part of e-learning systems.
techniques such as k-nearest neighbor, naive Bayes,
Romero et al. described how different data mining
decision trees, artificial neural networks, support vector
techniques can be used in that setting to improve the
machines, K-means, hierarchical clustering etc.12 course and the students’ learning.6
This research shows how one can leverage the available One of the most frequent research topics in the area of
data on student behavior, in order to predict success of
EDM (also investigated in this research) is the
students, as well as profile students into groups which
prediction of student performance.6,14,16 The main idea
may help improve existing learning material and behind this research direction is that based on student
collaborative learning. The study involves data from
activity one can predict the future outcome of student
students attending online (distance learning) university
performance. For the purpose of predicting students’
courses as suggested by Romero et al.,6 and extends final outcome on a course, researchers used various
available data with students cognitive styles.
techniques and algorithms. Kotsiantis et al., proposed an
Additionally we propose Moodle module that allows
incremental ensemble of classifiers as a technique for
automatic extraction of data needed for EDM analysis predicting students’ performance in distance
and deploys models evolved in this study.
education.17 Neuro-fuzzy system rules are used for
student knowledge diagnosis through game learning
The paper is structured as follows: Section 2 introduces environment.18 Kotsiantis also proposed a prototype
related work on using e-learning data and applying data
version of decision support system for prediction of
mining models. Architectural design of the decision-
students’ performance based on students’ demographic
support system is given in Section 3, with experimental characteristics and their marks in a small number of
results in using data mining models presented in Section
written assignments.19 Myller et al. used neural
4. Potential ways of using knowledge gained by data
networks (multilayer perceptron),20 and Traynor and
mining models is described in Section 5, and Section 6 Gibson used combination of Artificial Neural Networks
time will be set to zero value since it is in a timestamp student can spend undetermined amount of time reading
format. This is especially the case if the quiz is left open some forum without providing feedback to the system,
by the educator as is in the case of self evaluation tests it is difficult to determine if the user is active in the
that are open during the entire semester. By subtracting forum or not. In this case we used the activity log that
beginning time from end time one would get a negative tracks each click user makes on a link in a system.
time value. Such cases we had to exclude as if they did
not even attempt the quiz. The same issue occurred with Since Moodle provides module name as one of the
the assignments that students did not finish and upload. meta-data regarding that action we were able to track
the students’ movement through a forum. The time
Systems that have many users can suffer from data spent was determined as the addition of the times
redundancy issues. In case of Moodle the most common between two clicks. If a student made a last click in a
case of redundancy is in duplicate courses or user forum context and then was inactive for a prolonged
accounts. In our organization we minimized the period of time, as a referent time we decided to use an
occurrence of redundancy by using the centralized average time between two clicks in a forum context for
approach in generation of courses and user accounts. all users. This is caused by the fact that users often do
System administrator is generating courses and user not properly log out of the system. Usually they just
accounts upon teacher’s requests and it is his duty to close the web browser and move on to other activities.
primarily check if new addition is already entered in the Unfortunately, this does not leave any feedback when
system. This process practically excluded the possibility certain activity ended. For calculation of time spent on
of redundancy that targets this analysis. forums for every student on every course, we designed
specialized application that is integrated into stream
When it comes to the Forum module, extraction of the (Figure 1) and uses extracted data from Forum module
information about time spent is more complicated. Since
and Activity Log. Finally, extracted data is aggregated 3.2. Prediction of students’ success
on the student-course level.
We defined a classification model to predict if a student
would display excellent performance (i.e. highest
Additionally we used the data about students’ cognitive
grades) on a selected course. This problem is interesting
styles that are gathered from a questionnaire that we
since there are many stakeholders interested in
administered through Moodle. We administered self-
recognizing students with excellent
report MBTI questionnaire which is already
performance.45,46,47,48 For the input data for the
successfully used for analysis of student’s profiles.40
prediction, the model would use the data describing
The MBTI form has 95 forced-choice items that forms
student behavior on e-learning resources (e.g. forums,
four bipolar scales: Extraversion-Introversion (EI),
discussions, quizzes, posts, assignments) as described in
Sensing-Intuition (SN), Thinking-Feeling (TF) and
the previous subsection. The dataset contains 260
Judging-Perception (JP). A combination of these
instances. The preparation of data included extracting
dimensions builds 16 different types of cognitive
more features (such as grouping courses in math-
functioning. Introverts who are oriented primarily to the
oriented and social-oriented), normalizing features, and
Downloaded by [Dalhousie University] at 00:42 27 December 2012
negative example. LIFT ratio measures the degree to complex decision boundaries performed better. Overall,
which the predictions of a classification model are better both NeuralNet and AdaBoost gave quite good results,
than randomly-generated predictions. It is defined as the rendering models with quality which allow further use.
ratio of true positives to total positives resulting from Since the results are taken using cross-validation, we
the classification process compared to the fraction of can expect to successfully predict excellence of roughly
true positives in the overall population. We used both of 9 out of 10 students.
these measures to complement the evaluation based on
accuracy. This is important measure since in this After selecting the three most promising classification
research we are dealing with imbalanced data. algorithms for the task at hand, namely AdaBoost,
These measures are important because accuracy often NeuralNet and RandomForests, we tried to improve the
tends to overlook the classifier inability to predict all the performance, measured by AUC, by doing different
classes, when it is concentrating only to detect one preprocessing and parameter optimization steps. The
class. Testing is done in RapidMiner data mining setup for this preprocessing is shown in Figure 2.
platform,38 using default parameters and random seeds.
The results are shown in Table 2.
Looking at the results, several algorithms show good Table 3: Improvement of AUC by preprocessing, for
performance in generating the needed classification selected algorithms
models. These are NeuralNet, AdaBoost and
no handle optimize attribute
RandomForests, and all three are comparable with preprocess missing parameters selection
respect to accuracy. While RandomForests looks most RandForests 0.750 0.858 0.890 0.848
useful by accuracy, AUC evaluation measure does not Adaboost 0.826 0.779 0.839 0.838
prefer this algorithm, since the AUC value is too small. NeuralNet 0.835 0.767 0.853 0.812
Still, since we know the cognitive profile of these who always passed the Psychology course with “good”,
students, we can direct our effort in adapting our course but not with “excellent” results.
materials to fit that target group. For example, analysis
indicates that course on Mathematics is more suitable to Also, there are occasions when we cannot isolate a
Empiric and Judging cognitive styles. This is probably complete cognitive profile of successful students, as in
due to the nature of the subject that gives the upper edge Figure 5. However, partial information could be
to Empirics that are better in deductive thinking and observed, for example, looking at only first two
reasoning. Teacher can try to overcome that gap by cognitive attributes. Here, Empiric, as well as Introvert,
adapting materials to the opposing cognitive styles. are part of only the first cluster of students (first
Intuitives might benefit through learning by doing column), which all turned out excellent by the end of
approach, through simulations or games. Also the course. This links only those attributes (Empiric and
Perceivers could find it more appealing to use Introvert) to the success of students.
interactive multimedia material. Also we could offer
different examination approaches, as, for example,
Downloaded by [Dalhousie University] at 00:42 27 December 2012
for analysis of student behavior, more than has ever 62 (2007) pp. 183–221.
been possible in the traditional learning context. 10. A. C. Romero, and A. S. Ventura, Educational data mining:
A survey from 1995 to 2005, Journal of Expert Systems
Acknowledgements Applications, 33(1) (2007) 135-146.
11. C-H Weng, Mining fuzzy specific rare itemsets for
This research is partially funded by a grant from the education data, Knowledge-Based Systems 24 (5) (2011) pp.
Serbian Ministry of Science and Technological 697-708.
Development, project ID III 47003. 12. C. Romero and S. Ventura, Educational data mining: a
This research is partially funded with support from the review of the state-of-the-art, IEEE Trans. Syst. Man
Cybernet. C Appl. Rev., 40(6) (2011) 601–618.
European Commission through Life Long Learning
13. A. Krueger, A. Merceron and B. Wolf, A Data Model to
Programme project TRAILER No. 519141-LLP-1- Ease Analysis and Mining of Educational Data, in Proc.
2011-1-ES-KA3-KA3MP. Third International Conference on Educational Data
This publication reflects the views only of the author, Mining, (USA, Pittsburgh, 2010) pp. 131-140.
and the Commission cannot be held responsible for any 14. Y-H Wang, H-C Liao, Data mining for adaptive learning in
use which may be made of the information contained a TESL-based e-learning, Expert Systems with Applications
38 (6) (2011), pp. 6480-6485.
therein. 15. V.Ramesh, P.Parkavi, P.Yasodha, Performance Analysis of
Data Mining Techniques for Placement Chance Prediction,
References International Journal of Scientific & Engineering Research
2 (8) (2011).
1. Y-C Lee, N. Terashima, A Distance Instructional System 16. C. Vialardi, J. Chue, J.P. Peche, G. Alvarado, B. Vinatea, J.
with Learning Performance Evaluation Mechanism: Estrella and Á. Ortigosa, A data mining approach to guide
Moodle-Based Educational System Design, Distance
students through the enrollment process based on academic
Education Technologies 10 (2) (2012). doi: performance, User modeling and user-adapted interaction
10.4018/jdet.2012040104 21 (1-2) (2011), pp. 217-248. doi: 10.1007/s11257-011-
2. T. Martin-Blas, A. Serano-Fernandez, The role of new
9098-4.
technologies in the learning process: Moodle as a teaching 17. S. Kotsiantis, K. Patriarcheas and M. Xenos, A
tool in Physics, Computers & Education 52 (2009) pp. 35- combinational incremental ensemble of classifiers as a
44. doi:10.1016/j.compedu.2008.06.005
technique for predicting students’ performance in distance
3. I. Kazanidis, S. Valsamidis, T. Theodosiou and S. education, Knowledge-Based Systems, 23(6) (2010) 529-
Kontogiannis, Proposed framework for data mining in e- 535.
learning: The case of Open e-Class, in Proc. IADIS
18. K. Kuk, P. Spalevic, S. Ilic, M. Caric, Z. Trajcevski, A
International Conference of Applied Computing, (Rome, Model for Student Knowledge Diagnosis through Game
Italy, 2009), pp. 254–258. Learning Environment, Technics Technologies Education
4. F. J. García-Peñalvo, M. Á.Conde, M. Alier, María J.
Management – TTEM, 7 (1) (2012) 103-110.
Casany, Opening Learning Management Systems to 19. S. Kotsiantis, Use of machine learning techniques for
Personal Learning Environments, Journal of Universal educational proposes: a decision support system for
Computer Science 17(9)(2011), pp. 1222-1240.
forecasting students’ grades, Artificial Intelligence Review,
(Online First) (2011) 1-14.
20. N. Myller, J. Suhonen and E. Sutinen, Using Data Mining attributional strategies: a person oriented approach, High
for Improving Web-Based Course Design, in Proc. Educ 61 (2011), pp. 513–529. doi: 10.1007/s10734-010-
International Conference on Computers in Education, 9346-2
(USA, Washington, 2002) pp. 959- 964. 33. D. Perera, J. Kay, I. Koprinska, K. Yacef and O. R. Zaïane,
21. D. Traynor and J.P. Gibson, Synthesis and Analysis of Clustering and Sequential Pattern Mining of Online
Automatic Assessment Methods in CS1, in Proc. The 36th Collaborative Learning Data, IEEE Transaction on
SIGCSE Technical Symposium on Computer Science Knowledge and Data Engineering, 21 (6) (2009), pp. 759-
Education SIGCSE’05, (ACM Press., Louis Missouri, USA 772.
, 2005) pp. 495-499. 34. S.Y. Chen and X. Liu, Mining students' learning patterns
22. B. Minaei-bidgoli, D. A. Kashy, G. Kortmeyer and W. F. and performance in Web-based instruction: a cognitive style
Punch, Predicting student performance: an application of approach, Interactive Learning Environments 19 (2) (2011).
data mining methods with an educational Web-based doi:10.1080/10494820802667256
system, in Proc. 33rd International Conference on Frontiers 35. J.M. Adán-Coello C.M. Tobar E.S.J. de Faria, W.S de
in Education, (Colorado, Westminister, 2003) pp. 13-18. Menezes, R.L. de Freitas, Forming Groups for Collaborative
23. M. Delgado, E. Gibaja, M.C. Pegalajar and O. Pérez, Learning of Introductory Computer Programming Based on
(2006). Predicting Students' Marks from. Moodle Logs Students’ Programming Skills and Learning Styles,
using Neural Network Models, in Proc. International International Journal of Information and Communication
Downloaded by [Dalhousie University] at 00:42 27 December 2012
Capital Management: The case of Meta4 PeopleNet, Data & Knowledge Engineering (2012). doi:
International Journal of Information Management 31(4) 10.1016/j.datak.2012.03.005
(2011), pp. 395-399. 59. B. Chen and T. Bryer, Investigating Instructional Strategies
47. A. García-Crespo, R. Colomo-Palacios, J.M Gómez-Berbís, for Using Social Media in Formal and Informal Learning,
& M. Mencke, M,. BMR: Benchmarking Metrics The International Review of Research in Open and Distance
Recommender for Personnel issues in Software Learning, ISSN: 1492-3831, 13 (1) (2012).
Development Projects. International Journal of
Computational Intelligence Systems 2(3) (2009), pp. 257-
267.
48. S. Westlund, Leading Techies: Assessing Project
Leadership Styles Most Significantly Related to Software
Developer Job Satisfaction. International Journal of Human
Capital and Information Technology Professionals 2(2)
(2011), pp. 1-15. doi:10.4018/jhcitp.2011040101
49. O.C.S. Tzeng, S.L. Ware, J-M. Chen, Measurement and
Utility of Continuous Unipolar Ratings for the Myer-Briggs
Type Indicator, Journal of Personality Assessment, 53(4)
Downloaded by [Dalhousie University] at 00:42 27 December 2012
(1989) 727-738.
50. C. Romero, S. Ventura, P. G. Espejo and C. Hervs, Data
Mining Algorithms to Classify Students, in Proc. 1st
International Conference on Educational Data Mining
(EDM’08), (Montreal, Canada, 2008) pp. 8–17.
51. P. Lingras, M. Joshi, Experimental Comparison of Iterative
Versus Evolutionary Crisp and Rough Clustering,
International Journal of Computational Intelligence
Systems, 4(1)(2011), pp.12-28.
52. Y.-C. Lin, T.-K. Wu, S.-C. Huang, Y.-R. Meng, W.-Y.
Liang, Rough Sets as a Knowledge Discovery and
Classification Tool for the Diagnosis of Students with
Learning Disabilities, International Journal of
Computational Intelligence Systems, 4(1) (2011), pp.29-43.
53. M. Matijaš, M. Vukićević, S. Krajcar, Supplier Short Term
Load Forecasting Using Support Vector Regression and
Exogenous Input, Journal of Electrical Engineering
62(5)(2011) pp. 280-285. doi:10.2478/v10187-011-0044-9
54. B. Delibašić, M. Jovanović, M. Vukićević, M. Suknović, Z.
Obradović, Component-based decision trees for
classification, Intelligent Data Analysis 15 (5) (2011) pp.
671-693. doi: 10.3233/IDA-2011-0489
55. M. Suknovic, B. Delibasic, M. Jovanovic, M. Vukicevic,
D. Becajski-Vujaklija and Z. Obradovic, Reusable
components in decision trees induction algorithms,
Computational Statistics (2012). doi:10.1007/s00180-011-
0242-8.
56. B. Delibasic, K. Kirchner, J. Ruhland, M. Jovanovic, M.
Vukicevic, Reusable components for partitioning clustering
algorithms. Artificial Intelligence Review 32 (1-4) (2009)
pp. 59-75. doi: 10.1007/s10462-009-9133-6
57. M. Vukicevic, M. Jovanovic, B. Delibasic, S. Isljamovic,
M. Suknovic, Reusable component-based architecture for
decision tree algorithm design, International Journal on
Artificial Intelligence Tools (2012). doi:
10.1142/S0218213012500224
58. B. Delibasic, M. Vukicevic, M. Jovanovic, K. Kirchner, J.
Ruhland, M. Suknovic, An architecture for component-
based design of representative-based clustering algorithms,