Latika Project
Latika Project
2.SYSTEM STUDY
2.1 REVIEW OF LITERATURE
* Safira Nury Safitri (Et.al) [2022] “Educational data mining using cluster
analysis methods and decision tree based on log mining” Journal resti,
VOL 1-6
6) P. Ratnapala (Et al)[2014] studied that the focus of this research was to
use Educational Data Mining (EDM) techniques to conduct a quantitative
analysis of students interaction with an e-learning system through
instructor-led non-graded and graded courses. This exercise is useful for
establishing a guideline for a series of online short courses for them. A
group of 412 students’ access behaviour in an e-learning system were
analysed and they were grouped into clusters using K-Means clustering
method according to their course access log records. The results
explained that more than 40% from the student group are passive online
learners in both graded and non-graded learning environments. The result
showed that the difference in the learning environments could change the
online access behaviour of a student group. Clustering divided the student
population into five access groups based on their course access behaviour.
Among these groups, the least access group (NG-41% and G-42%) and the
highest access group (NG-9% and G-5%) could be identified very clearly
due to their access variation from the rest of the groups.
9) Chunxia Wang [2021] This study explores the application of data mining
techniques to analyze student behavior in online English education. Wang
(2021) proposes a method that combines the Apriori algorithm for
association rule mining with fuzzy neural networks to process and analyze
large volumes of student learning data. The research addresses limitations
of traditional methods, such as low processing efficiency and high memory
requirements. The proposed approach involves collecting student behavior
data, establishing a learning behavior model, and applying data mining
techniques for preparation, statistics, and analysis. The author reports
that this method demonstrates improved data processing efficiency,
reduced memory usage, and lower prediction errors compared to
conventional approaches. This research contributes to the growing field of
educational data mining and offers potential insights for enhancing online
English education systems in the context of increasing global economic
integration and the importance of English language.
11) Anduela Lile [2011] Studied that Recently, Educational Data Mining
has become an emerging research field used to extract knowledge and
discover patterns from E-learning systems. The educational system in
Albania is currently facing a number of issues such as identifying students’
needs, personalization of training and predicting the quality of student
interactions. Educational Data Mining provides a set of techniques, which
can help the educational system to overcome these issues. The objective
of this research is to introduce Educational Data Mining, by describing a
step-by-step process using a variety of techniques such as Attribute
Weighting (Weighting by Information Gain, Relief, Hi-Squared,
Uncertainty), Clustering (K-Means), Classification(Tree Induction),
Association Mining (Apriori, FPGrowth, Create Association Rule, GSP) in
order to achieve the goal to discover useful knowledge from the Moodle
LMS. Analyzing mining results enables educational institutions to better
allocate resources and organize the learning process in order to improve
the learning experience of students as well as increase their profits. The
experimental results have shown that the data mining model presented in
this research was able to obtain comprehensible and logical feedback
from the LMS data describing students’ learning behavior patterns. For
this work, Rapid Miner (v5.0) and Weka (v3.6.2) data mining tools were
used to mine data from the Moodle system, used in “C Programming -
CEN112” course taken by Computer Engineering students at Epoka
University, during Spring Semester 2009-2010.
* Anduela Lile [2011] “Analyzing E-Learning Systems Using Educational
Data Mining Techniques” Mediterranean journal of social sciences,VOL-2(1-
17).
2.2.1 DRAWBACKS
2.3.1 FEATURES
3.DATASET DESCRIPTION
This dataset captures various personal, academic, and social factors that
could influence student behaviour and performance in online education
environments. The columns in the dataset include demographic
information, study habits, and interaction patterns in online learning.
Below is a detailed description of each column:
1. Gender: Categorical variable indicating the gender of the student
(e.g., Male, Female, Other).
Dataset Name:
This dataset is useful for analyzing the effects of various factors (such as
economic status, access to technology, social interactions, and study
habits) on student engagement, satisfaction, and academic performance
in an online learning environment. The data can be used for generating
insights into how different demographic and behavioral aspects influence
learning outcomes during online education.
1. Gender
2. Home Location
3. Level of Education
4. Age (Years)
5. Number of Subjects
7. Economic Status
8. Family Size
7. Prediction Module
9. Deployment Module
4.DATA ANALYSIS