0% found this document useful (0 votes)

11 views6 pages

Chapter 04

Uploaded by

RameshPrasadBhatta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views6 pages

Chapter 04

Uploaded by

RameshPrasadBhatta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

RESEARCH METHODOLOGY FOLLOWED

Chapter 4
Research Methodology Followed

4.1 Research Methodology for Predicting Academic Trends for Effective

Decision Making Using Data Mining Techniques

The proposed methodology has been performed in phased manner comprising of following: a)
study of related literature, b) study of functional requirement of education and training followed
in academic institute, c) synthesizing and analyzing of educational dataset, d) algorithmic study
of data mining techniques used, e) implementation of mining techniques on dataset, extracting
and evaluating results generated and f) predicting the academic trends obtained using different
data mining techniques.

Study of Related Literature: For the proposed study, we have studied the literature which
includes Knowledge discovery in databases (KDD) processes, data mining methods, and
techniques. Apart from this, literature related to software and tools like WEKA, Rapid Miner,
SPSS, Hadoop and MatLab etc. have been studied.

Study of Functional Requirements of Education and Training Followed in the Academic

Institute: In this phase, fundamental requirements for education and training followed in
institutes are analyzed. There are following pointers which have been analyzed are mentioned
as follows: a) problems being faced with the current educational model, b) difference in
traditional and modern approach of teaching, c) need of smart learning environments, d)
policies and practices adopted in education, e) challenges exist in the educational data
classification, f) role of data mining in education, g) necessity of educational data mining, i)
how mining techniques have solved the problems, j) the sample size of educational dataset, k)
composition of faculty members in institute, l) courses which are currently being taught, m)
number of students enrolled in the courses, n) teaching methodology followed, o) success rate
of students in placement drives, p) inclusion of skill-oriented courses into curricula, q)
feedback analysis.

49 | P a g e
RESEARCH METHODOLOGY FOLLOWED

Synthesize Educational Dataset and Predictive Analytics of Educational Trends Using

Data Mining Techniques: In this phase, the educational datasets have been synthesized,
semi-synthesized and obtained from educational institutions. After data collection, data has
been pre-processed, cleaned and then the algorithmic approach of data mining techniques
have been applied over the dataset. The approach has been followed based on certain
parameters which are mainly followed in academic institutions for analysis are categorized as
follows: a) Attendance, b) GPA, c) reasoning skills, d) quantitative skills, e) communication
skills, f) technical skills, g) assignment work, h) practical work, i) presentations etc. which
have not been limited to these parameters only. The sample size of the dataset has been
increased and extended to test for its suitability and prediction accuracy.

Implementation and Evaluation of Results: Following the above approach, educational

datasets synthesized for experimentation purpose have been implemented into mining models
using classification and clustering techniques.

1) The quality assurance of academic institutions has been predicted taking into
consideration the parameters which are categorized as: a) teaching skills, b) course
content and c) infrastructure. This quality assessment of an institution has been done
using Regression techniques of statistics which is the first approach of data mining
followed.

2) Then, the classification of the educational dataset is performed using Decision tree
classifiers. A decision tree classifier has been implemented in order to obtain the
following: a) prediction of performance of students in a particular class, b)
identification of students whose attendance is short and have performed poorly in
sessional, and d) calculating information gain, which is a metric that shows how well
one attribute classifies the training data.

3) Further, in this direction, mining of educational dataset using K-Means clustering

technique has been performed. The goal of the K-Means algorithm is to minimize the
total distance between the cluster and its corresponding centroid. Using K-Means,
students have been clustered based on their performance in sessional, attendance and
overall performance in class. The centroid values have been calculated from the

50 | P a g e
RESEARCH METHODOLOGY FOLLOWED

educational dataset taking K-clusters, which helps instructors to achieve the following
results: a) identification of students who are short of attendance, b) performed poorly in
sessional, and c) cluster students who need special attention. Apart from it on, it is also
concluded, that on increasing the value of K, the accuracy becomes better and K-Means
find the better grouping of the data.

4) Educational data analysis related to clustering of students using neural network based
classification and clustering techniques have been performed. Neural networks are
basically a group of interconnected neurons which uses computational or mathematical
models to process information. A self-organizing map is a type of ANN (Artificial
Neural Network) which consists of neurons and each neuron is associated with a weight
vector of the same dimension as the input data vectors. It is an unsupervised neural
network algorithm which projects high-dimensional data onto a two-dimensional map.
In this technique, similar data items are mapped to nearby locations which help in
pattern recognition. Neural network based pattern recognition does the following: a)
classify inputs into a set of target categories, b) helps to select data, c) create and train a
network, d) evaluate its performance using cross-entropy and confusion matrices.

5) Another data mining technique which has been followed in research methodology is
Association rule mining. The knowledge has been extracted from a semi-synthesized
dataset specially created for this purpose for the students of engineering background.
Using ARM technique, preferable courses have been extracted from the dataset for
students to undergo industrial training. ARM is used to find associations between
frequently occurring variables. Association rules are generated based on the frequent
variables in datasets. Apriori is the algorithm used for mining of frequent patterns from
the transaction database. Through this methodology, rules have been discovered using
Apriori algorithm which helps instructors a) to find interest of students towards industry
oriented courses in an e-learning environment, b) to enhance the effectiveness of
academic planning/decision-making, c) to extract knowledge rules related to industry
demanding courses which needs to be introduced into syllabi.

6) Support Vector Machines are one of the supervised learning methods which have been
used in our proposed methodology for both regression and classification. Using this data

51 | P a g e
RESEARCH METHODOLOGY FOLLOWED

mining technique on another educational dataset, SVM classifiers have predicted the
placement of students based on parameters which are as follows: a) attendance, b) GPA,
c) reasoning skills, d) quantitative skills, e) communication skills, f) technical skills. It
is also concluded that, in many cases, students focus only on their regular curriculum
besides attaining those skills which are also necessary for the overall development of
student and their placements.

7) Further, in this direction, the Naive Bayes data mining technique has been used to

students. The same dataset and attributes have been used for experimentation purpose
which is being used for support vector machines. The knowledge extracted using this
technique has helped to obtain the following results: a) helps management authorities of
the institute to improve student placements, b) helps instructors to guide students to
focus on improving skills like aptitude, reasoning, and communication etc. apart from
technical skills to get placed.

8) In the proposed methodology, the next data mining technique, using which academic
data has been analyzed is K-Nearest Neighbor. Using this technique, the nearest
neighbor classes have been predicted for the attribute i.e. class performance. In K-
Nearest Neighbors technique, using K value, the nearest class for the upcoming group
of fresh students is determined which helps in: a) identifying group of those students
who are having good practical as well as good overall performance in the class, b)
strengthens the decision-making approach of instructors to monitor the capabilities of
the group, c) helps management of institute to adopt some new pedagogies to improve
student skills and placements, d) identifying those learners who are showing meager
performance in class, e) improving quality education. To get more accurate results, the
centroid value is increased in K-Means technique and followed nearest neighbor search
using distance metrics i.e. Minkowski, Chebychev, Euclidean Distance Vector etc. On
increasing the value of K-Nearest Neighbors, more accuracy in the prediction of each
class is obtained. The majority of the K nearest neighbors decides the class of any point.

9) The final research methodology approaches followed to perform educational data

mining are using Hadoop and Python machine learning language. A methodology using

52 | P a g e
RESEARCH METHODOLOGY FOLLOWED

MapReduce framework has been proposed. Hadoop distributed file system is used to
hold a large amount of data. The files are stored in a redundant fashion across multiple
machines which ensure their endurance to failure and parallel applications. Here, using
HDFS, tasks run over Map Reduce and output is obtained after aggregation of results.
The knowledge extracted using this technique has been implemented in order to obtain
the following results: a) guiding the students to choose and to focus on the right
course(s) based on their personal preferences, b) blending the concepts of data mining
and classification with those of big data, c) deriving right blend of courses for students
to pursue appropriate courses/trainings and to enhance their career prospects.

Machine learning is the need of the hour, as it is a fastest growing and revolutionary
part of the IT industry. In Machine learning, data analytics is done in a way that equips
coherent prototype building. Machine learning languages have inbuilt packages and
algorithms which emphasize, imbibe and train from data to find unknown observation
and meaningful information. The Python programming language is popular language of
machine learning because of following reasons: a) It is having a supportive multiplicity
and performance trade-off, b) is more perceptive than other languages, c) it consists of a
pattern of schema, has inbuilt libraries and packages which are very helpful in working
with machine learning systems, d) solves the complex set of machine learning tasks. But
in spite of all these powerful features, python programming language and its
contribution for educational data mining, analytics of educational data are still not
explored and utilized for improving the educational sector, learning analytics. In
proposed work, using Python, classification of educational dataset synthesized for
experimentation purpose has been performed by different classifiers and for that, a
validation dataset has been created, algorithms have been used to build the model and
finally, evaluation of data has been performed using these models. The best model
results are obtained and compared on the basis of their accuracy measures for
classifying the data. Apart from it, the results obtained have made the predictions as
follows: a) students overall performance in class, b) aptitude skills of class, c) students
attendance in class for a particular course.

53 | P a g e
RESEARCH METHODOLOGY FOLLOWED

Web-Based Data Mining Tools for Performing Feedback Analysis and Association Rule
Mining: As a part of the proposed methodology, web-based tools have been developed using
Asp.Net and php. Using Asp.Net, web-enabled association rule mining technique based tool
has been proposed which uses a SQL query mechanism for querying the discovered
knowledge in the form of association rules. The proposed web-based tool is helpful for
universities/institutions in providing students the appropriate guidance to opt for the right
course among the elective courses. This tool can be utilized a) to generate the combination of
elective courses mostly opted on the basis of feedback of students, b) to generate the
combination of elective courses best recommended on the basis of feedback from industry
experts, c) to help university/institute to adopt courses which are considered to be both
interesting and beneficial for students. Another tool has been developed in php with MySQL
for feedback analysis. The parameters of feedback are categorized as: a) teaching skills, b)
course content and c) infrastructure quality. The feedback is gathered from students/corporate
employees and through the proposed tool, results have been generated. The tool helps
management to obtain the following results: a) improving in-house training skills, b)
improving course content designed for trainings, c) improving pedagogies, d) improving
infrastructure quality.

4.2 Conclusion

This chapter contains the research methodology followed. The research methodology divided
into five phases has been described and discussion about these phases has been presented.

54 | P a g e

Extending The Student's Performance Via K Means and Blended Learning
No ratings yet
Extending The Student's Performance Via K Means and Blended Learning
4 pages
Ejsr 43 1 03
No ratings yet
Ejsr 43 1 03
6 pages
Analysis of Students'Critical Thinking Skills Using Data Mining Approaches (Survey Based Research)
No ratings yet
Analysis of Students'Critical Thinking Skills Using Data Mining Approaches (Survey Based Research)
5 pages
Paper 31-Educational Data Mining Students Performance Prediction
No ratings yet
Paper 31-Educational Data Mining Students Performance Prediction
9 pages
Management-Mining Students Data To Predict Student
No ratings yet
Management-Mining Students Data To Predict Student
6 pages
A Survey On Educational Data Mining Techniques
No ratings yet
A Survey On Educational Data Mining Techniques
5 pages
Discerning Learner's Erudition Using Data Mining Techniques
No ratings yet
Discerning Learner's Erudition Using Data Mining Techniques
6 pages
Predicting Students Performance Using Data Mining Technique With Rough Set Theory Concepts
No ratings yet
Predicting Students Performance Using Data Mining Technique With Rough Set Theory Concepts
7 pages
Student's Placement Eligibility Prediction Using Fuzzy Approach
No ratings yet
Student's Placement Eligibility Prediction Using Fuzzy Approach
5 pages
Chem 111 Course Outline
No ratings yet
Chem 111 Course Outline
2 pages
Data Mining On Educational Domain: Nikhil Rajadhyax Prof. Rudresh Shirwaikar
No ratings yet
Data Mining On Educational Domain: Nikhil Rajadhyax Prof. Rudresh Shirwaikar
6 pages
How To Optimize An Expert Advisor Using MetaTrader 4 Strategy Tester
100% (2)
How To Optimize An Expert Advisor Using MetaTrader 4 Strategy Tester
10 pages
Top 10 Data Mining Papers
No ratings yet
Top 10 Data Mining Papers
126 pages
Role of Data Mining in Education For Improving Students Performance For Social Change
No ratings yet
Role of Data Mining in Education For Improving Students Performance For Social Change
2 pages
Synopsis New
No ratings yet
Synopsis New
5 pages
Data Mining Applications: A Comparative Study For Predicting Student's Performance
No ratings yet
Data Mining Applications: A Comparative Study For Predicting Student's Performance
7 pages
Surveyof Data Miningin Elearning Managment System 2
No ratings yet
Surveyof Data Miningin Elearning Managment System 2
16 pages
Running Head:: Data Mining 1
No ratings yet
Running Head:: Data Mining 1
7 pages
Feature Selection Techniques and Classification Al
No ratings yet
Feature Selection Techniques and Classification Al
14 pages
Student Cluster Analysis Based On Moodle Data and Academic Performance Indicators
No ratings yet
Student Cluster Analysis Based On Moodle Data and Academic Performance Indicators
4 pages
InTech-Mining Enrollment Data Using Descriptive and Predictive Approaches
No ratings yet
InTech-Mining Enrollment Data Using Descriptive and Predictive Approaches
21 pages
Salah Hashim 2020 IOP Conf. Ser. Mater. Sci. Eng. 928 032019
No ratings yet
Salah Hashim 2020 IOP Conf. Ser. Mater. Sci. Eng. 928 032019
19 pages
1.student Performance Prediction Techniques
No ratings yet
1.student Performance Prediction Techniques
5 pages
Student Performance Prediction by Using Data Mining Classification Algorithms
No ratings yet
Student Performance Prediction by Using Data Mining Classification Algorithms
6 pages
Regression Analysis of Student Academic Performance Using Deep Learning
No ratings yet
Regression Analysis of Student Academic Performance Using Deep Learning
16 pages
Unit-I Python Notes
No ratings yet
Unit-I Python Notes
62 pages
Yash 21BSDS12 Perdictive Analysis Report
No ratings yet
Yash 21BSDS12 Perdictive Analysis Report
20 pages
Data Mining Review1
No ratings yet
Data Mining Review1
5 pages
PM Web 18058
No ratings yet
PM Web 18058
18 pages
V3i12 0295
No ratings yet
V3i12 0295
9 pages
Badr 2016
No ratings yet
Badr 2016
10 pages
Chapter Two
No ratings yet
Chapter Two
7 pages
Student Performance Analysis Using Educa
No ratings yet
Student Performance Analysis Using Educa
8 pages
ICSMB2016-C Anuradha
No ratings yet
ICSMB2016-C Anuradha
7 pages
Prediction of Student Academic Performance by An Application of K-Means Clustering Algorithm
No ratings yet
Prediction of Student Academic Performance by An Application of K-Means Clustering Algorithm
3 pages
Analysis of Student Academic Performance Using Clustering Techniques
No ratings yet
Analysis of Student Academic Performance Using Clustering Techniques
21 pages
Development of Student's Academic Performance Prediction Model
No ratings yet
Development of Student's Academic Performance Prediction Model
16 pages
Student Performance Prediction by Using Data Mining Classification Algorithms
No ratings yet
Student Performance Prediction by Using Data Mining Classification Algorithms
5 pages
Predicting Student Academic Success DDA
No ratings yet
Predicting Student Academic Success DDA
26 pages
Review On Prediction Algorithms in Educational Data Mining
No ratings yet
Review On Prediction Algorithms in Educational Data Mining
2 pages
(Cybernetics and Information Technologies) Predicting Student Performance by Using Data Mining Methods For Classification
No ratings yet
(Cybernetics and Information Technologies) Predicting Student Performance by Using Data Mining Methods For Classification
12 pages
Chapter 7 PDF
No ratings yet
Chapter 7 PDF
2 pages
Prediction Clustering
No ratings yet
Prediction Clustering
16 pages
Data Mining: A Prediction of Performer or Underperformer Using Classification
No ratings yet
Data Mining: A Prediction of Performer or Underperformer Using Classification
5 pages
Classification Model of Prediction For Placement of Students
No ratings yet
Classification Model of Prediction For Placement of Students
9 pages
Student Performance Evaluation in Educat
No ratings yet
Student Performance Evaluation in Educat
3 pages
Article 6
No ratings yet
Article 6
6 pages
A Survey On Educational Data Mining Techniques in Predicting Student's Academic Performance
No ratings yet
A Survey On Educational Data Mining Techniques in Predicting Student's Academic Performance
3 pages
The Journal of Engineering - 2019 - Li - Educational Data Mining For Students Performance Based On Fuzzy C Means
No ratings yet
The Journal of Engineering - 2019 - Li - Educational Data Mining For Students Performance Based On Fuzzy C Means
6 pages
Krishnan N. Machine Learning For Materials Discovery. Numerical Recipes... 2024
No ratings yet
Krishnan N. Machine Learning For Materials Discovery. Numerical Recipes... 2024
287 pages
Data Mining For Small Student Data Set - Knowledge Management System For Higher Education Teachers
No ratings yet
Data Mining For Small Student Data Set - Knowledge Management System For Higher Education Teachers
11 pages
Prediction of Students' Educational Status Using CART Algorithm, Neural Network, and Increase in Prediction Precision Using Combinational Model
No ratings yet
Prediction of Students' Educational Status Using CART Algorithm, Neural Network, and Increase in Prediction Precision Using Combinational Model
5 pages
Educational Data Mining Techniques Approach To Predict Student's Performance
No ratings yet
Educational Data Mining Techniques Approach To Predict Student's Performance
4 pages
Literature Review
No ratings yet
Literature Review
11 pages
Mining Students Data To Analyze Learning Behavior: A Case Study
No ratings yet
Mining Students Data To Analyze Learning Behavior: A Case Study
4 pages
20122
No ratings yet
20122
22 pages
(Fa) Fianl Research Paper Data Mining..
No ratings yet
(Fa) Fianl Research Paper Data Mining..
59 pages
Handling Missing Value in Decision Tree Algorithm PDF
No ratings yet
Handling Missing Value in Decision Tree Algorithm PDF
6 pages
Evaluation of Student Academic Performan
No ratings yet
Evaluation of Student Academic Performan
7 pages
Analysis of Educational
No ratings yet
Analysis of Educational
5 pages
Review On Prediction Algorithms in Educational Data Mining: A.Dinesh Kumar, R.Pandi Selvam, K.Sathesh Kumar
No ratings yet
Review On Prediction Algorithms in Educational Data Mining: A.Dinesh Kumar, R.Pandi Selvam, K.Sathesh Kumar
8 pages
Final Survey Paper 17-9-13
No ratings yet
Final Survey Paper 17-9-13
5 pages
Paper Dinesh Clustering Techniques
No ratings yet
Paper Dinesh Clustering Techniques
5 pages
(Nonlinear (6-31) : Structures GTU-Sem. 3-Comp/T) Binary Tree
No ratings yet
(Nonlinear (6-31) : Structures GTU-Sem. 3-Comp/T) Binary Tree
25 pages
Muskingum Routing - Example
No ratings yet
Muskingum Routing - Example
12 pages
23 Ex 5G Absolute Maximum and Minimum
No ratings yet
23 Ex 5G Absolute Maximum and Minimum
8 pages
Asymptotic Notations
No ratings yet
Asymptotic Notations
18 pages
Timeline of Probability and Statistics
100% (1)
Timeline of Probability and Statistics
3 pages
Summer-Math-Entering-Gr - 11-Honors
No ratings yet
Summer-Math-Entering-Gr - 11-Honors
19 pages
Cs3452 - Toc - QB New
No ratings yet
Cs3452 - Toc - QB New
10 pages
Earley Parsing PDF
No ratings yet
Earley Parsing PDF
27 pages
Programming Preliminaries (Chapter 3)
No ratings yet
Programming Preliminaries (Chapter 3)
7 pages
Lecture1 AML
No ratings yet
Lecture1 AML
16 pages
DS Unit 2
No ratings yet
DS Unit 2
34 pages
24: 12.07.05 Flory-Huggins Theory: Today
No ratings yet
24: 12.07.05 Flory-Huggins Theory: Today
4 pages
Voiced/Unvoiced Decision For Speech Signals Based On Zero-Crossing Rate and Energy
No ratings yet
Voiced/Unvoiced Decision For Speech Signals Based On Zero-Crossing Rate and Energy
5 pages
Information Theory, Pattern Recognition and Neural Networks: Part III Physics, January 2007
No ratings yet
Information Theory, Pattern Recognition and Neural Networks: Part III Physics, January 2007
2 pages
Historical Development (Chapter 1)
No ratings yet
Historical Development (Chapter 1)
7 pages
Control Statements
No ratings yet
Control Statements
44 pages
Chapter 02
No ratings yet
Chapter 02
12 pages
Cbse
No ratings yet
Cbse
64 pages
Module 5 - Probability Assignment DS
No ratings yet
Module 5 - Probability Assignment DS
2 pages
Goertzel's Algorithm
No ratings yet
Goertzel's Algorithm
4 pages
Pointers
No ratings yet
Pointers
7 pages
C Chap06
No ratings yet
C Chap06
30 pages
Chapter 6
No ratings yet
Chapter 6
13 pages
Icesc48915.2020.9155615
No ratings yet
Icesc48915.2020.9155615
6 pages
Numerical Analysis With Optimiz PDF
No ratings yet
Numerical Analysis With Optimiz PDF
102 pages
Query Optimization
No ratings yet
Query Optimization
7 pages
Gov Decisions
No ratings yet
Gov Decisions
6 pages
Kristian Perriu Audio Classification
No ratings yet
Kristian Perriu Audio Classification
5 pages
PCA and Sparse PCA Principal Component Analysis
No ratings yet
PCA and Sparse PCA Principal Component Analysis
2 pages
Journal December 21
No ratings yet
Journal December 21
181 pages
Unit 5
No ratings yet
Unit 5
11 pages
Strings
No ratings yet
Strings
5 pages
FPGA
No ratings yet
FPGA
20 pages
18CS54 - ATCI - MODULE 4 - TURING MACHINES - Part 2
No ratings yet
18CS54 - ATCI - MODULE 4 - TURING MACHINES - Part 2
19 pages
A Package Is A Collection of Similar Types of Classes
No ratings yet
A Package Is A Collection of Similar Types of Classes
6 pages
Answer Key Quizactivity - Mansci
No ratings yet
Answer Key Quizactivity - Mansci
10 pages
Math 365 Project
No ratings yet
Math 365 Project
2 pages
Assignment 9 July 2022 Solution
No ratings yet
Assignment 9 July 2022 Solution
4 pages
Arrays
No ratings yet
Arrays
3 pages
Lab 2
No ratings yet
Lab 2
2 pages
EDUCATION DATA MINING FOR PREDICTING STUDENTS’ PERFORMANCE
From Everand
EDUCATION DATA MINING FOR PREDICTING STUDENTS’ PERFORMANCE
Dr. GEETHA N DATA SCIENTIST, BENGALURU
No ratings yet
Mixed Methods Research: Applying AI Tools for Effective Writing and Publishing
From Everand
Mixed Methods Research: Applying AI Tools for Effective Writing and Publishing
Krishna Bista
No ratings yet

Chapter 04

Uploaded by

Chapter 04

Uploaded by

RESEARCH METHODOLOGY FOLLOWED

4.1 Research Methodology for Predicting Academic Trends for Effective

Study of Functional Requirements of Education and Training Followed in the Academic

Synthesize Educational Dataset and Predictive Analytics of Educational Trends Using

Implementation and Evaluation of Results: Following the above approach, educational

3) Further, in this direction, mining of educational dataset using K-Means clustering

9) The final research methodology approaches followed to perform educational data

You might also like