0% found this document useful (0 votes)

106 views16 pages

Prediction Clustering

The document summarizes research on analyzing student academic performance using clustering techniques. It compares the performance of 4 clustering algorithms (k-means, k-medoids, fuzzy c-means, expectation maximization) on student academic data from private colleges. The algorithms are evaluated based on purity, normalized mutual information, and time taken to form clusters. Related work applying data mining techniques like clustering and classification to predict student performance and identify patterns is also reviewed.

Uploaded by

isaac

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

106 views16 pages

Prediction Clustering

Uploaded by

isaac

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

International Journal of Pure and Applied Mathematics

Volume 119 No. 15 2018, 309-323

ISSN: 1314-3395 (on-line version)
url: http://www.acadpubl.eu/hub/
Special Issue
http://www.acadpubl.eu/hub/

ANALYSIS OF STUDENT ACADEMIC PERFORMANCE USING

CLUSTERING TECHNIQUES
K. Govindasamy1, T.Velmurugan2
1
Research Scholar, VELS University, Chennai, India.
2
Associate Professor, PG and Research Department of Computer Science,
D. G. Vaishnav College, Chennai, India.
E-Mail: [email protected], [email protected]

Abstract: Student‟s performance is an essential part in higher learning institutions. Predicting

student‟s performance becomes more challenging due to the large volume of data in educational
databases. Clustering is one of the method in data mining to analyze the massive volume of data. It
categorizes data into clusters such that objects are grouped in the same cluster when they are similar
according to specific metrics. This paper is designed to study and compare four clustering
algorithms. The algorithms used for the research is k-Means, k-Medoids, Fuzzy C Means (FCM)
and Expectation Maximization (EM). The main advantage of clustering is that interesting patterns
and structures can be found directly from very large data sets with little or none of the background
knowledge. The performance of the clustering algorithms is compared based on the factors: Purity,
Normalized mutual information(NMI) and time taken to form cluster.

Keywords: Educational Data Mining, k-Means Algorithm, k-Medoids Algorithm, Fuzzy C Means
Algorithm, Expectation Maximization Algorithm.

1. Introduction
Data mining is a process of extracting previously unknown, valid, potential useful and
hidden patterns from large data sets. As the amount of data stored in educational data bases is in
increasing rapidly. In order to get required benefits from such large data and to find hidden
relationships between variables using different data mining techniques developed and used.
Clustering is most widely used techniques in data mining. The aim of clustering is to partition
students in to homogeneous groups according to their characteristics and abilities [1].
Usually educational organizations used to collect huge amount of data which would be
relevant to faculty members, students, etc. But the importance of data that is collected is unknown.
The data that are used in generating simple queries or traditional reports may be in significant,
which will not contribute to the process of inference/decision making in the educational
organizations. The collected data may also contain such insignificant data. Also the volume and
complexity of the collected data may be very high such that it is not easy to handle. If that is the
case then the collected data may not be used and memory is occupied unnecessarily. The available
data can be made usable if and only if it is converted into useful information by exploiting
potentiality of the collected data. A wide range of data mining algorithms is used to extract useful
information from potential data gathered in various educational organizations.

309
International Journal of Pure and Applied Mathematics Special Issue

There are increasing research interests in education field using data mining. Application of
Data mining techniques concerns to develop the methods that discover knowledge from data and
used to uncover hidden information. The discovered knowledge can be used to better understand
students‟ behavior, to assist instructors, to improve teaching, to evaluate and improve e-learning
system, to improve student academic performance; to improve curriculums and many others
benefits [2].
This study investigates the educational domain of data mining. This paper performs a
comparative analysis of four clustering algorithms namely k-means algorithm, k-Mediods
algorithm, Fuzzy C Means algorithm and Expectation Maximization algorithm. The performance of
these clustering algorithms is compared in terms of purity, normalized mutual information and time
taken to form a cluster. The student data was collected from different private Arts and Science
colleges. The collected academic data was grouped according to their similar characteristics,
forming clusters.
The rest of the article is organized as follows. Section 2 discusses about various research
articles related to data mining techniques for predict clustering students‟ performance Section 3
explores the basic concepts of k-Means algorithm, k-Medoids algorithm, FCM algorithm and EM
algorithm in detail. The clustering results of each algorithm were examined in detail and compared
with each other to evaluate the performance of the algorithms in section 4. Finally, concludes the
research work.

2. Related Work
In educational data mining various research have been done in predicting students‟
performance using different data mining techniques such as clustering, classification, neural
networks, etc. Some of the methodologies from different research articles were discussed in this
section. Educational Data Mining (EDM) is the field of study concerned with mining educational
data to find out interesting patterns and knowledge in educational organizations. In [3], the study
explores multiple factors theoretically assumed to affect students‟ performance in higher education,
and finds a qualitative model which best classifies and predicts the students‟ performance based on
related personal and social factors. In [3] four decision tree algorithms was used on the collected
student‟s data, namely, C4.5 decision tree, ID3 decision tree, CART decision Tree, and CHAID.
Durairaj et al., [4] propose Educational Data mining for Prediction of Student Performance
Using Clustering Algorithms. They predicting the students‟ performance, used weka data mining
through clustering, which paved way to strategic management tool. In [5] Prashant et al., examined
the clustering analysis in data mining that analyzes the use of k-means algorithm in improving
students academic performance in higher education and presents k-means clustering algorithm as a
simple and efficient tool to monitor the progression of students performance.
Shiwani and Roopali [6], had proposed a work to evaluate the performance of students of
Digital Electronics of university institute of engineering and technology. The researcher had applied
unsupervised learning algorithms such as K-means and Hierarchical clustering using WEKA tool as
an open source tool. The paper [7] focuses on the study of data mining techniques applied to small
data sets concerning higher education institutions, concludes that the use of these techniques in real-

310
International Journal of Pure and Applied Mathematics Special Issue

life situations is useful and promising, and can provide administrators with precious tools for
decision. Clustering is used in[8] for analyzing data concerning the evaluation of courses taken by
students, linked to their results in the corresponding exams. The work presented in[9]reviews
different clustering algorithms applied to educational data mining context while [10] is an
interesting review of recent educational data mining development whose contents are in turn
analyzed by a data mining approach.
Sarala et al., discussed [11] the applications of data mining in educational institution to
extract useful information from the huge data sets and providing analytical tool to view and use this
information for decision making processes by taking real life examples. The paper in [12] focuses
on set up a clustering algorithm which is most suitable for predicting students performance in
educational data mining. The objective of this research work is to gain an insight into how
clustering analysis can be done in educational domain and to highlight the potential characteristics
of the clustering algorithms within the educational data set. In [13] a new model was used to predict
the student performance using a neural network. The model helps to accurately predict students at
risk of dropping and reduce dropout rates. This comparison between planned and actual
performance indicates that the model works in the estimation of student performance. A Research
work done by Veeramuthu et al. [14], had designed a model to present as a guideline for higher
educational system to improve their decision making processes. The authors aim to analyze how
different factor affect a student learning behavior and performance using K-means clustering
algorithm. A work done by Sivaram et al., [15] had surveyed the applicability of clustering and
classification algorithms for recruitment data mining techniques that fit the problems which are
determined. A study has been made by applying K-means, fuzzy C-means clustering and decision
tree classification algorithms to the recruitment data of an industry.

3. Clustering Algorithms
Clustering is process, grouping a set of physical or abstract objects into classes of similar
objects. A cluster is a collection of data objects that are similar to one another with in the same
cluster and are dissimilar to the objects in other clusters. Data clustering is alternatively referred to
an unsupervised learning and statistical data analysis. Cluster analysis is an important human
activity. Cluster analysis has been widely used in numerous applications including pattern
recognition, data analysis, image processing and market research. Clustering is a descriptive task
that seeks to identify homogenous group objects based on the values of their attributes. Clustering
has many requirements like scalability, dealing with different types of attributes, discovery of
clusters with arbitrary shape, minimal requirements for domain knowledge to determine input
parameters, ability to deal with noisy data, high dimensionality, interpretability and usability.
Clustering techniques can be broadly classified into many categories; partitioning, hierarchical,
density-based, grid-based, model-based algorithms.

3.1 The k-Means Algorithm

k-Means is one of the simplest unsupervised learning algorithms used for clustering. Given
D, a data set of n objects, and k, the number of clusters to form, a partitioning algorithm organizes

311
International Journal of Pure and Applied Mathematics Special Issue

the objects into k partitions (k ≤n), where each partition represents a cluster. The clusters are
formed to optimize an objective partitioning criterion, such as a dissimilarity function based on
distance. The algorithm is composed of the following steps:
Step 1:Place k points into the space represented by the objects that are being clustered. These point
are present initial group centroids.
Step 2:Assign each object to the group that has the closest centroid.
Step 3:When all objects have been assigned, recalculate the positions of the k centroids.
Step 4:Repeat steps 2 and 3 until the centroids no longer move.
This produces a separation of the objects into groups from which the metric to be minimized
can be calculated. The k-means simple clustering algorithm that has been improved to several
problem domains.

3.2 The k-Medoids Algorithm

The k-Medoids algorithm is related to the k-Means algorithm and the medoid shift
algorithm. Both the k-Means and k-Medoids algorithms are partition (breaking the dataset up into
groups). k-Means attempts to minimize the total squared error, while k-medoids minimizes the sum
of dissimilarities between points labeled to be in a cluster and a point designated as the center of
that cluster. In contrast to the k-Means algorithm, k-Medoids chooses data points as centers
(medoids or exemplars).k-Medoids is also a partitioning technique of clustering that clusters the
data set of n objects into k clusters with k known a priori [16].The algorithm is composed of the
following steps:
Step1: Using Euclidean distance as a dissimilarity measure, compute the distance between every
pair of all objects as follows
p
2
dij Xia Xja (1)
a 1

i=1,…,n; j=1,..,n
Step 2: Calculate Pij to make an initial guess at the centers of the clusters.
dij
Pij = n (2)
dij
i 1
i=1,…,n;j=1,…n
n
Step 3: Calculate Pij( j i...n) at each objects and sort them in ascending order. Select k objects
i 1
having the minimum value as initial group medoids.
Step 4: Assign each object to the nearest medoid.
Step 5: Calculate the current optimal value, the sum of distance from all objects to their medoids.
Step 6: Replace the current medoid in each cluster by the object which minimizes the total distance
to other objects in its cluster.
Step 7: Assign each object to the nearest new medoid.

312
International Journal of Pure and Applied Mathematics Special Issue

Step 8: Calculate new optimal value, the sum of distance from all objects to their new medoids. If
the optimal value is equal to the previous one, then stop the algorithm. Otherwise, go back
to the Step 6.

3.3 The FCM Algorithm

Fuzzy C-Means (FCM) is a method of clustering which allows one piece of data to belong
to two or more clusters. This method is frequently used in pattern recognition. FCM algorithm
works by assigning membership to each data point corresponding to each cluster center on the basis
of distance between the cluster center and the data point. More data is near to the cluster center and
its membership towards the particular cluster center. Clearly, summation of membership of each
data point should be equal to one [16]. After each iteration membership and cluster centers are
updated according to the formula:

1
ij
c 2
dij m 1
dik
k 1 (3)
n
m
ij xi
i 1
vj n
, j 1,2.....c (4)
m
ij
i 1

where,
'n' is the number of data points.
'vj' represents the jth cluster center.
'm' is the fuzziness index m € [1, ∞].
'c' represents the number of cluster center.
'µij' represents the membership of ith data to jth cluster center.
'dij' represents the Euclidean distance between ith data and jth cluster center.
Main objective of fuzzy c-means algorithm is to minimize:

n c
m
J U ,V || xi vj || 2 (5)
i 1 j 1

where,
'||xi – vj||' is the Euclidean distance between ith data and jth cluster center.

Steps for Fuzzy c-means clustering

Let X = {x1, x2, x3 ..., xn} be the set of data points and V = {v1, v2, v3 ..., vc} be the set of centers.
Step 1: Randomly select ‘c’ cluster centers.

313
International Journal of Pure and Applied Mathematics Special Issue

Step 2: Calculate the fuzzy membership 'µij' using:

1
ijj
c 2 (6)
dij m 1

k 1
dik
Step 3: Compute the fuzzy centers 'vj' using:
n
m
ij xi
i 1
vj n
, j 1,2.....c (7)
m
ij
i 1

Step 4: Repeat step 2 and 3 until the minimum 'J' value is achieved or ||U(k+1) - U(k)|| < β.
Where,
„k‟ is the iteration step.
„ ‟ is the termination criterion between [0,1]
„U = (µij)n*c‟ is the fuzzy membership matrix.
„J‟ is the objective function.

3.4 The EM Algorithm

The EM algorithm is an efficient iterative procedure to compute the Maximum Likelihood
(ML) estimate in the presence of missing or hidden data. In ML estimation, wish to estimate the
model parameter(s) for which the observed data are the most likely. The iteration of the EM
algorithm consists of two processes. They are E-step and M-step. In the expectation, or E-step, the
missing data are estimated given the observed data and current estimate of the model parameters.
This is achieved using the conditional expectation, explaining the choice of terminology. In the M-
step, the likelihood function is maximized under the assumption that the missing data are known.
The estimate of the missing data from the E-step is used in lieu of the actual missing data.
Convergence is assured since the algorithm is guaranteed to increase the likelihood at each iteration.
The algorithm is composed of following steps.

Step 1: Initialization
Step 2:E-Step: This step is responsible to estimate the probability of each element belong to each
cluster.
Step 3: M-Step: This step is responsible to estimate the parameters of the probability distribution of
each class for the next step.
Step 4: Convergence Test: After each iteration is performed a convergence test which verifies if
the difference of the attributes vector of iteration to the previous iteration is smaller than an
acceptable, given by parameter.

4. Experimental Results

314
International Journal of Pure and Applied Mathematics Special Issue

This section explains the performance evaluation of proposed approach. The soil nutrients is
implemented using Java (version 1.7), and the experiments are performed on a Intel(R) Pentium
machine with a speed 2.13 GHz and 2.0 GB RAM using Windows 7 32-bit Operating System.

4.1 Data Set Description

Various departments of student data is collected from private Arts and Science Colleges.
More than 1531 student‟s details are collected with their performance in Seminar and Assignments.
The data is mainly used for evaluating the performance of various clustering algorithms to predict
the academic performance of the students in their end of the semester exanimations. Table 1 shows
the data set attribute description.

Table 1: Description of Student Data set

Attribute Description
`S.N Student Serial Number
Name Name of the Student
Sex Gender Male/Female
Branch B.A., (Eng,& Tam), BBA, BCA, B.Com, B.Sc., (CS & Maths)
SSLC Mark 10th total marks
HSS Mark 12th total marks
Medium Studies in school- English/Tamil
Location Student Native City
FS Family Size
FT Family Type
FAI Family Annual Income
FQ Father Qualification
MQ Mother Qualification
LT Location Type (Village, Town)
PSG Previous Semester Grade (Average, Good, Excellent)
SemP Seminar Performance (for 3 years)
Att Student Attendance
ESG End Semester (Average, Good, Excellent)

Figure 1: Sample Data Set

315
International Journal of Pure and Applied Mathematics Special Issue

4.2 Metrics of Cluster Evaluation

A clustering algorithm is evaluated using (i) some internal evaluation measure like cohesion,
separation, or the silhouette-coefficient (addressing both, cohesion and separation), (ii) some
external evaluation measure like accuracy, precision, or recall with respect to some given class-
structure of the data. In some cases, where evaluation based on class labels does not seem viable,
(iii) careful (manual) inspection of clusters shows them to be a somehow meaningful collection of
apparently somehow related objects.
The proposed clustering algorithms are evaluated using Purity, Normalized mutual
information (NMI) and time taken to form cluster. Purity is a simple and transparent evaluation
measure. Normalized mutual information can be information-theoretically interpreted. To compute
purity, each cluster is assigned to the class which most frequent in the cluster and then accuracy of
this assignment is measured by counting the number of correctly assigned documents and dividing
by N.
1
purity( , C ) max wk  cj (8)
N k
s Where ={w1,w2,…,wk} is the set of clusters and C={c1,c2,…cj} is the set of classes.
Bad clustering have purity values close to 0, a perfect clustering has a purity of 1.
Normalized Mutual Information or NMI is computed as follows:
I ( ,C)
NMI ( , C ) (9)
H ( ) H (C ) 2
Where I is the mutual information

316
International Journal of Pure and Applied Mathematics Special Issue

P wk cj
I ( , C) P wk cj log (10)
K J p( wk ) p(cj )
wk cj N N wk cj
= log (11)
K J N wk cj
Where P(wk) , P(cj) and P(wk cj) are the probabilities of a document being in cluster wk class cj
and in the intersection of wkand cj.
H is entropy,
H P wk log P wk
k

wk wk
log
k N N (10)
NMI is always a number between 0 and 1.

Figure 2 The Results of K-Means Algorithm

4.3 Performance of Clustering Algorithm

The cluster quality is evaluated using the number of clusters, execution time, purity, and
NMI. Distribution of requirements data set among the clusters: The total number of clusters is
three (Average, Good, Excellent). Table 2 shows the total number of requirements that are
distributed when k Means, k-Medoids, FCM and EM algorithm are applied.

Figure 3 Clustering Algorithm Comparison

317
International Journal of Pure and Applied Mathematics Special Issue

Table 2 Distribution of requirements in clusters

Clustering Algorithm Average Good Excellent Total
k-Means 287 608 636 1531
k-Medoids 613 615 303 1531
FCM 709 126 696 1531
EM 231 684 616 1531

800

700

600

500 Average
400 Good
300 Excellent

200

100

0
k-Means k-Medoids FCM EM

Figure 4 Cluster Distribution Comparisons

Figure 1 shows the distribution of cluster comparison. The distribution shows that the data points in
cluster-1 uniformly distributed except k-Means algorithm.

5. Results and Discussion

318
International Journal of Pure and Applied Mathematics Special Issue

Table 3 Execution Time comparison

Clustering Algorithm Execution Time in ms
k-Means 128
k-Medoids 110
FCM 250
EM 560

Table 3 shows the execution time of clustering algorithms. The time consumption of FCM is less
compared to the EM. The lowest execution time is in K-Medoids. In figure 2, the x axis represents
the clustering algorithm and y-axis represent the time in milliseconds.
Execution Time in ms
600

500

400
Time in ms

300

200

100

0
K-Means K-Medoids FCM EM
Clustering Algorithm

Figure 5 Comparison of Execution time

Table 4 and Figure 3 show the comparison of purity and NMI values.

Table 4 Comparison of Purity and NMI Values

Algorithm Purity NMI
k-Means 0.375 0.264
k-Medoids 0.374 0.199
FCM 0.624 0.071
EM 0.664 0.032

319
International Journal of Pure and Applied Mathematics Special Issue

Clustering Comparison
0.7
0.6
0.5
Value

0.4
0.3 Purity
0.2 NMI
0.1
0
K-Means K-Medoids FCM EM
Clustering Algorithm

Figure 6 Purity and NMI Comparison for Clustering Algorithm

From the comparison the purity value of EM and FCM is more compare to the k-Means and k-
Medoids algorithms. The NMI value of EM and FCM is less compared to the k-Means and k-
Medoids algorithms. From the comparison the clustering algorithm FCM and EM is better
compared to k-Means and k-Medoids in terms of distribution purity, and NMI but thee algorithms
take more execution time.

6. Conclusion
The research work has put an effort to reveal that the clustering techniques serve as
powerful tool in educational data mining. Here various clustering algorithms are discussed and by
using these algorithms, student‟s performance is evaluated. In this research work, clustering
algorithms k-Means, k-Medoids, FCM and EM were examined and compared based on the
performance of the algorithms using student data set. The taken parameters of students data set are
evaluated and the results are analysed. The parameters purity, NMI and etc are analysed in this
work.The clustering algorithms are evaluated using execution time, purity and NMI. The result
shows that FCM and EM algorithm performs well compared with other two clustering algorithms.

Reference

[1] Sreenivasarao, Vuda, and Capt Genetu Yohannes. "Improving academic performance of
students of defence university based on data warehousing and data mining" Global Journal of
Computer Science and Technology, 2012, Vol. 12(2), pp 29-36.

320
International Journal of Pure and Applied Mathematics Special Issue

[2] Romero, Cristobal, and Sebastian Ventura. "Educational data mining: A survey from 1995 to
2005.”, Expert systems with applications, 2007, Vol. 33, pp. 135-146.
[3] Saa, Amjad Abu. "Educational Data Mining & Students‟ Performance Prediction.",
International Journal of Advanced Computer Science and Applications, 2016, Vol. 7(5), pp.
212-220.
[4] Durairaj, M., and C. Vijitha., "Educational Data mining for Prediction of Student Performance
Using Clustering Algorithms." , International Journal of Computer Science and Information
Technologies , 2014, Vol. 5(4), pp. 5987-5991.
[5] Saxena, Prashant Sahai, and M. C. Govil., "Prediction of Student‟s Academic Performance
using Clustering.", Special Conference Issue: National Conference on Cloud Computing &
Big Data., 2014,
[6] Rana, Shiwani, and Roopali Garg., "Evaluation of student‟s performance of an institute using
clustering algorithms.", International Journal of Applied Engineering Research, 2016,
Vol.11(5), pp. 3605-3609.
[7] Natek, Srečko, and Moti Zwilling., "Student data mining solution–knowledge management
system related to higher education institutions.", Expert systems with applications,
2014, Vol. 41(14) pp. 6400-6407.
[8] Campagni, Renza, Donatella Merlini, and M. Cecilia Verri. "Finding Regularities in Courses
Evaluation with K-means Clustering.", CSEDU - 6th International Conference on Computer
Supported Education, 2014, Vol. 2, pp. 26-33.
[9] A. Dutt, S. Aghabozrgi, M.A.B. Ismail, H. Mahroeian, "Clustering algorithms applied in
educational data mining", International Journal of Information and Electronics Engineering,
2015, Vol. 5, pp. 280-291.
[10] Peña-Ayala, Alejandro. "Educational data mining: A survey and a data mining-based analysis
of recent works." Expert systems with applications, 2014, Vol.41 (4), pp.1432-1462.
[11] Sarala, V., and J. Krishnaiah. "Empirical Study Of Data Mining Techniques In Education
System.", International Journal of Advances in Computer Science and Technology (IJACST),
2015, pp. 15-21.
[12] C.Anuradha, T.Velmurugan, R. Anandavally, "Clustering algorithms in educational data
mining: a review ", International Journal of Power Control and Computation(IJPCSC) Vol 7.
No.1 – 2015 pp.47-52
[13] Shirodkar, Jateen Shet, and Viren Pereira., "Determining Students Performance Using the
Tool of Artificial Neural Network.", International Journal of Innovative Research and
Development, 2016, Vol. 5 No. 2, pp. 314-318.
[14] Veeramuthu, P., Dr R. Periyasamy, and V. Sugasini., "Analysis of Student Result Using
Clustering Techniques." IJCSIT), International Journal of Computer Science and Information
Technologies, 2014, Vol. 5, No. 4, pp. 5092-5094.
[15] Sivaram, N., and K. Ramar., "Applicability of clustering and classification algorithms for
recruitment data mining." , International Journal of Computer Applications, 2010, Vol. 4,
No. 5, pp. 23-28.

321
International Journal of Pure and Applied Mathematics Special Issue

[16] Velmurugan. T and T. Santhanam, “Computational Complexity between K-means and K-

medoids clustering algorithms for normal and uniform distributions of data points”, Journal of
Computer Science, Vol. 6, Issue 3, 2010, pp.363-368.
www.thescipub.org/fulltext/jcs/jcs63363-368.pdf

322
323
324

Analyzing Undergraduate Students' Performance Using Educational Data Mining
No ratings yet
Analyzing Undergraduate Students' Performance Using Educational Data Mining
18 pages
Data Mining MCQ
78% (147)
Data Mining MCQ
34 pages
Analysis of Student Academic Performance Using Clustering Techniques
No ratings yet
Analysis of Student Academic Performance Using Clustering Techniques
21 pages
A Survey On Educational Data Mining Techniques in Predicting Student's Academic Performance
No ratings yet
A Survey On Educational Data Mining Techniques in Predicting Student's Academic Performance
3 pages
Ejsr 43 1 03
No ratings yet
Ejsr 43 1 03
6 pages
Educational Data Mining Techniques Approach To Predict Student's Performance
No ratings yet
Educational Data Mining Techniques Approach To Predict Student's Performance
4 pages
Final Survey Paper 17-9-13
No ratings yet
Final Survey Paper 17-9-13
5 pages
Ukwuoma 2019
No ratings yet
Ukwuoma 2019
5 pages
Extending The Student's Performance Via K Means and Blended Learning
No ratings yet
Extending The Student's Performance Via K Means and Blended Learning
4 pages
Predicting Student Academic Success DDA
No ratings yet
Predicting Student Academic Success DDA
26 pages
Data Mining Review1
No ratings yet
Data Mining Review1
5 pages
Student Performance Evaluation in Educat
No ratings yet
Student Performance Evaluation in Educat
3 pages
Paper 31-Educational Data Mining Students Performance Prediction
No ratings yet
Paper 31-Educational Data Mining Students Performance Prediction
9 pages
Preprocessing and Analyzing Educational Data Set Using X-API For Improving Student's Performance
No ratings yet
Preprocessing and Analyzing Educational Data Set Using X-API For Improving Student's Performance
5 pages
Badr 2016
No ratings yet
Badr 2016
10 pages
Charitopoulos 2017
No ratings yet
Charitopoulos 2017
9 pages
(Cybernetics and Information Technologies) Predicting Student Performance by Using Data Mining Methods For Classification
No ratings yet
(Cybernetics and Information Technologies) Predicting Student Performance by Using Data Mining Methods For Classification
12 pages
A Survey On Educational Data Mining Techniques
No ratings yet
A Survey On Educational Data Mining Techniques
5 pages
Edu Data Mining
100% (1)
Edu Data Mining
6 pages
Data Mining Applications: A Comparative Study For Predicting Student's Performance
No ratings yet
Data Mining Applications: A Comparative Study For Predicting Student's Performance
7 pages
Tegegne 2018
No ratings yet
Tegegne 2018
15 pages
Handling Missing Value in Decision Tree Algorithm PDF
No ratings yet
Handling Missing Value in Decision Tree Algorithm PDF
6 pages
Evaluating Students Performance Using K Means Clustering IJERTV6IS050070
No ratings yet
Evaluating Students Performance Using K Means Clustering IJERTV6IS050070
3 pages
BIA Assignment
No ratings yet
BIA Assignment
7 pages
Case Study 3
No ratings yet
Case Study 3
3 pages
Yash 21BSDS12 Perdictive Analysis Report
No ratings yet
Yash 21BSDS12 Perdictive Analysis Report
20 pages
10.1007@978 981 13 6861 548
No ratings yet
10.1007@978 981 13 6861 548
15 pages
Prediction of Student Academic Performance by An Application of K-Means Clustering Algorithm
No ratings yet
Prediction of Student Academic Performance by An Application of K-Means Clustering Algorithm
3 pages
Student Performance Prediction Using Machine Learn
No ratings yet
Student Performance Prediction Using Machine Learn
8 pages
Chapter 04
No ratings yet
Chapter 04
6 pages
Dake 2019 Ijca 919320
No ratings yet
Dake 2019 Ijca 919320
6 pages
Student Performance Analysis Using Educa
No ratings yet
Student Performance Analysis Using Educa
8 pages
The Journal of Engineering - 2019 - Li - Educational Data Mining For Students Performance Based On Fuzzy C Means
No ratings yet
The Journal of Engineering - 2019 - Li - Educational Data Mining For Students Performance Based On Fuzzy C Means
6 pages
Regression Analysis of Student Academic Performance Using Deep Learning
No ratings yet
Regression Analysis of Student Academic Performance Using Deep Learning
16 pages
Student Performance Prediction by Using Data Mining Classification Algorithms
No ratings yet
Student Performance Prediction by Using Data Mining Classification Algorithms
6 pages
ICSMB2016-C Anuradha
No ratings yet
ICSMB2016-C Anuradha
7 pages
Student Cluster Analysis Based On Moodle Data and Academic Performance Indicators
No ratings yet
Student Cluster Analysis Based On Moodle Data and Academic Performance Indicators
4 pages
Educational Data Mining: A Review and Analysis of Student's Academic Performance
No ratings yet
Educational Data Mining: A Review and Analysis of Student's Academic Performance
15 pages
1 s2.0 S1877050915019018 Main
No ratings yet
1 s2.0 S1877050915019018 Main
9 pages
Studentperformancepredictionbyusingdataminingclassificationalgorithms_IJCSMR_2012
No ratings yet
Studentperformancepredictionbyusingdataminingclassificationalgorithms_IJCSMR_2012
5 pages
CHAPTER TWO
No ratings yet
CHAPTER TWO
7 pages
Hari Ganesh 2015
No ratings yet
Hari Ganesh 2015
6 pages
Pattern
No ratings yet
Pattern
14 pages
Mining Students Data To Analyze Learning Behavior: A Case Study
No ratings yet
Mining Students Data To Analyze Learning Behavior: A Case Study
4 pages
Role Of Data Mining in Education for Improving Students Performance for Social Change
No ratings yet
Role Of Data Mining in Education for Improving Students Performance for Social Change
2 pages
V3i12 0295
No ratings yet
V3i12 0295
9 pages
Sashin - 2012 - A Survey and Future Vision of Data Mining in Educational Field
No ratings yet
Sashin - 2012 - A Survey and Future Vision of Data Mining in Educational Field
5 pages
A Systematic Review On Educational Data Mining
No ratings yet
A Systematic Review On Educational Data Mining
15 pages
E-Learning Using Data Mining: Shimaa Abd Elkader Abd Elaal
No ratings yet
E-Learning Using Data Mining: Shimaa Abd Elkader Abd Elaal
17 pages
(fa) fianl research paper Data mining..
No ratings yet
(fa) fianl research paper Data mining..
59 pages
CID 0548 Synopsis
No ratings yet
CID 0548 Synopsis
1 page
1.Student Performance Prediction techniques
No ratings yet
1.Student Performance Prediction techniques
5 pages
Educational Data Mining: Student Performance Prediction in Academic
No ratings yet
Educational Data Mining: Student Performance Prediction in Academic
7 pages
Paper Dinesh Clustering Techniques
No ratings yet
Paper Dinesh Clustering Techniques
5 pages
Top 10 Data Mining Papers
No ratings yet
Top 10 Data Mining Papers
126 pages
PM Web 18058
No ratings yet
PM Web 18058
18 pages
Data Mining Approach To Predict Academic Performance of Students
No ratings yet
Data Mining Approach To Predict Academic Performance of Students
11 pages
Review On Prediction Algorithms in Educational Data Mining: A.Dinesh Kumar, R.Pandi Selvam, K.Sathesh Kumar
No ratings yet
Review On Prediction Algorithms in Educational Data Mining: A.Dinesh Kumar, R.Pandi Selvam, K.Sathesh Kumar
8 pages
AbuSaa2019 Article FactorsAffectingStudentsPerfor
No ratings yet
AbuSaa2019 Article FactorsAffectingStudentsPerfor
32 pages
ICT Project Management: Framework for ICT-based Pedagogy System: Development, Operation, and Management
From Everand
ICT Project Management: Framework for ICT-based Pedagogy System: Development, Operation, and Management
Suman Ahmmed
No ratings yet
AI and ML Applications for Decision-Making in Education Sector
From Everand
AI and ML Applications for Decision-Making in Education Sector
Zemelak Goraga
No ratings yet
Integration of Data Mining Clustering Approach in The Personalized E-Learning System
No ratings yet
Integration of Data Mining Clustering Approach in The Personalized E-Learning System
11 pages
Coloured Petri Nets: Chapter 1: Modelling and Validation
No ratings yet
Coloured Petri Nets: Chapter 1: Modelling and Validation
47 pages
Test of Discrete Event Systems - 12.11.2013
No ratings yet
Test of Discrete Event Systems - 12.11.2013
9 pages
Test of Discrete Event Systems - 22.10.2013: These Values Are Not Realistic
No ratings yet
Test of Discrete Event Systems - 22.10.2013: These Values Are Not Realistic
4 pages
Future Generation Computer Systems: Pierre Matri María S. Pérez Alexandru Costan Luc Bougé Gabriel Antoniu
No ratings yet
Future Generation Computer Systems: Pierre Matri María S. Pérez Alexandru Costan Luc Bougé Gabriel Antoniu
13 pages
Design and Evaluation of A DIY Construction System For Educational Robot Kits
No ratings yet
Design and Evaluation of A DIY Construction System For Educational Robot Kits
20 pages
The Art and Science of C
No ratings yet
The Art and Science of C
596 pages
4.3 K-Medoids
No ratings yet
4.3 K-Medoids
31 pages
ML - UNIT 5 - Material - SVCK - CSE
No ratings yet
ML - UNIT 5 - Material - SVCK - CSE
22 pages
Data Mining Modul 3 Notes
No ratings yet
Data Mining Modul 3 Notes
3 pages
3205-Article Text-23308-1-10-20240703
No ratings yet
3205-Article Text-23308-1-10-20240703
7 pages
DWM UNIT-VI (2)
No ratings yet
DWM UNIT-VI (2)
30 pages
A Study On Weather Forecast Using Data Streams
No ratings yet
A Study On Weather Forecast Using Data Streams
11 pages
ML Exp 10
No ratings yet
ML Exp 10
5 pages
Comparative Study On KMeans and PAM Algorithm
No ratings yet
Comparative Study On KMeans and PAM Algorithm
5 pages
Data Mining: Concepts and Techniques: Cluster Analysis
No ratings yet
Data Mining: Concepts and Techniques: Cluster Analysis
97 pages
Stock Price Prediction Using K-Medoids Clustering With Indexing Dynamic Time Warping
No ratings yet
Stock Price Prediction Using K-Medoids Clustering With Indexing Dynamic Time Warping
7 pages
Data Mining - Clustering
No ratings yet
Data Mining - Clustering
90 pages
13 Clustering Techniques
No ratings yet
13 Clustering Techniques
47 pages
10clustering - Han and Kamber
No ratings yet
10clustering - Han and Kamber
93 pages
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
No ratings yet
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
9 pages
CSD6011 - Machine Learning For Cyber Security
No ratings yet
CSD6011 - Machine Learning For Cyber Security
3 pages
Data Mining Mid Syllabus
No ratings yet
Data Mining Mid Syllabus
162 pages
DM GTU Study Material Presentations Unit-5 21052021124400PM
No ratings yet
DM GTU Study Material Presentations Unit-5 21052021124400PM
63 pages
ML Unsupervised Notes
No ratings yet
ML Unsupervised Notes
26 pages
02 Data Mining-Partitioning Method
No ratings yet
02 Data Mining-Partitioning Method
8 pages
Chapter 7. Cluster Analysis
No ratings yet
Chapter 7. Cluster Analysis
120 pages
Chapter 3: Cluster Analysis: 3.1 Basic Concepts of Clustering
No ratings yet
Chapter 3: Cluster Analysis: 3.1 Basic Concepts of Clustering
33 pages
Ecography - 2020 - Testolin - Global Distribution and Bioclimatic Characterization of Alpine Biomes
No ratings yet
Ecography - 2020 - Testolin - Global Distribution and Bioclimatic Characterization of Alpine Biomes
10 pages
Ml Unit 5 Material Svck Cse
No ratings yet
Ml Unit 5 Material Svck Cse
22 pages
Chapter 5 Clustering
No ratings yet
Chapter 5 Clustering
40 pages
Big Data
No ratings yet
Big Data
7 pages
R For Data Science Sample Chapter
100% (1)
R For Data Science Sample Chapter
39 pages
L18 K Means
No ratings yet
L18 K Means
27 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
53 pages
2022 IJIE Template and Article Guide For Author V.17.08.10.22
No ratings yet
2022 IJIE Template and Article Guide For Author V.17.08.10.22
12 pages

Prediction Clustering

Uploaded by

Prediction Clustering

Uploaded by

International Journal of Pure and Applied Mathematics

Volume 119 No. 15 2018, 309-323

ANALYSIS OF STUDENT ACADEMIC PERFORMANCE USING

Abstract: Student‟s performance is an essential part in higher learning institutions. Predicting

3.1 The k-Means Algorithm

3.2 The k-Medoids Algorithm

3.3 The FCM Algorithm

Steps for Fuzzy c-means clustering

Step 2: Calculate the fuzzy membership 'µij' using:

3.4 The EM Algorithm

4.1 Data Set Description

Table 1: Description of Student Data set

Figure 1: Sample Data Set

4.2 Metrics of Cluster Evaluation

Figure 2 The Results of K-Means Algorithm

4.3 Performance of Clustering Algorithm

Figure 3 Clustering Algorithm Comparison

Table 2 Distribution of requirements in clusters

Figure 4 Cluster Distribution Comparisons

5. Results and Discussion

Table 3 Execution Time comparison

Figure 5 Comparison of Execution time

Table 4 Comparison of Purity and NMI Values

Figure 6 Purity and NMI Comparison for Clustering Algorithm

[16] Velmurugan. T and T. Santhanam, “Computational Complexity between K-means and K-

You might also like