0% found this document useful (0 votes)

72 views7 pages

BIA Assignment

This document discusses the application of data mining in education. It begins by defining educational data mining as applying data mining techniques to data from educational settings to extract meaningful insights about learning. Some key data mining tasks in education include classification to profile students, predictive modeling to predict outcomes like course passing or graduation, and clustering to group similar students or courses. The document then provides examples of how these techniques can be applied, such as predicting student performance, analyzing enrollment data, and recommending improvements. Finally, it lists several primary applications of educational data mining like analyzing and visualizing data, providing feedback to instructors, making recommendations for students, and detecting undesirable student behaviors.

Uploaded by

Aman kr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views7 pages

BIA Assignment

Uploaded by

Aman kr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

BIA ASSIGNMENT

BATCH: 2018-2020

SUBMITTED BY: SUBMITTED TO:

AMAN KUMAR PROF. SUBHRANSHU
MOHANTY

ARMY INSTITUTE OF MANAGEMENT & TECHNOLOGY,

GREATER NOIDA (UP) – 201306
APPLICATION OF DATA MINING IN EDUCATION
INTRODUCTION
Educational Data Mining (EDM) describes a research field concerned with the application of
data mining, machine learning and statistics to information generated from educational
settings (e.g., universities and intelligent tutoring systems). EDM refers to techniques and
tools designed for automatically extracting meaning from large repositories of data generated
by peoples learning activities in educational settings At a high level, the field seeks to
develop and improve methods for exploring this data, which often has multiple levels of
meaningful hierarchy, in order to discover new insights about how people learn in the context
of such settings In doing so, EDM has contributed to theories of learning investigated by
researchers in educational psychology and the learning sciences. The field is closely tied to
that of learning analytics, and the two have been compared and contrasted. Quite often, this
data is extensive, fine-grained, and precise. The main objective of applying Data Mining to
educational data is to analyse educational Data contents, models, to summarize/analyse the
learner’s discussions, etc. Education Data Mining concentrates on the computing process
models which focus on Education context. In educational system, a student’s performance is
determined by the term work, attendance and end semester examination. The term work is
carried out by the teacher based upon student's performance in educational activities such as
class test, assignments, attendance. The end semester examination is one that is scored by the
student in semester examination. Student must get minimum marks to pass a semester in
internal as well as end semester examination.
Calders & Pechenizkiy (2011) associate basic EDM tasks to traditional data mining
problems, i.e.:
Classic DM Educational Example
Problems
Classification Categorizing and profiling students, determining their learning styles
and preferences.
Predictive Inducing models that can predict whether (and when) a student will
Modelling pass a course or not or will eventually graduate or drop out.
Clustering Grouping Similar students (based on behaviour, performance, etc) or
grouping similar courses, assignments etc together, exploring
collaborative learning patterns.
Bi-Clustering Finding which questions (tasks, courses etc) are difficult/easy for
which students.
Frequent Pattern Finding (elective) courses often taken together or popular paths in
Mining study programs or actions in LMS
Emerging Pattern Finding patterns that capture significant differences in behaviour of
Mining students who graduated vs. those students who did not or that explain
the changes in behaviour of student generations over different years.
Collaborative Recommending suitable learning objects, based on the analysis of the
filtering and performance of other learners, recommending remedial classes for the
recommendations students.
Visual Analytics Facilitating reasoning about the educational processes or learning
results via interactive data/model visualization, e.g. Visualizing
collaborations of students.
APPLICATIONS OF DATA MINING IN HIGHER EDUCATION
List of the primary applications of EDM is provided by Cristobal Romero and Sebastian
Ventura. In their taxonomy, the areas of EDM application are:

 Analysis and visualization of data

 Providing feedback for supporting instructors
 Recommendations for students
 Predicting student performance
 Student modelling
 Detecting undesirable student behaviours
 Grouping students
 Social network analysis
 Developing concept maps
 Constructing courseware
 Planning and scheduling
There are many application areas of data mining like customer analytics, Agriculture,
banking, Security Applications, Educational data mining, Mass surveillance, Privacy
preserving etc. The main concerned area is about data mining applications in educational
systems. Educational Data Mining (EDM) is an emerging discipline, concerned with
developing methods for exploring the unique types of data that come from educational
settings, and using those methods to better understand students, and the settings which they
learn in. A key area of EDM is mining student’s performance. Another key area is mining
enrolment data. Key uses of EDM include predicting student performance and studying
learning in order to recommend improvements to current educational practice. EDM can be
considered one of the learning sciences, as well as an area of data mining. The main
applications of EDM are listed as follows:

A. Analysis and Visualization of Data

It is used to highlight useful information and support decision making. In the
educational environment, for example, it can help educators and course administrators
to analyse the students’ course activities and usage information to get a general view
of a student’s learning. Statistics and visualization information are the two main
techniques that have been most widely used for this task. Statistics is a mathematical
science concerning the collection, analysis, interpretation or explanation, and
presentation of data. It is relatively easy to get basic descriptive statistics from
statistical software, such as SPSS. Statistical analysis of educational data (logs
files/databases) can tell us things such as where students enter and exit, the most
popular pages students browse, number of downloads of e-learning resources, number
of different pages browsed and total time for browsing different pages. It also
provides knowledge about usage summaries and reports on weekly and monthly user
trends, amount of material students might go through and the order in which students
study topics, patterns of studying activity, timing and sequencing of events, and the
content analysis of students notes and summaries. Statistical analysis is also very
useful to obtain reports assessing how many minutes student worked, number of
problems here solved and his correct percentage along with our prediction about his
score and performance level. Visualization uses graphic techniques to help people to
understand and analyse data. There are several studies oriented toward visualizing
different educational data such as patterns of annual, seasonal, daily and hourly user
behaviour on online forums. Some of such investigations are statistical graphs to
analyse assignments complement, questions admitted, exam score, student tracking
data to analyse student’s attendance, results on assignments and quizzes, weekly
information regarding students and group’s activities.

B. Predicting Student Performance

In this case, we estimate the unknown value of a variable that describes the student. In
education, the values normally predicted are student’s performance, their knowledge,
score, or marks. This value can be numerical/continuous (regression task) or
categorical/discrete (classification task). Regression analysis is used to find relation
between a dependent variable and one or more independent variables. Classification is
used to group individual items based upon quantitative characteristics inherent in the
items or on training set of previously labelled items. Prediction of a student’s
performance is the most popular applications of DM in education. Different
techniques and models are applied like neural networks, Bayesian networks, rule-
based systems, regression, and correlation analysis to analyse educational data. This
analysis helps us to predict student’s performance i.e. to predict about his success in a
course and to predict about his final grade based on features extracted from logged
data. Different types of rule-based systems have been applied to predict student’s
performance (mark prediction) in an eLearning environment (using fuzzy-association
rules).Several regression techniques are used to predict student’s marks like linear
regression for predicting student’s academic performance, stepwise linear regression
for predicting time to be spent on a learning page, multiple linear regression for
identifying variables that could predict success in colleges courses and for predicting
exam results in distance education courses.

C. Grouping Students
In this case groups of students are created according to their customized features,
personal characteristics, etc. These clusters/groups of students can be used by the
instructor/developer to build a personalized learning system which can promote
effective group learning. The DM techniques used in this task are classification and
clustering. Different clustering algorithms that are used to group students are
hierarchical agglomerative clustering, K-means and model-based clustering. A
clustering algorithm is based on large generalized sequences which help to find
groups of students with similar learning characteristics like hierarchical clustering
algorithm which are used in intelligent e-learning systems to group students according
to their individual learning style preferences.
D. Enrolment Management
This term is frequently used in higher education to describe well-planned strategies
and tactics to shape the enrolment of an institution and meet established goals.
Enrolment management is an organizational concept and a systematic set of activities
designed to enable educational institutions to exert more influence over their student
enrolments. Such practices often include marketing, admission policies, retention
programs, and financial aid awarding. Strategies and tactics are informed by
collection, analysis, and use of data to project successful outcomes. Activities that
produce measurable improvements in yields are continued and/or expanded, while
those activities that do not are discontinued or restructured. Competitive efforts to
recruit students are a common emphasis of enrolment managers. The numbers of
universities and colleges instituting offices of "enrolment management" have
increased in recent years. These offices serve to provide direction and coordination of
efforts of multiple offices such as admissions, financial aid, registration, and other
student services. Often these offices are part of an enrolment management division.
Some of the typical aims of enrolment management include
 Improving yields at inquiry, application, and enrolment stages.
 Increasing net revenue, usually by improving the proportion of entering
students capable of paying most or all unsubsidized tuition.
 Increasing demographic diversity.
 Improving retention rates.
 Increasing applicant pools.

TECHNIQUES OF EDUCATIONAL DATA MINING

A. Clustering Algorithm
Clustering is a division of data into groups of similar objects. Clustering plays an
outstanding role in data mining applications such as information retrieval and text
mining, scientific data exploration, web analysis, spatial database applications,
medical diagnostics, marketing and many more.
Data Clustering is unsupervised and statistical data analysis technique. It is used to
classify the same data into a homogeneous group of primary school students it is used
to operate on a large dataset to discover hidden pattern and relationship helps to make
decision quickly and efficiently. Cluster analysis is used to break down a large set of
data into subsets called clusters. Each cluster is a collection of data objects that are
like one another. They are placed within the same cluster but are dissimilar to objects
in other clusters. Following algorithms are used in education mining in Clustering.

B. K-Mean Clustering Algorithm

The K-means is one of the best clustering algorithms in data mining. K-Means is a
non-hierarchical clustering method that seeks to partition the data into the form of one
or more clusters .This method partitions the data into clusters so that the data having
the same characteristics are grouped into one cluster and the data that have different
characteristics grouped into another cluster.
K-Means Clustering (KMC) proposes to partition n objects into k clusters in which
each object belongs to the cluster with the nearest mean. Exactly k different clusters
have been produced by this method with greatest possible characteristic. Initially best
number of clusters k leading to the greatest separation (distance) is not known and
must be computed from the data. K-Means clustering’s objective is to minimize the
squared error function or total intracluster variance.
Let X = {x1,x2,x3,……..,xn} be the set of data points and V = {v1,v2,…….,vc} be
the set of centres.
1) Randomly select ‘c’ cluster centres.
2) Calculate the distance between each data point and cluster centres.
3) Assign the data point to the cluster centre whose distance from the cluster centre is
minimum of all the cluster centres.
4) Recalculate the new cluster centre using:

Where, ‘ci’ represents the number of data points in ith cluster.

5) Recalculate the distance between each data point and new obtained cluster centres.
6) If no data point was reassigned then stop, otherwise repeat from step 3).

C. Classification
Classification is the form of data analysis that extracts models describing important
data classes. This approach frequently uses decision tree classification algorithms.
The data classification process includes learning and classification. In learning
method, the training data sets are analysed by the classification algorithm. In
classification test data sets are used to find the accuracy of the classification rules. If
the accuracy is acceptable the rules can be applied to the new data tuples.

D. ID3 Algorithm
Terminologies used in ID3 Algorithm:
 Establish Classification Attribute.
 Compute Classification Entropy.
 Calculate Information Gain using classification attribute.
 Select Attribute with the highest gain to be the next Node in the tree (starting
from the Root node).
 Remove Node Attribute, creating reduced table.

E. Dimensionality reduction techniques

The dimensionality reduction is the utmost vital method to eliminate redundant
attributes and noise which can be further classified into feature extraction and
assortment method. The student clusters are generated from the well-known
COBWEB algorithm. The student clusters are formed based on the credits obtained
from the given semester. As stated before, the grouping utility is defined as the
weighted distance utility of the attribute (i.e. credits).To mine the students’
performance data, the data mining classification techniques such as – Decision tree
Random Tree and J48 classification models were built with 10 cross validation fold
using WEKA.

Literature Review On Educational Data Mining
100% (2)
Literature Review On Educational Data Mining
5 pages
Peter Lorange - Innovations in Shipping (2020) PDF
100% (1)
Peter Lorange - Innovations in Shipping (2020) PDF
434 pages
Edu Data Mining
100% (1)
Edu Data Mining
6 pages
Appearance Release: Complete Only For Hazardous Activity
No ratings yet
Appearance Release: Complete Only For Hazardous Activity
1 page
The Recent State of Educational Data Mining: A Survey and Future Visions
No ratings yet
The Recent State of Educational Data Mining: A Survey and Future Visions
6 pages
Extending The Student's Performance Via K Means and Blended Learning
No ratings yet
Extending The Student's Performance Via K Means and Blended Learning
4 pages
A Survey On Research Work in Educational Data Mining
No ratings yet
A Survey On Research Work in Educational Data Mining
7 pages
"Educational Data Mining A Review of Satate of Art
No ratings yet
"Educational Data Mining A Review of Satate of Art
18 pages
Paper 31-Educational Data Mining Students Performance Prediction
No ratings yet
Paper 31-Educational Data Mining Students Performance Prediction
9 pages
A Survey On Educational Data Mining Techniques
No ratings yet
A Survey On Educational Data Mining Techniques
5 pages
Comparison of Applications For Educational Data Mining in Engineering Education Buenaño-Fernandez Diego
No ratings yet
Comparison of Applications For Educational Data Mining in Engineering Education Buenaño-Fernandez Diego
5 pages
Hexcel HBS Analysis
50% (2)
Hexcel HBS Analysis
3 pages
Techniques For Examining Student Data For Indicators of Future Success - A Survey and Analysis
No ratings yet
Techniques For Examining Student Data For Indicators of Future Success - A Survey and Analysis
8 pages
Educational Data Mining: A State-Of-The-Art Survey On Tools and Techniques Used in EDM
No ratings yet
Educational Data Mining: A State-Of-The-Art Survey On Tools and Techniques Used in EDM
7 pages
Educational Data Mining and Its Role in Determining Factors Affecting Students Academic Performance A Systematic Review
No ratings yet
Educational Data Mining and Its Role in Determining Factors Affecting Students Academic Performance A Systematic Review
7 pages
Role of Data Mining in Education For Improving Students Performance For Social Change
No ratings yet
Role of Data Mining in Education For Improving Students Performance For Social Change
2 pages
Daud 2017
No ratings yet
Daud 2017
7 pages
Regression Analysis of Student Academic Performance Using Deep Learning
No ratings yet
Regression Analysis of Student Academic Performance Using Deep Learning
16 pages
Abstract Educational Data Mining
No ratings yet
Abstract Educational Data Mining
2 pages
Chapter One 1.1 Background of The Study
No ratings yet
Chapter One 1.1 Background of The Study
220 pages
1 s2.0 S1877050915019018 Main
No ratings yet
1 s2.0 S1877050915019018 Main
9 pages
A Survey On Educational Data Mining in Field of Education: Dr. P. Nithya, B. Umamaheswari, A. Umadevi
No ratings yet
A Survey On Educational Data Mining in Field of Education: Dr. P. Nithya, B. Umamaheswari, A. Umadevi
10 pages
1.student Performance Prediction Techniques
No ratings yet
1.student Performance Prediction Techniques
5 pages
Predicting Academic Outcomes - A Survey From 2007 Till 2018
No ratings yet
Predicting Academic Outcomes - A Survey From 2007 Till 2018
33 pages
A Survey On Educational Data Mining and
No ratings yet
A Survey On Educational Data Mining and
21 pages
Yash 21BSDS12 Perdictive Analysis Report
No ratings yet
Yash 21BSDS12 Perdictive Analysis Report
20 pages
Feature Selection Algorithms For Predicting Students Academic Performance Using Data Mining Techniques
No ratings yet
Feature Selection Algorithms For Predicting Students Academic Performance Using Data Mining Techniques
5 pages
Salah Hashim 2020 IOP Conf. Ser. Mater. Sci. Eng. 928 032019
No ratings yet
Salah Hashim 2020 IOP Conf. Ser. Mater. Sci. Eng. 928 032019
19 pages
ICSMB2016-C Anuradha
No ratings yet
ICSMB2016-C Anuradha
7 pages
AbuSaa2019 Article FactorsAffectingStudentsPerfor
No ratings yet
AbuSaa2019 Article FactorsAffectingStudentsPerfor
32 pages
Badr 2016
No ratings yet
Badr 2016
10 pages
Chapter Two
No ratings yet
Chapter Two
7 pages
Student Performance Prediction by Using Data Mining Classification Algorithms
No ratings yet
Student Performance Prediction by Using Data Mining Classification Algorithms
6 pages
2950-Article Text-5557-1-10-20210418
No ratings yet
2950-Article Text-5557-1-10-20210418
6 pages
Factors Affecting Students Performance I
No ratings yet
Factors Affecting Students Performance I
32 pages
Educational Data Mining - A Survey and A Data Mining-Based Analysis of Recent Works
No ratings yet
Educational Data Mining - A Survey and A Data Mining-Based Analysis of Recent Works
31 pages
Educational Data Mining
No ratings yet
Educational Data Mining
2 pages
Pad Project Research Paper
No ratings yet
Pad Project Research Paper
15 pages
Educational Data Mining: A Review and Analysis of Student's Academic Performance
No ratings yet
Educational Data Mining: A Review and Analysis of Student's Academic Performance
15 pages
Development of Student's Academic Performance Prediction Model
No ratings yet
Development of Student's Academic Performance Prediction Model
16 pages
Student Performance Analysis Using Educa
No ratings yet
Student Performance Analysis Using Educa
8 pages
Educational Data Mining
No ratings yet
Educational Data Mining
9 pages
Review On Prediction Algorithms in Educational Data Mining
No ratings yet
Review On Prediction Algorithms in Educational Data Mining
2 pages
Evolutionary Algorithm Based Rule(s) Generation For Personalized Courseware Construction in Educational Data Mining
No ratings yet
Evolutionary Algorithm Based Rule(s) Generation For Personalized Courseware Construction in Educational Data Mining
7 pages
Prediction Clustering
No ratings yet
Prediction Clustering
16 pages
Reviewed
No ratings yet
Reviewed
19 pages
COFIMCO Installation and Operation Manual
100% (2)
COFIMCO Installation and Operation Manual
11 pages
Version Final Enviada
No ratings yet
Version Final Enviada
20 pages
A Survey On Educational Data Mining Techniques in Predicting Student's Academic Performance
No ratings yet
A Survey On Educational Data Mining Techniques in Predicting Student's Academic Performance
3 pages
(Fa) Fianl Research Paper Data Mining..
No ratings yet
(Fa) Fianl Research Paper Data Mining..
59 pages
E-Learning Using Data Mining: Shimaa Abd Elkader Abd Elaal
No ratings yet
E-Learning Using Data Mining: Shimaa Abd Elkader Abd Elaal
17 pages
Hari Ganesh 2015
No ratings yet
Hari Ganesh 2015
6 pages
Educational Data Mining: A Review of The State of The Art
No ratings yet
Educational Data Mining: A Review of The State of The Art
18 pages
20122
No ratings yet
20122
22 pages
Handling Missing Value in Decision Tree Algorithm PDF
No ratings yet
Handling Missing Value in Decision Tree Algorithm PDF
6 pages
Analysis of Data Mining Techniques Applied To LMS For Personalized Education
No ratings yet
Analysis of Data Mining Techniques Applied To LMS For Personalized Education
5 pages
Study On Educational Data Mining
No ratings yet
Study On Educational Data Mining
9 pages
Review On Prediction Algorithms in Educational Data Mining: A.Dinesh Kumar, R.Pandi Selvam, K.Sathesh Kumar
No ratings yet
Review On Prediction Algorithms in Educational Data Mining: A.Dinesh Kumar, R.Pandi Selvam, K.Sathesh Kumar
8 pages
Sashin - 2012 - A Survey and Future Vision of Data Mining in Educational Field
No ratings yet
Sashin - 2012 - A Survey and Future Vision of Data Mining in Educational Field
5 pages
Final Survey Paper 17-9-13
No ratings yet
Final Survey Paper 17-9-13
5 pages
Educational Data Mining: A Literature Review
No ratings yet
Educational Data Mining: A Literature Review
9 pages
Experiment 8 Fuentes Mark
No ratings yet
Experiment 8 Fuentes Mark
29 pages
Data Mining in Education Sector
No ratings yet
Data Mining in Education Sector
7 pages
2,000 Most Common Italian Words
No ratings yet
2,000 Most Common Italian Words
30 pages
Time and Decision Economic and Psychological Perspectives of Intertemporal Choice George Loewenstein
No ratings yet
Time and Decision Economic and Psychological Perspectives of Intertemporal Choice George Loewenstein
82 pages
Fortec PT Brochure July 2020 Web
No ratings yet
Fortec PT Brochure July 2020 Web
30 pages
Redox
No ratings yet
Redox
2 pages
Admit Card
No ratings yet
Admit Card
3 pages
BSD Junction Blok A No 3, JL Pahlawan Seribu, BSD City, Tangerang Selatan PH: (021) 3032 1716 / 081 689 5500 / Cs@royalgardenspa - Co.id
No ratings yet
BSD Junction Blok A No 3, JL Pahlawan Seribu, BSD City, Tangerang Selatan PH: (021) 3032 1716 / 081 689 5500 / Cs@royalgardenspa - Co.id
26 pages
Optima Super Secure Brochure
No ratings yet
Optima Super Secure Brochure
20 pages
Clarion IDE Users Guide
No ratings yet
Clarion IDE Users Guide
302 pages
Package Desire': R Topics Documented
No ratings yet
Package Desire': R Topics Documented
22 pages
Ultrasonic Sensors: USA Series US-T50/R25 US-S25AN US-S300 Series US-1AH
No ratings yet
Ultrasonic Sensors: USA Series US-T50/R25 US-S25AN US-S300 Series US-1AH
19 pages
JSR-211 - Devx
No ratings yet
JSR-211 - Devx
6 pages
Sikorsky v. City of Newburgh, No. 23-1171 (2d Cir. May 2, 2025)
No ratings yet
Sikorsky v. City of Newburgh, No. 23-1171 (2d Cir. May 2, 2025)
13 pages
2.3.11.a Calculating Property Drainage
No ratings yet
2.3.11.a Calculating Property Drainage
6 pages
Elgamatic 100
No ratings yet
Elgamatic 100
1 page
Robin Austin Resume
No ratings yet
Robin Austin Resume
4 pages
Au Bon Pain
No ratings yet
Au Bon Pain
6 pages
Web Wonders
No ratings yet
Web Wonders
2 pages
Chapter 2 Architectural Models
No ratings yet
Chapter 2 Architectural Models
44 pages
An Open Ended Contract
No ratings yet
An Open Ended Contract
5 pages
IA Carpentry
No ratings yet
IA Carpentry
103 pages
Final Sudeshna Resume
No ratings yet
Final Sudeshna Resume
1 page
Michael's Resume 2024
No ratings yet
Michael's Resume 2024
3 pages
Adora Seedless One Sheet 2015
No ratings yet
Adora Seedless One Sheet 2015
1 page
6 Internship Contract Agreement f2f
No ratings yet
6 Internship Contract Agreement f2f
2 pages
Debit+Notes DN0006 60006782970 60006783462 474581000000381079 1662203079064
No ratings yet
Debit+Notes DN0006 60006782970 60006783462 474581000000381079 1662203079064
1 page
AI and ML Applications for Decision-Making in Education Sector
From Everand
AI and ML Applications for Decision-Making in Education Sector
Zemelak Goraga
No ratings yet
Teaching and Learning in STEM With Computation, Modeling, and Simulation Practices: A Guide for Practitioners and Researchers
From Everand
Teaching and Learning in STEM With Computation, Modeling, and Simulation Practices: A Guide for Practitioners and Researchers
Alejandra J. Magana
No ratings yet

BIA Assignment

Uploaded by

BIA Assignment

Uploaded by

BIA ASSIGNMENT

SUBMITTED BY: SUBMITTED TO:

ARMY INSTITUTE OF MANAGEMENT & TECHNOLOGY,

 Analysis and visualization of data

A. Analysis and Visualization of Data

B. Predicting Student Performance

TECHNIQUES OF EDUCATIONAL DATA MINING

B. K-Mean Clustering Algorithm

Where, ‘ci’ represents the number of data points in ith cluster.

E. Dimensionality reduction techniques

You might also like