0% found this document useful (0 votes)
173 views6 pages

Academic Performance Prediction Based On Multisource, Multi Feature Behavioral Data

The document proposes a model called Augmented Education (AugmentED) to predict students' academic performance using multi-source behavioral data. It aggregates data from various campus sources, extracts relevant features, and uses machine learning algorithms to classify performance. Feedback is then provided to students, especially those at risk, to help optimize their university engagement. The model is tested on a real dataset of 156 students and shown to accurately predict academic outcomes.

Uploaded by

bits computers
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
173 views6 pages

Academic Performance Prediction Based On Multisource, Multi Feature Behavioral Data

The document proposes a model called Augmented Education (AugmentED) to predict students' academic performance using multi-source behavioral data. It aggregates data from various campus sources, extracts relevant features, and uses machine learning algorithms to classify performance. Feedback is then provided to students, especially those at risk, to help optimize their university engagement. The model is tested on a real dataset of 156 students and shown to accurately predict academic outcomes.

Uploaded by

bits computers
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

Academic Performance Prediction Based on

Multisource, Multi feature Behavioral Data


ABSTRACT

Digital data trails from disparate sources covering different aspects of student life
are stored daily in most modern university campuses. However, it remains
challenging to (i) combine these data to obtain a holistic view of a student, (ii) use
these data to accurately predict academic performance, and (iii) use such predictions
to promote positive student engagement with the university. To initially alleviate this
problem, in this paper, a model named Augmented Education (AugmentED) is
proposed. In our study, (1) first, an experiment is conducted based on a real-world
campus dataset of college students (N = 156) that aggregates multisource behavioral
data covering not only online and offline learning but also behaviors inside and
outside of the classroom. Specifically, to gain in-depth insight into the features
leading to excellent or poor performance, metrics measuring the linear and nonlinear
behavioral changes (e.g., regularity and stability) of campus lifestyles are estimated;
furthermore, features representing dynamic changes in temporal lifestyle patterns are
extracted by the means of long short-term memory (LSTM). (2) Second, machine
learning-based classification algorithms are developed to predict academic
performance. (3) Finally, visualized feedback enabling students (especially at-risk
students) to potentially optimize their interactions with the university and achieve a
study-life balance is designed. The experiments show that the AugmentED model
can predict students’ academic performance with high accuracy.
EXISTING SYSTEM

 Regarding the students’ behavioral time series, nonlinear metrics have been
used to discover nonlinear behavioral patterns. We consider entropy an
example. In [22], entropy is proposed to quantify the regularity/orderliness
of students’ behaviors, and it was demonstrated that a small entropy value
generally leads to high regularity and high academic performance. Another
example is entropy calculated based on a Hidden Markov Model (HMM)
analysis [44], which is called HMM-based entropy for simplicity in our
study. HMM-based entropy is proposed to quantify the
uncertainty/diversity of students’ behaviors, e.g., the uncertainty between
the transition of different behaviors and the various activities that a
behavior exhibits. In [44], HMM-based entropy is evaluated by the
following two steps: (i) extracting the hidden states of a behavioral time
series by HMM [45,46]; and (ii) subsequently calculating the HMM-based
entropy of the extracted hidden states.

 To further recognize students’ activities and discover their nonlinear


behavioral patterns, the following three new metrics, which have not been
applied in students’ behavioral time series analysis previously, are also
worth to be studied.  Lyapunov Exponent (LyE) [47-51] is a measure of
the stability of a time series. For example, in [47], LyE is used to quantify
the stability of a gait time series, and the results demonstrate that a time
series with a large LyE value is less stable than a series with a small LyE
value, i.e., generally, a large LyE value indicates high instability.
Therefore, in gait analyses, LyE is considered a stability risk indicator for
falls [47] that can distinguish healthy subjects from those at a high risk of
falling.  Hurst Exponent (HurstE) [52-54] is a measure of predictability
(in some studies, it is also called long-term memory) of a time series. For
example, in [53], HurstE is applied to quantify the predictability of a
financial time series, and the results demonstrate that a time series with a
large HurstE value can be predicted more accurately than a series with a
HurstE value close to 0.5.

 Detrended Fluctuation Analysis (DFA) [54-57] is a measure of the long-


range correlation (also called statistical self-affinity or long-range
dependence) of a time series [56]. For example, in [56], DFA is used to
quantify the long-range correlation of a heart rate time series, and it is
demonstrated that a time series with a small DFA value indicates less long-
range correlation behavior than a series with a large DFA value. Therefore,
in heart rate analyses, DFA is considered a long-range correlation indicator
that can distinguish healthy subjects from those with severe heart disease
[56].

Disadvantages
o In the existing work, the system does not read Smart Card data.
o This system is less performance due to lack of PREDICTION
ALGORITHMS.
PROPOSED SYSTEM

 In the proposed system,initially alleviate the challenges mentioned above, a


model named Augmented Education (AugmentED) is proposed in this paper.
In the proposed system, this model mainly consists of the following three
modules:
 Data Module in which multisource data on campus covering a large variety
of data trails are aggregated and fused, and the characteristics/features that can
represent students’ behavioral change from three different perspectives are
evaluated;
 Prediction Module in which academic performance prediction is considered a
classification problem that is solved by machine learning (ML)-based
algorithms; and
 Feedback Module in which visualized feedback is delivered individually
based on the predictions made and feature analysis. Finally, AugmentED is
examined using a real-world dataset of 156 college students.

Advantages

(i) capturing a sufficiently rich profile of a student and integrating these data to
obtain a holistic view;
(ii) (ii) exploring the factors affecting students’ academic performance and using
this information to develop a robust prediction model with high accuracy;
and
(iii) taking advantage of the prediction model to deliver personalize services that
potentially enable students to drive behavioral change and optimize their
study-life balance.
SYSTEM REQUIREMENTS

➢ H/W System Configuration:-

➢ Processor - Pentium –IV


➢ RAM - 4 GB (min)
➢ Hard Disk - 20 GB
➢ Key Board - Standard Windows Keyboard
➢ Mouse - Two or Three Button Mouse
➢ Monitor - SVGA

SOFTWARE REQUIREMENTS:

 Operating system : Windows 7 Ultimate.

 Coding Language : Python.

 Front-End : Python.

 Back-End : Django-ORM

 Designing : Html, css, javascript.

 Data Base : MySQL (WAMP Server).

You might also like