Towards Robust Knowledge Tracing Models via k-Sparse Attention

Huang, Shuyan; Liu, Zitao; Zhao, Xiangyu; Luo, Weiqi; Weng, Jian

Computer Science > Machine Learning

arXiv:2407.17097 (cs)

[Submitted on 24 Jul 2024]

Title:Towards Robust Knowledge Tracing Models via k-Sparse Attention

Authors:Shuyan Huang, Zitao Liu, Xiangyu Zhao, Weiqi Luo, Jian Weng

View PDF HTML (experimental)

Abstract:Knowledge tracing (KT) is the problem of predicting students' future performance based on their historical interaction sequences. With the advanced capability of capturing contextual long-term dependency, attention mechanism becomes one of the essential components in many deep learning based KT (DLKT) models. In spite of the impressive performance achieved by these attentional DLKT models, many of them are often vulnerable to run the risk of overfitting, especially on small-scale educational datasets. Therefore, in this paper, we propose \textsc{sparseKT}, a simple yet effective framework to improve the robustness and generalization of the attention based DLKT approaches. Specifically, we incorporate a k-selection module to only pick items with the highest attention scores. We propose two sparsification heuristics : (1) soft-thresholding sparse attention and (2) top-$K$ sparse attention. We show that our \textsc{sparseKT} is able to help attentional KT models get rid of irrelevant student interactions and have comparable predictive performance when compared to 11 state-of-the-art KT models on three publicly available real-world educational datasets. To encourage reproducible research, we make our data and code publicly available at \url{this https URL}\footnote{We merged our model to the \textsc{pyKT} benchmark at \url{this https URL}.}.

Comments:	Accepted at SIGIR'2023 (revised version with additional results)
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2407.17097 [cs.LG]
	(or arXiv:2407.17097v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.17097

Submission history

From: Zitao Liu [view email]
[v1] Wed, 24 Jul 2024 08:49:18 UTC (660 KB)

Computer Science > Machine Learning

Title:Towards Robust Knowledge Tracing Models via k-Sparse Attention

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Robust Knowledge Tracing Models via k-Sparse Attention

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators