Avoiding Forgetting and Allowing Forward Transfer in Continual Learning via Sparse Networks

Sokar, Ghada; Mocanu, Decebal Constantin; Pechenizkiy, Mykola

Computer Science > Machine Learning

arXiv:2110.05329 (cs)

[Submitted on 11 Oct 2021 (v1), last revised 6 Jul 2022 (this version, v3)]

Title:Avoiding Forgetting and Allowing Forward Transfer in Continual Learning via Sparse Networks

Authors:Ghada Sokar, Decebal Constantin Mocanu, Mykola Pechenizkiy

View PDF

Abstract:Using task-specific components within a neural network in continual learning (CL) is a compelling strategy to address the stability-plasticity dilemma in fixed-capacity models without access to past data. Current methods focus only on selecting a sub-network for a new task that reduces forgetting of past tasks. However, this selection could limit the forward transfer of relevant past knowledge that helps in future learning. Our study reveals that satisfying both objectives jointly is more challenging when a unified classifier is used for all classes of seen tasks-class-Incremental Learning (class-IL)-as it is prone to ambiguities between classes across tasks. Moreover, the challenge increases when the semantic similarity of classes across tasks increases. To address this challenge, we propose a new CL method, named AFAF, that aims to Avoid Forgetting and Allow Forward transfer in class-IL using fix-capacity models. AFAF allocates a sub-network that enables selective transfer of relevant knowledge to a new task while preserving past knowledge, reusing some of the previously allocated components to utilize the fixed-capacity, and addressing class-ambiguities when similarities exist. The experiments show the effectiveness of AFAF in providing models with multiple CL desirable properties, while outperforming state-of-the-art methods on various challenging benchmarks with different semantic similarities.

Comments:	Accepted at European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2022)
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2110.05329 [cs.LG]
	(or arXiv:2110.05329v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2110.05329

Submission history

From: Ghada Sokar [view email]
[v1] Mon, 11 Oct 2021 14:51:56 UTC (1,778 KB)
[v2] Mon, 31 Jan 2022 22:40:38 UTC (1,200 KB)
[v3] Wed, 6 Jul 2022 08:39:42 UTC (1,207 KB)

Computer Science > Machine Learning

Title:Avoiding Forgetting and Allowing Forward Transfer in Continual Learning via Sparse Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Avoiding Forgetting and Allowing Forward Transfer in Continual Learning via Sparse Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators