Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning

Deng, Danruo; Chen, Guangyong; Hao, Jianye; Wang, Qiong; Heng, Pheng-Ann

Abstract:The backpropagation networks are notably susceptible to catastrophic forgetting, where networks tend to forget previously learned skills upon learning new ones. To address such the 'sensitivity-stability' dilemma, most previous efforts have been contributed to minimizing the empirical risk with different parameter regularization terms and episodic memory, but rarely exploring the usages of the weight loss landscape. In this paper, we investigate the relationship between the weight loss landscape and sensitivity-stability in the continual learning scenario, based on which, we propose a novel method, Flattening Sharpness for Dynamic Gradient Projection Memory (FS-DGPM). In particular, we introduce a soft weight to represent the importance of each basis representing past tasks in GPM, which can be adaptively learned during the learning process, so that less important bases can be dynamically released to improve the sensitivity of new skill learning. We further introduce Flattening Sharpness (FS) to reduce the generalization gap by explicitly regulating the flatness of the weight loss landscape of all seen tasks. As demonstrated empirically, our proposed method consistently outperforms baselines with the superior ability to learn new skills while alleviating forgetting effectively.

Comments:	NeurIPS2021
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2110.04593 [cs.LG]
	(or arXiv:2110.04593v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2110.04593

Computer Science > Machine Learning

Title:Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators