Class Gradient Projection For Continual Learning

Chen, Cheng; Zhang, Ji; Song, Jingkuan; Gao, Lianli

doi:10.1145/3503161.3548054

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.14905 (cs)

[Submitted on 25 Nov 2023]

Title:Class Gradient Projection For Continual Learning

Authors:Cheng Chen, Ji Zhang, Jingkuan Song, Lianli Gao

View PDF

Abstract:Catastrophic forgetting is one of the most critical challenges in Continual Learning (CL). Recent approaches tackle this problem by projecting the gradient update orthogonal to the gradient subspace of existing tasks. While the results are remarkable, those approaches ignore the fact that these calculated gradients are not guaranteed to be orthogonal to the gradient subspace of each class due to the class deviation in tasks, e.g., distinguishing "Man" from "Sea" v.s. differentiating "Boy" from "Girl". Therefore, this strategy may still cause catastrophic forgetting for some classes. In this paper, we propose Class Gradient Projection (CGP), which calculates the gradient subspace from individual classes rather than tasks. Gradient update orthogonal to the gradient subspace of existing classes can be effectively utilized to minimize interference from other classes. To improve the generalization and efficiency, we further design a Base Refining (BR) algorithm to combine similar classes and refine class bases dynamically. Moreover, we leverage a contrastive learning method to improve the model's ability to handle unseen tasks. Extensive experiments on benchmark datasets demonstrate the effectiveness of our proposed approach. It improves the previous methods by 2.0% on the CIFAR-100 dataset.

Comments:	MM '22: Proceedings of the 30th ACM International Conference on Multimedia
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.14905 [cs.CV]
	(or arXiv:2311.14905v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.14905
Related DOI:	https://doi.org/10.1145/3503161.3548054

Submission history

From: Cheng Chen [view email]
[v1] Sat, 25 Nov 2023 02:45:56 UTC (1,282 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Class Gradient Projection For Continual Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Class Gradient Projection For Continual Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators