Incremental Object Detection with CLIP

Huang, Ziyue; He, Yupeng; Liu, Qingjie; Wang, Yunhong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2310.08815 (cs)

[Submitted on 13 Oct 2023 (v1), last revised 9 Jul 2024 (this version, v3)]

Title:Incremental Object Detection with CLIP

Authors:Ziyue Huang, Yupeng He, Qingjie Liu, Yunhong Wang

View PDF HTML (experimental)

Abstract:In contrast to the incremental classification task, the incremental detection task is characterized by the presence of data ambiguity, as an image may have differently labeled bounding boxes across multiple continuous learning stages. This phenomenon often impairs the model's ability to effectively learn new classes. However, existing research has paid less attention to the forward compatibility of the model, which limits its suitability for incremental learning. To overcome this obstacle, we propose leveraging a visual-language model such as CLIP to generate text feature embeddings for different class sets, which enhances the feature space globally. We then employ super-classes to replace the unavailable novel classes in the early learning stage to simulate the incremental scenario. Finally, we utilize the CLIP image encoder to accurately identify potential objects. We incorporate the finely recognized detection boxes as pseudo-annotations into the training process, thereby further improving the detection performance. We evaluate our approach on various incremental learning settings using the PASCAL VOC 2007 dataset, and our approach outperforms state-of-the-art methods, particularly for recognizing the new classes.

Comments:	5 pages, 1 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2310.08815 [cs.CV]
	(or arXiv:2310.08815v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2310.08815

Submission history

From: Ziyue Huang [view email]
[v1] Fri, 13 Oct 2023 01:59:39 UTC (567 KB)
[v2] Wed, 29 May 2024 16:11:10 UTC (1,254 KB)
[v3] Tue, 9 Jul 2024 06:55:00 UTC (567 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Incremental Object Detection with CLIP

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Incremental Object Detection with CLIP

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators