Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey

Xin, Yi; Luo, Siqi; Zhou, Haodi; Du, Junlong; Liu, Xiaohong; Fan, Yue; Li, Qing; Du, Yuntao

Computer Science > Computer Vision and Pattern Recognition

arXiv:2402.02242v1 (cs)

[Submitted on 3 Feb 2024 (this version), latest version 8 Feb 2024 (v2)]

Title:Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey

Authors:Yi Xin, Siqi Luo, Haodi Zhou, Junlong Du, Xiaohong Liu, Yue Fan, Qing Li, Yuntao Du

View PDF HTML (experimental)

Abstract:Large-scale pre-trained vision models (PVMs) have shown great potential for adaptability across various downstream vision tasks. However, with state-of-the-art PVMs growing to billions or even trillions of parameters, the standard full fine-tuning paradigm is becoming unsustainable due to high computational and storage demands. In response, researchers are exploring parameter-efficient fine-tuning (PEFT), which seeks to exceed the performance of full fine-tuning with minimal parameter modifications. This survey provides a comprehensive overview and future directions for visual PEFT, offering a systematic review of the latest advancements. First, we provide a formal definition of PEFT and discuss model pre-training methods. We then categorize existing methods into three categories: addition-based, partial-based, and unified-based. Finally, we introduce the commonly used datasets and applications and suggest potential future research challenges. A comprehensive collection of resources is available at this https URL.

Comments:	Submitted to IJCAI 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2402.02242 [cs.CV]
	(or arXiv:2402.02242v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2402.02242

Submission history

From: Yi Xin [view email]
[v1] Sat, 3 Feb 2024 19:12:20 UTC (99 KB)
[v2] Thu, 8 Feb 2024 08:17:57 UTC (99 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators