SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

He, Xilin; Luo, Cheng; Xian, Xiaole; Li, Bing; Song, Siyang; Khan, Muhammad Haris; Xie, Weicheng; Shen, Linlin; Ge, Zongyuan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.09865 (cs)

[Submitted on 13 Oct 2024 (v1), last revised 20 Nov 2024 (this version, v2)]

Title:SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

Authors:Xilin He, Cheng Luo, Xiaole Xian, Bing Li, Siyang Song, Muhammad Haris Khan, Weicheng Xie, Linlin Shen, Zongyuan Ge

View PDF HTML (experimental)

Abstract:Facial expression datasets remain limited in scale due to privacy concerns, the subjectivity of annotations, and the labor-intensive nature of data collection. This limitation poses a significant challenge for developing modern deep learning-based facial expression analysis models, particularly foundation models, that rely on large-scale data for optimal performance. To tackle the overarching and complex challenge, we introduce SynFER (Synthesis of Facial Expressions with Refined Control), a novel framework for synthesizing facial expression image data based on high-level textual descriptions as well as more fine-grained and precise control through facial action units. To ensure the quality and reliability of the synthetic data, we propose a semantic guidance technique to steer the generation process and a pseudo-label generator to help rectify the facial expression labels for the synthetic images. To demonstrate the generation fidelity and the effectiveness of the synthetic data from SynFER, we conduct extensive experiments on representation learning using both synthetic data and real-world data. Experiment results validate the efficacy of the proposed approach and the synthetic data. Notably, our approach achieves a 67.23% classification accuracy on AffectNet when training solely with synthetic data equivalent to the AffectNet training set size, which increases to 69.84% when scaling up to five times the original size. Our code will be made publicly available.

Comments:	Updated Results
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2410.09865 [cs.CV]
	(or arXiv:2410.09865v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.09865

Submission history

From: Xilin He [view email]
[v1] Sun, 13 Oct 2024 14:58:21 UTC (10,767 KB)
[v2] Wed, 20 Nov 2024 07:38:20 UTC (16,051 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators