DiFaReli++: Diffusion Face Relighting with Consistent Cast Shadows

Ponglertnapakorn, Puntawat; Tritrong, Nontawat; Suwajanakorn, Supasorn

Computer Science > Computer Vision and Pattern Recognition

arXiv:2304.09479 (cs)

[Submitted on 19 Apr 2023 (v1), last revised 25 Jan 2025 (this version, v4)]

Title:DiFaReli++: Diffusion Face Relighting with Consistent Cast Shadows

Authors:Puntawat Ponglertnapakorn, Nontawat Tritrong, Supasorn Suwajanakorn

View PDF

Abstract:We introduce a novel approach to single-view face relighting in the wild, addressing challenges such as global illumination and cast shadows. A common scheme in recent methods involves intrinsically decomposing an input image into 3D shape, albedo, and lighting, then recomposing it with the target lighting. However, estimating these components is error-prone and requires many training examples with ground-truth lighting to generalize well. Our work bypasses the need for accurate intrinsic estimation and can be trained solely on 2D images without any light stage data, relit pairs, multi-view images, or lighting ground truth. Our key idea is to leverage a conditional diffusion implicit model (DDIM) for decoding a disentangled light encoding along with other encodings related to 3D shape and facial identity inferred from off-the-shelf estimators. We propose a novel conditioning technique that simplifies modeling the complex interaction between light and geometry. It uses a rendered shading reference along with a shadow map, inferred using a simple and effective technique, to spatially modulate the DDIM. Moreover, we propose a single-shot relighting framework that requires just one network pass, given pre-processed data, and even outperforms the teacher model across all metrics. Our method realistically relights in-the-wild images with temporally consistent cast shadows under varying lighting conditions. We achieve state-of-the-art performance on the standard benchmark Multi-PIE and rank highest in user studies.

Comments:	DiFaReli++ extends our previous work DiFaReli (ICCV 2023)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
Cite as:	arXiv:2304.09479 [cs.CV]
	(or arXiv:2304.09479v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2304.09479

Submission history

From: Puntawat Ponglertnapakorn [view email]
[v1] Wed, 19 Apr 2023 08:03:20 UTC (22,168 KB)
[v2] Fri, 21 Apr 2023 07:09:55 UTC (22,168 KB)
[v3] Thu, 7 Sep 2023 09:08:01 UTC (25,206 KB)
[v4] Sat, 25 Jan 2025 18:24:20 UTC (38,858 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DiFaReli++: Diffusion Face Relighting with Consistent Cast Shadows

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DiFaReli++: Diffusion Face Relighting with Consistent Cast Shadows

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators