Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models

Dam, Phuong; Jeong, Jihoon; Tran, Anh; Kim, Daeyoung

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.07371 (cs)

[Submitted on 12 Mar 2024 (v1), last revised 17 Jul 2024 (this version, v3)]

Title:Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models

Authors:Phuong Dam, Jihoon Jeong, Anh Tran, Daeyoung Kim

View PDF HTML (experimental)

Abstract:This study discusses the critical issues of Virtual Try-On in contemporary e-commerce and the prospective metaverse, emphasizing the challenges of preserving intricate texture details and distinctive features of the target person and the clothes in various scenarios, such as clothing texture and identity characteristics like tattoos or accessories. In addition to the fidelity of the synthesized images, the efficiency of the synthesis process presents a significant hurdle. Various existing approaches are explored, highlighting the limitations and unresolved aspects, e.g., identity information omission, uncontrollable artifacts, and low synthesis speed. It then proposes a novel diffusion-based solution that addresses garment texture preservation and user identity retention during virtual try-on. The proposed network comprises two primary modules - a warping module aligning clothing with individual features and a try-on module refining the attire and generating missing parts integrated with a mask-aware post-processing technique ensuring the integrity of the individual's identity. It demonstrates impressive results, surpassing the state-of-the-art in speed by nearly 20 times during inference, with superior fidelity in qualitative assessments. Quantitative evaluations confirm comparable performance with the recent SOTA method on the VITON-HD and Dresscode datasets. We named our model Fast and Identity Preservation Virtual TryON (FIP-VITON).

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.07371 [cs.CV]
	(or arXiv:2403.07371v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.07371

Submission history

From: Phuong Dam Hoang [view email]
[v1] Tue, 12 Mar 2024 07:15:29 UTC (22,392 KB)
[v2] Mon, 25 Mar 2024 05:48:28 UTC (22,392 KB)
[v3] Wed, 17 Jul 2024 06:50:47 UTC (13,085 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators