DualLip: A System for Joint Lip Reading and Generation

Chen, Weicong; Tan, Xu; Xia, Yingce; Qin, Tao; Wang, Yu; Liu, Tie-Yan

doi:10.1145/3394171.3413623

Computer Science > Multimedia

arXiv:2009.05784 (cs)

[Submitted on 12 Sep 2020]

Title:DualLip: A System for Joint Lip Reading and Generation

Authors:Weicong Chen, Xu Tan, Yingce Xia, Tao Qin, Yu Wang, Tie-Yan Liu

View PDF

Abstract:Lip reading aims to recognize text from talking lip, while lip generation aims to synthesize talking lip according to text, which is a key component in talking face generation and is a dual task of lip reading. In this paper, we develop DualLip, a system that jointly improves lip reading and generation by leveraging the task duality and using unlabeled text and lip video data. The key ideas of the DualLip include: 1) Generate lip video from unlabeled text with a lip generation model, and use the pseudo pairs to improve lip reading; 2) Generate text from unlabeled lip video with a lip reading model, and use the pseudo pairs to improve lip generation. We further extend DualLip to talking face generation with two additionally introduced components: lip to face generation and text to speech generation. Experiments on GRID and TCD-TIMIT demonstrate the effectiveness of DualLip on improving lip reading, lip generation, and talking face generation by utilizing unlabeled data. Specifically, the lip generation model in our DualLip system trained with only10% paired data surpasses the performance of that trained with the whole paired data. And on the GRID benchmark of lip reading, we achieve 1.16% character error rate and 2.71% word error rate, outperforming the state-of-the-art models using the same amount of paired data.

Comments:	Accepted by ACM Multimedia 2020
Subjects:	Multimedia (cs.MM)
Cite as:	arXiv:2009.05784 [cs.MM]
	(or arXiv:2009.05784v1 [cs.MM] for this version)
	https://doi.org/10.48550/arXiv.2009.05784
Related DOI:	https://doi.org/10.1145/3394171.3413623

Submission history

From: Weicong Chen [view email]
[v1] Sat, 12 Sep 2020 13:13:55 UTC (3,419 KB)

Computer Science > Multimedia

Title:DualLip: A System for Joint Lip Reading and Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multimedia

Title:DualLip: A System for Joint Lip Reading and Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators