Multi-task Learning For Detecting and Segmenting Manipulated Facial Images and Videos

Nguyen, Huy H.; Fang, Fuming; Yamagishi, Junichi; Echizen, Isao

Computer Science > Computer Vision and Pattern Recognition

arXiv:1906.06876 (cs)

[Submitted on 17 Jun 2019]

Title:Multi-task Learning For Detecting and Segmenting Manipulated Facial Images and Videos

Authors:Huy H. Nguyen, Fuming Fang, Junichi Yamagishi, Isao Echizen

View PDF

Abstract:Detecting manipulated images and videos is an important topic in digital media forensics. Most detection methods use binary classification to determine the probability of a query being manipulated. Another important topic is locating manipulated regions (i.e., performing segmentation), which are mostly created by three commonly used attacks: removal, copy-move, and splicing. We have designed a convolutional neural network that uses the multi-task learning approach to simultaneously detect manipulated images and videos and locate the manipulated regions for each query. Information gained by performing one task is shared with the other task and thereby enhance the performance of both tasks. A semi-supervised learning approach is used to improve the network's generability. The network includes an encoder and a Y-shaped decoder. Activation of the encoded features is used for the binary classification. The output of one branch of the decoder is used for segmenting the manipulated regions while that of the other branch is used for reconstructing the input, which helps improve overall performance. Experiments using the FaceForensics and FaceForensics++ databases demonstrated the network's effectiveness against facial reenactment attacks and face swapping attacks as well as its ability to deal with the mismatch condition for previously seen attacks. Moreover, fine-tuning using just a small amount of data enables the network to deal with unseen attacks.

Comments:	Accepted to be Published in Proceedings of the IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS) 2019, Florida, USA
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1906.06876 [cs.CV]
	(or arXiv:1906.06876v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1906.06876

Submission history

From: Hong Huy Nguyen [view email]
[v1] Mon, 17 Jun 2019 07:27:54 UTC (963 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-task Learning For Detecting and Segmenting Manipulated Facial Images and Videos

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-task Learning For Detecting and Segmenting Manipulated Facial Images and Videos

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators