Deepfake Detection

The document compares the performance of pretrained EfficientNet models for deepfake detection using the DFDC dataset. It analyzes the top solutions from the DFDC challenge that used EfficientNets and hypothesizes that larger models may not achieve better performance for this task unlike other tasks the models were originally trained on. Experiments are conducted using models from the highest performing DFDC entry to evaluate performance versus size.

Uploaded by

vibgyor500

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Deepfake Detection

Uploaded by

vibgyor500

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

EfficientNets for DeepFake Detection: Comparison of

Pretrained Models
2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus) | 978-1-6654-0476-1/20/$31.00 ©2021 IEEE | DOI: 10.1109/ElConRus51938.2021.9396092

Artem A. Pokroy Alexey D. Egorov

Department of Computer Systems and Technologies Department of Computer Systems and Technologies
National Research Nuclear University MEPhI National Research Nuclear University MEPhI
(Moscow Engineering Physics Institute) (Moscow Engineering Physics Institute)
Moscow, Russia Federation Moscow, Russia Federation
[email protected] [email protected]

Abstract—Rapid advances in media generation techniques [10], which were finetuned on videos containing facial
have made the creation of AI-generated fake face videos more manipulations.
accessible than ever before. In order to accelerate the development
of new ways to expose forged videos, Facebook created Deep Fake EfficientNet family of models consists of a single baseline
Detection Challenge (DFDC), which demonstrated multiple neural network scaled up to various sizes. For all these models
approaches to solve this problem. Analysis of top-performing there is a dependency between the number of model parameters
solutions revealed that all winners used pre-trained EfficientNet and its performance. Larger models achieve higher accuracy.
networks, which was finetuned on videos containing face But this dependency does not preserve several similar transfer-
manipulations. Because of this observation, we decide to compare learning tasks (e.g. CIFAR-10, CIFAR-100, Oxford-IIIT Pets).
the performance of EfficientNets models within the task of However, even in these tasks, an overall increase in accuracy
detecting fake videos. For comparison, we use models, based on was still observed [10]. Figure 1 illustrates the break of
the highest-performing entrant of DFDC, entered by Selim dependency with CIFAR-100. In case of deepfake detection, the
Seferbekov, and the DFDC dataset as training data. Our data model works with is very different from the data it was
experiments show that there is no strong correlation between originally trained on. Therefore, we hypothesize that the
model performance and its size. The best accuracy was achieved tendency to improve accuracy of the model with an increased
by B4 and B5 models. number of its parameters may not be observed within the
deepfake detection problem. If this hypothesis is correct, then
Keywords——deepfake videos; deep learning; digital media while transferring pre-trained EfficientNets to the deepfake
forensics; detection techniques detection problem, models with fewer parameters may perform
I. INTRODUCTION better results.
Digital image and video manipulation technologies have The analysis of the DFDC winners’ solutions and the
been developing rapidly for several decades, and one of these proposed hypothesis prompted us to compare the performance
technologies, the transformation of human faces on videos, has of pre-trained EfficientNets transferred to deepfake detection
achieved tremendous results over the last years. Currently, there task. This comparison will allow us to find out which model
are several publicly available solutions [1, 2], that allow any user achieves the highest performance.
to perform various facial manipulations on high-quality videos,
without requiring a profound knowledge of computer science. In
the age of information technologies, when social networks are
actively used as a source of information, media data modified in
this way can spread rapidly and have a serious social impact [3,
4]. This phenomenon is publicly referred to as a deepfake.
Active development of various deepfake creation methods
led to the emergence of research in the field of deepfake
detection. These studies gave rise to many deepfake detection
methods. Some of these methods search for defects related to
human behavior [5, 6], others search for specific artifacts that
occur in the process of digital image transformation [7], but most
of these methods are data-driven, they are based on sophisticated
machine learning models and do not search for specific defects.
So as to accelerate the development of new ways to detect
deepfakes, Facebook created Deepfake Detection Challenge
(DFDC) [8]. This competition revealed a variety of different
approaches to solving this problem. We analyzed the top
demonstrated solutions and it appeared that all the winners’
solutions used pre-trained models of the EfficientNet family

Authorized licensed use limited to: UNIVERSITY OF BATH. Downloaded on June 14,2021 at 15:38:31 UTC from IEEE Xplore. Restrictions apply.
II. RESEARCH MATERIALS AND METHODS
While comparing models, we use a full DFDC dataset
consisting of real videos and videos modified by various
deepfake creation methods. The task within which we compare
EfficientNets is to predict the class of these videos, fake or real.
In order to solve this task, we created a baseline classification
algorithm inspired by the solution of the winner of the DFDC
competition, Selim Seferbekov. The most important step of this
algorithm is the use of one model from the EfficientNets family,
and since each model has the same output data format, we can
simply replace one model with another and compare the
algorithm performance without changing other parts of it.
Fig. 1. The tendency of increasing accuracy. Generally, as the number of
parameters increases, the accuracy increases. But on the CIFAR-100 dataset,
models B2 and B6 show worse results than B1 and B5, respectively.

Fig. 2. Video classification algorithm.

The video classification algorithm we use can be divided into independently classify each frame of the video as fake or
five steps: real. Figure 3 illustrates the architecture of the classifier
we use.
• STEP 1: we lower the video frequency by 30 times, then
extract an image fragment containing a human face from • STEP 5: we average all probabilities in sequence and
each frame. After that, we save each video as a sequence define the resulting number as the probability that the
of images of people's faces. We use Multi-task Cascaded input video is fake.
Convolutional Networks (MTCNN) [9] to detect people's
faces.
• STEP 2: for each image in the sequence, we apply a
random combination of the following data augmentation
methods: rotation by a random angle, flips, blackout
random part of the image, Gaussian noise, compression,
grayscale, and isotropic resize. During the augmentation
process, we do not scale the images and use the highest
resolution possible.
• STEP 3: we use the chosen EfficientNet to transform
each image in sequence into a feature matrix. In this case,
EfficientNets are used as encoders. Each EfficientNet has
a strictly defined size of input data, so we pre-scale each
image to the required size.
Fig. 3. Classifier architecture.
• STEP 4: by using a simple binary classifier, we convert
each feature matrix in sequence to a single number. This For each pre-trained model from the EfficientNet family we
number represents the probability that the frame compare (B0-B7), we train our classifier paired with the chosen
originally belongs to a fake video. In fact, we model. For that, we do 20 epochs of fitting on a train set of

599

Authorized licensed use limited to: UNIVERSITY OF BATH. Downloaded on June 14,2021 at 15:38:31 UTC from IEEE Xplore. Restrictions apply.
videos, computing loss function by each frame separately. Then The highest accuracy is achieved by using the B5 model. Figure
we test each model by applying our classification algorithm with 4 demonstrates the dependency between the accuracy of the
finetuned EfficientNet and trained classifier to a test set of model and the number of its parameters.
videos. We use produced predictions to compare EfficientNets
using binary classification metrics. Our train and test sets consist IV. DISCUSSION AND CONCLUSIONS
of 10000 and 5000 videos from the full DFDC dataset, At present, the problem of deepfake detection is still
respectively. relevant, but there are already many solutions that allow us to
Our work was performed using NRNU MEPhI high- find deepfakes with different accuracy degree. Among these
performance computing center, even so, we had to use only a solutions, high results are achieved by methods that use models
part of the full DFDC dataset and low frame rate of videos, due of the EfficientNet family for feature extraction.
to the high computational complexity of EfficientNets B6 and According to our results, the use of pre-trained EfficientNets
B7. However, we used the full DFDS dataset instead of the with a larger number of parameters does not always lead to
preview DFDS dataset, since the full release contains videos that increase in accuracy. The accuracy of our solution retains similar
have been modified by a larger number of different methods. results when using B0-B3 models. The use of B4 and B5 models
it increases and reaches the peak value, but the use of B6 and B7
III. RESULTS models leads to a significant decrease in accuracy, despite the
Using the computed predictions, we compare the models by great advantage in the number of parameters.
two metrics: accuracy and AUC-ROC. The results are presented
in Table 1. A decrease in accuracy of our solution with the use of B6 and
B7 models may be related to the fact that convolutional neural
TABLE 1. EFFICIENTNET PERFORMANCE RESULTS ON DFDC networks of this size begin to work with more complex patterns
that are much more difficult to transfer to a different task. In this
Model Accuracy AUC-ROC Params case, models with a larger number of parameters can potentially
EfficientNet-B0 70.5 0.785 5.3M achieve better results, but this will require much longer training.
7.8M
EfficientNet-B1 70.1 0.779 REFERENCES
EfficientNet-B2 70.1 0.769 9.2M [1] I. Perov et al., “DeepFaceLab: A simple, flexible and extensible face
swapping framework,” arXiv, May 2020, Accessed: Nov. 30, 2020.
EfficientNet-B3 69.9 0.785 12M [Online]. Available: http://arxiv.org/abs/2005.05535.
19M [2] “deepfakes/faceswap: Deepfakes Software For All.”
EfficientNet-B4 72.7 0.828
https://github.com/deepfakes/faceswap (accessed Nov. 30, 2020).
EfficientNet-B5 74.4 0.829 30M [3] “Tech - Disinfo and 2020 Election — NYU Stern Center for Business and
Human Rights.” https://bhr.stern.nyu.edu/tech-disinfo-and-2020-election
EfficientNet-B6 72.6 0.807 43M
(accessed Nov. 30, 2020).
EfficientNet-B7 70.4 0.789 66M [4] R. Chesney and D. K. Citron, “Deep Fakes: A Looming Challenge for
Privacy, Democracy, and National Security,” SSRN Electron. J., Aug.
2018, doi: 10.2139/ssrn.3213954.
[5] Y. Li, M. C. Chang, and S. Lyu, “In Ictu Oculi: Exposing AI created fake
videos by detecting eye blinking,” Jan. 2019, doi:
10.1109/WIFS.2018.8630787.
[6] X. Yang, Y. Li, and S. Lyu, “Exposing Deep Fakes Using Inconsistent
Head Poses,” in ICASSP, IEEE International Conference on Acoustics,
Speech and Signal Processing - Proceedings, May 2019, vol. 2019-May,
pp. 8261–8265, doi: 10.1109/ICASSP.2019.8683164.
[7] F. Matern, C. Riess, and M. Stamminger, “Exploiting visual artifacts to
expose deepfakes and face manipulations,” in Proceedings - 2019 IEEE
Winter Conference on Applications of Computer Vision Workshops,
WACVW 2019, Feb. 2019, pp. 83–92, doi:
10.1109/WACVW.2019.00020.
[8] “Deepfake Detection Challenge | Kaggle.”
https://www.kaggle.com/c/deepfake-detection-challenge (accessed Nov.
30, 2020).
[9] K. Zhang, Z. Zhang, Z. Li, and Y. Qiao, “Joint Face Detection and
Alignment Using Multitask Cascaded Convolutional Networks,” IEEE
Signal Process. Lett., vol. 23, no. 10, pp. 1499–1503, Oct. 2016, doi:
Fig. 4. Model parameters vs. accuracy. 10.1109/LSP.2016.2603342.
[10] M. Tan and Q. V. Le, “EfficientNet: Rethinking Model Scaling for
These results confirm the hypothesis we propose. A Convolutional Neural Networks,” 36th Int. Conf. Mach. Learn. ICML
tendency to increase the accuracy of the model with an increase 2019, vol. 2019-June, pp. 10691–10700, May 2019, Accessed: Nov. 30,
2020. [Online]. Available: http://arxiv.org/abs/1905.11946.
in the number of its parameters is not observed in our results.

600

Authorized licensed use limited to: UNIVERSITY OF BATH. Downloaded on June 14,2021 at 15:38:31 UTC from IEEE Xplore. Restrictions apply.

Short Form AIGP Study Outline
No ratings yet
Short Form AIGP Study Outline
28 pages
Optimization of DeepFake Video Detection Using Image Preprocessing
No ratings yet
Optimization of DeepFake Video Detection Using Image Preprocessing
5 pages
publi-7030
No ratings yet
publi-7030
5 pages
Deeo
No ratings yet
Deeo
11 pages
Deep Fake Detection - Finalized
No ratings yet
Deep Fake Detection - Finalized
8 pages
A Performance Enhancement of Deepfake Video
No ratings yet
A Performance Enhancement of Deepfake Video
10 pages
IJRPR7765
No ratings yet
IJRPR7765
5 pages
Deep Fake Detection Using CNN: Project Course On Neural Network
No ratings yet
Deep Fake Detection Using CNN: Project Course On Neural Network
4 pages
Deepfake Detection of Images
No ratings yet
Deepfake Detection of Images
9 pages
Deepfake 1
No ratings yet
Deepfake 1
6 pages
Deepfake Detection Humans vs. Machines
No ratings yet
Deepfake Detection Humans vs. Machines
6 pages
Deepfake Synopsis-1
No ratings yet
Deepfake Synopsis-1
2 pages
Pnas 2110013119
No ratings yet
Pnas 2110013119
11 pages
Synopsis Report
No ratings yet
Synopsis Report
8 pages
Digital Forensics and Analysis of Deepfa
No ratings yet
Digital Forensics and Analysis of Deepfa
6 pages
Robust Deepfake Detection Leveraging EfficientNet-B3 Backbone With Binary Classification Techniques
No ratings yet
Robust Deepfake Detection Leveraging EfficientNet-B3 Backbone With Binary Classification Techniques
9 pages
Deepfake Detection Using CNN and DCGANS to Drop-Out Fake Multimedia Content a Hybrid Approach
No ratings yet
Deepfake Detection Using CNN and DCGANS to Drop-Out Fake Multimedia Content a Hybrid Approach
6 pages
Deep Fake
No ratings yet
Deep Fake
7 pages
Detectx: A Deepfake Detection System
No ratings yet
Detectx: A Deepfake Detection System
17 pages
Project Report (1)_removed (1)_removed
No ratings yet
Project Report (1)_removed (1)_removed
51 pages
Celeb-Df: A Large-Scale Challenging Dataset For Deepfake Forensics
No ratings yet
Celeb-Df: A Large-Scale Challenging Dataset For Deepfake Forensics
10 pages
Deepfakestack-Kavya R Shetty
No ratings yet
Deepfakestack-Kavya R Shetty
16 pages
Group 85 Survey Paper[1]-1
No ratings yet
Group 85 Survey Paper[1]-1
5 pages
Group 4
No ratings yet
Group 4
11 pages
An Efficient Deepfake Detection Using Robust Deep Learning Approch
No ratings yet
An Efficient Deepfake Detection Using Robust Deep Learning Approch
24 pages
Nccds 3
No ratings yet
Nccds 3
3 pages
Innovative Project
No ratings yet
Innovative Project
7 pages
GTA-Net_A_Robust_Method_for_Deepfake_Face_Image_Detection
No ratings yet
GTA-Net_A_Robust_Method_for_Deepfake_Face_Image_Detection
6 pages
Seminar
No ratings yet
Seminar
18 pages
Deepfake Video Detection Using Convolutional Visio
No ratings yet
Deepfake Video Detection Using Convolutional Visio
9 pages
Deepfake Video Detection System Using Deep Neural Networks
No ratings yet
Deepfake Video Detection System Using Deep Neural Networks
6 pages
DEEPFAKE-5
No ratings yet
DEEPFAKE-5
7 pages
K024 K006 DWM ResearchPaper
No ratings yet
K024 K006 DWM ResearchPaper
16 pages
Project Report (1)_removed
No ratings yet
Project Report (1)_removed
53 pages
Deepfake Video Detection Using Convolutional Vision Transformer
No ratings yet
Deepfake Video Detection Using Convolutional Vision Transformer
9 pages
Ijset v11 Issue6 571
No ratings yet
Ijset v11 Issue6 571
5 pages
DeepFake Video Detection
No ratings yet
DeepFake Video Detection
22 pages
It_Wasnt_Me_Irregular_Identity_in_Deepfake_Videos
No ratings yet
It_Wasnt_Me_Irregular_Identity_in_Deepfake_Videos
5 pages
Deepfake Image and Video Detection Using Deep Learning Algorithms
No ratings yet
Deepfake Image and Video Detection Using Deep Learning Algorithms
5 pages
Phase 1 Review 1
No ratings yet
Phase 1 Review 1
14 pages
DeepFake Detection For Human Face Images and Videos A Survey
No ratings yet
DeepFake Detection For Human Face Images and Videos A Survey
19 pages
Deep Fake Today
No ratings yet
Deep Fake Today
7 pages
A Machine Learning Based Approach For Deepfake Detection in Social Media Through Key Video Frame Extraction
No ratings yet
A Machine Learning Based Approach For Deepfake Detection in Social Media Through Key Video Frame Extraction
18 pages
Video Deepfake Detection Using Particle Swarm Optimization Improved Deep Neural Networks
No ratings yet
Video Deepfake Detection Using Particle Swarm Optimization Improved Deep Neural Networks
37 pages
Iee Paper Final
No ratings yet
Iee Paper Final
14 pages
Celeb-DF: A New Dataset For DeepFake Forensics
No ratings yet
Celeb-DF: A New Dataset For DeepFake Forensics
6 pages
Paper (Related Project-3)
No ratings yet
Paper (Related Project-3)
9 pages
A12 - Deepfake Detection Implementation
No ratings yet
A12 - Deepfake Detection Implementation
4 pages
In-The-Wild_Deepfake_Detection_using_Adaptable_CNN_Models_with_Visual_Class_Activation_Mapping_for_Improved_Accuracy
No ratings yet
In-The-Wild_Deepfake_Detection_using_Adaptable_CNN_Models_with_Visual_Class_Activation_Mapping_for_Improved_Accuracy
6 pages
deep_fake_detection
No ratings yet
deep_fake_detection
8 pages
G24_R3
No ratings yet
G24_R3
14 pages
Illuminating Deepfakes: - An Anti-Deepfake Technology
No ratings yet
Illuminating Deepfakes: - An Anti-Deepfake Technology
20 pages
Survey Paper
No ratings yet
Survey Paper
5 pages
G24_R1
No ratings yet
G24_R1
19 pages
Mover: Mask and Recovery Based Facial Part Consistency Aware Method For Deepfake Video Detection
No ratings yet
Mover: Mask and Recovery Based Facial Part Consistency Aware Method For Deepfake Video Detection
10 pages
TSP_IASC_29653
No ratings yet
TSP_IASC_29653
19 pages
806_combining Computer Vision and Intraframe Noise Methods to Detect a Deepfake
No ratings yet
806_combining Computer Vision and Intraframe Noise Methods to Detect a Deepfake
8 pages
Deepfake Detection For Human Face Images and Videos: A Survey
No ratings yet
Deepfake Detection For Human Face Images and Videos: A Survey
19 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
From Everand
Visual Sensor Network: Exploring the Power of Visual Sensor Networks in Computer Vision
Fouad Sabry
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Buy ebook Supply Chain Analytics: Concepts, Techniques and Applications 1st Edition Kurt Y. Liu cheap price
100% (1)
Buy ebook Supply Chain Analytics: Concepts, Techniques and Applications 1st Edition Kurt Y. Liu cheap price
47 pages
ML LAB Viva Questions with Answers
No ratings yet
ML LAB Viva Questions with Answers
10 pages
Information Sciences: M. Zarinbal, M.H. Fazel Zarandi, I.B. Turksen
No ratings yet
Information Sciences: M. Zarinbal, M.H. Fazel Zarandi, I.B. Turksen
24 pages
A User-Centric Machine Learning
No ratings yet
A User-Centric Machine Learning
11 pages
All Question Sets and Mid Term Questions
No ratings yet
All Question Sets and Mid Term Questions
59 pages
AI Documents
No ratings yet
AI Documents
25 pages
01 - Introduction To Datamining
No ratings yet
01 - Introduction To Datamining
19 pages
Devoir 1
No ratings yet
Devoir 1
6 pages
1 s2.0 S0747563221003861 Main
No ratings yet
1 s2.0 S0747563221003861 Main
10 pages
GR No-01-Project-Report
No ratings yet
GR No-01-Project-Report
51 pages
PA
No ratings yet
PA
8 pages
Unit 4
No ratings yet
Unit 4
23 pages
Experiential Study of Kernel Functions To Design An Optimized Multi-Class SVM
No ratings yet
Experiential Study of Kernel Functions To Design An Optimized Multi-Class SVM
6 pages
Prediction of Software Effort Using Artificial NeuralNetwork and Support Vector Machine
No ratings yet
Prediction of Software Effort Using Artificial NeuralNetwork and Support Vector Machine
7 pages
CSE 4237 SoftCom Solutions
No ratings yet
CSE 4237 SoftCom Solutions
115 pages
UNIT-2 ML notes
No ratings yet
UNIT-2 ML notes
15 pages
Using Machine Learning For Land Suitability Classification
No ratings yet
Using Machine Learning For Land Suitability Classification
12 pages
DSML Practical
No ratings yet
DSML Practical
3 pages
DS Manual
No ratings yet
DS Manual
29 pages
4 - Data Analytics Using DM and ML Algorithms - 1
No ratings yet
4 - Data Analytics Using DM and ML Algorithms - 1
71 pages
BCS602 Module 1
No ratings yet
BCS602 Module 1
35 pages
P.31 ICAIBDA Paper Halaman 179-185
No ratings yet
P.31 ICAIBDA Paper Halaman 179-185
302 pages
Agriculture Crop Recommendation System Using
No ratings yet
Agriculture Crop Recommendation System Using
57 pages
Machine Learning Approach For Malignant Melanoma Classification
No ratings yet
Machine Learning Approach For Malignant Melanoma Classification
7 pages
Tanagra
No ratings yet
Tanagra
8 pages
gradient_exploding_vanishing_problem_v2
No ratings yet
gradient_exploding_vanishing_problem_v2
3 pages
Sample Questions
No ratings yet
Sample Questions
51 pages
ML UNIT-IV Notes
100% (1)
ML UNIT-IV Notes
23 pages
Perbandingan Algoritma Naïve Bayes Dan KNN Dalam Analisis Sentimen Masyarakat Terhadap Pelaksanaan PPPK Guru
No ratings yet
Perbandingan Algoritma Naïve Bayes Dan KNN Dalam Analisis Sentimen Masyarakat Terhadap Pelaksanaan PPPK Guru
7 pages

Deepfake Detection

Uploaded by

Deepfake Detection

Uploaded by

EfficientNets for DeepFake Detection: Comparison of

Artem A. Pokroy Alexey D. Egorov

598 978-1-6654-0476-1/21/$31.00 ©2021 IEEE

Fig. 2. Video classification algorithm.

You might also like