0% found this document useful (0 votes)

23 views8 pages

Report

Artificial Intelligence

Uploaded by

Pranav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views8 pages

Report

Artificial Intelligence

Uploaded by

Pranav

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Project Report: Deep Fake Detection

Problem Statement and Background:

Deepfakes can distort our perception of the truth and we need to develop a strategy to
improve their detection. Deep Fakes are increasingly detrimental to privacy, social security,
and democracy. We plan to achieve better accuracy in predicting real and fake videos.
For an instance, Recently a video on social media has shown that a high ranked U.S legislator
declared his own support for an enormous tax increase. At this point, people might tend to
react accordingly because the video is exactly the same as the person by looks and voice.
This way, DeepFake content can be used to manipulate people’s opinions. So, Deepfakes
detection plays a prominent role in identifying fake content on social media and other forms
of media.

Relevant work:
● Blink detection network using CNN and LSTM - https://arxiv.org/pdf/1806.02877.pdf
● Recurrent Convolutional Strategies for Face Manipulation Detection in Videos -
https://arxiv.org/pdf/1905.00582.pdf
● Deep Learning Based Computer Generated Face Identification Using Convolutional
Neural Network(CGFace) - https://www.mdpi.com/2076-3417/8/12/2610/htm
● MesoNet: a Compact Facial Video Forgery Detection Network - https://hal-upec-
upem.archives-ouvertes.fr/hal-01867298/document

Methods:
Dataset:
We plan to use Detect fake videos using “DeepFake detection” challenge dataset of Kaggle.
Dataset: https://www.kaggle.com/c/deepfake-detection-challenge/data
The full dataset contains 470 GB of video files(training and testing) and a metadata file for
each video. We plan to use 100 videos with ground truth, split them as 70% training, and
20% test and evaluate models using this. We plan to build a model that generalizes well.

Columns in metadata file:

filename - the filename of the video.
label - whether the video is real or fake.
original - in the case that a train set video is fake, the original video is listed here.
split - this is always equal to "train

Preprocessing:
Videos to frames Conversion - Captured frames using Vedio_Capture class of cv2 library
from a video.
Individual Video length (8 seconds) → 300 Frames

● Frames to Faces - We explored dlib and Facenet to detect faces in frames and saved
the face images resized to 86*86 with both RGB and Grayscale formats. We hope that
faces are important features to identify fake and real images.
● To leverage discrepancies across frames, we saved each video frame image
sequentially inside a single folder across the pipeline of data preprocessing, So that
we could use it for LSTM if needed. For CNN and GAN’s this doesn’t really matter.
● We resized images to different sizes like 256*256 with entire frame, 128*128 face
only, 64*64 face only, 86*86 face only and trained them to pick the best
configuration.
● We also explored training on RGB images and gray scale images. GAN had a
performance improvement on generating high quality images in lesser epochs for
Gray scale images, and it was intuitive that RGB would take time to learn complex
features as the dimensions increased three times. But we went ahead and trained
models for RGB as it captures high resolution as is more applicable in practice.

Baseline model:

We are using a single neuron with sigmoid activation function as a baseline model to classify
images.

CGFace model

CGFace: It is a computer-generated face detection task by customizing the number of

convolutional layers, so it performs well in detecting computer-generated face images.
Adding to that an imbalanced framework (IF-CGFace) is created by altering CGFace’s layer
structure to adjust to the imbalanced data issue by extracting features from CGFace layers
and using them to train AdaBoost.

Batch Normalization: Before the fully connected layers, one batch normalization layer was
added. The reason was to improve optimization by introducing some noise into the network
so that it can regularize the model besides the dropout layers.

Optimization algorithm:Adam, learning rate : 0.001, batch size: 32, and 50 epochs

Model building:
We modified above architecture to accept input image of 84*84 RGB. We used 32 kernels
instead of 5 in the first two convolution layers to learn more dense features initially and then
encode along the way. Increasing number of kernels slowed the optimization process, but
increased the accuracy greatly.

DCGAN model

Methods for Deep Convolutional GANs

● Replace any pooling layers with strided convolutions (discriminator) and fractional-
strided convolutions (generator).
● Use batchnorm in both the generator and the discriminator.
● Remove fully connected hidden layers for deeper architectures.
● Use ReLU activation in the generator for all layers except for the output, which uses
Tanh.
● Uses LeakyReLU activation in the discriminator for all layers.

We modified the architecture with different kernel sizes and number of kernels, for
processing the face image that we have of size 84*84*3.
Generator: We gave a 100 dimension noise vector, this was based on other research papers
that have successfully implemented GAN’s and other variants of GANS. For the number of
kernels, and filter size, we went backward from last layer which is 84*84 image, and tried
many parameters and ran a few 100 epochs to find the better parameter. It was challenging to
tune hyper parameters for GAN as it took more time.

Discriminator: We built it to accept the 84*84*3 image with 2 convolution layers, and a
fully connected layer activated by Leaky ReLu to make predictions.
Generator Discriminator

MAML based CNN classifier:

We referred to a MAML paper mentioned in reference, and implemented a CNN classifier by

using less dataset (300-shot 2-way).
We computed loss, gradients and updated weights as specified in the algorithm using
tensorflow.
Meta training tasks: As our task involved facial features, we thought of adding gender
classification tasks, emotion recognition tasks, human vs horse classification tasks as meta
training tasks. But we did not see any improvement in the accuracy and loss pattern.

We thought it could be due to less amount of sample tasks that we had. So, we tried using the
entire imagenet task as our meta training tasks, and implemented it. But the RAM usage was
very high and the computation was not enough. Because of this, google vm instance got
terminated and we lost gan results. So, we decided to give up on the MAML based CNN
classifier and take it as a future project for summer possibly.
Results:
We planned to achieve better accuracy for the real vs fake prediction task. This is our primary
goal of the project. We initially planned to use MAML, so that it would generalize well to un-
seen samples, and it can be trained online with few samples. As it did not work, we chose
convolution based classifiers, and GAN has our primary solutions.
Baseline solution vs Primary solution: Baseline solution doesn’t encode the pixels and
learn the image features. Primary solution does that by using convolution layers. So, the
baseline solution has a better possibility to generalize to new unseen samples.
CGFace - Took 70 to 80 seconds per epoch on a CPU machine, i7, 8 GB RAM. We ran it for
a total of 50 epochs
GAN - Took about 15 to 20 seconds per epoch on a Google deep learning vm instance, 13
GB 2 vCPUs, 1 x NVIDIA Tesla K80. We ran it for a total of 1500 epochs.
Visualization:
Base Model - Accuracy and Loss vs epoch
CGFace - Accuracy and Loss vs epoch

DCGAN - Generated images at 1000th epoch

We understand that the equilibrium between discrinimator and generator is not achieved yet
and it might take several epochs for it. We lost the model weights of 2500+ epochs in the
middle and we had to restart and interrupt the training process with the time we had. But we
believe important facial features are being learnt and the training process is on the right track.
If it learns the colour and other complex shapes and features of the shape, we hope it would
be able to predict the real and fake images with reasonable accuracy. Currently, DCGAN
predicts all images as real.

Model Training Accuracy Testing Accuracy

Baseline model 82.022 62.9333

CGFace Model 94.822 68.2777
GAN Model NA 50

Tools:
Python - Programming language
Dlib, Facenet, MTCNN - Face detection
CV2 - Image and Video processing
Tensorflow - Deep learning library
Keras - Deep learning library
Machine configuration for training: Google deep learning VM instance 13 GB RAM, 500 GB
storage, 2 vCPUs, 1 x NVIDIA Tesla K80

Lessons learned:
CNN classifier learns better than naive classifier or other fully connected networks because
of its ability to learn different image features with different settings of kernels. We also
reduce the dimensions greatly in a meaningful way to speed up the optimization process.
We hoped meta learning would require less parameter tuning and simple models would
perform well. But our assumption turns out to be wrong as for meta-training to go well, we
might have to tune the parameters well and we might have to do it dynamically too to achieve
best performance. We learnt that MAML++ approach overcomes this limitation to a certain
extent.
For GAN’s to perform well with high dimensional images, and to train directly on videos
using conv3d layers, we need a lot of computing resources. For videos, a single epoch could
take up to days. It is always a best practice to save weight and create checkpoints during
training, we learned it the hard way as we lost our gan weights on the last day!

Team Contribution:
We coordinated and worked on the tasks equally from day 1 when we started exploration. So,
We would say each of us have equal effort. We had teams meeting on the daily basis once the
class went remote and had working sessions.
Aditi: Main focus on MAML implementation. Worked on other two models, pre-processing
and visualization as well.
Selva: Main focus on GAN. Worked on other two models, pre-processing and visualization
as well.
Swayanshu: Main focus on CGFace. Worked on other two models, pre-processing and
visualization as well.

References:

1. CGFace - https://www.mdpi.com/2076-3417/8/12/2610/htm

2. DCGAN - https://arxiv.org/pdf/1511.06434.pdf

3. MAML - https://arxiv.org/pdf/1703.03400.pdf

4. Blink detection network using CNN and LSTM - https://arxiv.org/pdf/1806.02877.pdf

5. Recurrent Convolutional Strategies for Face Manipulation Detection in Videos -

https://arxiv.org/pdf/1905.00582.pdf
6. Deep Learning Based Computer Generated Face Identification Using Convolutional
Neural Network(CGFace) - https://www.mdpi.com/2076-3417/8/12/2610/htm
7. MesoNet: a Compact Facial Video Forgery Detection Network - https://hal-upec-
upem.archives-ouvertes.fr/hal-01867298/document

Distributed Database Questions
33% (3)
Distributed Database Questions
8 pages
ESWIS External User Manual
100% (1)
ESWIS External User Manual
50 pages
Fake Image Detection Report
No ratings yet
Fake Image Detection Report
21 pages
Deep Fake Detection
No ratings yet
Deep Fake Detection
25 pages
Udacity Nanodegree Project Report
No ratings yet
Udacity Nanodegree Project Report
12 pages
Image Classification Using MNIST Dataset
No ratings yet
Image Classification Using MNIST Dataset
28 pages
Classificationusingcnn 170430184308
No ratings yet
Classificationusingcnn 170430184308
24 pages
Consent Form and Terms of Use For Applicant Services Provided by Vfs Global Operated Canada Visa Application Centres (Cvac)
No ratings yet
Consent Form and Terms of Use For Applicant Services Provided by Vfs Global Operated Canada Visa Application Centres (Cvac)
4 pages
Civil 3D and Dynamo: Dynamic Culvert Design and Analysis
No ratings yet
Civil 3D and Dynamo: Dynamic Culvert Design and Analysis
44 pages
MCOomp - 2021 - Changjin Wang +
No ratings yet
MCOomp - 2021 - Changjin Wang +
135 pages
Project Report: Topic: Real Time Facial Expression Recognition
No ratings yet
Project Report: Topic: Real Time Facial Expression Recognition
24 pages
VGGFace Transfer Learning and Siamese Network For Face Recognition
No ratings yet
VGGFace Transfer Learning and Siamese Network For Face Recognition
6 pages
Real-Time Convolutional Neural Networks For Emotion and Gender Classification
No ratings yet
Real-Time Convolutional Neural Networks For Emotion and Gender Classification
5 pages
Kirkvik Acit2022
No ratings yet
Kirkvik Acit2022
155 pages
Deepfake Detector
No ratings yet
Deepfake Detector
6 pages
Fake Face Detection Using CNN
No ratings yet
Fake Face Detection Using CNN
6 pages
Phase 1 PPT
No ratings yet
Phase 1 PPT
28 pages
KECReport
No ratings yet
KECReport
23 pages
AS2 Deep Learning
No ratings yet
AS2 Deep Learning
3 pages
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
No ratings yet
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
35 pages
DeepFake Detection (DR - Kalam)
No ratings yet
DeepFake Detection (DR - Kalam)
21 pages
Irjet V7i61094
No ratings yet
Irjet V7i61094
3 pages
Deepfake Image Detection
No ratings yet
Deepfake Image Detection
7 pages
Thesis PPT Hritu Raj-1
No ratings yet
Thesis PPT Hritu Raj-1
26 pages
Batch 16
No ratings yet
Batch 16
24 pages
Aditya Joshi 23252595 Assign 5
No ratings yet
Aditya Joshi 23252595 Assign 5
7 pages
Anime Face Generation Using DC-GANs
No ratings yet
Anime Face Generation Using DC-GANs
6 pages
Code Info
No ratings yet
Code Info
8 pages
Phase 2 Review 1
No ratings yet
Phase 2 Review 1
32 pages
Capstone Presentation
No ratings yet
Capstone Presentation
25 pages
Phase 1 Review
No ratings yet
Phase 1 Review
38 pages
Report
No ratings yet
Report
6 pages
Ratnesh
No ratings yet
Ratnesh
26 pages
Project Report (2) RRRRRRRRRRR
No ratings yet
Project Report (2) RRRRRRRRRRR
10 pages
Phase 1 Review 2
No ratings yet
Phase 1 Review 2
21 pages
MNIST Based Handwritten Digits Recognition
No ratings yet
MNIST Based Handwritten Digits Recognition
5 pages
Prblem Col
No ratings yet
Prblem Col
2 pages
AI Slide 2
No ratings yet
AI Slide 2
82 pages
ML Lab Session 05 - CNN Implementation
No ratings yet
ML Lab Session 05 - CNN Implementation
4 pages
Phase 1 Review 1
No ratings yet
Phase 1 Review 1
14 pages
21BCP167 Ai 9
No ratings yet
21BCP167 Ai 9
10 pages
Localization Using Convolutional Neural Networks
No ratings yet
Localization Using Convolutional Neural Networks
29 pages
Real Object Detection System Using Yolov3 Images
No ratings yet
Real Object Detection System Using Yolov3 Images
6 pages
8 Deep Learning CNN
No ratings yet
8 Deep Learning CNN
63 pages
Final Year Project Synopsis - Guidelines
No ratings yet
Final Year Project Synopsis - Guidelines
9 pages
Anime Face Generation Using DC-GANs
No ratings yet
Anime Face Generation Using DC-GANs
6 pages
Classifying Authentic and AI-Generated Images With A Fine - Tuned ResNet50 Model.
No ratings yet
Classifying Authentic and AI-Generated Images With A Fine - Tuned ResNet50 Model.
7 pages
Device Management in Operating System
No ratings yet
Device Management in Operating System
5 pages
MVS - Expt8 Object Detection and Reconstruction Using CNN
No ratings yet
MVS - Expt8 Object Detection and Reconstruction Using CNN
5 pages
Object Detection and Recognition: Final Project Title
No ratings yet
Object Detection and Recognition: Final Project Title
6 pages
Conference Paper
No ratings yet
Conference Paper
3 pages
Deepfake Detection GLSTM
No ratings yet
Deepfake Detection GLSTM
10 pages
Deep Fake Detection Using CNN: Project Course On Neural Network
No ratings yet
Deep Fake Detection Using CNN: Project Course On Neural Network
4 pages
Chapter 3: The Electronic Wallet 3.1. Introduction To E-Wallet
100% (2)
Chapter 3: The Electronic Wallet 3.1. Introduction To E-Wallet
11 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
Face Recognition Using CNN
No ratings yet
Face Recognition Using CNN
17 pages
Assignment1 C EE569
No ratings yet
Assignment1 C EE569
9 pages
Week 6
No ratings yet
Week 6
8 pages
Comprehensive Report On Generative AI and Computer Vision Projects
No ratings yet
Comprehensive Report On Generative AI and Computer Vision Projects
15 pages
Deepfake REVIEW 3
No ratings yet
Deepfake REVIEW 3
28 pages
3417-SUBMISSION - Manuscript File (.Pdf-.Docx) - 14755-1-10-20241223
No ratings yet
3417-SUBMISSION - Manuscript File (.Pdf-.Docx) - 14755-1-10-20241223
10 pages
Modbus TCP Interface BAT and BATI Inverters v4 5 Eng
No ratings yet
Modbus TCP Interface BAT and BATI Inverters v4 5 Eng
124 pages
Deepfake Detection of Images
No ratings yet
Deepfake Detection of Images
9 pages
Adobe Illustrator Basic Pen Tool
100% (1)
Adobe Illustrator Basic Pen Tool
15 pages
2.PC Jotun Chart1011 PDF
No ratings yet
2.PC Jotun Chart1011 PDF
3 pages
Practical 14
No ratings yet
Practical 14
5 pages
Viva Questions 1
No ratings yet
Viva Questions 1
3 pages
Maths Course Outline Y8
No ratings yet
Maths Course Outline Y8
5 pages
Lab 2b-RSA Public-Key Encryption and Signature Lab
No ratings yet
Lab 2b-RSA Public-Key Encryption and Signature Lab
10 pages
Inter View Questions
No ratings yet
Inter View Questions
6 pages
Exactive Series Manbre en
No ratings yet
Exactive Series Manbre en
258 pages
MX12 Ug
No ratings yet
MX12 Ug
210 pages
Lab 2 - Secret Key Encryption Lab
No ratings yet
Lab 2 - Secret Key Encryption Lab
8 pages
Prompt Engg Module2
No ratings yet
Prompt Engg Module2
38 pages
Protocols and Switching
No ratings yet
Protocols and Switching
48 pages
Lab 1 - Stegnography
No ratings yet
Lab 1 - Stegnography
5 pages
Lab 3 Wireshark and TCP Dump Tool Demo
No ratings yet
Lab 3 Wireshark and TCP Dump Tool Demo
7 pages
Acer Aspire Es1-512 Wistron Ea53-Bm SCH PDF
No ratings yet
Acer Aspire Es1-512 Wistron Ea53-Bm SCH PDF
49 pages
TopupArticle Latest PDF
No ratings yet
TopupArticle Latest PDF
23 pages
Feedback: Your Answer Is Correct. The Correct Answer Is: Resources
No ratings yet
Feedback: Your Answer Is Correct. The Correct Answer Is: Resources
19 pages
SF Dump
No ratings yet
SF Dump
20 pages
Table Analysis
No ratings yet
Table Analysis
22 pages
CN Practical
No ratings yet
CN Practical
9 pages
Tentative Program
No ratings yet
Tentative Program
3 pages
Appache OS 10048
No ratings yet
Appache OS 10048
1 page
Data Communication Syllabus
No ratings yet
Data Communication Syllabus
6 pages
SIMATIC Virtualization As A Service
No ratings yet
SIMATIC Virtualization As A Service
4 pages
User Behavior Analytics
No ratings yet
User Behavior Analytics
2 pages
Accesing IO
No ratings yet
Accesing IO
3 pages
Presentation Impress Shortcut Keys
No ratings yet
Presentation Impress Shortcut Keys
3 pages
1 - 6 Years Experience 2nd
No ratings yet
1 - 6 Years Experience 2nd
2 pages
Rohit Data Analysis
No ratings yet
Rohit Data Analysis
1 page
Mid Sem 1 Portions
No ratings yet
Mid Sem 1 Portions
3 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet