CNN Bilstm

The document outlines the implementation of a CNN-BiLSTM model for detecting deep fake audio, emphasizing its importance for security and media integrity. It utilizes the ASVspoof 2019 dataset, which includes over 121,000 audio samples of both genuine and spoofed speech, and details the preprocessing steps and model architecture. The approach combines CNNs for feature extraction and BiLSTM for capturing temporal dependencies, ultimately classifying audio as real or fake.

Uploaded by

batman pawankalyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views16 pages

CNN Bilstm

Uploaded by

batman pawankalyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

CNN-BILSTM IMPLEMENTATION FOR DEEP

FAKE AUDIO DETECTION

By Koukuntla Pranav
MT24MCS022
INTRODUCTION

Detecting deep fake audio using a CNN-BiLSTM

model.
Deep fake audio detection is crucial for security,
media integrity, and fraud prevention.
A hybrid CNN-BiLSTM model for feature extraction
and sequence modeling.
DATASET

ASVspoof 2019 Dataset

Contains both genuine and spoofed (fake)
speech samples.
Two classes – Real (genuine) and Fake (spoofed).
File Format: WAV files.
Audio clips vary in duration.
Multiple subsets: training, evaluation.
Total Clips: Over 121,000 audio samples (training,
development, and evaluation splits).
DATASET

Audio samples are up to 7 seconds long.

Spoofing Techniques includes TTS (text-to-speech),
VC (voice conversion), and replay attacks.
Contains metadata like speaker ID and attack
type.
Training: ~25,380 clips
Development: ~24,844 clips
Evaluation: ~71,311 clips
PREPROCESSING

Using Librosa to load audio files.

Standardizing all audio to 16kHz sample rate.
Applying noise reduction filters to clean audio.
Mel-Spectrogram Extraction: Converting raw
waveforms into spectrogram images for CNN
processing.
Scaling data between 0 and 1 for stable training.
PREPROCESSING

Using Librosa to load audio files.

Mel-Spectrogram mimics how humans perceive

sound frequencies.
Mel-Spectrogram is convers to (Channels, Time,
Frequency).
Visualization: Real vs. Fake spectrograms show
clear differences in frequency patterns.
CNN-BILSTM MODEL
ARCHITECTURE

Convolutional Neural Networks (CNNs) extract

spatial features from spectrograms.
Bidirectional LSTM (BiLSTM) captures temporal
dependencies in speech.
Fully Connected Layers & Sigmoid Activation for
binary classification (real vs. fake).
CNN-BILSTM MODEL
ARCHITECTURE
IMPLEMENTATION

Apply CNN layers for feature extraction.

Pass extracted features to BiLSTM layers.
Classify using fully connected layers with a
sigmoid activation.
Use Binary Cross Entropy (BCE) loss for training.
IMPLEMENTATION
RESULT
RESULT
RESULT
REFERENCES
Todisco, M., et al. (2019). ASVspoof 2019: Enabling Logical
Access Spoofing Detection. Interspeech 2019. Link
Wani, T. M., Qadri, S. A. A., Comminiello, D., & Amerini, I.
(2024). Detecting Audio Deepfakes: Integrating CNN and
BiLSTM with Multi-Feature Concatenation. Proceedings of the
2024 ACM Workshop on Information Hiding and Multimedia
Security, 271–276. Link
THANK YOU

A Deep Learning Framework For Audio Deepfake Detection
No ratings yet
A Deep Learning Framework For Audio Deepfake Detection
12 pages
Audio - Deepfake - Detection - Using - Deep - Learning Paper2
No ratings yet
Audio - Deepfake - Detection - Using - Deep - Learning Paper2
6 pages
CNN Bilstm 2021
No ratings yet
CNN Bilstm 2021
5 pages
RBPRATYUSH448
No ratings yet
RBPRATYUSH448
20 pages
Final Intro AIReport
No ratings yet
Final Intro AIReport
9 pages
BTP Final
No ratings yet
BTP Final
16 pages
BTP Report
No ratings yet
BTP Report
39 pages
Audio DeepFake Detection (Innovative)
100% (1)
Audio DeepFake Detection (Innovative)
16 pages
A Hybrid CNN-LSTM Approach For Deepfake Audio Detection CRC FINAL
No ratings yet
A Hybrid CNN-LSTM Approach For Deepfake Audio Detection CRC FINAL
6 pages
Audio DeepFake Detection (Innovative)
No ratings yet
Audio DeepFake Detection (Innovative)
16 pages
Base Paper 1 (Hybrid Approach)
No ratings yet
Base Paper 1 (Hybrid Approach)
6 pages
Report
No ratings yet
Report
7 pages
Seminar Report Parthiv
No ratings yet
Seminar Report Parthiv
58 pages
AI Audio Deepfake
No ratings yet
AI Audio Deepfake
18 pages
Implementation Paper
No ratings yet
Implementation Paper
13 pages
Notes
No ratings yet
Notes
3 pages
Seminar Report Final
No ratings yet
Seminar Report Final
37 pages
Final Deepfake Voice Detection Report
No ratings yet
Final Deepfake Voice Detection Report
36 pages
Deepfake Report Finalll-1
No ratings yet
Deepfake Report Finalll-1
37 pages
Minor Project Ms
No ratings yet
Minor Project Ms
12 pages
Paper - Bims 2
No ratings yet
Paper - Bims 2
6 pages
Audio Deepfake Detection Paper
100% (1)
Audio Deepfake Detection Paper
6 pages
Deep Learning For Audio Signal Processing
No ratings yet
Deep Learning For Audio Signal Processing
14 pages
Detection of Synthetically Generated Speech
No ratings yet
Detection of Synthetically Generated Speech
5 pages
Fake Audio Detection
100% (2)
Fake Audio Detection
14 pages
Deepfake Speech Detection Research
No ratings yet
Deepfake Speech Detection Research
3 pages
2018ac04523 FR
No ratings yet
2018ac04523 FR
27 pages
Audio Deepfake (Camera Ready Paper)
No ratings yet
Audio Deepfake (Camera Ready Paper)
13 pages
Deepfake Basepaper
No ratings yet
Deepfake Basepaper
3 pages
Deepfakes Audio Detection Techniques Using Deep Convolutional Neural Network-Paper3
No ratings yet
Deepfakes Audio Detection Techniques Using Deep Convolutional Neural Network-Paper3
6 pages
Solving The Cocktail Party Problem Using Deep Neural Networks
No ratings yet
Solving The Cocktail Party Problem Using Deep Neural Networks
2 pages
Flow Chart:: Input Audio Preprocessing
No ratings yet
Flow Chart:: Input Audio Preprocessing
14 pages
Datasheet PDF
No ratings yet
Datasheet PDF
6 pages
Towards End-to-End Synthetic Speech Detection: Member, IEEE Senior Member, IEEE Member, IEEE
No ratings yet
Towards End-to-End Synthetic Speech Detection: Member, IEEE Senior Member, IEEE Member, IEEE
5 pages
Computers 13 00256
No ratings yet
Computers 13 00256
13 pages
Effects of Dataset Sampling Rate For Noise Cancellation Through Deep Learning
No ratings yet
Effects of Dataset Sampling Rate For Noise Cancellation Through Deep Learning
16 pages
Keynote Slides
No ratings yet
Keynote Slides
33 pages
Deep Fake Detection Document
No ratings yet
Deep Fake Detection Document
10 pages
2018ac04523 Final Report
No ratings yet
2018ac04523 Final Report
27 pages
Wijethunga 2020
No ratings yet
Wijethunga 2020
6 pages
Project PPT Bhu
No ratings yet
Project PPT Bhu
12 pages
Melnet: A Generative Model For Audio in The Frequency Domain
No ratings yet
Melnet: A Generative Model For Audio in The Frequency Domain
14 pages
Breaking Down The Mix - Using Python and Neural Networks To Separate Audio Tracks - by John MicMico - Artificial Intelligence in Plain English
No ratings yet
Breaking Down The Mix - Using Python and Neural Networks To Separate Audio Tracks - by John MicMico - Artificial Intelligence in Plain English
9 pages
Speech Emotion Recognition1
No ratings yet
Speech Emotion Recognition1
86 pages
Deepfake Audio Detection and Justification With Ex
No ratings yet
Deepfake Audio Detection and Justification With Ex
19 pages
Nidhi Chakravarty Mohit Dua: A Lightweight Feature Extraction Technique For Deepfake Audio Detection
No ratings yet
Nidhi Chakravarty Mohit Dua: A Lightweight Feature Extraction Technique For Deepfake Audio Detection
25 pages
A New Deep CNN Model For Environmental Sound Classification
No ratings yet
A New Deep CNN Model For Environmental Sound Classification
9 pages
Predicting Singer Voice Using Convolutional Neural Network
No ratings yet
Predicting Singer Voice Using Convolutional Neural Network
17 pages
NLPReport Phase 1
No ratings yet
NLPReport Phase 1
5 pages
Major Project
No ratings yet
Major Project
22 pages
10 1109@JSTSP 2019 2909479
No ratings yet
10 1109@JSTSP 2019 2909479
13 pages
Fake Audio Detection Based On Unsupervised Pretraining Models
No ratings yet
Fake Audio Detection Based On Unsupervised Pretraining Models
5 pages
A Robust Audio Deepfake Detection System Via Multi-View Feature
No ratings yet
A Robust Audio Deepfake Detection System Via Multi-View Feature
5 pages
Deepfake Audio Detection Via MFCC Features Using M
No ratings yet
Deepfake Audio Detection Via MFCC Features Using M
11 pages
An Improved Feature Extraction For Hindi Language Audio Impersonation Attack Detection
No ratings yet
An Improved Feature Extraction For Hindi Language Audio Impersonation Attack Detection
26 pages
Audio Noise Detection
No ratings yet
Audio Noise Detection
29 pages
Pfa Inr
No ratings yet
Pfa Inr
75 pages
PHD Thesis Sound Event Detection With Weakly Labelled Data - v2.0
No ratings yet
PHD Thesis Sound Event Detection With Weakly Labelled Data - v2.0
102 pages
Noise Reduction: Enhancing Clarity, Advanced Techniques for Noise Reduction in Computer Vision
From Everand
Noise Reduction: Enhancing Clarity, Advanced Techniques for Noise Reduction in Computer Vision
Fouad Sabry
No ratings yet
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Autocad Prospectus2 - 0
No ratings yet
Autocad Prospectus2 - 0
5 pages
08 Introduction To The PlantPAx Process System For Operational Efficiencies
No ratings yet
08 Introduction To The PlantPAx Process System For Operational Efficiencies
95 pages
Lab No. 02 Registers, Memory Segmentaion and Buffers in 80X86 Emulator Assembly Language
No ratings yet
Lab No. 02 Registers, Memory Segmentaion and Buffers in 80X86 Emulator Assembly Language
12 pages
1 To 10
No ratings yet
1 To 10
10 pages
Java Streams
No ratings yet
Java Streams
21 pages
Printing - imageRUNNER 2224N - 2224 - Specification - Canon India
100% (1)
Printing - imageRUNNER 2224N - 2224 - Specification - Canon India
6 pages
CH1 HW
No ratings yet
CH1 HW
13 pages
Excel Charts 2019-141-159
No ratings yet
Excel Charts 2019-141-159
19 pages
Testimonials of Logo Opener
No ratings yet
Testimonials of Logo Opener
10 pages
Ellis - Horwood.c.for - Professional.programmers.1988.SCAN DARKCROWN
No ratings yet
Ellis - Horwood.c.for - Professional.programmers.1988.SCAN DARKCROWN
216 pages
The Applications of Multimedia System
No ratings yet
The Applications of Multimedia System
2 pages
Virtualization and Cloud Computing-U3
No ratings yet
Virtualization and Cloud Computing-U3
8 pages
Kendriya Vidyalaya Sangathan, Kolkata Region BOARD EXAMINATION (2023-24) Class - X Subject - Artificial Intelligence (417) Marking Scheme
No ratings yet
Kendriya Vidyalaya Sangathan, Kolkata Region BOARD EXAMINATION (2023-24) Class - X Subject - Artificial Intelligence (417) Marking Scheme
13 pages
Module 4
No ratings yet
Module 4
41 pages
Welcome To Xodo: With Xodo You Can
No ratings yet
Welcome To Xodo: With Xodo You Can
11 pages
Alexander Shvets Design Patterns Explained Simply
No ratings yet
Alexander Shvets Design Patterns Explained Simply
2 pages
Week 6 - Interactive Presentation
No ratings yet
Week 6 - Interactive Presentation
38 pages
Lab 2 - Data Modeling and Exploration
No ratings yet
Lab 2 - Data Modeling and Exploration
56 pages
CNC Customization Fanuc Picture: Features
No ratings yet
CNC Customization Fanuc Picture: Features
5 pages
Unit Test Empowerment
0% (1)
Unit Test Empowerment
2 pages
5 - WinCC Flexible 2008 - Basics - Text&Graphiclists - EN
No ratings yet
5 - WinCC Flexible 2008 - Basics - Text&Graphiclists - EN
11 pages
Final Eval Report PDF
No ratings yet
Final Eval Report PDF
89 pages
VPH Issue 2 2v6 ISSUE2
No ratings yet
VPH Issue 2 2v6 ISSUE2
36 pages
Information & Communication Technology 2023: Model Paper For The Technical & General Test
No ratings yet
Information & Communication Technology 2023: Model Paper For The Technical & General Test
7 pages
A4Q Selenium-Tester-Foundation Mock-Exam EN v1.3
No ratings yet
A4Q Selenium-Tester-Foundation Mock-Exam EN v1.3
21 pages
MATLAB With TensorFlow and PyTorch (TechSource)
No ratings yet
MATLAB With TensorFlow and PyTorch (TechSource)
32 pages
CC 5
No ratings yet
CC 5
1 page
BIBLIO Quick Reference Manual
No ratings yet
BIBLIO Quick Reference Manual
35 pages
Michael Dubois III: Dataclay - Motion Graphics Artist
No ratings yet
Michael Dubois III: Dataclay - Motion Graphics Artist
2 pages
Peer To Peer Network
No ratings yet
Peer To Peer Network
60 pages