Deep Audio Classification
Deep Audio Classification
SESSION NO: 1
DATE: 27/08/2022
REVIEW NO: ZERO
BATCH: 15
DEEP AUDIO
CLASSIFICATION
Using Tensorflow and Convolutional
neural networks
TEAM MEMBERS
• SHREE VARSHAN B
• YASHWANT B
• PUNEETH RAM P
• VINAY NARAYAN
PROBLEM STATEMENT
• This is one of the challenges proposed by HP unlocked and it's about deep
audio analysis.
• The Challenge is to build a Machine Learning model and code to count the
number of Capuchin bird calls within a given clip.
INTRODUCTION
• The dataset comprises of bird calls in the form of audio clips out of which
the density of Capuchin bird calls has to be measured using Convolutional
neural networks and tensorflow.
• Python is used to build the tensorflow CNN model and to identify whether
the clip contains Capuchin bird call.
SOLUTION
• The training data comprises of audio clips of bird calls of both capuchin
bird calls and non-capuchin bird calls which is passed through the Cnn
model after being converted to spectrogram form from the provided audio
waveforms.
• Then the trained Tensorflow neural network is slid across each of these
windows to find the capuchin bird calls.
• Finally the consecutive capuchin bird calls are grouped together as one as
the belong to the same bird and the density of capuchin bird call in the
particular recording is calculated.
ARCHITECTURE DIAGRAM
Binary
output
Output
0 1 1 0 1 1