0% found this document useful (0 votes)

272 views5 pages

Gender Detection by Voice Using Deep Learning

The recognition of gender voices as an important part of answering certain voices. To distinguish gender from sound signals, sound techniques have defined the gender-relevant features (male or female) of these sound signals. In this study

Uploaded by

International Journal of Innovative Science and Research Technology

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

272 views5 pages

Gender Detection by Voice Using Deep Learning

Uploaded by

International Journal of Innovative Science and Research Technology

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Volume 5, Issue 10, October – 2020 International Journal of Innovative Science and Research Technology

ISSN No:-2456-2165

Gender Detection by Voice Using Deep Learning

Mia Mutiany Iwa Ovyawan Herlistiono
Department of Informatics Department of Informatics
WidyatamaUniversity Widyatama University
Bandung, Indonesia Bandung, Indonesia

Abstract:- The recognition of gender voices as an MFCC chose feature extraction because it is a fairly
important part of answering certain voices. To good feature extraction method for noise reduction which
distinguish gender from sound signals, sound techniques requires a fast, easy, and complete processing time.
have defined the gender-relevant features (male or Meanwhile, the classification of votes is based on gender
female) of these sound signals. In this study, we used using a machine support vector (SVM). [3]
various models to improve accuracy, one of which was
by using deep learning with the voice gender DNN II. RELATED WORK
method. This noise reduction uses the extraction feature
of the Mel Frequency Cepstral Coefficient (MFCC), then Several studies discussdetecting voices based on
the sound classification uses SVM. By using a separation gender. Previous research conducted by Martin and Joensuu
ratio of 80% for training data and 20% for testing data. who developed speech recognition using the detected GMM
The results showed that using DNN for voice recognition and FFT features gave the best results for the classification
was better and pairing with the SVM algorithm obtained level[4][5].
an accurate result of 0.97%.
S.LYuan[6] developed a voice recognition system and
Keywords:- Voice Recognition, Deep Neural Network, Deep detected gender based on voice using Deep Learning with
Learning, MFCC, SVM. the Deep Neural Network (DNN) algorithm resulting in a
Word Error Rate (WER) in speech recognition which
I. INTRODUCTION showed less than optimal results.

Voice recognition is one of the important researches Also, Lee and Kwak [7] used DNN and two
that are currently widely used from a variety of applications, classifications in detecting sounds based on gender. The two
such as security systems, authentication, and so on. Voice classifications are SVM and decision tree (DT). In his
recognition must have high performance, to be able to research, feature extraction used by MFCC to identify
improve speech recognition performance, one of which is by gender voices resulted in fairly good accuracy.
adding a gender classification procedure. With this gender
classification, the problem space in speech recognition can III. DEEP LEARNING
be limited only based on predetermined gender[1].
Deep learning is a method that is often used in the
Voice data is divided into training data and testing data field of machine learning based on the Network Artificial
by classifying gender into two categories, namely male and Neural (ANN) principle model. Deep learning can solve
female. Male and female voices have their characteristics problems with large datasets such as image recognition, text
due to different resonances in the throat[2]. By processing detection, speech recognition, audio, etc. Because there are
sound signals, these characteristics will be obtained in a techniques for using feature extraction from training data
form that can be recognized by a computer. With especially for speech recognition. Artificial Neural Network,
thesecharacteristics, the computer can identify gender a method that adds a hidden layer, this deep learning can be
through sound signals. Therefore, we need a learning started with the input layer (voice recording), which can
algorithm that can help humans detect sounds based on then be processed in the form of a signal that is
gender. The deep learning algorithm can support this interconnected between nodes with each other in processing
detection because it can predict more accurately and quickly data and ultimately through the output to accuracy.
for speech recognition.
One of the deep learning algorithms is called the Deep
Deep Learning has excellent models for detections Neural Network (DNN). DNN is one of the developments of
such as image recognition, emotional recognition, and the Artificial Neural Network. The DNN method is capable
speech recognition. Deep learning which is commonly used of performing voice recognition with good results because it
for speech recognition is a deep neural network (DNN). can determine the feature extraction in each layer.
Therefore, deep learning with the DNN model will be used
to detect speech recognition. The deep learning algorithm
used for this study uses several existing feature extractions.

IJISRT20OCT267 www.ijisrt.com 393

Volume 5, Issue 10, October – 2020 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
IV. METHODOLOGY 2) Spectral Bandwidth
Spectral Bandwidth of the wave which is the
The basic process for speech recognition systems maximum half of the crest represented by two vertical red
involves recording speech signals as input, preprocessing, lines and the wavelength axis. The fig shows. 3:
extracting certain features, and finally classifying them. This
research has 350 speech recording voice data.

A. Deep Neural Network

Deep Neural Network In three hidden layers is used
between the input layer and the output layer. The first
hidden layer has 256 units, the second hidden layer has 128
units, and the third hidden layer has 64 units. So that the
results of the summary of the model using the hard method
have the total number of parameters used in the deep neural
network is 48.194 or 48%. Shown in fig. 1.

Fig 3:-Bandwidth Spectral Features

3) Spectral Rollof
Spectral rollof to represent the frequency where the
high frequency drops to 0. A power spectrum where 85% of
the power is at the lower frequency. Fig4 shows:

Fig 1:- Input & output parameters of each layer

B. Dataset
This study uses a dataset[8] with male and female
genders. The total number of data consisted of 350 speakers,
each of whom had 190 men and 160 women who were
involved in this voice recording. The corresponding audio is
saved as a mono, 16bit, 32kHz WAV file. Fig 4:-Rollof Spectral Features
C. Extraction Feature 4) Zero-Crossing Rate
The voice recognition that we have previously Zero-Crossing Rate to measure the smoothness of the
processed is then extracted by several methods including signal is to count the number of zero-crossing in the signal
spectral centroid, spectral bandwidth, spectral rollof, MFCC segment. Sound signal oscillates slowly ega 100 Hz signal
(Mel Frequency Cepstral Coefficients), zero-crossing rate, will pass 100 zeros per second. Can be shown the fig. 5:
and feature chroma. But here we are focusing on feature
extraction for speech recognition using MFCC.

1) Spectral Centroid
Spectral centroid shows the energy frequency
spectrum indicating where the center of mass for the sound
is located. Shown the fig. 2:

Fig 5:-Zero-Crossing Rate Feature

Fig 2:- Centroid Spectral Features

IJISRT20OCT267 www.ijisrt.com 394

Volume 5, Issue 10, October – 2020 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
5) Chroma Feature Whereas for the output of each frame after filter
Chroma feature is for calculating chromatogramsfrom processing with N is the number of samples per frame Y [n]
a waveform or spectrogram. Chroma feature is a strong is the output signal X [n] and the input signal W [n] is the
spectrum for the representation of music audio, voices, etc. nth efficiency of the Hamming window [10]. Here is the
Shown in fig. 6: equation:

𝑌[𝑛] = 𝑋[𝑛] ∗ 𝑊[𝑛] (2)

Fast Fourier Transform (FFT) is used to convert the

signal from time to frequency[11].

The filterbankis based on the Mel scale, namely from

linear and logarithmic [12]. The following formula is used
to calculate Mel frequency:
𝑓
𝑀𝑒𝑙 (𝑓) = 2595 ∗ log 10(1 + 700) (3)

Fig 6:- Chromatogram or Spectrogram Feature The input from the filterbank of 2595 and 700 is a
fixed, predefined value that is widely used in the MFCC
6) MFCC method [13]. [14] and the last process is Discrete Cosine
Mel Frequency Cepstral Coefficients (MFCC) is one Transform (DCT) whose output is called Mel Frequency
method that is widely used in the field of speech technology, Cepstral Coefficients (MFCC). Shown in the fig.8:
both speech recognition, and voice recognition.

The Mel Frequency Cepstral Coefficient (MFCC)

technique is often used for the extraction of important
features from sound files based on different bandwidth
frequency for human hearing, the sound signal is filtered
linearly at low frequencies (below 1000Hz) and
logarithmically for high frequency (above 1000Hz).

Fig 8:-MFCC Feature

V. EXPERIMENT AND RESULT

The software instrument used to run the system is

cloud tools (Google Colab). The reason for using these tools
is due to the limited hardware used to support the system
processing speed. This section describes the deep neural
Fig 7:-MFCC Diagram Process network for speech recognition classification using SVM
and the overall results of speech recognition detection from
The first diagram process in MFCC is preemphasis, several matrices in terms of accuracy, precision, recall, and
which is to produce energy in high frequency which was f1. As explained in the following equation:
previously compressed during the process of producing
sound. Framing is used as trimming the sound signal file 𝑇𝑃+𝑇𝑁
𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = 𝑇𝑃+𝐹𝑃+𝐹𝑁+𝑇𝑁
(4)
which is divided into smaller parts, therefore signal analysis
can be processed by performing a short time (frame) in the 𝑇𝑃
recognition system. So it is most important to cut a signal 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 = (5)
𝑇𝑃+𝐹𝑃
that is smaller and still contains the original characteristics
for the signal analysis process [9]. Windowing is used to 𝑇𝑃
𝑅𝑒𝑐𝑎𝑙𝑙 = (6)
avoid interrupting signals that have been previously 𝑇𝑃+𝐹𝑁

processed. The function of the hamming window can be 2∗(𝑅𝑒𝑐𝑎𝑙𝑙 𝑥 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛)

expressed in the equation: 𝐹1 = (𝑅𝑒𝑐𝑎𝑙𝑙+𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛)
(7)

2𝜋𝑛
𝑊[𝑛] = 0.54 − 0.46 cos[ ] (1)
𝑁−1

IJISRT20OCT267 www.ijisrt.com 395

Volume 5, Issue 10, October – 2020 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
In the above equation, it is TP true positive, TN true
negative, FP false positive, and FN false negative.

Some of these metrics will calculate the success of a

prediction whose accuracy results are 0.97%, recall 0.95%,
precision 0.88%, and F1 0.96%. Apart from several metrics,
a confusion matrix is also presented in this section. The
confusion matrix summarizes the classification results to
show the performance of the deep neural networks.

A. Support Vector Machine Classification

A support vector machine (SVM) is a conventional
learning model for pattern recognition and data analysis.
The SVM classification consists of a set of hyperplanes that
can be used for classification or regression analysis. After Fig 10:-Loss Classification Graph
the extraction feature is completewe use the SVM file to
measure the accuracy against the voice recognition This training is carried out after the feature extraction
classification. The results obtained from the SVM process is tested for its performance stability. This
classification, the accuracy is 0.9214 or 0.9%. SVM acts as performance stability is measured by the split percentage
one good approach to data modeling. Also, kernel mapping technique from 60%: 40% to 90%: 10%.
on vector machines provides a general basis as described in
table 1. The highest accuracy that occurs in the RBF kernel C. Confusion Matrix
is 0.9714% or 0.97%. This confusion matrix is a representative diagram that
is presented with a good process. This confusion matrix
Kernel SVM summarizes the complete classification results based on true
RBF Polynomial Linear and false objects. The correct classification result is 68 data.
Based on Figure 11. The label classification is marked with
0.9785 0.9642 0.9857 Female (0) and Male (1).
Insample accuracy

0.9714 0.9285 0.9714

Outsample accuracy
Table 1:- RBF,Poly & Linear Kernel Mapping

B. Training and Test Result

In this case, the training was conducted with 80% data
sharing for training data and 20% for testing data. As shown
in Figs. 9 and 10, is a graph of training accuracy, training
loss, validation accuracy, and validation loss. Each value is
training accuracy 0.7643, training loss 0.5276, validation
accuracy 0.7714, and validation loss 0.5643.

Fig 11:-Confusion Matrix

VI. CONCLUSION

In this study, the detection of voice recognition based

on gender used the DNN method which resulted in good
accuracy. Also, each speech recognition feature extraction
and classification. This extraction feature is to remove noise
in gender voice data by using MFCCbecause it is a good,
fast, and complete method. Meanwhile, the classification of
Fig 9:-Accuracy Classification Graph gender speech recognition uses the SVM algorithm which
has good accuracy results.

IJISRT20OCT267 www.ijisrt.com 396

Volume 5, Issue 10, October – 2020 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165
REFERENCES

[1] Brigham E, The Fast Fourier Transform and Its

Application. Prentice-Hall Inc. New Jersey., 1988. .
[2] Y. Zhang, M. Pezeshki, P. Brakel, S. Zhang, C. Laurent, Y.
Bengio, and A. Courville, “Towards End-to-End Speech
Recognition with Deep Convolutional Neural Networks,”
In Proceedings of Interspeech, 2016. pp. 410-414.
[3] Jamil Ahmad, Mustansar Fiaz, Soon-il Kwon, Maleerat
Sodanil, Bay Vo, and Sung Wook Baik. Gender
Identification using MFCC for Telephone Applications - a
Comparative Study. arXiv preprint arXiv:1601.01577,
2016..
[4] Martin, A., and Przybocki, M. Speaker recognition in a
multi-speaker environment.In Proc. 7th European
Conference on Speech Communication and Technology
(Eurospeech 2001) (Aalborg,Denmark, 2001), pp. 787–90. .
[5] Tomi Kinnunen “Spectral Feature for Automatic Voice-
independent Speaker Recognition ”Depertment of
Computer Science, Joensuu University, Finland. December
21, 2003..
[6] L. Yufang, S. Jie, H. Wenjun, and P. Wenlin, "Speech
recognition of isolated word speech of primi language on
HTK," Journal of Yunnan Minzu University (Natural
Sciences Edition), pp. 426-430, 2015..
[7] M.-W. Lee and K.-C. Kwak, “Performance comparison of
gender and age group recognition for human-robot
interaction,” IJACSA) International Journal of Advanced
Computer Science and Applications, vol. 3, no. 12, 2012 .
[8] Akshay Babbar, "Akshay Babbar :
speakerrecognition,"2019. [Online]. Available:
https://www.kaggle.com/akshay4/speakerrecognition.
[Accessed: 19-Agustus-2020]., [Online].
[9] W. Junqin and Y. Junjun, “An improved arithmetic of
MFCC in speech recognition system,” in Electronics,
Communications and Control (ICECC), 2011 International
Conference on, 2011, pp. 719–722. .
[10] B. J. Mohan, “Speech recognition using MFCC and DTW,”
in Advances in Electrical Engineering (ICAEE), 2014
International Conference on, 2014, pp. 1–4. .
[11] A. Vijayan, B. M. Mathai, K. Valsalan, R. R. Johnson, L.
R. Mathew, and K. Gopakumar, “Throat microphone
speech recognition using mfcc,” in Networks & Advances
in Computational Technologies (NetACT), 2017
International Conference on, 2017, pp. 392–395..
[12] N. Alcaraz Meseguer, “Speech analysis for automatic
speech recognition,” Institutt for elektronikk og
telekommunikasjon, 2009. .
[13] S. T. Saste and S. M. Jagdale, “Emotion recognition from
speech using MFCC and DWT for security system,” in
Electronics, Communication and Aerospace Technology
(ICECA), 2017 International conference of, 2017, vol. 1,
pp. 701–704..
[14] G. Jhawar, P. Nagraj, and P. Mahalakshmi, “Speech
disorder recognition using MFCC,” in Communication and
Signal Processing (ICCSP), 2016 International Conference
on, 2016, pp. 0246–0250.

IJISRT20OCT267 www.ijisrt.com 397

Bandlab - A Daw Overview: Cursor (F) ) and Through The Bars of The Song, Shown As Numbers. Here, A Cycle (Loop) of 4 Bars
100% (2)
Bandlab - A Daw Overview: Cursor (F) ) and Through The Bars of The Song, Shown As Numbers. Here, A Cycle (Loop) of 4 Bars
6 pages
Vdoc - Pub Introduction To The Finite Element Method
No ratings yet
Vdoc - Pub Introduction To The Finite Element Method
216 pages
Lecture 2 Robot Manipulators
No ratings yet
Lecture 2 Robot Manipulators
66 pages
02 Hoberman PDF
No ratings yet
02 Hoberman PDF
40 pages
Booleova Algebra
No ratings yet
Booleova Algebra
71 pages
Integration Tables
No ratings yet
Integration Tables
1 page
Natasa Nedeljkovic INTERACTION OF ATOMIC PARTICLES WITH SOLID SURFACES
No ratings yet
Natasa Nedeljkovic INTERACTION OF ATOMIC PARTICLES WITH SOLID SURFACES
76 pages
SWR sm-400s Owners Manual
100% (1)
SWR sm-400s Owners Manual
5 pages
6gender Recognition Using Speech Processing Techniques in Labview Copyright Ijaet
No ratings yet
6gender Recognition Using Speech Processing Techniques in Labview Copyright Ijaet
13 pages
A Survey On Machine Learning Approaches For Automatic
No ratings yet
A Survey On Machine Learning Approaches For Automatic
23 pages
Lecture Packet 03 - Solid Mechanics Part 2
No ratings yet
Lecture Packet 03 - Solid Mechanics Part 2
58 pages
Automotive Engine Tribology
No ratings yet
Automotive Engine Tribology
8 pages
Lecture13 PDF
100% (1)
Lecture13 PDF
31 pages
Spurious Modes in Two-Dimensional Isoparametric Elements
No ratings yet
Spurious Modes in Two-Dimensional Isoparametric Elements
13 pages
The WKB Approximation: Quantum Mechanics 2 - Lecture 4
No ratings yet
The WKB Approximation: Quantum Mechanics 2 - Lecture 4
80 pages
Mehanika Fluida Pismeni Ispit 1
No ratings yet
Mehanika Fluida Pismeni Ispit 1
1 page
EN 1993-1:2005/AC:2009, Eurocode 3: Design of Steel Structures
No ratings yet
EN 1993-1:2005/AC:2009, Eurocode 3: Design of Steel Structures
3 pages
A Case Study On Design of A Flywheel For Punching Press Operation
No ratings yet
A Case Study On Design of A Flywheel For Punching Press Operation
4 pages
Ass-1-Fluid Mech.
No ratings yet
Ass-1-Fluid Mech.
3 pages
(Hinton, 1976) A Note On Mass Lumping and Related Processes in The Finite Element Method
No ratings yet
(Hinton, 1976) A Note On Mass Lumping and Related Processes in The Finite Element Method
5 pages
Teorija Elasticnosti
No ratings yet
Teorija Elasticnosti
89 pages
Auto CAD Excercise
No ratings yet
Auto CAD Excercise
31 pages
Male Stud Coupling Thread BSP Parallel: Type: 1003 1 Type: 1003.
No ratings yet
Male Stud Coupling Thread BSP Parallel: Type: 1003 1 Type: 1003.
9 pages
Promenljiva Masa
No ratings yet
Promenljiva Masa
16 pages
Provođenje Toplote I Difuzija - Teorija
No ratings yet
Provođenje Toplote I Difuzija - Teorija
43 pages
2005predavanje 0 PDF
No ratings yet
2005predavanje 0 PDF
42 pages
Beam Element Example
No ratings yet
Beam Element Example
4 pages
CSWA Simulastion Preparation - Soal Dan Jawaban Final (2021) + Penjelasan Soal Nomer 4
No ratings yet
CSWA Simulastion Preparation - Soal Dan Jawaban Final (2021) + Penjelasan Soal Nomer 4
25 pages
Ring Balance Manometer PDF
100% (1)
Ring Balance Manometer PDF
3 pages
Matlab-Simulink Upatstvo Za Vezbi SD
No ratings yet
Matlab-Simulink Upatstvo Za Vezbi SD
175 pages
An Abaqus Extension For Welding Simulations: M. Shubert, M. Pandheeradi, F. Arnold, and C. Habura
No ratings yet
An Abaqus Extension For Welding Simulations: M. Shubert, M. Pandheeradi, F. Arnold, and C. Habura
15 pages
Univerzitet U Istočnom Sarajevu Mašinksi Fakultet
No ratings yet
Univerzitet U Istočnom Sarajevu Mašinksi Fakultet
20 pages
Motherboard Manual Ga-M61sme-S2l 2
No ratings yet
Motherboard Manual Ga-M61sme-S2l 2
96 pages
FE Simulation of Ultrasonic Vibrations in Turning
No ratings yet
FE Simulation of Ultrasonic Vibrations in Turning
11 pages
Internal Forces in Beams
No ratings yet
Internal Forces in Beams
14 pages
1.proračun Sila Pogonskog Zupčanika: Mašinska Tehnička Škola Prof: Mujkić Nasveta
No ratings yet
1.proračun Sila Pogonskog Zupčanika: Mašinska Tehnička Škola Prof: Mujkić Nasveta
4 pages
Examples of Dynamical Systems
No ratings yet
Examples of Dynamical Systems
52 pages
Tensors
No ratings yet
Tensors
25 pages
Rynold Equation - Journal Bearing Project
100% (1)
Rynold Equation - Journal Bearing Project
42 pages
Skripta Otpornost Materijala II
100% (1)
Skripta Otpornost Materijala II
52 pages
Formule Ploce I Ljuske
No ratings yet
Formule Ploce I Ljuske
18 pages
The Finite Volume Method For Diffusion Problems: Div U Div Grad S T
100% (1)
The Finite Volume Method For Diffusion Problems: Div U Div Grad S T
29 pages
MATLAB and Octave
No ratings yet
MATLAB and Octave
91 pages
Lecture Notes in Turbulence
No ratings yet
Lecture Notes in Turbulence
191 pages
Linear Buckling FEMAP Examples
No ratings yet
Linear Buckling FEMAP Examples
15 pages
1 Stress and Strain
100% (1)
1 Stress and Strain
14 pages
5 Virtualni Rad
No ratings yet
5 Virtualni Rad
1 page
BP Valjci PDF
No ratings yet
BP Valjci PDF
20 pages
Metoda Konačnih Elemenata - Prof. DR Milisav Kalajdžić
100% (1)
Metoda Konačnih Elemenata - Prof. DR Milisav Kalajdžić
121 pages
The Importance of Voice Classification
No ratings yet
The Importance of Voice Classification
8 pages
Reseni Ispitni Zadaci Iz TEH MEHANIKE
No ratings yet
Reseni Ispitni Zadaci Iz TEH MEHANIKE
23 pages
Column Buckling
0% (1)
Column Buckling
13 pages
ANSYS Fluid Analysis Guide
No ratings yet
ANSYS Fluid Analysis Guide
217 pages
Idea Lab PPT Principal Video
No ratings yet
Idea Lab PPT Principal Video
10 pages
Research Paper ML
No ratings yet
Research Paper ML
3 pages
Dd0ef6b9d8b508972861
No ratings yet
Dd0ef6b9d8b508972861
6 pages
A Comparative Study of Speech Recognition Accuracy For Male and Female Voices
No ratings yet
A Comparative Study of Speech Recognition Accuracy For Male and Female Voices
6 pages
Our Mini Project Report
No ratings yet
Our Mini Project Report
29 pages
Identifying The Gender of A Voice Using Acoustic Properties
No ratings yet
Identifying The Gender of A Voice Using Acoustic Properties
8 pages
Research Article: DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network
No ratings yet
Research Article: DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network
13 pages
Thesis 20
No ratings yet
Thesis 20
36 pages
Transdermal Delivery of Herbal Extracts: A Review on Techniques, Polymers, and Permeation Enhancers
No ratings yet
Transdermal Delivery of Herbal Extracts: A Review on Techniques, Polymers, and Permeation Enhancers
6 pages
Examining how the Revamping and Construction of MV Uhuru I and MV Uhuru II Respectively by KRC in Collaboration with KSL have Promoted Socio-Economic Development in the Lake Victoria Region, Kenya
No ratings yet
Examining how the Revamping and Construction of MV Uhuru I and MV Uhuru II Respectively by KRC in Collaboration with KSL have Promoted Socio-Economic Development in the Lake Victoria Region, Kenya
8 pages
An Integrated IoT-Based System for Automated Plant Disease Detection and Management
No ratings yet
An Integrated IoT-Based System for Automated Plant Disease Detection and Management
8 pages
Modified March FSM-Based Memory BIST Architecture
No ratings yet
Modified March FSM-Based Memory BIST Architecture
5 pages
Effects of Writing-for-Learn Strategies and Triangulated Evaluation on Students’ Mathematics Achievement, Academic Self-Perception, and Learning- Engagement in South-South Nigerian Teacher Education Colleges
No ratings yet
Effects of Writing-for-Learn Strategies and Triangulated Evaluation on Students’ Mathematics Achievement, Academic Self-Perception, and Learning- Engagement in South-South Nigerian Teacher Education Colleges
123 pages
Urban Poverty and Spatial Inequality: A Geospatial Analysis of Slum Dynamics in Rapidly Growing African Cities
No ratings yet
Urban Poverty and Spatial Inequality: A Geospatial Analysis of Slum Dynamics in Rapidly Growing African Cities
17 pages
An Assessment of Tax Reduction on Hospitality and Tourism Industry in Nigeria
No ratings yet
An Assessment of Tax Reduction on Hospitality and Tourism Industry in Nigeria
3 pages
Post Covid Impact on Students’ Life: A Comparative Study
No ratings yet
Post Covid Impact on Students’ Life: A Comparative Study
6 pages
Outpatient Management of Pulmonary Embolism: Current Evidence and Future Perspectives
No ratings yet
Outpatient Management of Pulmonary Embolism: Current Evidence and Future Perspectives
8 pages
Morphological Awareness and Vocabulary Retention Among English Major Students: A Mixed Method Study
No ratings yet
Morphological Awareness and Vocabulary Retention Among English Major Students: A Mixed Method Study
19 pages
From Serendipity to Strategy: Repurposed Drugs and Their Mechanistic Role in Inflammatory Bowel Disease
No ratings yet
From Serendipity to Strategy: Repurposed Drugs and Their Mechanistic Role in Inflammatory Bowel Disease
14 pages
Effect of Resistance Training on Skill-Related Fitness Components: Standing Broad Jump (Power) and 30m Dash (Speed) in High School Male Kabaddi Athletes
No ratings yet
Effect of Resistance Training on Skill-Related Fitness Components: Standing Broad Jump (Power) and 30m Dash (Speed) in High School Male Kabaddi Athletes
5 pages
Nanomedicine and Neurodegeneration: Targeting Alzheimer’s Disease Pathology and Drug Delivery Challenges
No ratings yet
Nanomedicine and Neurodegeneration: Targeting Alzheimer’s Disease Pathology and Drug Delivery Challenges
10 pages
Contribution of District School Quality Assurance Officers’ Feedback Implementation in Improving Teaching and Learning Environment in Public Primary Schools of Kilimanjaro Region Tanzania
No ratings yet
Contribution of District School Quality Assurance Officers’ Feedback Implementation in Improving Teaching and Learning Environment in Public Primary Schools of Kilimanjaro Region Tanzania
10 pages
Real-Time Credit Card Fraud Detection Using Ensemble and Supervised Learning Approaches
No ratings yet
Real-Time Credit Card Fraud Detection Using Ensemble and Supervised Learning Approaches
18 pages
Structured Approach for Floating Roof Integrity Assessment and Preventive Measures
No ratings yet
Structured Approach for Floating Roof Integrity Assessment and Preventive Measures
6 pages
Dark Matter Detection in Galaxies: Analyzing Rotation Curves for Hidden Mass Signatures
No ratings yet
Dark Matter Detection in Galaxies: Analyzing Rotation Curves for Hidden Mass Signatures
5 pages
Foundations of Quantum Mechanics and General Relativity – Theory and Practice –Part II
No ratings yet
Foundations of Quantum Mechanics and General Relativity – Theory and Practice –Part II
15 pages
A Study of Attitude on National Education Policy 2020 Among the Undergraduate Students of Ghanapriya Women’s College, Imphal
No ratings yet
A Study of Attitude on National Education Policy 2020 Among the Undergraduate Students of Ghanapriya Women’s College, Imphal
5 pages
Leadership Coaching Techniques of School Heads and Work Performance of Public Secondary Teachers in Davao Del Norte
No ratings yet
Leadership Coaching Techniques of School Heads and Work Performance of Public Secondary Teachers in Davao Del Norte
2 pages
Machine Learning–Driven Fintech Solutions for Credit Scoring and Financial Inclusion in the Gig Economy
No ratings yet
Machine Learning–Driven Fintech Solutions for Credit Scoring and Financial Inclusion in the Gig Economy
16 pages
Personality Traits and Self-Leadership of Kindergarten Teachers of Davao City Division
No ratings yet
Personality Traits and Self-Leadership of Kindergarten Teachers of Davao City Division
2 pages
Instructional Immediacy of Teachers and Stduents' Motivation in District 1, Davao City
No ratings yet
Instructional Immediacy of Teachers and Stduents' Motivation in District 1, Davao City
2 pages
Supervision of School Heads and Professional Development on Public Secondary Teachers in Tagum City
No ratings yet
Supervision of School Heads and Professional Development on Public Secondary Teachers in Tagum City
2 pages
Comparative Assessment of Organic Manures (Cow Dung and Poultry Manure) and NPK Fertilizer on Soil Quality in a Part of Guinea Savannah Region of Nigeria
No ratings yet
Comparative Assessment of Organic Manures (Cow Dung and Poultry Manure) and NPK Fertilizer on Soil Quality in a Part of Guinea Savannah Region of Nigeria
10 pages
Administrative Style and Teachers’ Compliance in Selected Universal Secondary Education Schools in Kampala Capital City, Uganda
No ratings yet
Administrative Style and Teachers’ Compliance in Selected Universal Secondary Education Schools in Kampala Capital City, Uganda
8 pages
The Esthetic and Functional Outcomes of Full Mouth Rehabilitation Performed Using the Hobo Twin-Stage and Pankey-Mann-Schuyler Philosophy – A Meta-Analysis
No ratings yet
The Esthetic and Functional Outcomes of Full Mouth Rehabilitation Performed Using the Hobo Twin-Stage and Pankey-Mann-Schuyler Philosophy – A Meta-Analysis
8 pages
SAP S/4HANA Migration with Clean Core Principles and Hybrid Cloud Strategies for the Intelligent Enterprise
No ratings yet
SAP S/4HANA Migration with Clean Core Principles and Hybrid Cloud Strategies for the Intelligent Enterprise
4 pages
The Regulatory Uncertainty of Smart Contract Flaws in Virtual and Argumentative Reality
No ratings yet
The Regulatory Uncertainty of Smart Contract Flaws in Virtual and Argumentative Reality
5 pages
Evaluation of the Existing Infrastructural Facilities for Slum Upgrading Plan and Measures of old Orozo Township, Federal Capital Territory, Nigeria
No ratings yet
Evaluation of the Existing Infrastructural Facilities for Slum Upgrading Plan and Measures of old Orozo Township, Federal Capital Territory, Nigeria
9 pages
Digital Fluxgate Magnetometer Design Notes
No ratings yet
Digital Fluxgate Magnetometer Design Notes
11 pages
About Turbo VZO
No ratings yet
About Turbo VZO
1 page
Parameters Study Guide New - PDF Version 1
No ratings yet
Parameters Study Guide New - PDF Version 1
8 pages
Signals and Systems Unit 2: Mit-Wpu
No ratings yet
Signals and Systems Unit 2: Mit-Wpu
31 pages
EEC Fault Codes
No ratings yet
EEC Fault Codes
2 pages
Seminar Data Compression
No ratings yet
Seminar Data Compression
32 pages
Band-Extended Angular Spectrum Method For Accurate Diffraction Calculation PDF
No ratings yet
Band-Extended Angular Spectrum Method For Accurate Diffraction Calculation PDF
5 pages
Introduction To Deconvolution and Inversion
No ratings yet
Introduction To Deconvolution and Inversion
49 pages
CV Lab
No ratings yet
CV Lab
14 pages
17ec741 - Multimedia Information Representation - Module 2
No ratings yet
17ec741 - Multimedia Information Representation - Module 2
54 pages
Signalsyll
No ratings yet
Signalsyll
107 pages
Ece V Analog Communication Notes
No ratings yet
Ece V Analog Communication Notes
74 pages
Vibbb
No ratings yet
Vibbb
4 pages
Application of OFDM
No ratings yet
Application of OFDM
4 pages
Multi-Service System: An Enabler of Flexible 5G Air-Interface
No ratings yet
Multi-Service System: An Enabler of Flexible 5G Air-Interface
9 pages
Praktikum I Penyearah Dioda 1 Fasa Paruh Gelombang: 1. PC Sebagai Alat Praktik. 2. Software Orcad Pspice
No ratings yet
Praktikum I Penyearah Dioda 1 Fasa Paruh Gelombang: 1. PC Sebagai Alat Praktik. 2. Software Orcad Pspice
31 pages
Vocal Mixing Cheatsheet PDF
No ratings yet
Vocal Mixing Cheatsheet PDF
6 pages
A Fully-Dynamic Capacitive Touch Sensor With Tri-Level Energy Recycling and Compressive Sensing Technique Achieving 1513 HZ Framerate and 10.66 PJ Step Ener
No ratings yet
A Fully-Dynamic Capacitive Touch Sensor With Tri-Level Energy Recycling and Compressive Sensing Technique Achieving 1513 HZ Framerate and 10.66 PJ Step Ener
3 pages
Ec2037 Mutimedia Question Bank
No ratings yet
Ec2037 Mutimedia Question Bank
16 pages
Chapter 1 - 2
No ratings yet
Chapter 1 - 2
11 pages
Signals and Systems Lab
No ratings yet
Signals and Systems Lab
2 pages
Design and Simulation of Hairpin Band Pass Filter For Different Substrate
No ratings yet
Design and Simulation of Hairpin Band Pass Filter For Different Substrate
3 pages
Basics of Analog-To-digital Convertion
100% (1)
Basics of Analog-To-digital Convertion
9 pages
2.1 Image Enhancement
No ratings yet
2.1 Image Enhancement
7 pages
Woofer Tester Pro
No ratings yet
Woofer Tester Pro
16 pages
Baby Audio: Parallel Aggressor Plugin - Demo & Giveaway
No ratings yet
Baby Audio: Parallel Aggressor Plugin - Demo & Giveaway
5 pages
Computer Exercise Chap5-已解锁
No ratings yet
Computer Exercise Chap5-已解锁
8 pages
Signals and Systems: Chapter SS-7 Sampling
No ratings yet
Signals and Systems: Chapter SS-7 Sampling
26 pages

Gender Detection by Voice Using Deep Learning

Uploaded by

Gender Detection by Voice Using Deep Learning

Uploaded by

Volume 5, Issue 10, October – 2020 International Journal of Innovative Science and Research Technology

Gender Detection by Voice Using Deep Learning

IJISRT20OCT267 www.ijisrt.com 393

A. Deep Neural Network

Fig 3:-Bandwidth Spectral Features

Fig 1:- Input & output parameters of each layer

Fig 5:-Zero-Crossing Rate Feature

IJISRT20OCT267 www.ijisrt.com 394

𝑌[𝑛] = 𝑋[𝑛] ∗ 𝑊[𝑛] (2)

Fast Fourier Transform (FFT) is used to convert the

The filterbankis based on the Mel scale, namely from

The Mel Frequency Cepstral Coefficient (MFCC)

Fig 8:-MFCC Feature

V. EXPERIMENT AND RESULT

The software instrument used to run the system is

processed. The function of the hamming window can be 2∗(𝑅𝑒𝑐𝑎𝑙𝑙 𝑥 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛)

IJISRT20OCT267 www.ijisrt.com 395

Some of these metrics will calculate the success of a

A. Support Vector Machine Classification

0.9714 0.9285 0.9714

B. Training and Test Result

Fig 11:-Confusion Matrix

In this study, the detection of voice recognition based

IJISRT20OCT267 www.ijisrt.com 396

[1] Brigham E, The Fast Fourier Transform and Its

IJISRT20OCT267 www.ijisrt.com 397

You might also like