Voice (Speaker) Recognition Using Neural Networks: Synopsis

The document describes a project to implement voice recognition using neural networks. It involves recording voice samples, extracting features using mel frequency cepstral coefficients (MFCCs), and using those features as input to a neural network. The neural network is trained using backpropagation and can perform speaker verification by comparing voice features to a stored database, or speaker identification by determining which stored voice a sample matches most closely.

Uploaded by

JyotiiBubnaRungta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views4 pages

Voice (Speaker) Recognition Using Neural Networks: Synopsis

Uploaded by

JyotiiBubnaRungta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

SYNOPSIS

VOICE(SPEAKER) RECOGNITION
USING NEURAL NETWORKS
AIM : To implement neural networks and use them for Voice Recognition
(Identify a speaker from his/her voice sample)

DESCRIPTION:

Voice, or speaker, recognition is a biometric modality that uses an individual’s voice for
recognition purposes. (It is a different technology than “speech recognition”, which
recognizes words as they are articulated, which is not a biometric.) The speaker
recognition process relies on features influenced by both the physical structure of an
individual’s vocal tract and the behavioral characteristics of the individual.

The speech signal conveys information about the identity of the speaker. The area of
speaker identification is concerned with extracting the identity of the person speaking
the utterance. Two voices are compared and tested to see if they are of the same
speaker or different. The decision is made by the neural network.

PARADIGMS (Features) OF VOICE RECOGNITION:

1. Speaker verification -Verify that a given speaker is one who he claims to be.
System prompts the user who claims to be the speaker to provide ID. System verifies
user by comparing codebook of given speech utterance with that given by user. If it
matches the set threshold then the identity claim of the user is accepted otherwise
rejected.
2. Speaker identification - detects a particular speaker from a known population.
The system prompts the user to provide speech utterance. System identifies the user by
comparing the codebook of speech utterance with those of the stored in the database
and lists, which contain the most likely speakers, could have given that speech
utterance.

In our project, we aim to study the individual information included in the sound
waves by applying the information into the inputs of a neural network and identify
patterns (pattern matching) in order to verify the identity of the speaker, thereby
using Artificial Intelligence (AI) to spot differences in sound waves and decide
whether the speaker is really who he claims he is or to identify who the speaker really is
by comparing his sound waves with those stored in databases.
IMPLEMENTATION DETAILS:

Steps:

1. Record Speech

• This involves recording the person’s voice

2. Filtering

• It involves removing unwanted information or data from voice samples

3. Feature extraction

• It is a process of studying and deriving useful information from the filtered input
patterns.

4. Decision

• The MFCCs would be applied as input to the neural network. The neural
network used would be a multilayer feed-forward neural network.

The neural network would be self-learning and it would identify the common
patterns between same voices and return whether the person is really who he/she
claims to be or identify the person by comparing voice with database. The accuracy
given by the neural network would be proportional to the training involved.

Calculating MFCCs

1. Take the Fourier transform of (a windowed excerpt of) a signal (cleaned for
silence frames and disturbances using some filters).

2. Map the powers of the spectrum obtained above to the mel scale.

3. Take the logs of the powers at each of the mel frequencies.

4. Take the discrete cosine transform of the list of mel log powers, as if it were a
signal.

5. The MFCCs are the amplitudes of the resulting spectrum.

Neural Network to be used: Multi-layer feed-forward network.
Training Algorithm to be used: Back-Propagation Algorithm.
Activation function to be used: Squashing function 1/(1+e-x)
Co-efficient to be used: MFCC (Mel Frequency Cepstral Co-efficient).

Language to be used: MATLAB or C++ or JAVA (whichever is comfortable and

faster for you).

Price-Rexroth Hydraulics Division
78% (9)
Price-Rexroth Hydraulics Division
512 pages
Speaker Recognition PHD Thesis
100% (3)
Speaker Recognition PHD Thesis
5 pages
Speaker Verification For Remote Authentication
100% (2)
Speaker Verification For Remote Authentication
31 pages
Official Notification For OAVS Recruitment
No ratings yet
Official Notification For OAVS Recruitment
28 pages
Voice Recognition Using Matlab: Presented By: Avienash Raibole Paresh Meshram Vinayak Kolpek
100% (1)
Voice Recognition Using Matlab: Presented By: Avienash Raibole Paresh Meshram Vinayak Kolpek
18 pages
8 D Report Format
No ratings yet
8 D Report Format
9 pages
Speech Recognition Using Artificial Neural Network: - A Review
100% (1)
Speech Recognition Using Artificial Neural Network: - A Review
4 pages
Speaker Recognition Using MATLAB
95% (64)
Speaker Recognition Using MATLAB
75 pages
Breakout Play (Trend Following) - Trading Plan - Full (Sample)
91% (11)
Breakout Play (Trend Following) - Trading Plan - Full (Sample)
15 pages
Full Text 01
No ratings yet
Full Text 01
54 pages
Methodology For Speaker Identification and Recognition System
100% (1)
Methodology For Speaker Identification and Recognition System
13 pages
134 Rashid Bicet2021
No ratings yet
134 Rashid Bicet2021
9 pages
Voice Recog
No ratings yet
Voice Recog
25 pages
Intermittent Fasting
100% (1)
Intermittent Fasting
36 pages
Speaker Verification Using Convolutional Neural Networks: Hossein Salehghaffari
No ratings yet
Speaker Verification Using Convolutional Neural Networks: Hossein Salehghaffari
6 pages
Listino Est 2011
100% (1)
Listino Est 2011
321 pages
Thesis Bich Ngoc Do
No ratings yet
Thesis Bich Ngoc Do
72 pages
A Study of Federated Learning Based Speaker Verifi
No ratings yet
A Study of Federated Learning Based Speaker Verifi
14 pages
SSC Cpo
No ratings yet
SSC Cpo
1 page
Voice Recognition Based Security System Using
No ratings yet
Voice Recognition Based Security System Using
6 pages
My Project
No ratings yet
My Project
6 pages
Mohini Dey - Capstone
No ratings yet
Mohini Dey - Capstone
52 pages
Voice Recognition
100% (1)
Voice Recognition
18 pages
Speech Proceesing End Sem Reveiw FINAL
No ratings yet
Speech Proceesing End Sem Reveiw FINAL
16 pages
MFCC and Vector Quantization For Arabic Fricatives2012
No ratings yet
MFCC and Vector Quantization For Arabic Fricatives2012
6 pages
Template
No ratings yet
Template
44 pages
Mini Project Report Template
No ratings yet
Mini Project Report Template
31 pages
Automatic+Speaker+Recognition+System - EEE
No ratings yet
Automatic+Speaker+Recognition+System - EEE
11 pages
Final Report Complete PDF
No ratings yet
Final Report Complete PDF
26 pages
Automatic Speaker Recognition System
No ratings yet
Automatic Speaker Recognition System
11 pages
Utterance Based Speaker Identification
No ratings yet
Utterance Based Speaker Identification
14 pages
FSBC01 The Use of Repair and Maintenance Budget For Buildings
No ratings yet
FSBC01 The Use of Repair and Maintenance Budget For Buildings
5 pages
Voice Recognition System Using Machine L
No ratings yet
Voice Recognition System Using Machine L
7 pages
Digital Voice Analysis
0% (2)
Digital Voice Analysis
20 pages
Anabarasi (1) 11
No ratings yet
Anabarasi (1) 11
16 pages
Online Quiz
100% (1)
Online Quiz
25 pages
Speaker Recognition System Using MFCC and Vector Quantization
No ratings yet
Speaker Recognition System Using MFCC and Vector Quantization
7 pages
Speaker Recognition: SRT Project of Signal Processing
No ratings yet
Speaker Recognition: SRT Project of Signal Processing
27 pages
Research Paper Attri
No ratings yet
Research Paper Attri
7 pages
ISO 9001 Clauses Simply Explained Rev.1
No ratings yet
ISO 9001 Clauses Simply Explained Rev.1
26 pages
Test Initial Engleza Clasa A 8 A
No ratings yet
Test Initial Engleza Clasa A 8 A
2 pages
Expt 4 Conclusion and Applications
0% (2)
Expt 4 Conclusion and Applications
2 pages
Blavkjvdkhd
No ratings yet
Blavkjvdkhd
41 pages
Irma
No ratings yet
Irma
10 pages
Steganography: Reversible Data Hiding Methods For Digital Media
No ratings yet
Steganography: Reversible Data Hiding Methods For Digital Media
67 pages
C' Ifornia: California Code Ol, Regulations
No ratings yet
C' Ifornia: California Code Ol, Regulations
62 pages
Speech Recognition
No ratings yet
Speech Recognition
4 pages
Soil Classification Using Horizontal To Vertical Spectrum Ratio Methods On Scilab in Sendangmulyo, Semarang
No ratings yet
Soil Classification Using Horizontal To Vertical Spectrum Ratio Methods On Scilab in Sendangmulyo, Semarang
8 pages
Final Deepfake Voice Detection Report
No ratings yet
Final Deepfake Voice Detection Report
36 pages
2 - CNN Based Speaker Recognition in Language and Text Independent Small Scale System
No ratings yet
2 - CNN Based Speaker Recognition in Language and Text Independent Small Scale System
4 pages
Palindromes: Digitalcommons@University of Nebraska - Lincoln
No ratings yet
Palindromes: Digitalcommons@University of Nebraska - Lincoln
19 pages
Main Report Draft UNTOCHED
No ratings yet
Main Report Draft UNTOCHED
82 pages
Intechopen 80419
No ratings yet
Intechopen 80419
18 pages
M FCC Review
No ratings yet
M FCC Review
10 pages
Alyssamari Aurereyes
No ratings yet
Alyssamari Aurereyes
2 pages
Worksheet and Coronavirus 10 Ac
No ratings yet
Worksheet and Coronavirus 10 Ac
5 pages
Speaker Recognition System Based On VQ in MATLAB Environment
No ratings yet
Speaker Recognition System Based On VQ in MATLAB Environment
8 pages
A Voice Identification System Using Hidden Markov Model
No ratings yet
A Voice Identification System Using Hidden Markov Model
6 pages
Security in Mobile Ad-Hoc Networks
No ratings yet
Security in Mobile Ad-Hoc Networks
14 pages
Final Project Report
No ratings yet
Final Project Report
15 pages
Speech Recognition Using Matlab: Objective
No ratings yet
Speech Recognition Using Matlab: Objective
2 pages
Speaker Recognition Using Matlab
No ratings yet
Speaker Recognition Using Matlab
14 pages
Algorithm For The Identification and Verification Phase
No ratings yet
Algorithm For The Identification and Verification Phase
9 pages
35.232-2016.30 Balsam Tawfiq Swaidan
No ratings yet
35.232-2016.30 Balsam Tawfiq Swaidan
70 pages
Segmentation of Medical Images Using Adaptive Region Growing
No ratings yet
Segmentation of Medical Images Using Adaptive Region Growing
10 pages
Thor: Whiteboard Capture and Indexing: Mihai Parparita Szymon Rusinkiewicz
No ratings yet
Thor: Whiteboard Capture and Indexing: Mihai Parparita Szymon Rusinkiewicz
6 pages
Digital Signal Processing: The Final
No ratings yet
Digital Signal Processing: The Final
13 pages
Jaya D. Kapoor Alamuri Ratnamala Institute of Engineering and Technology, Shahpur Kailas K. Devadkar Sardar Patel Institute of Technology, Andheri
No ratings yet
Jaya D. Kapoor Alamuri Ratnamala Institute of Engineering and Technology, Shahpur Kailas K. Devadkar Sardar Patel Institute of Technology, Andheri
6 pages
Sampling Procedure APEDA 1721269949
No ratings yet
Sampling Procedure APEDA 1721269949
5 pages
Chapter Test: QS - Explain How You Found Your Answer
No ratings yet
Chapter Test: QS - Explain How You Found Your Answer
1 page
Motor, Filter, Kühlsystem Und Auspuff
No ratings yet
Motor, Filter, Kühlsystem Und Auspuff
18 pages
Speaker Recognition Using MFCC and VQ
No ratings yet
Speaker Recognition Using MFCC and VQ
2 pages
Applied Sciences: Fficiency Analysis of Manufacturing Line With
No ratings yet
Applied Sciences: Fficiency Analysis of Manufacturing Line With
15 pages
Pressure Transmitter Offer
No ratings yet
Pressure Transmitter Offer
2 pages
Research Paper Mytsak
No ratings yet
Research Paper Mytsak
27 pages
Maretext Independent Speaker Identification Based On K-Mean Algorithm
No ratings yet
Maretext Independent Speaker Identification Based On K-Mean Algorithm
9 pages
Information Technology British English Teacher B2 C1
No ratings yet
Information Technology British English Teacher B2 C1
13 pages
PDF Living On A Prayer - English Version
No ratings yet
PDF Living On A Prayer - English Version
17 pages
How To Be Secure From Social Engineering Attack
No ratings yet
How To Be Secure From Social Engineering Attack
3 pages
Anu Arora Report
No ratings yet
Anu Arora Report
8 pages
Collab Report Merged
No ratings yet
Collab Report Merged
55 pages
Voice Command Recognition System Based On MFCC and DTW: Anjali Bala
No ratings yet
Voice Command Recognition System Based On MFCC and DTW: Anjali Bala
8 pages
Ijves Y14 05338
No ratings yet
Ijves Y14 05338
5 pages
Đề thi minh họa số 16
No ratings yet
Đề thi minh họa số 16
6 pages