0% found this document useful (0 votes)

77 views4 pages

Study of Speaker Verification Methods

Speaker verification is a process to accept or reject the identity claim of a speaker by comparing a set of measurements of the speaker‘s utterances with a reference set of measurements of the utterance of the person whose identity is claimed.. In speaker verification, a person makes an identity claim. There are two main stages in this technique, feature extraction and feature matching. Feature extraction is the process in which we extract some useful data which can later to be used to represent the speaker. Feature matching involves identification of the unknown speaker by comparing the feature extracted from the voice with the enrolled voices of known speakers.

Uploaded by

Editor IJRITCC

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views4 pages

Study of Speaker Verification Methods

Uploaded by

Editor IJRITCC

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

International Journal on Recent and Innovation Trends in Computing and Communication

Volume: 2 Issue: 8

ISSN: 2321-8169
2363 2367

_______________________________________________________________________________________________

Study of Speaker Verification Methods

Sneha M.Powar

Dr. V.V.Patil

Electronics & Telecommunication Department

Dr. J. J. Magdum College of Engineering, Jaysingpur
Kolhapur, India
Email: [email protected]

H.O.D. Electronics Department

Dr. J. J. Magdum College of Engineering
Jysingpur, India
Email: [email protected]

Abstract Speaker verification is a process to accept or reject the identity claim of a speaker by comparing a set of measurements of the
speakers utterances with a reference set of measurements of the utterance of the person whose identity is claimed.. In speaker verification, a
person makes an identity claim. There are two main stages in this technique, feature extraction and feature matching. Feature extraction is the
process in which we extract some useful data which can later to be used to represent the speaker. Feature matching involves identification of the
unknown speaker by comparing the feature extracted from the voice with the enrolled voices of known speakers.
Keywords- speaker verification, text dependent, text independent

__________________________________________________*****_________________________________________________
I.

INTRODUCTION

In our everyday lives there are many forms of communication,

for instance: body Language, textual language, pictorial
language and speech, etc. However amongst those forms
speech is always regarded as the most powerful form of
communication. From the signal processing point of view,
speech can be characterized in terms of the signal carrying
message information. The waveform could be one of the
representations of speech, and this kind of signal has been
most useful in practical applications. Extracting from speech
signal, we could get three main kinds of information: Speech
Text, Language, and Speaker Identity. Speech recognition
refers to the ability of a machine or program to recognize or
identify spoken words and carry out voice. The spoken words
are digitized into sequence of numbers, and matched against
coded dictionaries so as to identify the words. Speaker
recognition maybe defined as the process of recognizing a
person automatically using the information extracted from
speech signal of the person. This technique uses the voice of
the speaker to verify their identity to access to several services
such as accessing the computer or server from remote place,
voice dialing, accessing security services, mobile banking etc.
where security is the primary concern.[6]
II.

SPEAKER INFORMATION IN SPEECH SIGNAL

Speech is human beings primary means of communication,

and it contents essentially the meaning of information from a
speaker to a hearer, individual information representing
speakers identity and gender, and also sometimes the
emotions. In a speech production the properties of both
articulators, which produce the sound, and auditory organs,
which perceive the sound, should be involved in Speech
production can be divided into three principal components:
excitation production, vocal tract articulation, and lips' and/or

nostrils' radiation. Excitation powers the speech production

process. It is produced by the airflow from lungs, and then
carried by trachea through the vocal folds, during inspiration,
air is filled into lungs, and during expiration the energy will be
spontaneously released. The trachea conveys the resulting
provider to serve inputs to the vocal tract, and the volume of
air determines the amplitude of the sound. The vocal tract
works as a filter to shape the excitation sources. The
uniqueness of speaker voice not only depends on the physical
features of the vocal tract, but the speakers mental ability to
control the muscles of the organs in the vocal tract. It is not
easy for speaker to change the physical features intentionally.
However, these physical features are possible to be changed
with ageing. Speaker characteristics in the speech signal are
often difficult to carry out. Segmenting, labeling, and
measuring specific segmental speech events that characterize
speakers, such as nasalized speech sounds, is difficult because
of variable speech behavior and variable and distorted
recording and transmission conditions. Overall qualities, such
as breathiness, are difficult to correlate with specific speech
signal measurements and are subject to variability in the same
way as segmental speech events. The most important analysis
tool is short-time spectral analysis. It is no coincidence that
short-time spectral analysis also forms the basis for most
speech recognition systems. Short-time spectral analysis not
only resolves the characteristics that differentiate one speech
sound from another, but also many of the characteristics
already mentioned that differentiate one speaker from another.
There are two principal modes of short-time spectral analysis:
filter bank analysis and linear predictive coding (LPC)
analysis. In filter bank analysis, the speech signal is passed
through a bank of band pass filters covering the available
range of frequencies associated with the signal. Typically, this
range is 200 to 3,000 Hz for telephone band speech and 50 to
8,000 Hz for wide band speech. A typical filter bank for wide
2363

IJRITCC | August 2014, Available @ http://www.ijritcc.org

_______________________________________________________________________________________

International Journal on Recent and Innovation Trends in Computing and Communication

Volume: 2 Issue: 8

ISSN: 2321-8169
2363 2367

_______________________________________________________________________________________________
band speech contains 16 band pass filters spaced uniformly
500 Hz apart. The output of each filter is usually implemented
as a windowed, short-time Fourier transform [using fast
Fourier transform (FFT) techniques] at the center frequency of
the filter. LPC-based spectral analysis is widely used for
speech and speaker recognition. The LPC model of the speech
air stream to the larynx. Larynx refers as an energy signal
specifies that a speech sample at time t ,s .t/, can be
represented as a linear sum of the p previous samples plus an
excitation term, as follows:
S(t)= a1s(t 1) +a2s(t 2)+.ap(t p)+ G u(t)
The LPC coefficients ai are computed by solving a set of linear
equations resulting from the minimization of the mean-squared
error between the signal at time t and the linearly predicted
estimate of the signal. Two generally used methods for solving
the equations, the autocorrelation method and the covariance
method. [4]
III.

SPEAKER RECOGNITION

A. Speaker Identification
Speaker identification (SI) is the process of finding the identity
of an unknown speaker by comparing his/her voice with
voices of registered speakers in the database. Its a one-tomany comparison. The basic structure of SI system (SIS) is
shown in Figure. We notice that the core components in SIS
are the same as in SVS. In SIS, M speaker models are scored
in parallel and the most-likely one is reported. The core
components in SIS are the same as in SVS. In SIS, M speaker
models are scored in parallel and the most-likely one is
reported, and consequently decision will be one of the
speakers ID in the database, or will be none of the above if
and only if the matching score is below some threshold and
its in the case of a open-set SIS.

Different terms which have the same definition as SV could be

found in literature, such as voice verification, voice
authentication,
speaker/talker
authentication,
talker
verification. It performs a one-to-one comparison (it is also
called binary decision) between the features of an input voice
and those of the claimed voice that is registered in the system.
Three main components shown in this structure are: Front-end
Processing, Speaker Modeling, and Pattern Matching. To get
the feature vectors of incoming voice, front-end processing
will be performed, and then depending on the models used in
Pattern Matching, match scores will be calculated. If the score
is larger than a certain threshold, then as a result, claimed
speaker would be acknowledged. There are three main
components: Front-end Processing, Speaker Modeling, and
Pattern Matching. Front-end processing is used to highlight the
relevant features and remove the irrelevant ones. After the first
component, we will get the feature vectors of the speech
signal. Pattern Matching between the claimed speaker model
registered in the database and the imposter model will be
performed then, if the match is above a certain threshold, the

Fig. 2 Basic Structure of Speaker Verification

Identity claim is verified. Using a high threshold, system gets
high safety and prevents impostors to be accepted, but in the
mean while it also takes the risk of rejecting the genuine
person, and vice versa. [6]
IV.

METHODS OF SPEAKER VERIFICATION

Speaker verification systems typically operate in one of two

input modes: text dependent or text independent. In the textdependent mode, speakers must provide utterances of the same
text for both training and recognition trials. In the textindependent mode, speakers are not constrained to provide
specific texts in recognition trials. Since the text-dependent
mode can directly exploit the voice individuality associated
with each phoneme or syllable, it generally achieves higher
recognition performance than the text-independent mode.
A. Text-Dependent (Fixed Passwords)
Fig 1. Basic structure of Speaker Identification
B. Speaker verification
Speaker verification (SV) is the process of determining
whether the speaker identity is who the person claims to be.

The structure of a system using fixed passwords is rather

simple; input speech is time aligned with reference templates
or models created by using training utterances for the
passwords. If the fixed passwords are different from speaker to
2364

IJRITCC | August 2014, Available @ http://www.ijritcc.org

_______________________________________________________________________________________

International Journal on Recent and Innovation Trends in Computing and Communication

Volume: 2 Issue: 8

ISSN: 2321-8169
2363 2367

_______________________________________________________________________________________________
speaker, the difference can also be used as additional
individual information. This helps to increase performance.
The most common approach to automatic speaker recognition
in the text-dependent mode uses representations that preserve
temporal characteristics. Each speaker is represented by a
sequence of feature vectors (generally, short-term spectral
feature vectors), analyzed for each test word or phrase. This
approach is usually based on template matching techniques in
which the time axes of an input speech sample and each
reference template of registered speakers are aligned, and the
similarity between them accumulated from the beginning to
the end of the utterance is calculated. Trial-to-trial timing
variations of utterances of the same talker, both local and
overall, can be normalized by aligning the analyzed feature
vector sequence of a test utterance to the template feature
vector sequence using a dynamic programming (DP) time
warping algorithm or DTW. Since the sequence of phonetic
events is the same for training and testing, there is an overall
similarity among these sequences of feature vectors. Ideally
the intra-speaker differences are significantly smaller than the
inter-speaker differences.
B. Text Independent (No Specified Passwords)
There are several applications in which predetermined
passwords cannot be used. In addition, human beings can
recognize speakers irrespective of the content of the utterance.
Therefore, text-independent methods have recently been
actively investigated. Another advantage of text-independent
recognition is that it can be done sequentially, until a desired
significance level is reached, without the annoyance of having
to repeat passwords again and again. In a text-independent
system, the words or phrases used in recognition trials
generally cannot be predicted. Therefore, it is impossible to
model or match speech events at the level of words or phrases.
Classical text-independent speaker recognition techniques are
based on measurements for which the time dimension is
collapsed. Recently text-independent speaker verification
techniques based on short duration speech events have been
studied. The new approaches extract and measure salient
acoustic and phonetic events. The bases for these approaches
lie in statistical techniques for extracting and modeling
reduced sets of optimally representative feature vectors or
feature vector sequences or segments. These techniques fall
under the related categories of vector quantization (VQ),
matrix and segment quantization, probabilistic mixture
models, and HMM.
A set of short-term training feature vectors of a speaker can be
used directly to represent the essential characteristics of that
speaker. However, such a direct representation is impractical
when the number of training vectors is large, since the
memory and amount of computation required become
prohibitively large. Therefore, efficient ways of compressing

the training data have been tried using VQ techniques. In this

method, VQ codebooks consisting of a small number of
representative feature vectors are used as an efficient means of
characterizing speaker-specific feature. A speaker specific
codebook is generated by clustering the training feature
vectors of each speaker. In the recognition stage, an input
utterance is vector-quantized using the codebook of each
reference speaker, and the VQ distortion accumulated over the
entire input utterance is used in making the recognition
decision.
A five-state ergodic linear predictive HMM is used for broad
phonetic categorization. After identifying frames belonging to
particular phonetic categories, feature selection is performed.
In the training phase, reference templates are generated and
verification thresholds are computed for each phonetic
category. In the verification phase, after the phonetic
categorization, a comparison with the reference template for
each particular category provides a verification score for that
category. The final verification score is a weighted linear
combination of the scores for each category. The weights are
chosen to reflect the effectiveness of particular categories of
phonemes in discriminating between speakers and are adjusted
to maximize the verification performance. The performances
of speaker recognition based on a VQ-based method and that
using discrete/continuous ergodic HMM-based methods have
been compared, in particular from the viewpoint of robustness
against utterance variations. It was shown that a continuous
ergodic HMM method is far superior to a discrete ergodic
HMM method, and that a continuous ergodic HMM method is
as robust as a VQ-based method when enough training data is
available. However, when little data is available, the VQ-based
method is more robust than a continuous HMM method.[6]
V.

APPLICATION

A. Secure Access via telephone

The most straightforward way to employ SV is in the cases
when one has to gain access to some secure place via
telephone. Voice is completely compatible with the existing
transmission protocols via telephone channels; therefore no
special adaptations of the system (besides the installment of a
SV system) are necessary.
B. Home Banking
Home banking is another application where SV can be
applied. For the time being such a service is restricted to
operations within the accounts maintained by a single
individual. One can for e.g. check the status of their account,
transfer money between ones own saving accounts, etc. The
security is pretty low in these cases, the users are verified only
by saying their PIN and FR almost never occurs (after all who
wants to play \robber" with his own savings!). Still, however,
it is being researched how secure it could be to use SV for
2365

IJRITCC | August 2014, Available @ http://www.ijritcc.org

_______________________________________________________________________________________

International Journal on Recent and Innovation Trends in Computing and Communication

Volume: 2 Issue: 8

ISSN: 2321-8169
2363 2367

_______________________________________________________________________________________________
transactions including a second and third party (i.e. the so
called high-risk bank transactions). It is always noted that the
security measures should be proportional to the value that
could be obtained by this service.

[3]

[4]

C. Home shopping
Home shopping (see for e.g. http://www.hsn.com) is the
service that is most uninteresting to an imposter. SV is here
being employed, though backed up by a human operator. In
this service people ring to order products that are later on
shipped to their home addresses. In cases when all lines are
busy, a customer can always choose to use the automatic
service. They just have to speak their telephone number and if
their identity is successfully verified they can start ordering
products. If they are rejected, they are redirected to a human
operator. But even if their identity is mistaken for someone
else and some products are sending to another customer, there
is no harm because these products cannot go to an
unauthorized party (i.e. a criminal).

[5]

[6]

[7]

Higgins, A.L., Bahler, L.andPorter, J., Speaker

verification using randomized phrase prompting,
Digital Signal Processing, 1, 89106, 1991.
Rabiner, L.R. and Juang, B.-H., Fundamentals of
Speech Recognition,Prentice-Hall
Englewood
Cliffs, NJ, 1993.
Rosenberg, A.E., Lee, C.-H. and Gokcen, S.,
Connected word talker verification using whole word
hidden Markov models, Proc. IEEE Intl. Conf.
Acoust., Speech, Signal Processing, Toronto, 381
384, 1991.
Technical University of Denmark Informatics and
Mathematical Modelling Master thesis by Ling Feng
IMM-THESIS: ISSN 1601-233X
R. A. Cole and colleagues, Survey of the State of the
Art in Human Language Technology, National
Science Foundation European Commission.
BIOGRAPHY

D. Forensics and Survelliance

Detection of speakers in forensic cases boils down, in most
situations, to deciding whether a given recording is really from
a suspect or not. This is exactly the case of the Hypothesis
Test. Leaving aside legal issues SV can help police discover
how many different individuals are involved in a conversation
on a tape.
VI.

DISCUSSION

The challenges for implementing practical and uniformly

reliable systems for speaker verification, are rooted in
problems associated with variability and insufficient data. As
described earlier, variability is associated with trial-to-trial
variations in recording and transmission conditions and
speaking behavior. The most serious variations occur between
enrollment sessions and subsequent test sessions resulting in
models that are mismatched to test conditions. Most
applications require reliable system operation under a variety
of environmental and channel conditions and require that
variations in speaking behavior will be tolerated. Insufficient
data refers to the unavailability of sufficient amounts of data to
provide representative models and accurate decision
thresholds.
[1]

[2]

Miss S.M.Powar received the BE degree in 2011 in Electronics from

Bharti Vidyapeeths College Of Engg Kolhapur & persuing ME
Electronics & Communication in Dr. J.J.Magdum College of Engg.
Jaysingpur.
Dr.Mrs.V.V.Patil received the BE degree in1994 & ME degree in
2004 in Electronics Engg. Dept. of Walchand College of Engg. sangli
& PHD degree from Electrical Engg. Dept. of I.I.T Bombay in 2014.
She is currently professor & Head in Electronics Engg. Dept. of Dr.
J.J.Magdum College of Engg. Jaysingpur.
Her research interests are in the area of speech & signal processing
applications.

REFERENCES
Matsui, T. and Furui, S., Concatenated phoneme
models for text-variable speaker recognition, Proc.
IEEE Intl. Conf. Acoust., Speech, Signal Processing,
II, 391394, 1992.
Matsui, T. and Furui, S., Speaker adaptation of tiedmixture-based phoneme models for text prompted
speaker recognition, Proc. IEEE Intl. Conf. Acoust.,
Speech, Signal Processing, I, 125128, 1994
2366

IJRITCC | August 2014, Available @ http://www.ijritcc.org

_______________________________________________________________________________________

Software Requirements Specification: Intra Mailing System
No ratings yet
Software Requirements Specification: Intra Mailing System
19 pages
Aadhaar SHJ
No ratings yet
Aadhaar SHJ
1 page
Insecure Boot: Andrea Barisani - Head of Hardware Security
No ratings yet
Insecure Boot: Andrea Barisani - Head of Hardware Security
42 pages
List of Bapis of SD MM Fico
No ratings yet
List of Bapis of SD MM Fico
12 pages
Speaker Recognition System: A Project Report On
No ratings yet
Speaker Recognition System: A Project Report On
48 pages
Advanced Signal Processing Using Matlab
No ratings yet
Advanced Signal Processing Using Matlab
20 pages
Speaker Recognition
No ratings yet
Speaker Recognition
11 pages
Principle and Applications of Speaker Recognition Security System
No ratings yet
Principle and Applications of Speaker Recognition Security System
5 pages
MajorInterim Report1
No ratings yet
MajorInterim Report1
10 pages
Speaker Recognition Publish
No ratings yet
Speaker Recognition Publish
6 pages
An Overview of The Development of Speaker Recognition
No ratings yet
An Overview of The Development of Speaker Recognition
11 pages
بحث عمر
No ratings yet
بحث عمر
25 pages
Digital Signal Processing: The Final
No ratings yet
Digital Signal Processing: The Final
13 pages
Automatic Speaker Recognition System
No ratings yet
Automatic Speaker Recognition System
11 pages
Speaker Recognition: Fundamentals and Applications
From Everand
Speaker Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
DC Motor Control
No ratings yet
DC Motor Control
2 pages
An Automatic Speaker Recognition System
No ratings yet
An Automatic Speaker Recognition System
11 pages
EEL6586 Final Project:: A Speaker Identification and Verification System
No ratings yet
EEL6586 Final Project:: A Speaker Identification and Verification System
16 pages
Russia Project
No ratings yet
Russia Project
14 pages
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
No ratings yet
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
10 pages
Puede-ser-Speaker Identification Based On Hybrid Feature
No ratings yet
Puede-ser-Speaker Identification Based On Hybrid Feature
6 pages
Speaker Identification Using Mel Frequency Cepstral Coefficients
No ratings yet
Speaker Identification Using Mel Frequency Cepstral Coefficients
5 pages
Time Frequency Analysis and Wavelet Transform Tutorial Time-Frequency Analysis For Voiceprint (Speaker) Recognition
No ratings yet
Time Frequency Analysis and Wavelet Transform Tutorial Time-Frequency Analysis For Voiceprint (Speaker) Recognition
22 pages
Voice Recognition With Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms
No ratings yet
Voice Recognition With Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms
8 pages
Speaker Identification Using Power Distribution in
No ratings yet
Speaker Identification Using Power Distribution in
6 pages
Performance Comparison of Robust Speech PDF
No ratings yet
Performance Comparison of Robust Speech PDF
6 pages
Voice Syn - NN
No ratings yet
Voice Syn - NN
15 pages
A Review On Speaker Recognition - Technology and Challenges
No ratings yet
A Review On Speaker Recognition - Technology and Challenges
14 pages
Monalisha Barik Paper
No ratings yet
Monalisha Barik Paper
5 pages
State of The Art in Speaker Recognitin - 2202.12705v1
No ratings yet
State of The Art in Speaker Recognitin - 2202.12705v1
7 pages
Speaker Recognition Using Vector Quantization and Gaussian Mixture Models
No ratings yet
Speaker Recognition Using Vector Quantization and Gaussian Mixture Models
6 pages
Person Voice Recognition Methods
No ratings yet
Person Voice Recognition Methods
6 pages
Final
No ratings yet
Final
9 pages
Speaker Verification From Short Utterance Perspective: A Review
No ratings yet
Speaker Verification From Short Utterance Perspective: A Review
15 pages
Generic Model For Text Dependent Automatic Gujarati Speaker Recognition
No ratings yet
Generic Model For Text Dependent Automatic Gujarati Speaker Recognition
4 pages
Hedha Houa
No ratings yet
Hedha Houa
5 pages
Automatic Speaker Recognition by Speech Signal
No ratings yet
Automatic Speaker Recognition by Speech Signal
15 pages
Irjet V7i6965
No ratings yet
Irjet V7i6965
5 pages
Automatic+Speaker+Recognition+System - EEE
No ratings yet
Automatic+Speaker+Recognition+System - EEE
11 pages
JAWS (Screen Reader)
No ratings yet
JAWS (Screen Reader)
18 pages
About Speaker Recognition Techology
No ratings yet
About Speaker Recognition Techology
9 pages
An Executive Guide Biometrics
From Everand
An Executive Guide Biometrics
alasdair gilchrist
No ratings yet
Speech Processing Unit 4 Notes
No ratings yet
Speech Processing Unit 4 Notes
16 pages
JournalNX - Speaker Recognition
No ratings yet
JournalNX - Speaker Recognition
6 pages
(1339309X - Journal of Electrical Engineering) Text-Independent Speaker Recognition Using Two-Dimensional Information Entropy PDF
No ratings yet
(1339309X - Journal of Electrical Engineering) Text-Independent Speaker Recognition Using Two-Dimensional Information Entropy PDF
5 pages
Methodology For Speaker Identification and Recognition System
100% (1)
Methodology For Speaker Identification and Recognition System
13 pages
Sita#1part2 Merged
No ratings yet
Sita#1part2 Merged
61 pages
Speaker Recognition System
No ratings yet
Speaker Recognition System
7 pages
Real Time Speaker Recognition
No ratings yet
Real Time Speaker Recognition
45 pages
Speaker Recognition
No ratings yet
Speaker Recognition
12 pages
Final PPT On Speech Processing
50% (2)
Final PPT On Speech Processing
20 pages
Forensic Speech Recognition
No ratings yet
Forensic Speech Recognition
11 pages
Acoustic Analysis in Speaker Identification
No ratings yet
Acoustic Analysis in Speaker Identification
6 pages
Acoustic Parameters For Speaker Verification
No ratings yet
Acoustic Parameters For Speaker Verification
16 pages
++++tutorial Text Independent Speaker Verification
No ratings yet
++++tutorial Text Independent Speaker Verification
22 pages
Voice Recognition With Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms
No ratings yet
Voice Recognition With Neural Networks, Type-2 Fuzzy Logic and Genetic Algorithms
9 pages
Shareef Seminar Docs
No ratings yet
Shareef Seminar Docs
24 pages
DSP Implementation of Voice Recognition Using Dynamic Time Warping Algorithm
No ratings yet
DSP Implementation of Voice Recognition Using Dynamic Time Warping Algorithm
7 pages
Using Gaussian Mixture: Automatic Speaker Recognition Speaker Models
No ratings yet
Using Gaussian Mixture: Automatic Speaker Recognition Speaker Models
20 pages
Speaker Recognition System - v1
No ratings yet
Speaker Recognition System - v1
7 pages
Voice Technologies and Systems: Definitive Reference for Developers and Engineers
From Everand
Voice Technologies and Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Visual Word: Unlocking the Power of Image Understanding
From Everand
Visual Word: Unlocking the Power of Image Understanding
Fouad Sabry
No ratings yet
Text-to-Speech Systems and Algorithms: Definitive Reference for Developers and Engineers
From Everand
Text-to-Speech Systems and Algorithms: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Speech-to-Text Systems and Technologies: Definitive Reference for Developers and Engineers
From Everand
Speech-to-Text Systems and Technologies: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
A Review of 2D &3D Image Steganography Techniques
No ratings yet
A Review of 2D &3D Image Steganography Techniques
5 pages
A Review of Wearable Antenna For Body Area Network Application
No ratings yet
A Review of Wearable Antenna For Body Area Network Application
4 pages
Diagnosis and Prognosis of Breast Cancer Using Multi Classification Algorithm
No ratings yet
Diagnosis and Prognosis of Breast Cancer Using Multi Classification Algorithm
5 pages
Importance of Similarity Measures in Effective Web Information Retrieval
No ratings yet
Importance of Similarity Measures in Effective Web Information Retrieval
5 pages
A Review of 2D &3D Image Steganography Techniques
No ratings yet
A Review of 2D &3D Image Steganography Techniques
5 pages
Channel Estimation Techniques Over MIMO-OFDM System
No ratings yet
Channel Estimation Techniques Over MIMO-OFDM System
4 pages
Channel Estimation Techniques Over MIMO-OFDM System
No ratings yet
Channel Estimation Techniques Over MIMO-OFDM System
4 pages
IJRITCC Call For Papers (October 2016 Issue) Citation in Google Scholar Impact Factor 5.837 DOI (CrossRef USA) For Each Paper, IC Value 5.075
No ratings yet
IJRITCC Call For Papers (October 2016 Issue) Citation in Google Scholar Impact Factor 5.837 DOI (CrossRef USA) For Each Paper, IC Value 5.075
3 pages
A Review of Wearable Antenna For Body Area Network Application
No ratings yet
A Review of Wearable Antenna For Body Area Network Application
4 pages
Predictive Analysis For Diabetes Using Tableau: Dhanamma Jagli Siddhanth Kotian
No ratings yet
Predictive Analysis For Diabetes Using Tableau: Dhanamma Jagli Siddhanth Kotian
3 pages
A Study of Focused Web Crawling Techniques
No ratings yet
A Study of Focused Web Crawling Techniques
4 pages
45 1530697786 - 04-07-2018 PDF
No ratings yet
45 1530697786 - 04-07-2018 PDF
5 pages
Itimer: Count On Your Time
No ratings yet
Itimer: Count On Your Time
4 pages
Safeguarding Data Privacy by Placing Multi-Level Access Restrictions
No ratings yet
Safeguarding Data Privacy by Placing Multi-Level Access Restrictions
3 pages
Prediction of Crop Yield Using LS-SVM
No ratings yet
Prediction of Crop Yield Using LS-SVM
3 pages
A Clustering and Associativity Analysis Based Probabilistic Method For Web Page Prediction
No ratings yet
A Clustering and Associativity Analysis Based Probabilistic Method For Web Page Prediction
5 pages
Hybrid Algorithm For Enhanced Watermark Security With Robust Detection
No ratings yet
Hybrid Algorithm For Enhanced Watermark Security With Robust Detection
5 pages
44 1530697679 - 04-07-2018 PDF
No ratings yet
44 1530697679 - 04-07-2018 PDF
3 pages
Vehicular Ad-Hoc Network, Its Security and Issues: A Review
No ratings yet
Vehicular Ad-Hoc Network, Its Security and Issues: A Review
4 pages
Finest Execution Time Approach For Optimal Execution Time in Mobile and Cloud Computing
No ratings yet
Finest Execution Time Approach For Optimal Execution Time in Mobile and Cloud Computing
6 pages
41 1530347319 - 30-06-2018 PDF
No ratings yet
41 1530347319 - 30-06-2018 PDF
9 pages
A Content Based Region Separation and Analysis Approach For Sar Image Classification
No ratings yet
A Content Based Region Separation and Analysis Approach For Sar Image Classification
7 pages
Image Restoration Techniques Using Fusion To Remove Motion Blur
No ratings yet
Image Restoration Techniques Using Fusion To Remove Motion Blur
5 pages
Novel Approach For Comparative Analysis of Networking Routing Protocol
No ratings yet
Novel Approach For Comparative Analysis of Networking Routing Protocol
6 pages
49 1530872658 - 06-07-2018 PDF
No ratings yet
49 1530872658 - 06-07-2018 PDF
6 pages
Motif and Conglomeration of Software Process Improvement Model
No ratings yet
Motif and Conglomeration of Software Process Improvement Model
3 pages
Lift Control System Based On PLC
No ratings yet
Lift Control System Based On PLC
3 pages
BUSINESS DIARY - An Interactive and Intelligent Platform For SME's
No ratings yet
BUSINESS DIARY - An Interactive and Intelligent Platform For SME's
3 pages
Paper On Design and Analysis of Wheel Set Assembly & Disassembly Hydraulic Press Machine
No ratings yet
Paper On Design and Analysis of Wheel Set Assembly & Disassembly Hydraulic Press Machine
4 pages
An Approach For Power Control in Vehicular Adhoc Network For Catastrophe Message
No ratings yet
An Approach For Power Control in Vehicular Adhoc Network For Catastrophe Message
7 pages
Download
No ratings yet
Download
1 page
Konica Minolta MFP Bizhub 751 601 Brochure
No ratings yet
Konica Minolta MFP Bizhub 751 601 Brochure
12 pages
Konica Minolta Bizhub C650i Brochure
No ratings yet
Konica Minolta Bizhub C650i Brochure
4 pages
802.1x Cisco ISE Succeed Steps
No ratings yet
802.1x Cisco ISE Succeed Steps
3 pages
Voucher - Up-346-10.14.24-Part2 5h
No ratings yet
Voucher - Up-346-10.14.24-Part2 5h
60 pages
Air 2018 SC Supp 1841 Si
No ratings yet
Air 2018 SC Supp 1841 Si
434 pages
Mis Review MCQ
No ratings yet
Mis Review MCQ
12 pages
EAadhaar 2758101000016020220523145440 03012024181857
No ratings yet
EAadhaar 2758101000016020220523145440 03012024181857
1 page
Syllabus in Computer System Security
No ratings yet
Syllabus in Computer System Security
3 pages
Community-Infineon
No ratings yet
Community-Infineon
6 pages
Owasp Api Security Checklist - Updated
100% (1)
Owasp Api Security Checklist - Updated
13 pages
The Power of Brands
No ratings yet
The Power of Brands
12 pages
EAadhaar 850389493249 26042020174344 462386 Unlocked
No ratings yet
EAadhaar 850389493249 26042020174344 462386 Unlocked
1 page
Facial Feature Recognition Using Biometrics
No ratings yet
Facial Feature Recognition Using Biometrics
3 pages
Authorization Letter
100% (1)
Authorization Letter
2 pages
Electronic Passport Using RFID
0% (1)
Electronic Passport Using RFID
4 pages
Secure Electronic Transactions-SET: January 2017
No ratings yet
Secure Electronic Transactions-SET: January 2017
23 pages
Ricoh Security Solutions Brochure
No ratings yet
Ricoh Security Solutions Brochure
18 pages
Karthick DSC Subscription
No ratings yet
Karthick DSC Subscription
1 page
Salesforce Single Sign On
No ratings yet
Salesforce Single Sign On
42 pages
IAccess Enrollment Form
No ratings yet
IAccess Enrollment Form
4 pages
ENISA-Smartphone Secure Development Guidelines
No ratings yet
ENISA-Smartphone Secure Development Guidelines
28 pages
NCC Enrollment Form
No ratings yet
NCC Enrollment Form
6 pages
IIT Guwahati Guideline - PHD
No ratings yet
IIT Guwahati Guideline - PHD
7 pages
Securing Land Registration Using Blockchain
No ratings yet
Securing Land Registration Using Blockchain
8 pages
Introduction:-: Database Security Database Management System - 2
No ratings yet
Introduction:-: Database Security Database Management System - 2
9 pages

Study of Speaker Verification Methods

Uploaded by

Study of Speaker Verification Methods

Uploaded by

International Journal on Recent and Innovation Trends in Computing and Communication

Study of Speaker Verification Methods

Electronics & Telecommunication Department

H.O.D. Electronics Department

In our everyday lives there are many forms of communication,

SPEAKER INFORMATION IN SPEECH SIGNAL

Speech is human beings primary means of communication,

nostrils' radiation. Excitation powers the speech production

IJRITCC | August 2014, Available @ http://www.ijritcc.org

International Journal on Recent and Innovation Trends in Computing and Communication

Different terms which have the same definition as SV could be

Fig. 2 Basic Structure of Speaker Verification

METHODS OF SPEAKER VERIFICATION

Speaker verification systems typically operate in one of two

The structure of a system using fixed passwords is rather

IJRITCC | August 2014, Available @ http://www.ijritcc.org

International Journal on Recent and Innovation Trends in Computing and Communication

the training data have been tried using VQ techniques. In this

A. Secure Access via telephone

IJRITCC | August 2014, Available @ http://www.ijritcc.org

International Journal on Recent and Innovation Trends in Computing and Communication

Higgins, A.L., Bahler, L.andPorter, J., Speaker

D. Forensics and Survelliance

The challenges for implementing practical and uniformly

Miss S.M.Powar received the BE degree in 2011 in Electronics from

IJRITCC | August 2014, Available @ http://www.ijritcc.org

You might also like