0% found this document useful (0 votes)

110 views5 pages

Performance Evaluation of MLP For Speech Recognition in Noisy Environments Using MFCC & Wavelets

This document summarizes research on using multilayer perceptron neural networks for speech recognition in noisy environments using two different feature extraction techniques: 1) Mel frequency cepstrum coefficients (MFCC) and 2) wavelet packets. The research compares the recognition accuracy of these two techniques when used as the front-end of an automatic speech recognition system. The system uses time-frequency masking to separate speech from noise before feature extraction and neural network classification. Experimental results show that the wavelet packet method achieves better recognition accuracy for noisy speech compared to MFCC features.

Uploaded by

aijazmona

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

110 views5 pages

Performance Evaluation of MLP For Speech Recognition in Noisy Environments Using MFCC & Wavelets

Uploaded by

aijazmona

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

International Journal of Computer Science & Communication Vol. 1, No. 2, July-December 2010, pp.

41-45

Performance Evaluation of MLP for Speech Recognition In Noisy Environments Using MFCC & Wavelets
P Phani Kumar1, K S N Vardhan2 & K Sri Rama Krishna3
Assistant Professor, Dept of ECE, V R Siddhartha Engineering College, Vijayawada PG Student, Dept of ECE, V R Siddhartha Engineering College, Vijayawada 3 Professor and Head, Dept of ECE, V R Siddhartha Engineering College, Vijayawada E-mail: [email protected], [email protected], [email protected]
1 2

ABSTRACT Speech is a very powerful and fast tool for communication. That is the reason; the problem of automatic speech recognition has been fascinating the computer scientists. Two robust speech recognition systems in noisy environment have been implemented in this work. One system is based on Mel-Frequency Cepstrum Coefficients and the other is based on Wavelet Packet Filter bank. Major drawback of all Automatic Speech Recognition (ASR) systems is poor performance in the noisy environment. Hence, a speech segregation stage using Time-Frequency masking is employed as the front end of the systems implemented here. Other two stages are feature extraction stage and recognition stage as in conventional ASR systems. The accuracy of the entire system depends mainly on the performance of the segregation stage and feature extraction stage. By employing wavelet packet filter bank approximately matching to human cochlea, recognition accuracy is obtained for noisy speech. Then Multilayer Perceptron (MLP) neural network is for the train and test procedures. Keywords: Multilayer Perceptron (MLP) Neural Network, Discrete Wavelet Transform (DWT), Mels Scale Frequency Filter

1. INTRODUCTION

Speech is one of the most important tools for communication between human and his environment. Therefore manufacturing of Automatic System Recognition (ASR) is desire for him all the time. Automatic Speech Recognition (ASR) is a technology that allows a computer to identify the words that a person speaks in to a microphone. Automatic speech recognition technology has made enormous advances in the last 20 years and produced sufficiently good performance to be usefully employable in a variety of tasks but does not exhibit the robustness to environmental noise. Real world applications require that speech recognition systems be robust to interfering noise. Unfortunately, the performance of a speech recognition system drops dramatically when there is a mismatch between training and testing conditions. In a speech recognition system, many parameters affect the accuracy of the Recognition System. Problems such as noisy environment, incompatibility between train and test conditions, dissimilar expressing of one word by two different speakers and different pronouncing of one word by one person in several times, is led to made system without complete recognition; So resolving each of these problems is a good step toward this aim. A speech recognition algorithm is consisted of several stages that the most significant of them are feature

extraction and pattern recognition. In feature extraction category, best presented algorithms are Mel Frequency Cepstrum Coefficients (MFCC) and an auditory filter bank employing wavelet packets. The performance gap between Automatic Speech Recognition and Human Speech Recognition (HSR) still remains large in the presence of noise. The performance of ASR system is almost similar with HSR performance in the absence of noise. The main reason for this performance gap is most of the developed ASR systems are performing speech segregation independently with recognition. A typical Speech Recognition system consists of a front end and back end. The front end gives the feature extraction of given input acoustic signal and back end is used for the recognition of speech by taking the feature vectors as input. Input signal is noisy speech, speech segregation unit separates the dominant speech from the noisy speech. From this speech, significant features are extracted for the recognition purpose. In the present work, the following methods are used for feature extraction, using Mel Frequency Cepstrum Coefficients (MFCC) and using an auditory filter bank employing wavelet packets. The performance of these two systems is compared. The purpose of speech feature extraction is to convert the speech waveform to some type of parametric representation for further analysis and processing.

International Journal of Computer Science & Communication (IJCSC)

Segregation of speech signals, when a representation of the sources exists such that the sources have disjoint support in that representation, it is possible to partition the support of the mixtures and obtain the original sources. One solution to the problem of demixing is thus to determine an appropriate disjoint representation of the sources and determine the partitions in this representation which demix. In this paper, we used the discrete short-time or windowed Fourier transform which is a good representation for demixing speech mixtures. Determining the partition blindly from one mixture is an open problem, but, given a second mixture, a method is described in, for partitioning the timefrequency lattice which separates the sources. Our speech recognition process contains four main stages. a. Acoustic processing that main task of this unit is filtering of the white noise from speech signals and consists of three parts, Fast Fourier Transform, Mels Scale Bank pass Filtering and Cepstral Analysis. W-Disjoint Orthogonality. Feature extractions from MFCC and wavelet transform coefficients.

The digitized sound signal contains relevant, the data, and irrelevant information, such as white noise; therefore it requires a lot of storage space. Most frequency component of speech signal is below 5KHz and upper ranges almost include white noise that directly impact on system performance and training speed, because of its chromatic nature. So speech data must be preprocessed. Fast Fourier Transform (FFT) Fast Fourier Transform is that which converts each frame of N samples from the time domain into the frequency domain. The FFT is a fast algorithm to implement the Discrete Fourier Transform (DFT) which is defined on the set of N samples {xn}, as follow:

Xn = xk e 2 jkn/ N , n = 0,1,2, N 1.
k =0

N 1

Mel Frequency Cepstral Coefficients System The speech signal is a slowly timed varying signal called as quasi-stationary, when examined over a sufficiently short period of time between 5 and 100 msec, its characteristics are fairly stationary. However, over long periods of time where on the order of 1/5 seconds or more the signal characteristic change to reflect the different speech sounds being spoken. Therefore, shorttime spectral analysis is the most common way to characterize the speech signal. A wide range of possibilities exist for parametrically representing the speech signal for the speech recognition task, such as Linear Prediction Coding (LPC), MelFrequency Cepstrum Coefficients (MFCC), and others. MFCC is perhaps the best known and most popular, and is used in one of the systems implemented in this work. To simplify the subsequent processing of the signal, useful features must be extracted and the data should be compressed. The power spectrum of the speech signal is the most often used method of encoding.

b. c.

d. Classification and recognition using back propagation learning algorithm.

Fig. 1: Input Speech Signal

Mel Frequency Cepstral Analysis is used to encode the speech signal. Mel scale frequencies are distributed linearly in the low range but logarithmically in the high range, which corresponds to the physiological characteristics of the human ear. Cepstral Analysis calculates the inverse Fourier transform of the logarithm of the power spectrum of the speech signal. So first, speech signal was transferred to frequency domain by Fast Fourier Transform (FFT). Then the set of Mel scale filter banks is shown below was implemented on it and energy values of upper frequencies are decreased. Sub arrays are combined with each other and Inverse Fast Fourier Transform (IFFT) is performed.

Fig. 2: Denoised Speech Signal

Performance Evaluation of MLP for Speech Recognition In Noisy Environments Using MFCC & Wavelets

Fig. 3: Spectrum & Cepstrum for the Speech Signal

Fig. 6: Mel Frequency Cepstral Coefficients Fig. 4: Mel Frequency Filter Bank

The mel-frequency scale is linear frequency spacing below 1000 Hz and a logarithmic spacing above 1000 Hz. As a reference point, the pitch of a 1 kHz tone, 40 dB above the perceptual hearing threshold, is defined as 1000 mels.

W-Disjoint Orthogonality (W-DO) Given a mixture x1 ( t ) = s j ( t) of sources s j (t), j = 1,N, to recover the original sources. In order to accomplish this, we assume the sources are pair wise W-disjoint orthogonal. We call two functions s1 and s2 W-disjoint orthogonal (W-DO) if, for a given a window function W, the supports of the windowed Fourier transforms of s1 and s2 are disjoint. The windowed Fourier transform of sj is defined
F W s j () ( t , w) =
j=1 N

1 2

W ( t) s ( ) e
j

Fig. 5: After Applying Mel-spaced Filter Bank

which we will refer to as s j (t , ) where appropriate. Speech is sparse in that a small percentage of the timefrequency coefficients in the STFT expansion of speech capture a large percentage of the overall power. In other words, the magnitude of the time-frequency representation of speech is often small. Measure of

International Journal of Computer Science & Communication (IJCSC)

approximate W-disjoint orthogonality based on the demixing performance of time-frequency masks created using knowledge of the instantaneous source and interference time-frequency powers of speech mixtures. Experiments on speech mixtures reveal that speech is approximately W-DO. In order to measure W-disjoint orthogonality for a given mask, we combine two important performance criteria: how well the mask preserves the source of interest, and how well the mask suppresses the interfering sources.

j , k [ n] = 2 2 2 j n k j , k [ n] = 2 2 2 j n k
j

(2)

[n] is called scaling function and [n] wavelet function. For the implementation of DWT, the filter bank structure is often used. The approximation coefficients at a higher level are passed through a high pass and a low pass filter followed by a down sampling by two, to compute both the detail and approximation coefficients at a lower level. This tree structure is repeated for a multi-level decomposition. A wavelet packet filter bank matching to the critical bands of human cochlea is implemented for feature extraction. Human cochlea model consists of 24 critical bands with different bandwidths. MLP Neural Network A Multilayer Perceptron (MLP) network consists of an input layer, one or more hidden layers, and an output layer. Each layer consists of multiple neurons. An artificial neuron is the smallest unit that constitutes the artificial neural network. The actual computation and processing of the neural network happens inside the neuron. In this work, we use an architecture of the MLP networks which is the feed forward network with back propagation training algorithm (FFBP). In this type of network, the input is presented to the network and moves through the weights and nonlinear activation functions toward the output layer, and the error is corrected in a backward direction using the well-known error back propagation correction algorithm. The number of neurons in each hidden layer has a direct impact on the performance of the network during training as well as during operation. Having more neurons than needed for a problem runs the network into an over fitting problem. Over fitting problem is a situation whereby the network memorizes the training examples. Networks that run into over fitting problem perform well on training examples and poorly on unseen examples. Also having less number of neurons than needed for a problem causes the network to run into under fitting problem. The under fitting problem happens when the network architecture does not cope with the complexity of the problem in hand. The under fitting problem results in an inadequate modeling and therefore poor performance of the network. Unfortunately, coming up with the right number of hidden layers and neurons for each hidden layer can only be achieved by trial and error. Many experiments have been conducted to get the optimum number of hidden layers and neurons.

Fig. 7: W-disjoint Orthogonality of Two Speeches

Discrete Wavelet Transform Wavelet analysis is a new development in the area of applied mathematics. Fourier analysis is ideal for studying stationary data, but is not well suited for studying data with transient events that cannot be statistically predicted from the datas past. Wavelets were designed with such nonstationary data in mind, and with their generality and strong results have quickly become useful to a number of disciplines. Wavelet transform can be viewed as the projection of a signal into a set of basis functions named wavelets. Such basis functions offer localization in the frequency domain. Compare to STFT which has equally spaced time-frequency localization, wavelet transform provides high frequency resolution at low frequencies and high time resolution at high frequencies. The discrete wavelet transform (DWT) of a signal x[n] is defined based on so-called approximation coefficients, w j0 , k , and detail coefficients, w j , k , as follows:

w j0 , k = w j , k =

1 M 1 M

x [ n] [ n]
n j0 , k n j0 , k

(1)

x [ n] [ n] for j j

where n = 0,1,2,..., M 1, j = 0,1,2,..., J 1, k = 0,2,..., 2j 1, and M denotes the number of samples to be transformed. The basis functions j , k [ n] , and j , k [n] are defined as:

Performance Evaluation of MLP for Speech Recognition In Noisy Environments Using MFCC & Wavelets

wavelet packet filter bank closely matching to human cochlea, as features for the recognition stage. Performance of the wavelet packet filter bank based system is found to be better than that of MFCC based system. Performance of the speech segregation stage is also found to be better than that of related schemes. The efficiency of segregation stage depends on the degree of approximation of W-disjoint orthogonality of speech signals.
Fig. 8: Perceptron (Train and Test)

REFERENCES [1] Abdul Ahad, Ahsan Fayyaz, Tariq Mehmood. Speech Recognition using Multilayer Perceptron, IEEE trans. pp.103,2002. Song Yang, Meng Joo Er, and Yang Gao, A High Performance Neural-Networks-Based Speech Recognition System, IEEE trans, pp.1527,2001. Ben Milner,Xu Shao: Clean Speech Reconstruction from MFCC Vectors and Fundamental Frequency Using an Integrated Front End, School of Computing Sciences, University of East Anglia, Norwich NR4 7TJ, UK, Speech Communication 48(2006) pp.697-715 I.Gavat, O.Dumitru, C. Iancu, Gostache, Learning Strategies in Speech Recognition, Proc. Elmar 2005, pp.237-240, june 2005, Zadar, Croatia. Bahlmann. Haasdonk. Burkhardt, Speech and Audio Recognition, IEEE Trans, 11, May 2003. Tebelskis. J. Speech Recognition Using Neural Networks, PhD. Dissertation, School of Computer Science, Carnegie Mellon University,1995. J. Tchorz, B. Kollmeier, A Psychoacoustical Model of the Auditory Periphery as Front-end for ASR; ASAEAAiDEGA Joint Meeting on Acoustics, Berlin, March 1999. R.P. Lippmann, An Introduction to Computing with Meural Nets. IEEE ASSP Mag., 4, Apr.1997. MathWorks. Neural Network Toolbox Users Guide, 2004.

[2]

[3]

[4]

[5]
Fig. 9: Hyper Plane of Hidden Neurons

[6]

2. RESULT ANALYSIS

MLP neural network was successfully developed using MATLAB and has been selected for our network implementation. In MLP which is the feed forward network, the training and testing can be done with the thousand numbers of samples. Both train and test with 1000 samples.
3. CONCLUSION

[7]

[8] [9]

In this paper, two robust speech recognition systems in noisy environment have been implemented. First system uses MFCC and the second system uses output of a

[10] S.M Peeling, R.K Moore and R.J.Tomlinson, The Multi Layer Perceptron as a Tool for Speech Pattern Processing Research. in Proc. IoA Autumn Conf.Speech Hearing, 1986. [11] L.R. Rabiner and B.H. Juang, Fundamentals of Speech Recognition, Prentice-Hall, Englewood Cliffs, N.J., 1993

Ceaser and Cleopatra
No ratings yet
Ceaser and Cleopatra
9 pages
National Manual For TB Control 2022update
No ratings yet
National Manual For TB Control 2022update
246 pages
Synopsis-Path Finding Robot
No ratings yet
Synopsis-Path Finding Robot
2 pages
74LS113
No ratings yet
74LS113
2 pages
Isaac Asiedu CV
No ratings yet
Isaac Asiedu CV
5 pages
Amber Training
100% (1)
Amber Training
36 pages
Final Project Report
No ratings yet
Final Project Report
15 pages
DCT Application in Speech Recognition: A Survey
No ratings yet
DCT Application in Speech Recognition: A Survey
5 pages
Scrubber
No ratings yet
Scrubber
15 pages
NLP Unit V
No ratings yet
NLP Unit V
8 pages
DSP Lab Mini Project
No ratings yet
DSP Lab Mini Project
7 pages
Chapter 2 - Speech Signal Processing
No ratings yet
Chapter 2 - Speech Signal Processing
60 pages
FORM - 2 - Appendix 2.A - Fire Protection Codes Matrix
No ratings yet
FORM - 2 - Appendix 2.A - Fire Protection Codes Matrix
9 pages
Magnetism - Notes 24-25
No ratings yet
Magnetism - Notes 24-25
13 pages
Denoising Speech For MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System
No ratings yet
Denoising Speech For MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System
5 pages
Speech Recognition Using MFCC and DTW: January 2014
No ratings yet
Speech Recognition Using MFCC and DTW: January 2014
5 pages
Feature Extraction Techniques For Speech Processing A Review
No ratings yet
Feature Extraction Techniques For Speech Processing A Review
8 pages
Applsci 09 02166
No ratings yet
Applsci 09 02166
12 pages
A Novel Approach For MFCC Feature Extraction
No ratings yet
A Novel Approach For MFCC Feature Extraction
5 pages
Paper 3
No ratings yet
Paper 3
8 pages
Ingles - Mackenzie 2024.01
No ratings yet
Ingles - Mackenzie 2024.01
4 pages
Lecture 7 - Automatic Speech Recognition
No ratings yet
Lecture 7 - Automatic Speech Recognition
58 pages
Adaptive Quadrature - Revisited
No ratings yet
Adaptive Quadrature - Revisited
18 pages
CCA Shree Cement
No ratings yet
CCA Shree Cement
10 pages
Analisis Factorial 2 2 y 2 3
No ratings yet
Analisis Factorial 2 2 y 2 3
8 pages
Hang Thỏ Word Formation B2 C1 Chapter 1
No ratings yet
Hang Thỏ Word Formation B2 C1 Chapter 1
4 pages
ENG-Accessories For Operating tables-210210X52P-20200318-small
No ratings yet
ENG-Accessories For Operating tables-210210X52P-20200318-small
26 pages
WinGD - TIN036 1 - Update On Dual Fuel Methanol Engine Development
No ratings yet
WinGD - TIN036 1 - Update On Dual Fuel Methanol Engine Development
2 pages
Bellman Ford PRESENTATION
No ratings yet
Bellman Ford PRESENTATION
9 pages
Arman, Nepal - Wikipedia
No ratings yet
Arman, Nepal - Wikipedia
2 pages
MFCC Features: Appendix A
No ratings yet
MFCC Features: Appendix A
19 pages
Intechopen 80419
No ratings yet
Intechopen 80419
18 pages
EE264 Final Project Report: Echai@stanford - Edu
No ratings yet
EE264 Final Project Report: Echai@stanford - Edu
17 pages
Audio Signal Processing Audio Signal Processing
No ratings yet
Audio Signal Processing Audio Signal Processing
31 pages
Aztecs Primary Homework Help
100% (1)
Aztecs Primary Homework Help
4 pages
IJARCSSE
No ratings yet
IJARCSSE
6 pages
2017 Bookmatter SpeechRecognitionUsingArticula
No ratings yet
2017 Bookmatter SpeechRecognitionUsingArticula
8 pages
Final Report On Speech Recognition Project
No ratings yet
Final Report On Speech Recognition Project
32 pages
Speech Recognition For Bangla Digit Using DTW & CNN in Matlab
No ratings yet
Speech Recognition For Bangla Digit Using DTW & CNN in Matlab
13 pages
Introduction
No ratings yet
Introduction
9 pages
Class 10: Physics Chapter-13 Magnetic Effect of Current
No ratings yet
Class 10: Physics Chapter-13 Magnetic Effect of Current
57 pages
MFCC and Vector Quantization For Arabic Fricatives2012
No ratings yet
MFCC and Vector Quantization For Arabic Fricatives2012
6 pages
M FCC Review
No ratings yet
M FCC Review
10 pages
Transmagnetic Resonance Field Theory
From Everand
Transmagnetic Resonance Field Theory
Timothy E. Douglas
No ratings yet
COMMUNICATION SYSTEMS
From Everand
COMMUNICATION SYSTEMS
B.P. Lathi
No ratings yet
Filter Bank: Insights into Computer Vision's Filter Bank Techniques
From Everand
Filter Bank: Insights into Computer Vision's Filter Bank Techniques
Fouad Sabry
No ratings yet
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
45 pages
Introduction To Matlab, Signal Processing & Speech Signal Processing
No ratings yet
Introduction To Matlab, Signal Processing & Speech Signal Processing
49 pages
Crystal Meth PDF
100% (3)
Crystal Meth PDF
64 pages
PE Pipe Fittings
No ratings yet
PE Pipe Fittings
41 pages
Adaptive Filter: Enhancing Computer Vision Through Adaptive Filtering
From Everand
Adaptive Filter: Enhancing Computer Vision Through Adaptive Filtering
Fouad Sabry
No ratings yet
MATLAB Simulink Introduction
No ratings yet
MATLAB Simulink Introduction
21 pages
Maths Summer Vacation Assignment Package Solution: Trigonometric Ratios & Identities
No ratings yet
Maths Summer Vacation Assignment Package Solution: Trigonometric Ratios & Identities
31 pages
Feature Extraction Methods LPC, PLP and MFCC in Speech Recognition
No ratings yet
Feature Extraction Methods LPC, PLP and MFCC in Speech Recognition
5 pages
Speaker Recognition Using Mel Frequency Cepstral Coefficients (MFCC) and Vector
No ratings yet
Speaker Recognition Using Mel Frequency Cepstral Coefficients (MFCC) and Vector
4 pages
Speech Analysis
No ratings yet
Speech Analysis
6 pages
Breaking Spaghetti Nives Bonacic Croatia IYPT 2011
No ratings yet
Breaking Spaghetti Nives Bonacic Croatia IYPT 2011
34 pages
Human Behaviour - Normal and Abnormal: DR. Kiran N. Shinglot Email: Kshinglot@yahoo - Co.in
100% (1)
Human Behaviour - Normal and Abnormal: DR. Kiran N. Shinglot Email: Kshinglot@yahoo - Co.in
34 pages
Canny 09gr820
No ratings yet
Canny 09gr820
7 pages
High School DXD 22 - Gremory of The Graduation Ceremony
56% (18)
High School DXD 22 - Gremory of The Graduation Ceremony
172 pages
Haldirams Intro
100% (2)
Haldirams Intro
12 pages
VVCSL Seafarers Health Self Declaration With COVID 19 Vaccine and Testing and Temperature Control Form
No ratings yet
VVCSL Seafarers Health Self Declaration With COVID 19 Vaccine and Testing and Temperature Control Form
3 pages
Speech Recognition Using Discrete Hidden Markov Model: Department of ECE, Saveetha Engineering College, Chennai, India
No ratings yet
Speech Recognition Using Discrete Hidden Markov Model: Department of ECE, Saveetha Engineering College, Chennai, India
6 pages
Implementation of Speech Recognition Using Artificial Neural Networks
No ratings yet
Implementation of Speech Recognition Using Artificial Neural Networks
12 pages
LESSON 2 EAR TO EAR Summary PDF
No ratings yet
LESSON 2 EAR TO EAR Summary PDF
4 pages
Fuzzy Version of Sum of Minimal Distances: Vladimir Curic
No ratings yet
Fuzzy Version of Sum of Minimal Distances: Vladimir Curic
13 pages
An Automatic Speaker Recognition System
100% (1)
An Automatic Speaker Recognition System
11 pages
MFCC PDF
No ratings yet
MFCC PDF
14 pages
Voice Activation Using Speaker Recognition For Controlling Humanoid Robot
No ratings yet
Voice Activation Using Speaker Recognition For Controlling Humanoid Robot
6 pages
Speech Recognition: A Complete Perspective: Ashok Kumar, Vikas Mittal
No ratings yet
Speech Recognition: A Complete Perspective: Ashok Kumar, Vikas Mittal
6 pages
Icect 2012
No ratings yet
Icect 2012
4 pages
pxc3872774 PDF
No ratings yet
pxc3872774 PDF
7 pages
Speaker Identification E6820 Spring '08 Final Project Report Prof. Dan Ellis
No ratings yet
Speaker Identification E6820 Spring '08 Final Project Report Prof. Dan Ellis
16 pages
Feature Extraction Methods LPC, PLP and MFCC in Speech Recognition
No ratings yet
Feature Extraction Methods LPC, PLP and MFCC in Speech Recognition
5 pages
Phy Prac. qp-1
No ratings yet
Phy Prac. qp-1
2 pages
Voice Recognition
100% (1)
Voice Recognition
18 pages
EEL6586 Final Project:: A Speaker Identification and Verification System
No ratings yet
EEL6586 Final Project:: A Speaker Identification and Verification System
16 pages
Abstract:: Text-Independent and Dependent Methods. in A Text
No ratings yet
Abstract:: Text-Independent and Dependent Methods. in A Text
11 pages
DC Pandey Mechanics Part 1 Solutions PDF
100% (2)
DC Pandey Mechanics Part 1 Solutions PDF
202 pages
Reconocimiento de Voz - MATLAB
No ratings yet
Reconocimiento de Voz - MATLAB
5 pages
Voice Recognition
No ratings yet
Voice Recognition
6 pages
Physical Characteristics of Optical Fibers
No ratings yet
Physical Characteristics of Optical Fibers
7 pages
$Xwrpdwlf6Shhfk5Hfrjqlwlrqxvlqj&Ruuhodwlrq $Qdo/Vlv: $evwudfw - 7Kh Jurzwk LQ Zluhohvv FRPPXQLFDWLRQ
No ratings yet
$Xwrpdwlf6Shhfk5Hfrjqlwlrqxvlqj&Ruuhodwlrq $Qdo/Vlv: $evwudfw - 7Kh Jurzwk LQ Zluhohvv FRPPXQLFDWLRQ
5 pages
Maretext Independent Speaker Identification Based On K-Mean Algorithm
No ratings yet
Maretext Independent Speaker Identification Based On K-Mean Algorithm
9 pages
Speech Feature Extraction and Classification Techniques: Kamakshi and Sumanlata Gautam
No ratings yet
Speech Feature Extraction and Classification Techniques: Kamakshi and Sumanlata Gautam
3 pages
A Review On Feature Extraction and Noise Reduction Technique
No ratings yet
A Review On Feature Extraction and Noise Reduction Technique
5 pages
Vector Quantization Approach For Speaker Recognition Using MFCC and Inverted MFCC
No ratings yet
Vector Quantization Approach For Speaker Recognition Using MFCC and Inverted MFCC
7 pages
Condition Monitoring Systems (CMS)
No ratings yet
Condition Monitoring Systems (CMS)
9 pages
Spoken Language Identification Using Hybrid Feature Extraction Methods
No ratings yet
Spoken Language Identification Using Hybrid Feature Extraction Methods
5 pages
Feature Extraction Methods LPC, PLP and MFCC
100% (1)
Feature Extraction Methods LPC, PLP and MFCC
5 pages
Digital Signal Processing "Speech Recognition": Paper Presentation On
No ratings yet
Digital Signal Processing "Speech Recognition": Paper Presentation On
12 pages
Ijves Y14 05338
No ratings yet
Ijves Y14 05338
5 pages
Speech Recognition
No ratings yet
Speech Recognition
4 pages
Dynamic Spectrum Derived MFCC and HFCC Parameters and Human Robot Speech Interaction
No ratings yet
Dynamic Spectrum Derived MFCC and HFCC Parameters and Human Robot Speech Interaction
5 pages
Simulation of Digital Communication Systems Using Matlab
From Everand
Simulation of Digital Communication Systems Using Matlab
Mathuranathan Viswanathan
3.5/5 (22)
Speaker Recognition Using Vocal Tract Features
No ratings yet
Speaker Recognition Using Vocal Tract Features
5 pages
Recognizing Voice For Numerics Using MFCC and DTW
No ratings yet
Recognizing Voice For Numerics Using MFCC and DTW
4 pages
Emergency Light
No ratings yet
Emergency Light
2 pages
Speech Recognition Using Matrix Comparison: Vishnupriya Gupta
No ratings yet
Speech Recognition Using Matrix Comparison: Vishnupriya Gupta
3 pages