Speech Recognition Using Correlation Tec
Speech Recognition Using Correlation Tec
Abstract— The Development in Wireless and communication and mobile devices has bolstered the
improvement of speech recognition system. When we say speech recognition system two main
significant terms that comes are the pattern matching and the feature extraction. This paper denotes
and computes a simple algorithm using MATLAB to match the patterns to recognize speech using
cross correlation technique. Correlation is a statistical measure where you have to contrast two or
more signals to discover the similarity between them. Speech recognition which is a part of
biometrics has become one of the major aspect to provide security to the devices and applications.
Speech recognition is a concept where we extract the spoken words and match it with the sample
previously provided.
Keywords— MATLAB Programming, Speech Recognition, Biometrics, Isolated Word Recognition,
Mel frequency cepstral coefficients (MFCC), Correlation
I. INTRODUCTION
Speech Recognition is the way of capturing the talked words using a gadget and converting them into
a digitally stored set of words. Speech recognition is used in almost every security project where you
need to speak and tell your password to computer and is also used for automation. In the current
world, there is a continually expanding need to confirm and recognize the voice of individuals
automatically. For every individual securing the personal details from the theft is the national
priority. This paper tells about the concept Mel frequency cepstral coefficients (MFCCs) as the
feature for the recorded speech. [1].
Speech recognition is basically and widely used concept for providing the security to the
applications. Security has become a major part for any user using any smart devices. Speech
Recognition is one of part of Biometrics. Biometrics, the physical qualities and behavioral attributes
that make each of us exceptional, are a characteristic decision for personality confirmation. It is a
developing innovation that guarantees a viable answer for our security needs. We can utilize a
biometric to get to our home, our record, or to conjure an altered setting for any safe range or
application. In this section we investigate the different sorts of biometric confirmation systems and
their arrangement potential.
1.1. Biometrics
The term biometrics is presently generally known as "the art of measuring physical qualities, to
check a man's character‖, and have got from the Greek words bio (life) and metric (to quantify)
which includes speech recognition, iris and face scans, and fingerprint recognition.
Biometric qualities can be further separated in two principle classes:
Physiological: This biometrics is the other sort utilized for distinguishing proof or check
purposes. Distinguishing proof alludes to figuring out who a man is. This technique is ordinarily
utilized as a part of criminal examinations.
Behavioral: It is utilized for confirmation purposes. Check is deciding whether a man is who they
say they are. This strategy takes a Pattern at examples of how certain exercises are performed by
a person.
1.2. History
In 1994, IBM organization was the first to introduce and commercialize the dictation feature which
was based on speech recognition. After that speech recognition has been introduced in many
different applications which include telephonic applications, Embedded Systems (Telephone Voice
Dialing System, Car Kit, and PDA), Multimedia applications like the language learning tools.
In the year 1960s and '70s, signature biometric concepts were produced, but yet the biometric field
quite stayed settled until the military and security offices enquired about and newly developed
biometric innovation fingerprinting.
‗Gunnar Fant‘ came with the new idea of the source-channel model of discourse generation and
marketed it in 1960, which turned out to be a valuable and ideal model of discourse creation. But
sadly, subsidizing at Bell Labs become scarce for quite a while when, in 1969, the powerful ‗John
Pierce‘ created an open letter that was incredibly format of acknowledgment research. ‗Pierce‘
defunded acknowledgment and examined at Bell Labs where no exploration on acknowledgment was
done until ‗Pierce‘ resigned and ‗James L. Flanagan‘ assumed control.
Further recently, ‗Raj Reddy‘ was the primary individual person to go up against ceaseless
acknowledgment as a graduate understudy at ‗Stanford University‘ in the late 1960s. Reddy's system
was intended to issue spoken commands for the game of chess which was played at the university.
These extricated components are Vector quantized utilizing Vector Quantization calculation. Vector
Quantization (VQ) is utilized for highlight extraction in both the preparation and testing stages. It is a
to a great degree effective portrayal of unearthly data in the discourse motion by mapping the vectors
from vast vector space to a limited number of districts in the space called clusters. [3]. after
component extraction, highlight coordinating includes the genuine technique to recognize the
obscure speaker by contrasting separated elements and the database utilizing the DISTMIN
calculation.
more. On the off chance that you have ever paid a bill via telephone using an automated system, you
have likely profit by speech recognition software.
Feature Extraction
Probability
Estimation
Decoding
Language Models
Recognized Sentences
Figure 3. Speech Recognition Process
Test.wav vs fifth.wav
Figure 4. Success Results
Now consider the denied.wav file which is not a match with any sample given. When the given input
speechrecognition(‗denied.wav‘) in MATLAB command prompt, the comparison will start and it
will tell denied which means the file is not matched with any of the sample files.
The below are the graphs:
Denied.wav vs fifth.wav
When we see in the success result, second sample is the successful match so at coordinates (0, 0) the
words of audio file match which is seen in frequency format in graph.
V. CONCLUSION
This paper defines us successfully about various features, behavior and characteristics of speech
signals and also deals with the concept of cross correlation. In this paper, an algorithm has been
created with the help of MATLAB programming which requires .wav format speech input signals
where comparison with the test sound file using correlation technique takes place. Thus, paper
concludes that in order to remove the further limitation of audio formats there is a requirement for
the study of various formats of speech signals which will be further used for communication with the
machines which include the hardware part and not the simulator.
REFERENCES
[1] Automatic Speech Recognition using correlation analysis By Rajorshee Raha, Amab Pramanik
[2] An Enhanced Speech Recognition System By Suma Shankaranand, Manasa S, Mani Sharma, Nithya A.S, Roopa
K.S., K.V. Ramakrishnan, International Journal of Recent Development in Engineering and Technology,Volume 2,
Issue 3, March 2014.
[3] Mahdi Shaneh and Azizollah Taheri, ‖Voice Command Recognition System based on MFCC and VQ Algorithms‖,
World Academy of Science, Engineering and Technology Journal, 2009.
[4] Nikolai Shokhirev, ‖Hidden Markov Models ―, 2010.
[5] SPEECH RECOGNITION USING MATLAB By ASEEM SAXENA,AMIT KUMAR SINHA,SHASHANK
CHAKRAWARTI,SURABHI CHARU, International Journal of Advances In Computer Science and Cloud
Computing, ISSN: 2321-4058 Volume- 1, Issue- 2, Nov-2013.