Ann LA2 Project
Ann LA2 Project
• Each neuron has a very simple structure, but an army of such elements constitutes a
tremendous processing power.
• Feedforward network is the first and the simplest form of ANN. In this network,
the information flows only in one i.e. forward direction from input node via
hidden nodes to the output node. Learning is the adaptation of free
parameters of neural network through a continuous process of stimulation by
the embedded environment. The back-propagation algorithm has emerged
to design the new class of layered feedforward network called as Multi-Layer
Perceptrons (MLP). It generally contains at least two layers of perceptrons. It
has one input layer, one or more hidden layers and output layers. The hidden
layer plays very important role and acts as a feature extractor.
Struture of Speech Recognition
The structure of a standard speech recognition system is illustrated in Figure. The elements
are as follows:
• Raw speech - Speech is typically sampled at a high frequency, e.g., 16 KHz over a
microphone or 8 KHz over a telephone. This yields a sequence of amplitude values over
time.
• Signal analysis - Raw speech should be initially transformed and compressed, in order to
simplify subsequent processing. Many signal analysis techniques are available which can
extract useful features and compress the data by a factor of ten,without losing any
important information. Among the most popular:
• Fourier analysis (FFT)- yields discrete frequencies over time, which can be
interpreted visually. Frequencies are often distributed using a Mel scale,
which is linear in the low range but logarithmic in the high range,
corresponding to physiological characteristics of the human ear.
• Perceptual Linear Prediction (PLP)- is also physiologically motivated, but
yields coefficients that cannot be interpreted visually.
• Linear Predictive Coding (LPC)- yields coefficients of a linear equation
that approximate the recent history of the raw speech values.
FLOWCHART OF THE SYSTEM
• The general structure of the speech
recognition program is shown in figure
bellow. The input of the system is the Close-talking microphone
speech signal. The preprocessing includes Microphone array
Word spotting
Utterance verification
Patiyuth Pramkeaw 2012 Improving MFCCbased FIR Filter Shows the improvement
speech in recognition rates of
classification spoken words
with FIR filter
Shivanker Dev 2013 Isolated Speech Dynamic Time It shows that the DTW
Dhingra Recognition using Warping(DTW) is the best non linear
MFCC and DTW feature
matching technique in
speech
identification,
with minimal
error rates and fast
computing speed
CONCLUSION
• For SR ANN is a effective and efficient way as it has multi layer network.
Speech Recognition is also used in smart phones. In smart phones
speech/spoken words are given as an input and SR s/w gives appropriate
search or information that user wants as a output.Neural networks, with their
remarkable ability to derive meaning from complicated or imprecise data,
can be used to extract patterns and detect trends that are too complex to
be noticed by either humans or other computer techniques. A trained neural
network can be thought of as an "expert" in the category of information it
has been given to analyse.
• ANN has,
1.Adaptive learning: An ability to learn how to do tasks based on the data
given for training or initial experience.
2.Self-Organisation: An ANN can create its own organisation or representation
of the information it receives during learning time.
3.Real Time Operation: ANN computations may be carried out in parallel, and
special hardware devices are being designed and manufactured which take
advantage of this capability.
4.Fault Tolerance via Redundant Information Coding: Partial destruction of a
network leads to the corresponding degradation of performance. However,
some network capabilities may be retained even with major network
damage. Thus for speech recognition artificial neural network is efficient and
effective algorithm among all algorithms.
BIBLIOGRAPHY
• http://en.wikipedia.org/wiki/Speech_recognition.
• http://en.wikipedia.org/wiki/Artificial_neural_network
• http://www.researchgate.net/
• Youtube
• Introduction to Various Algorithms of Speech Recognition: Hidden Markov
Model, Dynamic Time Warping and ArtificialNeural NetworksPahini A. Trived
• Automatic Speech Recognition System Prof. Pisal Ranjeet1 , Thite Prakash2 ,
Satpute Amruta3 & Shingade Monali4
• VOICE RECOGNITION SYSTEM: SPEECH-TO-TEXT Prerana Das, Kakali Acharjee,
Pranab Das and Vijay Prasad*