Report 1
Report 1
SUBMITTED BY:-
As a part of the continuation of the main project that we have started in the seventh semester,
we have decided to have a project flow that contain:-
We have analysed each block in the schematic diagram of VQ/HMM isolated word
recognition system as shown in the figure below.
As the input, audio signal is given to the PC. The PC takes this audio signal as wmv
file. This is given to feature extraction block where features of speech is extracted on the
basis of MEL frequency and pitch.
On this day, we studied about each blocks in circuit level designing. The details are given
below.
From the block level designing, the datas labelled and saved in the labelling block are
vector quantised, where the input speech is quantised to different levels and given to the data
base. Thus the first input speech is saved in the data base MODEL 1, second to MODEL 2,
etc upto L models and hence calculated the probability of each models and is saved in it.
Thus a data base of L models is created. Now, when we have to recognise a speech, an input
is given to the feature extraction. Then the extracted speech is labelled and sent to data base
where each data is compared with the models. If the input signal is matched with any of the
models, then it is given to the output as the recognised word.
An artificial neural network is designed from the data base where the probability is
calculated. Next we train the network by using some toolkits in Matlab. The tool we used is
simulink. One of the portion of HMM has been implemented as hardware part and is placed
in the neural network and thus the circuit level designing is complete.
By using the toolkits of Matlab, in the command window we have written the
program and assigned the values for x inputs and y outputs. Next we have to design a neural
network. From the toolkit we get the neural network start where we have four toolkits. They
are:-
Fitting tool
Clustering tool
Next we assigned the values of x and y. Then we went for the network architecture
where the number of hidden networks have to be assigned. Then we trained the network and
thus we have plotted the graph from which we could get the best fit. From the toolkit of the
matlab, we have given an input to an audio device from that to a time scope. Thus we got the
output and plotted in a graph. We gave different values for x and corresponding values
where plotted in the graph.
From the matlab simulink toolkit, to the audio device by using microphone, sound
was given as input and is given to a time scope and thus analysed the signal and the
corresponding plot was obtained.