Malayalam Speech Recognition
ISSN No:-2456-2165
Aishwariya D P
Dept. of Computer Science and Engineering
Sahrdaya College of Engineering and Technology
Abstract:- The project is based on the development of learning is used to classify the speech. Machine learning is
state-of-the-art large vocabulary continuous speech to make computers able to learn problems and solve
recognition (LVCSR) system for the Malayalam problems on their own. In our project the model is created
language. Problems of existing speech recognition are using tensor flow. Tensor flow is one machine learning
lack of accuracy and misinterpretation, time cost and approach. It is open source. It has a comprehensive, flexible
productivity, accents and speech recognition, ecosystem of tools, libraries and community resources that
background noise interference. The simulation of human let researchers push the state-of-the-art in Machine Learning
intelligence in computers refers to artificial intelligence and developers can easily build and deploy Machine
(AI) which includes Machine Learning, Natural Learning powered applications. Here we have also used a
Language Processing, Computer Vision and Robotics. In transfer Learning approach. A transfer learning method is
audio files or video files that are large and have minutes used to transfer two types of knowledge to different datasets.
in length, many files have a variety of audio and audio
files. In this project, transfer flow technique is used.So A. Motivation
the aim of the proposed system of speech recognition is The state of Kerala has 14 revenue districts. Each
to collect thousands of datasets of each category district has its own way of speaking Malayalam which is the
irrespective of their gender and also they can be of any mother tongue of the state. The state is also popularly called
age group and train them according to their native as land of backwaters which makes it a spot for tourists.
sequence so as to increase the accuracy level. Even though Malayalam is spoken all over the state, it's
occasionally hard for people to recognise the language.
Keywords:- Artificial Intelligence; Machine Learning; Hence with our software one could recognise the slang being
Large Vocabulary Continuous Speech Recognition; Support used.
Vector Machine, Tensor Flow.
B. Proposed system
I. INTRODUCTION The Malayalam language is a language that people use
in different slangs. People from each part of Kerala use a
Speech is a simple and usable technique of different slang. For each common word people from each
communication between humans, but nowadays humans region use different slang to pronounce it or say something
aren't limited to connecting but even to the different completely different. The Google assistant which we could
machines in our lives. The most important is the computer. revoke by saying OK Google is used to translate words and
So, this communication technique is often used between sentences of different languages to the language we request.
computers and humans. This interaction is completed It can't understand or differentiate different slangs or the
through interfaces, this area is called Human-Computer words that are used by the people throughout these regions.
Interaction (HCI).Presently, computers have already It's often misunderstood by Google Assistant on the words
replaced a tremendous number of humans in many creative used by the people. The accuracy hence is low. The problem
professions. Speech recognition can be predicted using a of existing speech recognition is a lack of accuracy and
computer. Our project focuses on the development of state- misinterpretation, time cost and productivity, accents and
of-the-art large vocabulary continuous speech recognition speech recognition, background noise interference. Here the
(LVCSR) systems for the Malayalam language. We choose transfer learning approach is used.
to listen to the desired sound from a large file. Here machine