Pvaresearch
Pvaresearch
Sudhir Yadav
APEX Inst. Of Technology
Chandigarh University
Chandigarh,India
[email protected]
Abstract:-In this paper, Voice intelligent Assistance tool is and may not always provide a satisfying user
used for searching purposes, summary extraction, setting experience. Additionally, voice assistants collect
reminders just by using voice commands. Voice recognition large amounts of data, making them vulnerable to
technology allows us to access any document or file we cyberattacks. Personalization is also a challenge, as
desire. voice assistants may not always be able to
If the user spells out the word it automatically types in the personalize responses to individual users. Overall,
required field. It recognizes the speech and searches the personal voice assistants facenumerous challenges in
appropriate content in the database and retrieves it. The audio their early stages of development.
from the microphone will be collected by this voice assistant,
which will subsequently translate it into text. To ensure that II. LITERATURE SURVEY
the virtual assistant can comprehend them, the user should
choose the proper language. If any wrong or invalid Voice assistant has a long history with several waves
communication happens it invokes some messages in dialog of major innovations. Voice assistant for dictation,
box. It is a software application which performs tasks and search, and voice commands has become a standard
events based on commands. Voice-Command and speech feature on smartphones and wearable devices. The
synthesis are enhancing the level of user- interaction in study stems from an overlooking literature review in
applications. This intelligent personal assistant (IPA) can order to present generic knowledge (theory and
interact with the user by opening a report, providing a brief concepts) about voice control, virtual assistants,
summary via speech-to-text, and outlining the most crucial fields of use and more. When looking at a number of
details in the appropriate context. Here, an attempt is made to currently available intelligent programs with natural
build an intelligent voice personal assistant using Python, language processing capabilities, many examples can
which offers the ability to control voice- activated devices be found in everyday life filling a variety of roles.
and speech- activated smart devices for information The first speech recognition system, named Audrey,
extraction. was created by Bell Laboratories in 1952. Audrey
NLP (Natural Language Processing) helps the virtual was rather rudimental and limited technology wise,
assistant to understand and respond to human speech and understanding only ten digits - spoken by particular
based on the voice commands the tasks are performed. BERT people (Pieraccini, 2012). About 10 years later, IBM
is designed for the computers to understandthe meaning of developed and demonstrated their Shoebox Machine.
ambiguous language. Keywords:- Virtual Assistant, Speech The device recognized and responded to 16 different
Recognition, Extracti spoken words, including all ten digits “0” to “9” as
Keywords— Virtual Assistant, Speech Recognition, well as calculating commands such as “plus” or
Extractive summarization, BERT “minus” (IBM, 2018).Shoebox Machine recognized
and responded to 16 spoken words, including the ten
I. INTRODUCTION digits from “0” through “9”, only in English by a
designated speaker. These limitations later proved to
be problematic, increasing the scepticism opposing
The aim of this project is to develop a highly
voice recognition. Mid 1970’s came the Hidden
functional and user- friendly personal voice assistant
Markov Model (HMM) (Rabiner,1989). The HMM
that can effectively understand, process, and respond
considerably altered the development of a feasible
to user voice commands. The voice assistant should
speech recognition software. With the help of HMM
be capable of performing a wide range of tasks, from
speech recognition started using a statistical method
providing information and reminders to executing
measuring the probability of unknown sounds being
complex actions, all while maintaining a natural and
words. Now, the potential to recognize an unlimited
engaging interaction with the user. Personal voice
number of words became imminent due to the
assistants face challenges such as accuracy, limited
method allowing the number of understandable
functionality, privacy concerns, security risks, and
words go up to a few thousands. These choices of
user acceptance. They struggle with understanding
human speech, particularly in noisy environments,