0% found this document useful (0 votes)
118 views3 pages

Desktop Voice Assistant Research Paper

The document discusses the development of a desktop voice assistant using Python. It analyzes previous voice assistant systems and discusses using speech recognition, natural language processing and text-to-speech to create a voice assistant that can perform tasks like browsing, scheduling and information retrieval through voice commands.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
118 views3 pages

Desktop Voice Assistant Research Paper

The document discusses the development of a desktop voice assistant using Python. It analyzes previous voice assistant systems and discusses using speech recognition, natural language processing and text-to-speech to create a voice assistant that can perform tasks like browsing, scheduling and information retrieval through voice commands.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Desktop Voice Assistant

Monalisha Aggarwal1, Deepali Kumari2, Ayush Kant3, Ayush Kumar Gupta4


1, 2, 3, 4
Department of Information Technology, Noida Institute of Engineering and Technology

Abstract: Voice assistants have improved accessibility and convenience across a range of
devices in recent years, becoming indispensable components of everyday life. The goal of this
project is to create a Desktop Voice Assistant system that will enable smooth voice-activated
desktop computer interaction. To comprehend customer inquiries and complete tasks quickly,
the Desktop Voice Assistant system combines speech recognition, natural language processing
(NLP), and text-to-speech (TTS) algorithms. In order to help users with tasks like online
browsing, scheduling, reminders, and information retrieval, the project's main goals are to
build an intuitive user interface, implement strong speech recognition capabilities, and
integrate a varied variety of functionality. The Desktop Voice Assistant system uses cutting-edge
natural language processing (NLP) models to accurately understand user commands and
provide timely, pertinent information or actions in response. By using Visual Studio Code for
development with modules like Wikipedia, PyAudio, and pyttsx3, the project shows how Python
can be used to create complex voice assistant systems that are both feasible and adaptable.
Voice assistants are positioned to become commonplace companions as technology develops,
streamlining activities and improving human-computer connection. The potential for voice
assistants to completely transform daily life is growing thanks to continuous developments in
artificial intelligence and speech recognition technologies, which present fresh chances for
creativity and effectiveness.
Keywords: Voice Assistant, Speech Recognition, Python, PyAudio, Natural Processing
Language(NLP).

I. INTRODUCTION
Voice assistants have become revolutionary tools in the modern era of rapid technology
advancement, changing the nature of human-computer connection. These systems demonstrate
capacities similar to human cognition through the integration of machine learning algorithms and
artificial intelligence (AI), completely changing the way consumers interact with technology. This
study investigates the design and applications of a customized voice assistant system that uses
voice commands to perform activities with ease. The Python programming language is utilized in
the system's construction. The need for simple and effective human-machine interactions has led
to an exponential increase in voice assistant technology in recent years. Voice-activated platforms,
like Alexa, Siri, and Google Assistant, are now commonplace and provide users with the ease of
doing everything from sending emails to managing smart home appliances. With the use of APIs
such as Speech Recognition in Python, speech recognition technology has spread widely, making
it easier for people to create voice assistant apps and enabling them to personalize their own
virtual assistants. The purpose of this study is to examine the many implications and features of
voice assistant systems that are written in Python. Through the utilization of Python's vast
libraries and adaptability, developers can customize voice assistants to match individual user
requirements, resulting in a streamlined and effective computing experience. The use of modules
like Wikipedia, PyAudio, and pyttsx3 allows for a multitude of features, such as voice commands
to commence online browsing, email management, and information retrieval. The purpose of this
study is to examine the many implications and features of voice assistant systems that are written
in Python. Through the utilization of Python's vast libraries and adaptability, developers can
customize voice assistants to match individual user requirements, resulting in a streamlined and
effective computing experience. The use of modules like Wikipedia, PyAudio, and pyttsx3 allows
for a multitude of features, such as voice commands to commence online browsing, email
management, and information retrieval. The most well-known iPhone app is called "SIRI," which
lets users speak commands into their phones and have them answer accordingly. The similar
"Google Assistant" app was also developed by Google and is available for Android cellphones.
That application does, however, rely significantly on Internet connections. The suggested system,
however, may function both with and without Internet connectivity. It processes user input—such
as speech or text—and returns the results in a variety of formats, including an action that has to be
taken. Voice-activated home automation systems have the potential to improve living conditions
and simplify daily chores for users. For those with impairments, voice control in energy-efficient
buildings is especially helpful as it enables them to lead lifestyles that were before unreachable. A
voice assistant appears to be a computerized program agent that uses voice control technology to
carry out activities or offer services to a person. There are numerous virtual voice assistants on the
market right now, including Google Assistant, Apple's Siri, Samsung Bixby, Microsoft's Cortana,
Amazon's Alexa, and many more.

II. LITERATURE REVIEW


Diksha Goutam Presented the idea of a voice assistant, which enables users to operate systems
using voice commands. Individuals with disabilities who want to utilize computers or laptops
have been introduced to the idea of a desktop virtual assistant. This voice recognizer can be used
both online and offline, and it can carry out different tasks based on what the user needs. AI and
Python programming are used in the development of voice assistants. The operation of a voice
assistant involves a variety of processes, including context extraction, system calls, API calls,
text-to-speech, voice recognition, and natural language processing [1].
Harshil Asodariya, Keval Vachhani, Eishan Ghori, and others suggest creating a desktop voice
assistant that enables users to communicate with their computers via voice commands. The
assistant makes use of natural language processing and artificial intelligence to comprehend and
react to voice input on a desktop computer. User feedback has indicated that this technology is a
convenient and effective tool that satisfies the needs of our target audience [2].
As L. R. Rabiner and B. S. Atal have pointed out, speech analysis is frequently carried out in
conjunction with pitched analyses. The study created a pattern recognition technique to determine
whether a particular voice signal segment should be categorized as an audio signal, unvoiced
speech, or silence based on the signal parameters. The main drawback of the technique is that it
needs to be run on a certain range of dimensions and with specific recording parameters [3].
In 2020, Gaurav Agrawal, Prof. Ronak Jain, Divyanshu Jain, Harsh Gupta, and Chinmay
Jain:suggested a system that is designed using the Python programming language. Python 3.8.
Various libraries, including those for Text to Speech, Speech Recognition, and Short Mail
Transfer Protocols (SMTP), were used. The user must be familiar with the fundamentals of the
English language. The purpose of this document is to give users a quick and simple way to get
their queries addressed. However, there are several shortcomings, such as the absence of support
for system calls [4].
Vishal Kumar Dhanraj, Semal Mahajan, and Lokeshkriplani designed a Personal Virtual Assistant
for all Windows versions was suggested. For this project, they employed artificial intelligence
technologies with Python. When a user gives their personal virtual assistant instructions, the audio
impulses are translated from natural language to digital signals. Virtual assistants are capable of
doing several tasks. Some Python installer packages, such as those for speech recognition, Python
backend, API calls, content extraction, system calls, Google Text-to-Speech, JSON, Wikipedia,
datetime, and web browser, were used in the creation of the virtual assistant [5].

You might also like