Minor Project Report
Minor Project Report
© 2024, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 2057
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 11 Issue: 04 | Apr 2024 www.irjet.net p-ISSN: 2395-0072
Our research aims to pave the way for more efficient and
natural human-computer interactions by integrating voice
assistant technology into desktop computing
environments.
3. PROPOSED DESIGN
The project will provide a reasonable grasp of an
intelligent assistant that can comprehend user commands.
Our voice assistant recognizes and complies with
commands given by the user through vocal media and
responds as required. We develop a prototype of an Figure 1: Use-Case Diagram
automated personal assistant capable of understanding
In this diagram, labeled as Figure 1, we can see the Use-
user commands and performing tasks across various
Case illustration showcasing the interaction between the
domains, including scheduling, information retrieval, and
user and the voice assistant. The user initiates the
task automation. Using web technologies, the proposed
conversation by speaking to the assistant, who then
desktop voice assistant application will be built cross-
proceeds to identify their speech, interpret their
platform for desktop apps using the Electron JS
intentions, carry out relevant tasks, and generate a fitting
framework. Using HTML, CSS, and JavaScript, Electron
response. The assistant has to furnish the requested
offers a single development environment that makes it
information or perform the necessary action.
easy to integrate with current web-based technologies
while developing desktop apps. Python will be used to
implement the application's backend features, which 4. METHODOLOGY
include speech recognition and natural language
processing. Python provides strong frameworks and tools When a user accesses a Voice Assistant application, they
to perform these tasks such as pyttsx3, are presented with a desktop application that allows them
SpeechRecognition, Pyaudio, etc. to interact with the platform. The backend functionalities
of the assistant were built using Python, while web
Our study aims to achieve several important goals, technologies such as HTML, CSS, and JavaScript were used
including: to create the frontend interface through Electron JS. To
enable communication between the frontend and backend
To investigate the principles of design and components, FastAPI was implemented. Various modules
technical architecture of desktop voice assistants. were developed and integrated using specific technologies
To look at the difficulties and development and frameworks to support the Voice Assistant's basic
process involved in creating a voice-activated operations, AI, voice recognition, face recognition, emotion
desktop interface. identification, and system application control. To enable
To investigate the factors related to user system voice within the program, we utilized Sapi5 and
experience and usability when utilizing a desktop Pyttsx3. The latter is a Python text-to-speech conversion
voice assistant. module that operates offline. Additionally, we
To test and assess the voice assistant application incorporated the Speech Recognition Library, which
to obtain user input and insights. provides numerous built-in functions. This library allows
To determine future paths and possible users to send commands to the assistant and receive voice
improvements for desktop voice assistant responses in return, complete with Text-to-speech
technology. capabilities. Upon recording, the assistant's underlying
algorithms convert the user's voice command to text.
When a user gives a command such as "Open Notepad" or
"Play a Song on YouTube", the voice assistant responds by
executing the command. The assistant waits for a pause to
indicate that the user has finished speaking, then searches
© 2024, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 2058
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 11 Issue: 04 | Apr 2024 www.irjet.net p-ISSN: 2395-0072
its database for the user's command and executes it and mapped to an action. The assistant executes the action
accordingly. and generates a response, which is spoken to the user.
The user's request is split into separate
commands to aid understanding by the voice 5. IMPLEMENTATION & RESULTS
assistant.
Once the command list is accessed, it searches and "Sifra," the voice assistant program for desktop use, is
compares our requests with others. designed with an intuitive user interface. When the
The list of commands is then sent back to the application is launched, users will be presented with a
voice assistant. login page that utilizes facial recognition for
After receiving the commands, the voice assistant authentication. This system is crucial in ensuring user
will know what action to take next. identification and secure access to the Assistant interface.
The voice assistant will ask for clarification if a Authenticating user credentials, limits access to only
request is unclear to ensure it understands the authorized users who can interact with the Assistant and
user's desired outcome. execute commands. Ultimately, this feature safeguards
If the voice assistant can understand the user's user privacy and provides a trusted and secure interface.
request, it will perform the task.
In the final phase of distribution, the Voice Assistant was
deployed to desktop environments while adhering to
operating system standards and ensuring compatibility
with various platforms.
© 2024, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 2059
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 11 Issue: 04 | Apr 2024 www.irjet.net p-ISSN: 2395-0072
assistant searches the content asked from Google Assigning Task To the Assistant:
and opens the required content in the browser.
Tells the accurate weather of the location the user
asks for. When asked for 'current weather in' the
assistant tells the exact weather of the desired
location of both maximum and minimum in
degrees Celsius.
Takes a screenshot of the display. When asked for
"capture", "capture my screen", "my screen",
"screenshot", or "take screenshot", the assistant
captures the display the user is using and stores it
in the path specified. Figure 5: Asking the assistant to open Notepad
It sends a mail to the username specified by the
user. When told “send mail”, the assistant asks to
whom the mail has to be sent and it will send a
mail according to that.
Can shut down or restart the system by just user
command etc.
6. CONCLUSION
© 2024, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 2060
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 11 Issue: 04 | Apr 2024 www.irjet.net p-ISSN: 2395-0072
understand language better, do more things, and work [8] A N Bhoomika Chowdary, Lavanya M, Darshan S,
with new tech. Overall, the Voice Helper project is a big Deepthi Andani, & Mrs. Jayasudha BSK “A Literature
step forward in how people and computers work Survey on Voice Assistant” IJRASET 2022.
together, making it easier and more personal to use
computers. [9] Faruk Lawal Ibrahim Dutsinma, Debajyoti Pal, Suree
Funikul, & Jonathan H. Chan “A Systematic Review of
Voice Assistant Usability: An ISO 9241-11 Approach”
ACKNOWLEDGEMENT
2022.
We would like to sincerely thank Dr. Mahendra Pawar, our
project guide, and the rest of the computer office's [10] S. Springenberg, M. Moberg, & O. Viikki, “Intelligent
teaching team for their significant suggestions, Personal Assistants” Speech Technology Seminar,
encouragement, and support during this research. Their Institute of Computer Science Hamburg, 2016.
knowledge and perceptions have been crucial in
determining the course and extent of this undertaking. We [11] V. Kukade, Ruchita G. Fengse, Kiran D. Rodge, Siddhi P.
also acknowledge the Department Head, Dr. Rais Mulla, for Ransing, & Vina M. Lomte, “Virtual Personal Assistant
his unwavering encouragement and assistance. His for the Blind”, International Journal of Computer
leadership has served as an inspiration to us. We also like Science and Technology, Volume 9, Issue 4, October-
to thank our peers and colleagues for their contributions, December 2018.
as they have offered insightful criticism and
recommendations. Their advice has substantially
improved our job. [12] Ritik Porwal, Ujjawal Tomar, Vishakha Dubey, Asst.
Prof. Akshita Mishra, Asst. Prof. Gourav Mandloi
REFERENCES “Voice Assistant” ISSN 2021.
© 2024, IRJET | Impact Factor value: 8.226 | ISO 9001:2008 Certified Journal | Page 2061