0% found this document useful (0 votes)
5 views

Voice Assistant Using Python 2

Voice Assistant
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

Voice Assistant Using Python 2

Voice Assistant
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 20

Voice

Assistant
using
Python
S.M.D.R. Government Polytechnic ,Dhule

Submitted By:-
• 323,Tejas Dilip Gokhale ,2200170123
• 321,Vishakha Sharad
Gangurde ,2200170120
• 307,Nandini Rakesh Patil,2200170102
• 318,Diksha Rajendra
Borse,2200170114
• 317,Jidnyesh Bhausaheb
Borse ,2200170113
Contents
• Introduction
• How does an Voice assistant work?
• Dataflow diagrams
• Automatic Speech Recognition
System Model
• Use case diagram
• Requirements
• Conclusion
Power of
communicat
ion
Introduction
• A Virtual assistant is a technology based on artificial
intelligence.The software user a device’s microphone
to receive voice requests while the voice output
takes place at the speaker.But the most exciting
thing happens between these two actions.
• It is a combination of several different
technologies;voice recognition,voice analysis and
language processing.
• It is completely developed using one of the most
powerful language python.
Workflow Model of Voice
Assistant
How does an intelligent
personal Voice assistant
work?
• User asks a personal assistant to perform a task.
• The natural language audio signal is converted into
digital that can be analyzed by the software.
• Compared with a database of the software using
an innovative algorithm to find suitable answer.
• This database is located on distributed servers in
cloud networks.For this reason,it must have a
reliable internet connection.
Study of Existing
System

 Mycroft: An open-source voice assistant


built on Python, which allows customization
and integration with a variety of platforms
and devices.
 Jarvis: A Python-based voice assistant. It
uses libraries for voice recognition, speech
synthesis, and natural language processing.
 Google Assistant & Amazon
Alexa:Python is commonly used to create
skills or routines for Alexa, or integrate
Google Assistant's API with Python
applications.
Limitations of Existing System
• Speech Recognition in Noisy Environments
• Limited Contextual Understanding
• Dependency on Cloud Services
• Personalization Challenges

Problem Identification
• Accurate Speech Recognition in Noisy
Environments
• Improved Contextual Understanding
• Privacy-Focused and Offline Functionality
• Enhanced Personalization
Problem
Definition
Voice Input

Speech-to-Text
Conversion

Search Request
Processing

Text-to-Speech
Conversion

Voice Output
Click icon to add picture

Dataflow Diagrams
DFD-0

Voice Personal
uUser Speech Action Computer
Voice
to Text
assistant

Responce

Voice Output
Text to speech
DFD-1

Voice Perform
comman
d
Personal action

User Voice Computer


Assitant
DFD-2

Microphone Google voice


Voice Perform API
comman action
This will
User
d

convert voice
This will convert
data into text
voice into binary
form.
Automatic Speech Recognition System
Model

Pronunciati
on
Languag
Acoustic Dictionary
e
Models
Models

Decod
Speech signal Feature er Recognized words
Extraction
Feature
Extraction
• Feature Extraction is a common to extract a set of features
from speech signal.
• Classification of feature extraction is carried out on the set
of features instead of the speech signals themselves.
• The Feature extraction stage seeks to provide a compact
representation of the speech waveform. This forms should
minimize the loss of information that discriminates
between words,and provide a good match with the
distributional assumption made by the acoustic models.
ACOUSTIC MODELS
• Acoustic model is a relationship between audio signal and phoneme
• Phoneme means one of the smallest unit of speech that makes one word
different from another word
PRONUNCIATION DICTIONARY
• The act or result of producing the sounds of speech,including
articulation,stress,and intonation.
• A phonetic transcription of a given word,sound,etc.
• An accepted standard of the sound and stress patterns of
word,phrase,etc.
LANGUAGE MODELS
• The language model provides context to distinguish between words and
phrases that sound similar.
for example,In American English the phrases “recognize speech” and
“wreek a nice beach”sound similar,but mean different things.
Requireme
nts
Software requirements
• Python 3.x, along with libraries like
SpeechRecognition,
• pyttsx3
• nltk
• spaCy
• Wikipedia
• Pyaudio
Requireme
nts
 Hardware requirements
• Device with microphone and speakers for input and output
 Operational Systems
• Cross-platform compatibility
 Additional
• Access to APIs or custom Database
• Internet
Voice Controlled Personal Assistant
Systems will use the Natural language
processing and can be integrated with
artificial intelligence techniques to
achieve a smart assistant that can
control the computer and applications
and even solve user queries using
web searches.It can be designed to
minimize the human efforts to
interact with many other subsystems,
which would otherwise have to be
performed manually.By achieving
this,the system will make human life
comfortable

Conclusion
Thank you

You might also like