Voice Assistant Using Python 2
Voice Assistant Using Python 2
Assistant
using
Python
S.M.D.R. Government Polytechnic ,Dhule
Submitted By:-
• 323,Tejas Dilip Gokhale ,2200170123
• 321,Vishakha Sharad
Gangurde ,2200170120
• 307,Nandini Rakesh Patil,2200170102
• 318,Diksha Rajendra
Borse,2200170114
• 317,Jidnyesh Bhausaheb
Borse ,2200170113
Contents
• Introduction
• How does an Voice assistant work?
• Dataflow diagrams
• Automatic Speech Recognition
System Model
• Use case diagram
• Requirements
• Conclusion
Power of
communicat
ion
Introduction
• A Virtual assistant is a technology based on artificial
intelligence.The software user a device’s microphone
to receive voice requests while the voice output
takes place at the speaker.But the most exciting
thing happens between these two actions.
• It is a combination of several different
technologies;voice recognition,voice analysis and
language processing.
• It is completely developed using one of the most
powerful language python.
Workflow Model of Voice
Assistant
How does an intelligent
personal Voice assistant
work?
• User asks a personal assistant to perform a task.
• The natural language audio signal is converted into
digital that can be analyzed by the software.
• Compared with a database of the software using
an innovative algorithm to find suitable answer.
• This database is located on distributed servers in
cloud networks.For this reason,it must have a
reliable internet connection.
Study of Existing
System
Problem Identification
• Accurate Speech Recognition in Noisy
Environments
• Improved Contextual Understanding
• Privacy-Focused and Offline Functionality
• Enhanced Personalization
Problem
Definition
Voice Input
Speech-to-Text
Conversion
Search Request
Processing
Text-to-Speech
Conversion
Voice Output
Click icon to add picture
Dataflow Diagrams
DFD-0
Voice Personal
uUser Speech Action Computer
Voice
to Text
assistant
Responce
Voice Output
Text to speech
DFD-1
Voice Perform
comman
d
Personal action
convert voice
This will convert
data into text
voice into binary
form.
Automatic Speech Recognition System
Model
Pronunciati
on
Languag
Acoustic Dictionary
e
Models
Models
Decod
Speech signal Feature er Recognized words
Extraction
Feature
Extraction
• Feature Extraction is a common to extract a set of features
from speech signal.
• Classification of feature extraction is carried out on the set
of features instead of the speech signals themselves.
• The Feature extraction stage seeks to provide a compact
representation of the speech waveform. This forms should
minimize the loss of information that discriminates
between words,and provide a good match with the
distributional assumption made by the acoustic models.
ACOUSTIC MODELS
• Acoustic model is a relationship between audio signal and phoneme
• Phoneme means one of the smallest unit of speech that makes one word
different from another word
PRONUNCIATION DICTIONARY
• The act or result of producing the sounds of speech,including
articulation,stress,and intonation.
• A phonetic transcription of a given word,sound,etc.
• An accepted standard of the sound and stress patterns of
word,phrase,etc.
LANGUAGE MODELS
• The language model provides context to distinguish between words and
phrases that sound similar.
for example,In American English the phrases “recognize speech” and
“wreek a nice beach”sound similar,but mean different things.
Requireme
nts
Software requirements
• Python 3.x, along with libraries like
SpeechRecognition,
• pyttsx3
• nltk
• spaCy
• Wikipedia
• Pyaudio
Requireme
nts
Hardware requirements
• Device with microphone and speakers for input and output
Operational Systems
• Cross-platform compatibility
Additional
• Access to APIs or custom Database
• Internet
Voice Controlled Personal Assistant
Systems will use the Natural language
processing and can be integrated with
artificial intelligence techniques to
achieve a smart assistant that can
control the computer and applications
and even solve user queries using
web searches.It can be designed to
minimize the human efforts to
interact with many other subsystems,
which would otherwise have to be
performed manually.By achieving
this,the system will make human life
comfortable
Conclusion
Thank you