0% found this document useful (0 votes)

108 views7 pages

Voice Assistant Using Python and AI

Uploaded by

atharv.choughule

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

108 views7 pages

Voice Assistant Using Python and AI

Uploaded by

atharv.choughule

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056

Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072

Voice Assistant Using Python and AI

Divisha Pandey1, Afra Ali2, Shweta Dubey3, Muskan Srivastava4, Shyam Dwivedi5, Md. Saif Raza6

1, 2,3,4, Student of B. Tech fourth year, Department of Computer Science and Engineering, Rameshwaram Institute of
Technology & Management, Lucknow, India
5Assistant Professor and Head of Department CSE, Rameshwaram Institute of Technology and Management,

Lucknow, India
6
Assistant Professor, Department of CSE, Rameshwaram Institute of Technology and Management, Lucknow, India
------------------------------------------------------------------------***------------------------------------------------------------------------------
Abstract – Today’s era is the era of digitalization. Having smart phones and desktops is no less than having the world on our
fingertips. Our lifestyle is involving being busy day by day. That busy, that people even find it a load to even type something to
perform a task. So here comes virtual assistant at rescue. Just speak to it and the task is done. From sending a hello on
WhatsApp to your friend to sending a full fleshed email to your boss virtual assistant will do it all for you. With time voice
search is dominating over text searching. But what are virtual assistants? A software program that helps us perform our daily
task just by speaking to it is a virtual assistant. A waking word is necessary to activate the software. This system can be used
efficiently on desktops. The premise behind starting this project was that the data present on the web is sufficient and is
openly available that can be used to build a virtual assistant that can make and perform intelligent decision for the user.

Index Terms – Python, Artificial Intelligence, Natural Language Processing, Speech Recognition.

1. INTRODUCTION

We are living in the era of technology where the era is replacing human beings by machines. Lifestyle and productivity are
the main reason behind this performance change and will also evolve with coming time. We need machine that think like
humans and perform the task given to them by human beings, and to do so we are training them. And as a result of one of
these training came the concept of virtual assistant.
A virtual assistant is self-employed software who is specialized in offering administrative services to clients from
remote location, usually a home office. Scheduling appointments, making phone calls, booking tickets, sending messages
and what not a virtual assistant can perform them all. It uses voice recognition features and language processing
algorithms to perform a task by recognizing the voice command of users. Filtering out irrelevant noise and background
disturbances are ignored by the assistant itself and give out relevant information as per the user requirement. This is a
software-based technology but companies nowadays are creating special devices integrated with this system that perform
tasks. Amazon Alexa is one such example.

Fig -1: Backend Working of Virtual Assistant

Day by day drastic changes are forming out in technologies. These changes are making it necessary to train our machines
with advancement. Deep learning, machine learning and neural network are some of the current technologies that involve
in the training of machines for their advancement. Voice assistant have made possible human and machine conversation.

© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 832
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072

Basically, we can say that these assistants are next level of advancement in development. The main privileged parts of the
society who are benefiting from these assistants are old age, blind, physically challenged, and children. Blind people who
cannot see can even interact with the machine with their voice only. Following are few tasks that can be performed by
virtual assistant:-
1. Reading out newspaper 5. Playing YouTube video 9. Run any application
2. Sending emails 6. Making notes 10. Checking stock price
3. Searching among web 7. Setting up alarm 11. Playing game
4. Playing music 8. Giving weather updates

These listed examples are only few task of the assistant. It can perform many more task as per the demand of the user.
The voice assistant developed by us is for the Windows user. This voice based module is desktop based which is built using
python modules and libraries. It is a basic version that can perform the entire basic day to day task assigned to them by the
user operating it. Few of the tasks to be performed by our assistant is listed above. The current technology is good in many
aspects but still can be improved by merging it with Machine Learning and Internet of Things (IoT). Python modules and
libraries have been used by us along with artificial intelligence and machine learning for training our model. Some
windows command has also been used by us in our model for making it to run smoothly on window operating system.
Basically, there are three working modes of our model:-

1. Supervised Learning 2. Unsupervised Learning 3. Reinforcement Learning

It can be used according to the requirement of the user. Machine learning and Deep learning along with natural language
processing concepts help us in achieving our goal and performing our desired task. With assistant we don’t need to type
the command again and again for performing the particular task. After creation the model can be used any number of
times by any number of users easily. Basically, this virtual assistant we can control many things on a single platform.

2. LITERATURE SURVEY
1.Bassam A, Raja N. et al, have wrote about statement and speech for communication between humans and machines
analog signals are used which is converted by speech signal to digital wave. The technology is massively utilized and has
unlimited uses and also permit machines to reply accordingly to users command and voices. Speech recognition system is
growing day by day and also has unlimited uses.
2 B.S.
Atal and L.R. Rabiner et al, has explained regarding speech analysis, and the theory is getting evolved day by day. The
research performed describes a pattern recognition technique for the determination of voice. It determines that the voice
input is weather voiced speech, unvoiced, or silence. It completely depends upon the dimensions finishing on the signal.
The system although comes with restrictions and the main restriction here is the requirement for exercising the algorithm
on the exact set of dimensions picked, and also for recording circumstances.
3. V. Radha and C. Vimala et al, explained about the most suitable way of communication between humans is speech. Since

speech recognition is an utmost technique of recognition, hence it makes human beings identical and makes it easier for
machines to recognize them. This helps in autonomous speech recognition and also has a lot of reputation. Some of the
most used speech recognition techniques are Dynamic Time Warping (DTW), HMM. For feature mining of speech Mel
Frequency Cepstrum Coefficients (MFCC), it offers a group of characteristic vectors of speech waveform. Studies have
revealed that MFCC is more precise and real than other mining approaches in speech recognition. The research has been
done on MATLAB and the outcomes on investigation depict that the system is capable in identification of words at a great
satisfactorily accuracy.
4. T.Schultz and A. Waielet al, explained about the spreading of speech technology products around the world. The research
tells about the query on how to port huge vocabulary incessant speech recognition (LVCSR) systems in a fast and well-
organized manner. However, there is a need to evaluate the acoustic models for novel destination language by means of
speech information from different source languages. But the restricted data from destination language identification
outcomes using language dependent, independent and language adaptive acoustic models are deliberated in the
framework of Global Phone project which examines LVCSR methods in 15 languages.

© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 833
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072

5. J. B. Allen et al has described Language as the utmost and significant means of communication and speech is its major

interface. For the interface creation between humans and machines, the speech signals were converted into analog and
digital wave shape as for the machine to understand. Speech technologies today permit the machines to react
appropriately according to human speeches and offers valuable and appreciated services. The carried out research gave
the result in terms of speech identification procedure, its basic model, its application, and techniques and also describe
several other research techniques that are necessary for speech recognition system. SRS is an emerging technology and is
increasing its vitality day by day gradually and also has infinite applications.
6. Mugdha Bapat, Pushpak Bhattacharyya et al, described morphological analyzer for almost of the Indian languages. At the
starting phase the planning was about some extent homomorphism “boos trappable” encryption technique. The research
proved out to be a great success for Marathi language that resulted in engagement of the Finite State Systems for the
demonstration of language in a sophisticated way. Since Marathi has a really difficult morphotactics hence the growth of
FSA is one of significant assistances.
7. G.
Muhammad, M.N. Huda et al, presented an ASR model for the Bangla digits. To carry out this research the information
was gathered for general Bangladeshi public. For identification purpose Mel-frequency cepstral coefficients (MFCCs) and
hidden Markov model (HMM) were used. In the trial it was discovered that female spoken digits have higher accuracy than
male spoken digits.
8. SeanR Eddy et al researched on Hidden Markov Models. They are basically a common statistical designing approach for
issues like sequences or time series. These methods are extensively being used in the process of speech recognition. With
the help of HMM formalism, it is possible to create a relation between formal, completely probabilistic techniques to
profiles and gapped structure arrangements. Steady theory for insertion and deletion, constant structure for joining
structural and sequence data are some of the popular offerings of HMM. It also makes sequence arrangements more
refining. It also makes satisfactorily arrangements for difficult threading techniques for protein reverse fold.

3. FEATURES OF VOICE ASSISTANT

TASK PERFORMANCE
A task is a piece of work to be done or undertaken. It can be occurring once or on repetition. A task that is
occurring on repetition is known as recurring task. Its repetition can occur at some certain intervals or at a pre appointed
time to the system in some cases. Let us understand it better with an example, suppose our team lead wants the progress
of our work on every Thursday, so we will add it to the recurring task list. Once we mark the current week task as done at
the desired time we will start getting reminders about the task of the upcoming week . Similarly, Task Request can also be
created by the user. With the help of task request a user can assign task to different users. Another feature that is a task list
is associated to task request. This list contains information like who assigned the task, who are assigned the task, date of
assigning, and followed by reassigning of the task.
INTERNET SOLICITATION
The assistant allows the person to engage with the internet for accessing of information like weather, directions,
schedules, stock performance, news etc, and that also just using simple voice command. The growth of internet is creating
a vast new network - a Voice Web – that help in accessing internet content just by the use of human voice. It can be called
as a voice portal to access the web. It creates a platform for users with natural language interface to access the web
content.
SYSTEM ARCHITECTURE
The system architecture of this project shows flow of control through the system. The hardware and software
specifications are also depicted here. The architecture diagram is as follows

© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 834
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072

Fig -2: Architecture Of Virtual Assistant

HARDWARE AND SOFTWARE REQUIREMENTS

 HARDWARE  SOFTWARE
 A desktop / laptop  Windows 8 and higher
 Minimum 512 MB RAM  Selenium Web Automation
 Internet connectivity  SQLite
 USB debugging mode for development
and testing
 Pentium-pro processor or later

4. SYSTEM DESIGN AND IMPLEMENTATION

EXISTING MODEL

Out of all the existing projects in the market most of them only use speech recognition using neural network.
Although their system give result based on moderate accuracy. Few of the techniques used by them are-
 CONTEXT AWARE COMPUTING
Context-aware computing is a style of computing in which situational and environmental information about
people, places and things is used to anticipate immediate needs and proactively offer enriched, situation-aware and usable
content, functions and experiences. The main use of this technique is to recognise the word spoken by the peoples and also
presuppose the mispronounced words.

 MEL-FREQUENCY CEPSTRAL COEFFICIENTS

MFCC is the collection of coefficients; this technique aims to develop the features from the audio signal which
can be used for detecting the phones in the speech. It is widely used technique for extracting the features from the audio
signal.

 NATURAL LANGUAGE PROCESSING

NLP is the branch of computer science more widely it is the branch of artificial intelligence that helps in the
interaction between humans and machines. It is due to the existence of NLP only that makes possible for computers to
read text, hear speech, interpret it, measure sentiment and determine which parts are important.

© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 835
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072

PROPOSED MODEL

 SPEECH TO TEXT

It is software that enables the recognition of human language and also convert in into the language
understood by machines using computer linguistics. It is also known as speech recognition.

 TEXT ANALYZING

 Inputs provided are just letters for computer.

 Software converts the speech into machine understood language.
 Commands are understood by the computers, virtual assistants convert this text to command.
 Virtual assistants convert or relate the words to functions and parameters for the creation of a command to
be understood by the computer.

The major milestone of our project is trying to increase the accuracy of speech to text software. The model will
basically be able to convert any speech with modulations or different accents with a higher accuracy on the day to day
basis. The given model is combines voice recognition with neural network to increase the precision.

5. WORKING PRINCIPLES

The virtual assistant involve following principles for working:

Natural Language Processing:

A method used in artificial intelligence for communicating with the machines or an intelligent system is known
as natural language processing (NLP). Processing of natural language is required when humans want to make machines
like robots to follow their command and also respond to them in human language. Five steps of natural language
processing are-

Fig -3: Working Model Of NLP

AUTOMATIC SPEECH RECOGNITION

This feature helps the machine to understand the command as per user’s input. The architecture of speech
recognition system is given below:

© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 836
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072

Fig -4: Architecture of Automatic Speech Recognition

ARTIFICIAL INTELLEGENCE

Artificial Intelligence (AI) is a way of making the machines performs the tasks given by humans in a way a
human will do it. A machine can calculate, perceive analogies, learn from experiences, store and retrieve information in its
memory, solve problems, use natural language, classification, and generalization and even adapt to environment and many
more, this all has been made possible due to the presence of artificial intelligence.

Fig -5: Branches of Artificial Intelligence

INTER PROCESS COMMUNICATION

Inter process communication between operating system and the undergoing or ongoing processes.

6. CONCLUSION

The paper tells about the new emerging technology for the desktop users. The virtual assistant provides a smart working
experience for the desktop user over the web. This new service is based on internet of things, speech recognition and
various other modern technologies like artificial intelligence, natural language processing and deep learning. Virtual
Assistant reduces the interruption of user, reduce the working time performance, and provide single platform for doing all
sort of work such as sending messages, contacting, and various other information. The system has become an ideal
platform for millions of user around the globe. It also overcomes many of the drawbacks of the existing system. It is
basically more efficient than various other existing software in the market. Although it has some of its own limitations.
Though it has high efficiency and also may have higher time consumption for task completion. Also the algorithms used
make it quite a challenge to tweak it in the near future.

© 2022, IRJET | Impact Factor value: 7.529 | ISO 9001:2008 Certified Journal | Page 837
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 09 Issue: 05 | May 2022 www.irjet.net p-ISSN: 2395-0072

7. REFERENCES

[1]. G.O. Young, “Synthetic structure of industrial plastics (Book style with paper title and editor),” in Plastics, 2nd ed.
Vol. 3, J. Peters, Ed. New York: McGraw-Hill, 1964, pp. 15-24.

[2]. M. Bapat, H. Gune, and P. Bhattacharyya, “A paradigm-based finite state morphological analyzer for marathi,” in
Proceedings of the 1st Workshop on South and Southeast Asia Natural Language Processing (WSSANLP), pp. 26-
34, 2010.

[3]. Knote, R., Janson, A., Eigenbrod, L. and Sollner, M., 2018. The What and How of Smart Personal Assistants:
Principles and Application Domains for Is Research.

[4]. V. Radha and C. Vimala, “A review on speech recognition challenges and approaches,” doaj. Org, vol. 2, no. 1, pp. 1-
7, 2012.

[5]. G. Muhammad, Y. Alotaibi, M.N. Huda, et al, pronunciation variation for asr: A survey of the“Automatic speech
recognition for bangla digits, literature” Speech Communication, vol. 29, no. in Computers and Information
Technology, 2009.2, pp. 225-246, 1999.

8. BIOGRAPHIES

Divisha Pandey - She is currently a student of B. Tech fourth year , Dept. of Computer Science and
Engineering , Rameshwaram Institute of Technology & Management, Lucknow and working on Virtual
Assistant using Python and AI.

Shweta Dubey - She is currently a student of B. Tech fourth year, Dept. of Computer Science and
Engineering , Rameshwaram Institute of Technology & Management, Lucknow and working on Virtual
Assistant using Python and AI.

Afra Ali - She is currently a student of B. Tech fourth year, Dept. of Computer Science and
Engineering , Rameshwaram Institute of Technology & Management , Lucknow and working on Virtual
Assistant using Python and AI.

Muskan Srivastava - She is currently a student of B. Tech fourth year, Dept. of Computer Science and
Engineering , Rameshwaram Institute of Technology & Management , Lucknow and working on Virtual
Assistant using Python and AI.

Shyam Dwivedi - He is currently Working as an Assistant Professor and of Head of Department

in Rameshwaram Institute of Technology and Management , Lucknow , India . He is M.TECH –
2012 BIT Mesra , Ranchi , he has a teaching experience of 10 years and 1 – year in TCS Industrial
experience.

01a EBO 2022 Architecture v06 HANDOUT
No ratings yet
01a EBO 2022 Architecture v06 HANDOUT
19 pages
2021 Canovate Mobile DC
No ratings yet
2021 Canovate Mobile DC
46 pages
AI-Based Virtual Assistant Using Python A Systematic Review
No ratings yet
AI-Based Virtual Assistant Using Python A Systematic Review
7 pages
03 Network Basics For Cloud Computing
No ratings yet
03 Network Basics For Cloud Computing
26 pages
Infiniband
No ratings yet
Infiniband
39 pages
High Availability Power Systems Redundancy Options-Seminar Report
100% (2)
High Availability Power Systems Redundancy Options-Seminar Report
23 pages
Advancing Fiber Optic Connectivity
No ratings yet
Advancing Fiber Optic Connectivity
16 pages
Solution Components: Video Surveillance
No ratings yet
Solution Components: Video Surveillance
6 pages
Guideline Zica Outbreak Management
No ratings yet
Guideline Zica Outbreak Management
36 pages
Presentation StruxureWare DCE
No ratings yet
Presentation StruxureWare DCE
120 pages
Nvidia h100 Datasheet 2430615
No ratings yet
Nvidia h100 Datasheet 2430615
4 pages
Secure Power Product
No ratings yet
Secure Power Product
80 pages
Présentation STULZ - 111123 PDF
100% (1)
Présentation STULZ - 111123 PDF
53 pages
Classification of Data Center Operations Technology (OT) Management Tools
100% (1)
Classification of Data Center Operations Technology (OT) Management Tools
16 pages
Training Infographics
No ratings yet
Training Infographics
20 pages
GPU As A Service Market Size To Surpass USD 33.91 Billion
No ratings yet
GPU As A Service Market Size To Surpass USD 33.91 Billion
8 pages
Immersion Cooling As The Next
No ratings yet
Immersion Cooling As The Next
6 pages
InRow Cooling Catalogue
No ratings yet
InRow Cooling Catalogue
15 pages
Rack Solutions APC
No ratings yet
Rack Solutions APC
36 pages
Session 3 & 4 (CH 2 - Approaches To Valuation - DCF & RV)
100% (1)
Session 3 & 4 (CH 2 - Approaches To Valuation - DCF & RV)
44 pages
Next-Generation Data Center Facility
No ratings yet
Next-Generation Data Center Facility
22 pages
Joseph Guevarra - Doc Research The Effect of Mobile Phone
No ratings yet
Joseph Guevarra - Doc Research The Effect of Mobile Phone
12 pages
24.02.2023 - Tender For Project Management Consultancy Services
No ratings yet
24.02.2023 - Tender For Project Management Consultancy Services
88 pages
Telkom Cloud: Milono W. Wibowo Divisi Multimedia - PT. TELKOM 021-70257788
100% (1)
Telkom Cloud: Milono W. Wibowo Divisi Multimedia - PT. TELKOM 021-70257788
22 pages
Rack Power Distribution System
No ratings yet
Rack Power Distribution System
24 pages
Struxureware For Data Centers: Data Center Infrastructure Management (Dcim) Software
No ratings yet
Struxureware For Data Centers: Data Center Infrastructure Management (Dcim) Software
11 pages
Starline Mission-Critical BrochureJAN20 US
No ratings yet
Starline Mission-Critical BrochureJAN20 US
8 pages
Busway Specification
100% (1)
Busway Specification
6 pages
CAST Diagrams-Which Two Quadrants To Use?
No ratings yet
CAST Diagrams-Which Two Quadrants To Use?
2 pages
ABB Ability Datacenter Automation - ColoCONNECT Days 2017 PDF
No ratings yet
ABB Ability Datacenter Automation - ColoCONNECT Days 2017 PDF
26 pages
Huawei IDS1000 All-In-One Container Data Center Solution Brochure
No ratings yet
Huawei IDS1000 All-In-One Container Data Center Solution Brochure
4 pages
THERMAL 5. Cold Aisle Containment - VERTIV
No ratings yet
THERMAL 5. Cold Aisle Containment - VERTIV
56 pages
AFRALTI-TDM Training Workshop On Network Synchronization, Maputo Mozambique, November 2011
No ratings yet
AFRALTI-TDM Training Workshop On Network Synchronization, Maputo Mozambique, November 2011
46 pages
Synopsis
No ratings yet
Synopsis
12 pages
Faraday Law of Electromagnetic Induction
No ratings yet
Faraday Law of Electromagnetic Induction
5 pages
CCTV Traning
No ratings yet
CCTV Traning
64 pages
DCR Report 2022
No ratings yet
DCR Report 2022
24 pages
Bus Duct Trunking System
No ratings yet
Bus Duct Trunking System
15 pages
Pricing Strategy - Lesson 1
No ratings yet
Pricing Strategy - Lesson 1
59 pages
Proposal Template - 10 Slides - Corporate
No ratings yet
Proposal Template - 10 Slides - Corporate
14 pages
Business Icon Pack 223
No ratings yet
Business Icon Pack 223
3 pages
C3 - Conducted and Wireless Media - Question
No ratings yet
C3 - Conducted and Wireless Media - Question
11 pages
Cabling TIA
No ratings yet
Cabling TIA
6 pages
The Results of Our 2022 Global Data Center Survey - Uptime Institute's Largest and Most Important Survey of The Year.
No ratings yet
The Results of Our 2022 Global Data Center Survey - Uptime Institute's Largest and Most Important Survey of The Year.
33 pages
IBM - The Next-Generation Data Center
No ratings yet
IBM - The Next-Generation Data Center
16 pages
GRP261x User Guide
No ratings yet
GRP261x User Guide
102 pages
Brochure - AgileCub Container Data Center v2.1
No ratings yet
Brochure - AgileCub Container Data Center v2.1
12 pages
Sever Cooling Ebook
No ratings yet
Sever Cooling Ebook
24 pages
Cabling Specification
No ratings yet
Cabling Specification
37 pages
Ch03 Types and Application of Virtualization
No ratings yet
Ch03 Types and Application of Virtualization
17 pages
CIT 811 TMA 3 Quiz Question
100% (1)
CIT 811 TMA 3 Quiz Question
3 pages
Astreya FNL v1.1 Data-Center TrendBook
100% (1)
Astreya FNL v1.1 Data-Center TrendBook
19 pages
Telecoms Academy 2017 Course Catalogue
No ratings yet
Telecoms Academy 2017 Course Catalogue
36 pages
Nxtra Corporate Brochure PDF
No ratings yet
Nxtra Corporate Brochure PDF
10 pages
gxp2130 gxp2140 gxp2160 Quick User Guide PDF
No ratings yet
gxp2130 gxp2140 gxp2160 Quick User Guide PDF
1 page
Bad Teacher Categories
No ratings yet
Bad Teacher Categories
3 pages
JLL Booming Data Centre Industry in India A Golden Opportunity
No ratings yet
JLL Booming Data Centre Industry in India A Golden Opportunity
31 pages
Emerging Technologies in Information and Communications Technology
From Everand
Emerging Technologies in Information and Communications Technology
Fouad Sabry
No ratings yet
Fin Irjmets1674010501
No ratings yet
Fin Irjmets1674010501
4 pages
My Voice Assistant Using Python
No ratings yet
My Voice Assistant Using Python
6 pages
Speech Recognition System - A Review
No ratings yet
Speech Recognition System - A Review
10 pages
Unit 5 UA
No ratings yet
Unit 5 UA
19 pages
Lecture Notes - Speech Processing
No ratings yet
Lecture Notes - Speech Processing
80 pages
Emotion Sense:-Real-time Speech Emotion Recognition For Live Calls
No ratings yet
Emotion Sense:-Real-time Speech Emotion Recognition For Live Calls
7 pages
Hyper Nasality
No ratings yet
Hyper Nasality
27 pages
2021 An Overview of Voice Conversion and Its Challenges From Statistical Modeling To Deep Learning
No ratings yet
2021 An Overview of Voice Conversion and Its Challenges From Statistical Modeling To Deep Learning
26 pages
Capstone Project Sem-6
No ratings yet
Capstone Project Sem-6
29 pages
Discriminative Deep Learning Based Hybrid Spectro-Temporal Features For Synthetic Voice Spoofing Detection
No ratings yet
Discriminative Deep Learning Based Hybrid Spectro-Temporal Features For Synthetic Voice Spoofing Detection
12 pages
Research of Effective UAV Detection Using Acoustic Data Recognition
No ratings yet
Research of Effective UAV Detection Using Acoustic Data Recognition
91 pages
COVID-19 Detection From Speech, Breathing and Coug - 230925 - 185202
No ratings yet
COVID-19 Detection From Speech, Breathing and Coug - 230925 - 185202
19 pages
LLM4psych Multimodalities
No ratings yet
LLM4psych Multimodalities
31 pages
10 1109icsc45622 2019 8938371
No ratings yet
10 1109icsc45622 2019 8938371
7 pages
Speech Processing Lab Manual
No ratings yet
Speech Processing Lab Manual
23 pages
Project Report B12
No ratings yet
Project Report B12
80 pages
UNIT-V Automatic Speech Recognition 22.10,24
No ratings yet
UNIT-V Automatic Speech Recognition 22.10,24
15 pages
Audio To Text Cookbook
No ratings yet
Audio To Text Cookbook
3 pages
Hala Paper
No ratings yet
Hala Paper
6 pages
AI-based Arabic Language and Speech Tutor
No ratings yet
AI-based Arabic Language and Speech Tutor
8 pages
Dynamic Biometrics The Case For A Real Time Solution To The Problem of Access Control Privacy and Security
No ratings yet
Dynamic Biometrics The Case For A Real Time Solution To The Problem of Access Control Privacy and Security
12 pages
Deepfake Audio Detection and Justification With Ex
No ratings yet
Deepfake Audio Detection and Justification With Ex
19 pages
EC5011 Task2 2021E187,2020E023
No ratings yet
EC5011 Task2 2021E187,2020E023
3 pages
Comparison Between SVM Other Classifiers For Ser IJERTV2IS1457
No ratings yet
Comparison Between SVM Other Classifiers For Ser IJERTV2IS1457
6 pages
Technologies-11-00091 - Implementation of Deep Learning Models On An SoC-FPGA Device For Real-Time Music Genre Classification
No ratings yet
Technologies-11-00091 - Implementation of Deep Learning Models On An SoC-FPGA Device For Real-Time Music Genre Classification
18 pages
A Review On Speaker Recognition - Technology and Challenges
No ratings yet
A Review On Speaker Recognition - Technology and Challenges
14 pages
1 s2.0 S1355030624000984 Main
No ratings yet
1 s2.0 S1355030624000984 Main
12 pages
Preprocessing Signal
No ratings yet
Preprocessing Signal
6 pages
Measuring Neuropsychiatric Symptoms in Patients With Early Cognitive Decline Using Speech Analysis
No ratings yet
Measuring Neuropsychiatric Symptoms in Patients With Early Cognitive Decline Using Speech Analysis
7 pages
Tsa Ut V
No ratings yet
Tsa Ut V
9 pages
An Overview of Noise-Robust Automatic Speech Recognition
No ratings yet
An Overview of Noise-Robust Automatic Speech Recognition
33 pages
Dialect Recognition System For Bagri Rajasthani Language Using Optimized Featured Swarm Convolutional Neural Network (Ofscnn) Model
No ratings yet
Dialect Recognition System For Bagri Rajasthani Language Using Optimized Featured Swarm Convolutional Neural Network (Ofscnn) Model
20 pages