0% found this document useful (0 votes)

10 views5 pages

Sphinx Speech Recognition

The document discusses speech recognition in Python using the CMU Sphinx toolkit, specifically the Pocketsphinx library for offline applications. It outlines the installation process for necessary libraries and provides code examples for continuous speech recognition and keyword searching. The document concludes by emphasizing the utility of CMU Sphinx in various applications of speech recognition.

Uploaded by

Madhavan Jayarama Mohan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views5 pages

Sphinx Speech Recognition

Uploaded by

Madhavan Jayarama Mohan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

EXERCISE 2

Speech Recognition in Python using

CMU Sphinx



“Hey, Siri!”, “Okay, Google!” and “Alexa playing some music” are some of
the words that have become an integral part of our life as giving voice
commands to our virtual assistants make our life a lot easier. But have
you ever wondered how these devices are giving commands via
voice/speech?
Do applications understand your voice? How does the computer even
decode this if it only understands 0/1?
The answer is simple: it uses Speech Recognition software to decode the
user input received as speech/voice using the device’s
microphone. Speech Recognition software to decode the user input
received as speech/voice using the device’s microphone. the task of this
software is to convert the speech to a string(text) so that the computer can
then decode it.
One such Toolkit is CMU Sphinx which is an open-source toolkit used for
speech recognition, it also has a lightweight recognizer library
called Pocketsphinx which will be used to recognize the speech. This
library is a great resource especially when you are offline as when you
have internet access you should prefer Google API with speech
recognition due to higher precision. but when you are building a project
that works offline or uses speech on an offline embedded device,
use pocketsphinx.

Recognition Process

Let’s discuss how this library works from behind to actually recognize our
voice, It takes a waveform and then splits it according to utterances by
silence then traverses and tries to find out what is being said in each
utterance for accomplishing this task it takes all possible combinations of
words and try to match them with audio choosing the best matching
combination.

Installation of modules
Since pocketsphinx is an external library i.e. its not present as an inbuilt
entity in python we would install it to our machines using pip installer and
then using import to invoke all the functionalities of this library,
Now open your terminal and type the following command
NOTE- make sure that you have latest version of pip installed if not then
type following
python -m pip install --upgrade pip setuptools wheel
If you have latest version of pip then proceed directly and type the
following code into your terminal.
pip install pocketsphinx
Now that you have installed pocketsphinx in your machine lets move
forward to more.

Prerequisites

There are two prerequisite library which is used along side with
pocketsphinx they are :-
1. SpeechRecognition – used for speech recognition ,with support for
several engines and APIs, online and offline.
2. PyAudio-used to play and even record audio in python.
Now it is recommended to install these two library using pip install
command:-
pip install SpeechRecognition
brew install portaudio
pip install pyaudio
Now installation of all required external library is completed so lets move
forward to code.

LiveSpeech

It is an external iterator class available in pocketsphinx which can be used

for continuous recognition or keyword search from a microphone.
Here is the code for continuous recognition.

 Python3

# import LiveSpeech
from pocketsphinx import LiveSpeech
for phrase in LiveSpeech():
# here the result is stored in phrase which
# ultimately displays all the words recognized
print(phrase)
else:
print("Sorry! could not recognize what you said")

Output :

We used LiveSpeech in a basic for in loop to fetch continuous speech

input from user using the device microphone then we store the converted
string into phrase and display each word uttered by the user.

Keyword searching

We use an variable named speech of type pocketsphinx.LiveSpeech ,

In which we invoke the class LiveSpeech with arguments keyphrase i.e.
the keyword to be searched and kws_threshold then we used an for in
loop on speech which continuously looks for user input in form of voice if
the user utters the word ‘forward’ then it is printed along with segments.
 Python3

# importing livespeech
from pocketsphinx import LiveSpeech

speech = LiveSpeech(keyphrase='forward', kws_threshold=1e-20)

# an for in loop to iterate in speech

for phrase in speech:
# printing if the keyword is spoken with segments along side.
print(phrase.segments(detailed=True))

Output :
Test program

First of all import speech_recognition with referencing it as some

reference name aud now you can recognize speech using your code.
Now fetch audio from devices microphone and store in variable reference
of type speech_recognition.Recognizer to recognize the audio and
convert to text. After that define microphone as your source of input and
define an variable reference say audio to listen i.e it takes user input of
speech and stores it there, then we use invoke sphinx using try we try
printing what user said here we invoke recognize_sphinx and pass
argument audio, now the work of this class to convert what user said (in
form of speech ) to text form and display it in console simply
called Recognition.
If the code is unable to accept voice input due to unclear voice then we
throw an exception for unclear voice and for RequestError tool.

 Python3

import speech_recognition as aud

# fetch audio from devices microphone

# and store in variable reference of type speech_recognition
a = aud.Recognizer()

# declaring device microphone as the source to take audio input

with aud.Microphone() as source:
print("Say something!")

# variable audio prints what user said in text format the end
audio = a.listen(source)

# invoking sphinx for speech recognition

try:
# printing audio
print("You said " + a.recognize_sphinx(audio))

except aud.UnknownValueError:
# if the voice is unclear
print("Could not understand")

except aud.RequestError as e:
print("Error; {0}".format(e))

Output:
Conclusion

This winds up our topic of discussion of Speech recognition using CMU

Sphinx , there lot of more applications of this useful library.

The Cambridge Guide To Learning English As A Second Language
100% (1)
The Cambridge Guide To Learning English As A Second Language
362 pages
Bộ đề 2 ôn TestAS
No ratings yet
Bộ đề 2 ôn TestAS
55 pages
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
180 pages
18th Century Language
100% (1)
18th Century Language
30 pages
Speech To Text
No ratings yet
Speech To Text
4 pages
Speech Recognition
No ratings yet
Speech Recognition
9 pages
GRD 7 English Notes T2 2022 (Tom Newby School)
No ratings yet
GRD 7 English Notes T2 2022 (Tom Newby School)
91 pages
Spoken Language Processing in Python Chapter2
No ratings yet
Spoken Language Processing in Python Chapter2
23 pages
Speech Recognition System
No ratings yet
Speech Recognition System
16 pages
Pocket Sphinx
No ratings yet
Pocket Sphinx
31 pages
Python GuiaUser
No ratings yet
Python GuiaUser
23 pages
Automatic Speech Recognition Using Python
No ratings yet
Automatic Speech Recognition Using Python
18 pages
Top One (English F1) Penerbitan Pelangi SDN BHD
No ratings yet
Top One (English F1) Penerbitan Pelangi SDN BHD
34 pages
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
No ratings yet
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
48 pages
Speech To Text
No ratings yet
Speech To Text
17 pages
Speech Recognition Transcription With Open Source ...
No ratings yet
Speech Recognition Transcription With Open Source ...
2 pages
Group No. 5: AI Desktop Assistant
No ratings yet
Group No. 5: AI Desktop Assistant
10 pages
ASR - Thesis Report PDF
No ratings yet
ASR - Thesis Report PDF
42 pages
Voice Recognition Using Python
No ratings yet
Voice Recognition Using Python
24 pages
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
No ratings yet
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
4 pages
Desktop Assistant Final
No ratings yet
Desktop Assistant Final
15 pages
Jarvis Tutorial
No ratings yet
Jarvis Tutorial
3 pages
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
31 pages
Python Assistent Mini Project Report
No ratings yet
Python Assistent Mini Project Report
23 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
24 pages
Methodology To Use in Speech To Text Python - Google Search PDF
No ratings yet
Methodology To Use in Speech To Text Python - Google Search PDF
1 page
Speech Recognition
No ratings yet
Speech Recognition
4 pages
Speech Recognition
No ratings yet
Speech Recognition
5 pages
Speech-To-Text: Python
No ratings yet
Speech-To-Text: Python
10 pages
Jarvis
No ratings yet
Jarvis
12 pages
Ai
No ratings yet
Ai
2 pages
Project Report
No ratings yet
Project Report
58 pages
Speech Understanding Content
No ratings yet
Speech Understanding Content
9 pages
Speech Recognition System Using Python Report
No ratings yet
Speech Recognition System Using Python Report
7 pages
Speech To Text Conversion
No ratings yet
Speech To Text Conversion
7 pages
How Speech Recognition Works: Hidden Markov Model
No ratings yet
How Speech Recognition Works: Hidden Markov Model
25 pages
Voice Assistant Using Python 2
No ratings yet
Voice Assistant Using Python 2
20 pages
7B Sem DL Lab1
No ratings yet
7B Sem DL Lab1
1 page
Chat Bot 1
No ratings yet
Chat Bot 1
7 pages
Voice Assistant Suggetion
No ratings yet
Voice Assistant Suggetion
3 pages
Speech Recognition Techniques - GUVI
No ratings yet
Speech Recognition Techniques - GUVI
4 pages
PBL 2
No ratings yet
PBL 2
5 pages
Labs 9
No ratings yet
Labs 9
4 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
22 pages
MATLAB Tutorial by Ashish Jangid
No ratings yet
MATLAB Tutorial by Ashish Jangid
46 pages
Virtual Assistance Project Brief
No ratings yet
Virtual Assistance Project Brief
8 pages
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
No ratings yet
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
4 pages
NLP 1.3.1 - Speed Recogmnition
No ratings yet
NLP 1.3.1 - Speed Recogmnition
20 pages
Python Text To Spesdfssech
No ratings yet
Python Text To Spesdfssech
2 pages
Minor Project123
No ratings yet
Minor Project123
40 pages
Assistant
No ratings yet
Assistant
2 pages
A Seminar Report On: R. H. Sapat College of Engineering, Management Studies and Research
No ratings yet
A Seminar Report On: R. H. Sapat College of Engineering, Management Studies and Research
32 pages
Voice Assistant Report
No ratings yet
Voice Assistant Report
4 pages
Voice Assistant
No ratings yet
Voice Assistant
3 pages
DL Proj Rep
No ratings yet
DL Proj Rep
11 pages
Synopsis
No ratings yet
Synopsis
5 pages
Jarvis
No ratings yet
Jarvis
4 pages
Nat Reviewer 6
No ratings yet
Nat Reviewer 6
5 pages
AI Desktop
No ratings yet
AI Desktop
14 pages
NDP3
No ratings yet
NDP3
6 pages
Mother Tongue History-Of-The-English-Language
No ratings yet
Mother Tongue History-Of-The-English-Language
10 pages
Speech Recognition Seminar Report
87% (97)
Speech Recognition Seminar Report
32 pages
Book of Softkeys Volume 1
No ratings yet
Book of Softkeys Volume 1
162 pages
Iot Lab Record
No ratings yet
Iot Lab Record
33 pages
EF4e Int Filetest 7B
No ratings yet
EF4e Int Filetest 7B
6 pages
Vivek Kumar - 1613112052
No ratings yet
Vivek Kumar - 1613112052
7 pages
1st 10th Grammar Test
No ratings yet
1st 10th Grammar Test
4 pages
Introduction To Linguistics Syntax: Class 7
No ratings yet
Introduction To Linguistics Syntax: Class 7
35 pages
Jarvis Voice Assistant
No ratings yet
Jarvis Voice Assistant
2 pages
B.SC II Matematics
No ratings yet
B.SC II Matematics
26 pages
List of Cohesive Devices
0% (1)
List of Cohesive Devices
7 pages
English For Specific Purposes
100% (1)
English For Specific Purposes
3 pages
NTE UI AudioScript U03
No ratings yet
NTE UI AudioScript U03
9 pages
Ten Free Blockchain Resources: Factom University
No ratings yet
Ten Free Blockchain Resources: Factom University
16 pages
Practice Test 5 Answers and Explanations Section 1 - Reading
No ratings yet
Practice Test 5 Answers and Explanations Section 1 - Reading
28 pages
BCT Unit 3
No ratings yet
BCT Unit 3
25 pages
Cds Unit 5 Notes
No ratings yet
Cds Unit 5 Notes
16 pages
In Search of Lost Time - Preview From Yale University
No ratings yet
In Search of Lost Time - Preview From Yale University
51 pages
Nisha Jyoti Sanskar Bharti Vidyalaya
No ratings yet
Nisha Jyoti Sanskar Bharti Vidyalaya
10 pages
Chapter 8 Bad Habit
No ratings yet
Chapter 8 Bad Habit
11 pages
Review 2, Units 4-6: Grammar and Vocabulary
No ratings yet
Review 2, Units 4-6: Grammar and Vocabulary
17 pages
S1E2
No ratings yet
S1E2
10 pages
ANH 6 CỦNG CỐ UNIT 1 TEST 01
No ratings yet
ANH 6 CỦNG CỐ UNIT 1 TEST 01
5 pages
Semester 2 2023 School Report
No ratings yet
Semester 2 2023 School Report
14 pages
Pratt Parser
No ratings yet
Pratt Parser
17 pages
CMU Rhyming Words
No ratings yet
CMU Rhyming Words
16 pages
OOPS For Design Pattern
No ratings yet
OOPS For Design Pattern
15 pages
Guia Ingles 1
No ratings yet
Guia Ingles 1
5 pages
Srijana Lama - CSD - 3 Yrs 0 Month
No ratings yet
Srijana Lama - CSD - 3 Yrs 0 Month
2 pages
Aef2wb 42 43
No ratings yet
Aef2wb 42 43
2 pages
Big Data Business Context
No ratings yet
Big Data Business Context
17 pages
Conditional Sentences
No ratings yet
Conditional Sentences
3 pages
12.sınıf Ingilizce Dersi Sorumluluk Sınavı Soruları Ve Cevap Anahtarı
No ratings yet
12.sınıf Ingilizce Dersi Sorumluluk Sınavı Soruları Ve Cevap Anahtarı
2 pages
Python Pranks and Mischief with NLP
From Everand
Python Pranks and Mischief with NLP
Edward Franklin
No ratings yet
Learn Python in 10 Minutes
From Everand
Learn Python in 10 Minutes
Victor Ebai
4/5 (30)
Essential Python 3
From Everand
Essential Python 3
Kevin Vans-Colina
No ratings yet
The 1 Page Python Book
From Everand
The 1 Page Python Book
Barani Kumar
2/5 (1)
Python for Beginners: An Introduction to Learn Python Programming with Tutorials and Hands-On Examples
From Everand
Python for Beginners: An Introduction to Learn Python Programming with Tutorials and Hands-On Examples
Nathan Metzler
4/5 (2)

Sphinx Speech Recognition

Uploaded by

Sphinx Speech Recognition

Uploaded by

EXERCISE 2

Speech Recognition in Python using

It is an external iterator class available in pocketsphinx which can be used

We used LiveSpeech in a basic for in loop to fetch continuous speech

We use an variable named speech of type pocketsphinx.LiveSpeech ,

speech = LiveSpeech(keyphrase='forward', kws_threshold=1e-20)

# an for in loop to iterate in speech

First of all import speech_recognition with referencing it as some

import speech_recognition as aud

# fetch audio from devices microphone

# declaring device microphone as the source to take audio input

# invoking sphinx for speech recognition

This winds up our topic of discussion of Speech recognition using CMU

You might also like