0% found this document useful (0 votes)

98 views

!pip Install Ibm - Watson

1) The document discusses building a voice-enabled chatbot using Watson services like Speech to Text and Text to Speech from Python. 2) It provides code to import necessary modules, initialize Watson services, and define functions to recognize speech, get responses from an Assistant, and synthesize speech. 3) Put together, the functions allow having a conversational interaction with the chatbot by running the code to recognize input, get the Assistant's response, and vocalize the response.

Uploaded by

Eros

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views

!pip Install Ibm - Watson

Uploaded by

Eros

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Now that you have a basic chatbot, we're going to learn about interacting with it through voice.

To
achieve this, we'll be calling the IBM Watson Speech to Text and Text to Speech APIs from Python.
The idea behind this lab is to show you how to interface with different Watson services, specifically
leveraging the Python SDK. Before we begin, create a new Speech to Text service and Text to
Speech service from within IBM Cloud, and copy your API Keys for the services somewhere.

To begin, head to https://labs.cognitiveclass.ai just like you did in Lab 4. Then, open a new Python
notebook, and follow these steps:

1. Install packages - In this case, you're only going to need one extra Python package: the
`ibm_watson` package. Instead of calling the IBM Watson REST APIs manually, this package acts
as a wrapper. It removes a lot of the hard work, specially for the speech services. Type the following
code into the cell:

!pip install ibm_watson

2. Import the right modules - For this lab, you'll need to import the following:

1. os - to run commands in the environment via "os.popen".

2. glob.glob - to find audio files.
3. ibm_cluod_sdk_core.authenticators.IAMAuthenticator - to help with API Key-based
authentication
4. ibm_watson:
a) SpeechToTextV1 - the Speech to Text service wrapper.

b) AssistantV2 - The Assistant service wrapper.

c) TextToSpeechV1 - the Text to Speech service wrapper.

To do so, type the following code into the next cell, and run the code:

import os

from glob import glob

import IPython

from ibm_cloud_sdk_core.authenticators import IAMAuthenticator

from ibm_watson import SpeechToTextV1

from ibm_watson import AssistantV2

from ibm_watson import TextToSpeechV1

3. Implementing Speech to Text - In order to implement the Speech to Text service, you need to
first instantiate your service wrapper. To do so, create a new instance of `SpeechToTextV1`. You'll
need to pass your API key through the IAMAuthenticator type, as well as the endpoint URL which
you can find just under the API Key on the service instance page on IBM Cloud.

You'll also need to define two more constants:

1. "SPEECH_EXTENSION" - the extension of the audio files that Speech to Text will need to
analyze.

2. "SPEECH_AUDIOTYPE" - the type of audio that Speech to Text will analyze - Watson
supports these formats.

Then, I define another function called "recognize_audio()". This function is simple: it waits for a new
audio file to appear in the current working directory (using `SPEECH_EXTENSION`). Right as it
appears, it'll read the file, delete the file from the filesystem, and then pass it to Watson.
Once the file is sent to Watson through the "recognition_service.recognize()" function, Watson
returns a JSON object that can be accessed through the "get_result()" function.

To parse this JSON, you navigate the hierarchy to get to the transcription that Watson is most
confident in. This is how it's done:

1. "["results"][0]" - this will get the first set of results from Watson's response.

2. "["alternatives"][0]" - of all the alternative transcriptions, it'll get the first (most likely) one.

3. "["transcript"]" - of all the data Watson returns, only take the transcript string ("str" type in Python).

To implement all of this, you'll use the following code in a new cell:

recognition_service = SpeechToTextV1(IAMAuthenticator('{YOUR_APIKEY}'))
recognition_service.set_service_url('{YOUR_ENDPOINT}')
SPEECH_EXTENSION = "*.webm"
SPEECH_AUDIOTYPE = "audio/webm"
def recognize_audio():
while len(glob(SPEECH_EXTENSION)) == 0:
pass
filename = glob(SPEECH_EXTENSION)[0]
audio_file = open(filename, "rb")
os.popen("rm " + filename)
result = recognition_service.recognize(audio=audio_file, content_type
=SPEECH_AUDIOTYPE).get_result()
return result["results"][0]["alternatives"][0]["transcript"]

Since you're running this code in a JupyterLab Notebook, you'll need record your audio via a special
method. On the very left of your screen, click the Palette option (the icon is a Color Palette). Then,
from the resulting list, click "Record Audio".

You'll be greeted with a little window, click the microphone when you're ready.
When you're done recording, click the stop button, and you should have put a "webm" file in the
current working directory.

4. Conversing with Watson Assistant - In order to facilitate the communication with the Assistant
service, let's define a helper function! This function will take some text from the user, and return
Watson's response. Before this function can be defined, we need to instantiate the wrapper around
the Assistant service itself. In order to do so, create a new instance of "AssistantV2". You'll need to
provide your API Key via an IAMAuthenticator through the "authenticator" argument. You'll also need
to provide a version of the AssistantV1 service - in this case, we're using "2019-02-28". You should
check the documentation for the current version. You'll also need to define the Assistant ID of your
assistant. Finally, you'll also need to specify your endpoint URL - you can find this on your service
instance page right under the API Key:

Finally, we'll go ahead and ask the Assistant to create a new "session". With a session, Watson can
automatically keep track of the context of a conversation. This means you don't need to handle the
context and pass it back and forth with Watson manually. To differentiate between session, you have
a session ID, which we store in "session_id". You can now define the "message_assistant" function.
The working of this function is simple:

1. Message the assistant with the user's utterance and the current session ID, and get a JSON
response.

2. Return the first response that Watson returned.

To implement this, you'll use the following code in a new cell:

assistant = AssistantV2(version='2019-02-28', authenticator=IAMAuthenticator

  ('{YOUR_APIKEY}'))
assistant.set_service_url('{YOUR_ENDPOINT}')
ASSISTANT_ID = "{YOUR_ASSISTANT_ID}"
session_id = assistant.create_session(assistant_id=ASSISTANT_ID).get_result
  ()["session_id"]
def message_assistant(text):
response = assistant.message(assistant_id=ASSISTANT_ID,
session_id=session_id,
input={'message_type': 'text', 'text': text}
                                   ).get_result()
return response["output"]["generic"][0]["text"]

5. Hearing Watson's response - To enable a truly end-to-end intuitive and interactive experience,
let's use Text to Speech to synthesize audio and have Watson speak! Start by initializing the
"TextToSpeechV1" wrapper. Pass it your API Key through an IAMAuthenticator, and your API
endpoint, which you can find right under the API Key in your service dashboard on IBM Cloud. Then,
define a new function called "speak_text". This is what it'll do:

1. Open a new file "temp.wav".

2. Take the text that Watson needs to speak and pass it to the "synthesis_service.synthesize()"
function. Tell it we're passing a WAV file, and tell it we want the "en-US_AllisonV3Voice" voice. You
can see more voices here.

3. Write Watson's response to the "temp.wav" file.

4. Play the "temp.wav" file.

This is the code you'll use to implement Text to Speech:

synthesis_service = TextToSpeechV1(IAMAuthenticator('{YOUR_APIKEY}'))
synthesis_service.set_service_url('{YOUR_ENDPOINT}')
def speak_text(text):
with open('temp.wav', 'wb') as audio_file:
response = synthesis_service.synthesize(text, accept='audio/wav', voice
="en-US_AllisonV3Voice").get_result()
audio_file.write(response.content)
return IPython.display.Audio("temp.wav", autoplay=True)
6. Putting the pieces together - Because of the way these functions work, putting them together is
as easy as chaining them together! By calling "recognize_audio()", you're waiting for the user to
provide some input. Then, that is passed to the "message_assistant()" function. The output of that
function is passed to "speak_text", which provides output to the user! To interact with the chatbot in
this lab, simply run this cell for every utterance. To be specific:

1. Run the following cell.

2. Record audio.

3. Wait until you hear Watson's response

4. Until you're done, repeat.

This is the simple code you'll need in the last cell:

speak_text(message_assistant(recognize_audio()))

That's all! Now, by running this cell every time you wish to speak to Watson, you'll be able to interact
in a natural, vocal manner.

Desktop Assistant Final
No ratings yet
Desktop Assistant Final
15 pages
ESI Study Guide For Exam AI-900
No ratings yet
ESI Study Guide For Exam AI-900
6 pages
Watson SPEECH TO TEXT Code Snippet.r
No ratings yet
Watson SPEECH TO TEXT Code Snippet.r
2 pages
Simulation Transcript IBM TTS
No ratings yet
Simulation Transcript IBM TTS
3 pages
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
31 pages
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
No ratings yet
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
48 pages
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
180 pages
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
From Everand
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
Jens Boje
No ratings yet
How Speech Recognition Works: Hidden Markov Model
No ratings yet
How Speech Recognition Works: Hidden Markov Model
25 pages
Report On Smart Bot Using Python
No ratings yet
Report On Smart Bot Using Python
19 pages
This Is The Instruction To Create An Instance of IBM Speech To Text
No ratings yet
This Is The Instruction To Create An Instance of IBM Speech To Text
9 pages
NoSQL Injection for Elasticsearch
From Everand
NoSQL Injection for Elasticsearch
Gary Drocella
No ratings yet
Virtual Assistance Project Brief
No ratings yet
Virtual Assistance Project Brief
8 pages
Voice Assistant Using Python 2
No ratings yet
Voice Assistant Using Python 2
20 pages
V Assist
No ratings yet
V Assist
3 pages
Minor
No ratings yet
Minor
25 pages
How to Write a Bulk Emails Application in Vb.Net and Mysql: Step by Step Fully Working Program
From Everand
How to Write a Bulk Emails Application in Vb.Net and Mysql: Step by Step Fully Working Program
Lotfi Ferchichi
No ratings yet
Jarvis Voice Assistant
No ratings yet
Jarvis Voice Assistant
2 pages
AI Desktop
No ratings yet
AI Desktop
14 pages
py report
No ratings yet
py report
8 pages
Desktop Voice Assiant Project Record
No ratings yet
Desktop Voice Assiant Project Record
9 pages
IEEE Paper Work
No ratings yet
IEEE Paper Work
3 pages
Sre Assignment
No ratings yet
Sre Assignment
15 pages
synopsis
No ratings yet
synopsis
6 pages
jarvis
No ratings yet
jarvis
4 pages
AutoIT Scripting For Beginners
From Everand
AutoIT Scripting For Beginners
Rajan
5/5 (2)
minor project sem 2
No ratings yet
minor project sem 2
35 pages
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet
Vioce Assistant by Python
No ratings yet
Vioce Assistant by Python
38 pages
Voice Assistant
No ratings yet
Voice Assistant
3 pages
Speech Recognition System
No ratings yet
Speech Recognition System
16 pages
Speech Recognition Transcription With Open Source ...
No ratings yet
Speech Recognition Transcription With Open Source ...
2 pages
Learn Java Programming in 24 Hours
From Everand
Learn Java Programming in 24 Hours
PublishDrive
No ratings yet
FreeSWITCH 1.0.6
From Everand
FreeSWITCH 1.0.6
Anthony Minessale
No ratings yet
01 coding the god bot (dragged) 2
No ratings yet
01 coding the god bot (dragged) 2
1 page
Python: Advanced Guide to Programming Code with Python
From Everand
Python: Advanced Guide to Programming Code with Python
Charlie Masterson
No ratings yet
Group No. 5: AI Desktop Assistant
No ratings yet
Group No. 5: AI Desktop Assistant
10 pages
Ai Virtual Assistant in Python: Submitted By: Rohit Kumar Sakshi Verma
No ratings yet
Ai Virtual Assistant in Python: Submitted By: Rohit Kumar Sakshi Verma
17 pages
Windows Batch File Programming
From Everand
Windows Batch File Programming
Michael Elliott
2/5 (2)
Virtual Assistant
No ratings yet
Virtual Assistant
18 pages
Inspiring Powershell Articles
From Everand
Inspiring Powershell Articles
Murat Yildirimoglu
No ratings yet
Visual Basic Programming:How To Develop Information System Using Visual Basic 2010, A Step By Step Guide For Beginners
From Everand
Visual Basic Programming:How To Develop Information System Using Visual Basic 2010, A Step By Step Guide For Beginners
Sherwyn Allibang
3.5/5 (2)
AI Voice Assistant
No ratings yet
AI Voice Assistant
51 pages
Pyqt6 101: A Beginner’s Guide to PyQt6
From Everand
Pyqt6 101: A Beginner’s Guide to PyQt6
Edward Chang
No ratings yet
A concise guide to PHP MySQL and Apache
From Everand
A concise guide to PHP MySQL and Apache
alasdair gilchrist
4/5 (2)
Skills For Fyp
No ratings yet
Skills For Fyp
2 pages
Voice Assistent - Minor
No ratings yet
Voice Assistent - Minor
14 pages
Six Weeks Industrial Training Report by Atul Kumar - 20230814 - 172719 - 0000
No ratings yet
Six Weeks Industrial Training Report by Atul Kumar - 20230814 - 172719 - 0000
56 pages
Quick Python Guide
From Everand
Quick Python Guide
Coder1
No ratings yet
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet
COMPUTER PRODUCTIVITY BOOK 1 Use AutoHotKey Create your own personal productivity scripts: AutoHotKey productivity, #1
From Everand
COMPUTER PRODUCTIVITY BOOK 1 Use AutoHotKey Create your own personal productivity scripts: AutoHotKey productivity, #1
Max Drake
No ratings yet
01 coding the god bot (dragged) 3
No ratings yet
01 coding the god bot (dragged) 3
1 page
The Complete Powershell Training for Beginners
From Everand
The Complete Powershell Training for Beginners
Abdelfattah Benammi
No ratings yet
python report
No ratings yet
python report
6 pages
personal_assistant_bot
No ratings yet
personal_assistant_bot
7 pages
Voice M
No ratings yet
Voice M
19 pages
Final
No ratings yet
Final
12 pages
Building A ChatGPT-4 Voice Assistant With Vivid U
No ratings yet
Building A ChatGPT-4 Voice Assistant With Vivid U
18 pages
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
From Everand
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
Adam Freeman
No ratings yet
suryanarayan 3
No ratings yet
suryanarayan 3
2 pages
Research Paper Publish
No ratings yet
Research Paper Publish
8 pages
Duolingo
No ratings yet
Duolingo
85 pages
Create A Watson Assistant Instance: Lab 1: Creating Watson Instances
No ratings yet
Create A Watson Assistant Instance: Lab 1: Creating Watson Instances
7 pages
AZ-120 Labs Prerequisites
No ratings yet
AZ-120 Labs Prerequisites
2 pages
AZ-120 Lab01b-Azure VM Windows Clustering
No ratings yet
AZ-120 Lab01b-Azure VM Windows Clustering
19 pages
AZ-120 Lab01a-Azure VM Linux Clustering
No ratings yet
AZ-120 Lab01a-Azure VM Linux Clustering
20 pages
AI-900: Microsoft Azure AI Fundamentals Sample Questions: User Guide
No ratings yet
AI-900: Microsoft Azure AI Fundamentals Sample Questions: User Guide
19 pages
Future of Work Turkey Report PDF
100% (1)
Future of Work Turkey Report PDF
47 pages
En PEP CER Learning Guide
No ratings yet
En PEP CER Learning Guide
42 pages
Purc111 Week 7
No ratings yet
Purc111 Week 7
22 pages
My Routine - Production EdW1 - Week 3
No ratings yet
My Routine - Production EdW1 - Week 3
6 pages
Travel & Tour Companies
100% (2)
Travel & Tour Companies
3 pages
Secondary English 4 Teacher Guide
No ratings yet
Secondary English 4 Teacher Guide
107 pages
LESSON-PLAN - For CO2 RPMS
No ratings yet
LESSON-PLAN - For CO2 RPMS
4 pages
T4 Communicative Competence Analysis of Its Components
No ratings yet
T4 Communicative Competence Analysis of Its Components
6 pages
Passive Voice With Modals: Worksheet Study!
100% (1)
Passive Voice With Modals: Worksheet Study!
1 page
Demonstrating Dynamic Datasets With ISIO 200
No ratings yet
Demonstrating Dynamic Datasets With ISIO 200
5 pages
18 Improper Integrals
No ratings yet
18 Improper Integrals
6 pages
Aws Dynamodb Two Case Studies
No ratings yet
Aws Dynamodb Two Case Studies
3 pages
English - Grade 4 Quarter 1-Module 7: Analogy: Synonyms and Antonyms First Edition, 2020 Republic Act 8293, Section 176 States
No ratings yet
English - Grade 4 Quarter 1-Module 7: Analogy: Synonyms and Antonyms First Edition, 2020 Republic Act 8293, Section 176 States
33 pages
Fycs Oops Manual
No ratings yet
Fycs Oops Manual
72 pages
Software Engineering
No ratings yet
Software Engineering
161 pages
Dokumen - Tips Agathiyar Vizha
No ratings yet
Dokumen - Tips Agathiyar Vizha
72 pages
Derivative Class 11
No ratings yet
Derivative Class 11
15 pages
Automatic Line Breaking of Long Lines of Text? - Tex..
No ratings yet
Automatic Line Breaking of Long Lines of Text? - Tex..
3 pages
Kandasamy Gypsy Goddess
No ratings yet
Kandasamy Gypsy Goddess
8 pages
Heidi Talaat Farid: Highly Developed Skills in
No ratings yet
Heidi Talaat Farid: Highly Developed Skills in
3 pages
Creative Learning
100% (1)
Creative Learning
4 pages
Arabic Exam - Yr 7 Trem 3
No ratings yet
Arabic Exam - Yr 7 Trem 3
13 pages
Chapter10-Computer Arithmatic
No ratings yet
Chapter10-Computer Arithmatic
25 pages
Download ebooks file Writing Reports to Get Results Quick Effective Results Using the Pyramid Method 3rd Edition Blicq R.S. all chapters
100% (10)
Download ebooks file Writing Reports to Get Results Quick Effective Results Using the Pyramid Method 3rd Edition Blicq R.S. all chapters
40 pages
Rana Manish Resume
No ratings yet
Rana Manish Resume
2 pages
1524 - შებლ ებადი, ალ-არაბი ემარა - მონუმენტური წარწერა თბილისის გალავანზე - არქეოლოგიური და ისტორიული კვლევა
No ratings yet
1524 - შებლ ებადი, ალ-არაბი ემარა - მონუმენტური წარწერა თბილისის გალავანზე - არქეოლოგიური და ისტორიული კვლევა
7 pages
Coordinating Conjunctions: For and Nor But or Yet So
0% (1)
Coordinating Conjunctions: For and Nor But or Yet So
10 pages
SAN Migration Step For SQL Cluster DB Server From One Storage To New Storage
No ratings yet
SAN Migration Step For SQL Cluster DB Server From One Storage To New Storage
5 pages
Week1 Q2
No ratings yet
Week1 Q2
14 pages
Dropdown & Multiple Select Operations: Import
No ratings yet
Dropdown & Multiple Select Operations: Import
10 pages
Akbar The Great
No ratings yet
Akbar The Great
16 pages
Kushal Mani - Resume
No ratings yet
Kushal Mani - Resume
1 page

!pip Install Ibm - Watson

Uploaded by

!pip Install Ibm - Watson

Uploaded by

Now that you have a basic chatbot, we're going to learn about interacting with it through voice.

!pip install ibm_watson

1. os - to run commands in the environment via "os.popen".

b) AssistantV2 - The Assistant service wrapper.

c) TextToSpeechV1 - the Text to Speech service wrapper.

from glob import glob

from ibm_cloud_sdk_core.authenticators import IAMAuthenticator

from ibm_watson import SpeechToTextV1

from ibm_watson import AssistantV2

from ibm_watson import TextToSpeechV1

You'll also need to define two more constants:

2. Return the first response that Watson returned.

To implement this, you'll use the following code in a new cell:

assistant = AssistantV2(version='2019-02-28', authenticator=IAMAuthenticator

1. Open a new file "temp.wav".

3. Write Watson's response to the "temp.wav" file.

4. Play the "temp.wav" file.

This is the code you'll use to implement Text to Speech:

1. Run the following cell.

3. Wait until you hear Watson's response

4. Until you're done, repeat.

This is the simple code you'll need in the last cell:

You might also like