0% found this document useful (0 votes)

275 views16 pages

Raspberry Pi

The document describes a Raspberry Pi based speech recognition system that can translate speech to text, translate between languages, respond to queries, and control LEDs. It details the hardware and software requirements, implementation of each feature using various APIs, and testing of the completed system.

Uploaded by

mitenshah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

275 views16 pages

Raspberry Pi

Uploaded by

mitenshah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

1

Piri
A Raspberry Pi Speech Recognition System

Made by :
Sujit Royal (201201216)
Tanya Shah (201201217)
Ajay Gaur (201201218)
Miten Shah (201201219)
Khyati Vaghamshi (201201220)

HARDWARE REQUIREMENTS

Raspberry Pi
SD Card
USB keyboard
USB mouse
USB headset with microphone
USB hub
Monitor
Breadboard
Power Supply
Cables
LEDs
Internet connectivity: LAN cable

SPECIFICATIONS
Raspberry Pi Model B

Processor Core - ARM1176JZFS/Video Core4GPU

Processor - Broadcom BCM2835
RAM - 512MB
Storage - SDcard
USB - 2 Host
Video output - RS-232, HDMI cable, Composite AV
Audio output - Via HDMI, 3.5mm Audio Jack
Power source - USB, 5V DC Jack
GPIO - 8
Ethernet - 10/100Mbs
Clock speed 700MHz

RS-232 to HDMI cable

iBall i2025MV USB Multimedia Headphone With Mic

Headphone Driver Unit : 40 mm driver unit

Headphone Frequency Response: 20Hz~20,000Hz
Headphone Sensitivity : 108dB
Impedance : 32 ohms
Microphone Driver Unit : 9.7 mm driver unit
Microphone Sensitivity : 582dB
Output Power : 100 mW
Input/Output Plugs : USB

APPLICATION HARDWARE SCHEMATIC

PROJECT IDEA
The user will be given three options. User may choose to:
1) Speak something and know the translation.
2) Ask a query and have a reply.
3) Give an order for the LEDs to glow.

APPLICATION DESCRIPTION

Libraries to be installed

python-pip
pycurl
mplayer
flac
python2.7
libcurl
wolframalpha

APIs to be accessed
Google Speech API
Microsoft Bing Translator API
Wolfram Alpha API

Modules of the program

Speech-to-Text
Using Google API: Create a FLAC file and encode the speech into it.
Upload it to Google Speech engine and get back the final result in the
form of text.
filename='[Link]'
key='APIkey'
url=
'[Link]
ng=enus&key='+key

#sendthefiletogooglespeechapi
c=[Link]()
[Link]([Link],0)
[Link]([Link],url)
fout=[Link]()
[Link]([Link],[Link])

[Link]([Link],1)
[Link]([Link],[

'ContentType:audio/xflacrate=16000'])

filesize=[Link](filename)
[Link]([Link],filesize)
fin=open(filename,'rb')
[Link]([Link],[Link])
[Link]()

#receivethetextbackfromgooglespeechapi
response_code=[Link](pycurl.RESPONSE_CODE)
response_data=[Link]()

start_loc=response_data.find("transcript")
tempstr=response_data[start_loc+13:]
end_loc=[Link]("\"")
final_result=tempstr[:end_loc]

[Link]()

#displaytherecognizedtext
print"YouSaid:"+final_result

Translation and Speech-to-Text

Using Microsoft Bing Translator API and Google Speech Engine: The
recognized text needs to be uploaded to Bing translator engine with the
origin and destination languages specified as arguments. The original text
and the translated text will be sent to Google Speech engine to be
converted into speech. Using Mplayer libraries, we shall play both the
sounds.
text=args.text_to_translate
origin_language=args.origin_language
destination_language=args.destination_language

defspeakOriginText(phrase):
googleSpeechURL=
"[Link]
origin_language+"&q="+phrase
[Link](["mplayer",googleSpeechURL],shell=False,
stdout=[Link],stderr=[Link])

defspeakDestinationText(phrase):
googleSpeechURL=
"[Link]
destination_language+"&q="+phrase
printgoogleSpeechURL
[Link](["mplayer",googleSpeechURL],shell=False,
stdout=[Link],stderr=[Link])

args={

'client_id':'ClientID',

'client_secret':'APIkey,

'scope':'[Link]

'grant_type':'client_credentials'
}

oauth_url=
'[Link]
oauth_junk=
[Link]([Link](oauth_url,data=[Link](args)
).content)
translation_args={

'text':text,

'to':destination_language,

'from':origin_language

headers={'Authorization':'Bearer
'+oauth_junk['access_token']}
translation_url=
'[Link]
translation_result=
[Link](translation_url+[Link](translation_args
),headers=headers)
translation=translation_result.text[2:1]

speakOriginText('Translating'+translation_args["text"])
speakDestinationText(translation)

Query Processing
Using Wolfram Alpha: Wolfram Alpha is a very popular engine which
answers to query requests very smartly. For example, it will reply with the

current time when queried with What time is it, etc. Here we take a query
and send it to Wolfram Alpha to process it and then display the reply.
app_id='APIkey'
client=[Link](app_id)

query=''.join([Link][1:])
res=[Link](query)

iflen([Link])>0:
texts=""
pod=[Link][1]

[Link]:

texts=[Link]
else:

texts="Ihavenoanswerforthat"

texts=[Link]('ascii','ignore')
printtexts
else:
print"Sorry,Iamnotsure."

Glowing LEDs: Based on the users intention, we may toggle the

LEDs.

TEST RESULTS
Phase 1 - Testing Speech-to-text
1)
1)
2)
3)

We successfully installed all the libraries and software needed.

We created the FLAC file to encode the speech.
Uploaded it to google engine.
Checked whether the output text matched with speech or not.

PASSED.
Phase 2 - Testing Translation and Text-to-speech
1) To translate the text into a different language we used Microsoft
Bing Translator ,we passed the language as arguments into it.
2) After translating ,translated language and original text obtained by
speech to text conversion were passed to google speech engine to
convert them into speech.
3) We have used Mplayer Libraries to play both the sounds

PASSED.
Phase 3- Testing the Query Processing
1) We have used Wolfram Alpha API due to its advanced features to
handle queries.
2) We have passed translated text as query to it.
3) It processes the query and gives the output (i.e in text form)
accordingly and convert the same to speech.

PASSED.
Phase 4 - Testing the Toggle of LEDs
1) Depending on users intention LEDs will toggle.
PASSED.

CONTRIBUTION
Sujith Royal
1)Documentation
2)Background Reading
Tanya Shah
1)Creation of Hardware Schematic
2)Application Study and generation of Requirements and Specification
Ajay Gaur
1)Coding
2)Background Reading
Miten Shah
1)Coding
2)Refining documentation
Khyati Vaghamshi
1)Testing of hardware and Software
2) Creation of Hardware Schematic

REFERENCES
1) Add the power of speech, hearing and vision to your robot MagPi, Pg 18-21, Issue 26, Aug 2014
2) Universal Translator - Dave Conroy,
[Link]
3) Raspberry Voice Recognition System - Oscal Liang,
[Link]
-siri/
4) Jasper - Control anything with your voice [Link]
5) eSpeak - [Link]

Programs
No ratings yet
Programs
7 pages
Raspberry Pi-Based Ai System For Speech Transcription
No ratings yet
Raspberry Pi-Based Ai System For Speech Transcription
5 pages
Voice Command System with Raspberry Pi
No ratings yet
Voice Command System with Raspberry Pi
4 pages
Major Project Presentation
No ratings yet
Major Project Presentation
9 pages
Design Lab2
No ratings yet
Design Lab2
22 pages
Voice Assistant Project Report
No ratings yet
Voice Assistant Project Report
58 pages
An Instantaneous Polyglot Translator Powered by The Raspberry Pi
No ratings yet
An Instantaneous Polyglot Translator Powered by The Raspberry Pi
12 pages
Desktop Assistant Final
No ratings yet
Desktop Assistant Final
15 pages
Jarvis Voice Assistant For PC
No ratings yet
Jarvis Voice Assistant For PC
10 pages
Speech APIs Fact Sheet
No ratings yet
Speech APIs Fact Sheet
2 pages
Raspberry Pi Voice Recognition Guide
No ratings yet
Raspberry Pi Voice Recognition Guide
2 pages
Bt3420 SRK Project Report (Ete) - Ms Garima Rathi
No ratings yet
Bt3420 SRK Project Report (Ete) - Ms Garima Rathi
25 pages
Text-to-Speech for Accessibility
No ratings yet
Text-to-Speech for Accessibility
2 pages
JARVIS A PC Voice Assistant
No ratings yet
JARVIS A PC Voice Assistant
9 pages
Iotdoc 1
No ratings yet
Iotdoc 1
22 pages
Department of Mechanical Engineering: Mini Project Phase 1 Presentation
No ratings yet
Department of Mechanical Engineering: Mini Project Phase 1 Presentation
12 pages
Speech Recognition System: Surabhi Bansal Ruchi Bahety
No ratings yet
Speech Recognition System: Surabhi Bansal Ruchi Bahety
5 pages
Vaibhav IEEE
No ratings yet
Vaibhav IEEE
2 pages
Major Project SEE Progress Report
No ratings yet
Major Project SEE Progress Report
35 pages
University of Calicut: Bachelor of Technology Computer Science & Engineering
No ratings yet
University of Calicut: Bachelor of Technology Computer Science & Engineering
31 pages
Chatbot Report
No ratings yet
Chatbot Report
18 pages
Text-to-Speech SRS for Developers
100% (1)
Text-to-Speech SRS for Developers
10 pages
Ijeast Paper
No ratings yet
Ijeast Paper
4 pages
Speech To Text Conversion & Display Using Raspberry Pi: M. Sudhakar Vandana Khare D Vijay Krishna Kanth
No ratings yet
Speech To Text Conversion & Display Using Raspberry Pi: M. Sudhakar Vandana Khare D Vijay Krishna Kanth
5 pages
Python-Based Voice Assistant Project
No ratings yet
Python-Based Voice Assistant Project
11 pages
Voice Assistant Using Python (Jalax)
No ratings yet
Voice Assistant Using Python (Jalax)
16 pages
Jarvis New
No ratings yet
Jarvis New
11 pages
Anyone Can Talk Tool
No ratings yet
Anyone Can Talk Tool
9 pages
Description of My Abstract
No ratings yet
Description of My Abstract
2 pages
Personal Voice Assistant
No ratings yet
Personal Voice Assistant
7 pages
Final
No ratings yet
Final
12 pages
AIspeaker
No ratings yet
AIspeaker
10 pages
AI-Powered Smart Receptionist System
No ratings yet
AI-Powered Smart Receptionist System
2 pages
ESP32 Based Webserver For Text To Speech (TTS) Conversion
100% (1)
ESP32 Based Webserver For Text To Speech (TTS) Conversion
10 pages
Speech Recognition
No ratings yet
Speech Recognition
5 pages
Portable Text-to-Speech Device for Accessibility
No ratings yet
Portable Text-to-Speech Device for Accessibility
10 pages
Voice Controlled Personal Assistant Using Raspberry Pi
No ratings yet
Voice Controlled Personal Assistant Using Raspberry Pi
5 pages
Department of Computer Science and Engineering) : CGB1121/ EGB1122
No ratings yet
Department of Computer Science and Engineering) : CGB1121/ EGB1122
18 pages
Anurag Synop
No ratings yet
Anurag Synop
9 pages
E016pdf1 - 395 Reconocimieento de Voz Sin ADC
No ratings yet
E016pdf1 - 395 Reconocimieento de Voz Sin ADC
6 pages
(Slideshare Downloader La) 653be634bfe72
No ratings yet
(Slideshare Downloader La) 653be634bfe72
11 pages
Voice Assistent Synopsis PDF
No ratings yet
Voice Assistent Synopsis PDF
4 pages
Final
No ratings yet
Final
12 pages
Real-Time Speech To Braille Converter For People With Auditory and Visual Impairments
No ratings yet
Real-Time Speech To Braille Converter For People With Auditory and Visual Impairments
27 pages
AI Desktop
No ratings yet
AI Desktop
14 pages
PBL 2
No ratings yet
PBL 2
5 pages
Voice-Controlled UI for Disabled Users
No ratings yet
Voice-Controlled UI for Disabled Users
19 pages
Project Proposal: FPGA Based Speech Recognition Project
100% (1)
Project Proposal: FPGA Based Speech Recognition Project
9 pages
Project
No ratings yet
Project
8 pages
Yamini Singh Major Plagiarism
No ratings yet
Yamini Singh Major Plagiarism
19 pages
Voice Assistant Project in Python
No ratings yet
Voice Assistant Project in Python
48 pages
Caption Generator
No ratings yet
Caption Generator
18 pages
Python-Based Virtual Assistant Project
100% (2)
Python-Based Virtual Assistant Project
44 pages
Project Report
No ratings yet
Project Report
124 pages
Voice-Controlled Robot Guide
50% (2)
Voice-Controlled Robot Guide
20 pages
Speech Recognition Project Overview
No ratings yet
Speech Recognition Project Overview
13 pages
Speech Tech for CS Students
No ratings yet
Speech Tech for CS Students
83 pages
Math El
No ratings yet
Math El
17 pages
Pacs Iclass Se Express r10 RDR Ds en 0
No ratings yet
Pacs Iclass Se Express r10 RDR Ds en 0
2 pages
Section 5 - Repair/Replacement Procedures: Part Number/ Nomenclature Description
No ratings yet
Section 5 - Repair/Replacement Procedures: Part Number/ Nomenclature Description
4 pages
How To Load Data From GPS Essential App in QGIS PDF
No ratings yet
How To Load Data From GPS Essential App in QGIS PDF
7 pages
BoQ for OLT Feeder Installation
No ratings yet
BoQ for OLT Feeder Installation
2 pages
DeepSea100 Data Sheet
No ratings yet
DeepSea100 Data Sheet
2 pages
5300 Product Brief
No ratings yet
5300 Product Brief
4 pages
Item 28 Certificado 5320-Wci - Leviton
No ratings yet
Item 28 Certificado 5320-Wci - Leviton
4 pages
Repair and Modification of Printed Boards and Electronic Assemblies
No ratings yet
Repair and Modification of Printed Boards and Electronic Assemblies
6 pages
19034-Brochure M1M Con Modifiche
No ratings yet
19034-Brochure M1M Con Modifiche
8 pages
Computer Memory and Performance Overview
No ratings yet
Computer Memory and Performance Overview
29 pages
Byk-066 N en
No ratings yet
Byk-066 N en
2 pages
Intel (R) USB 3 0 EXtensible Host Controller Driver - Bring Up Guide r1p00
No ratings yet
Intel (R) USB 3 0 EXtensible Host Controller Driver - Bring Up Guide r1p00
28 pages
Windows Memory Analysis With Volatility
No ratings yet
Windows Memory Analysis With Volatility
21 pages
Workshop Linux
100% (1)
Workshop Linux
48 pages
DBF110 Dryer Exhaust Booster System Installation Instructions
No ratings yet
DBF110 Dryer Exhaust Booster System Installation Instructions
4 pages
KBD Keyboard Installation Manual EnUS 2344172043
No ratings yet
KBD Keyboard Installation Manual EnUS 2344172043
32 pages
EEE C415: Digital Signal Processing: Addressing Modes of TMS320C54x
No ratings yet
EEE C415: Digital Signal Processing: Addressing Modes of TMS320C54x
30 pages
Mechanical Seal - Eagle Burgman
78% (9)
Mechanical Seal - Eagle Burgman
149 pages
Cyber Security Assignment Overview
No ratings yet
Cyber Security Assignment Overview
18 pages
300 XC-W USA 2013: Spare Parts Manual: Engine
No ratings yet
300 XC-W USA 2013: Spare Parts Manual: Engine
24 pages
GSM-UMTS Commands and Attributes v1.0.0 v1.3
No ratings yet
GSM-UMTS Commands and Attributes v1.0.0 v1.3
96 pages
Monicon ATS 100 User Manual
No ratings yet
Monicon ATS 100 User Manual
9 pages
Asynchronous Pipeline for A8051
No ratings yet
Asynchronous Pipeline for A8051
4 pages
Umu6100n XH BN68-08446Q-01 L16 170627.0
No ratings yet
Umu6100n XH BN68-08446Q-01 L16 170627.0
324 pages
12.3 DD EFI Kit Installation Manual
No ratings yet
12.3 DD EFI Kit Installation Manual
15 pages
Data Sheet RV 10 Auto Pro V
No ratings yet
Data Sheet RV 10 Auto Pro V
3 pages
Studiolive Rm-Series Mixers: Owner'S Manual
No ratings yet
Studiolive Rm-Series Mixers: Owner'S Manual
72 pages
BENQ W5000 Projector Level 2 Service Manual 2
No ratings yet
BENQ W5000 Projector Level 2 Service Manual 2
130 pages
Importaciones SONY 2021
No ratings yet
Importaciones SONY 2021
160 pages

Raspberry Pi

Uploaded by

Raspberry Pi

Uploaded by

1

Processor Core - ARM1176JZFS/Video Core4GPU

RS-232 to HDMI cable

iBall i2025MV USB Multimedia Headphone With Mic

Headphone Driver Unit : 40 mm driver unit

APPLICATION HARDWARE SCHEMATIC

Modules of the program

Translation and Speech-to-Text

Glowing LEDs: Based on the users intention, we may toggle the

We successfully installed all the libraries and software needed.

You might also like