0% found this document useful (0 votes)

9 views

Speech Understanding Content

Uploaded by

Chamod Kanishka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Speech Understanding Content

Uploaded by

Chamod Kanishka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Office Hour:Reserve via email.

Contact
Email: [email protected]

This course introduces the basic signal

processing and artificial intelligence
concepts that underlie modern speech
understanding applications, then shows you
how to create your own speech
understanding applications.
Fundamental concepts to be introduced will
include waveforms segmentation and
Course Description labeling, sampling frequency, frequency
domain analysis, spectrogram and mel
spectrogram.
Open-source speech recognition and speech
synthesis toolkits will be introduced.
Methods will be introduced that use the
open-source toolkits to create a voice-
activated web browser and a personal
assistant.

By the end of this course, students should

know how to segment a waveform in the
time domain, how to identify the most
important frequencies present in a
Goals waveform, how to use open-source toolkits
to perform speech recognition and speech
synthesis, and how to create their own
voice-activated web browser and personal
assistant.

Students interested in developing artificial

Audience intelligence applications using open-source
toolkits.

Speech analysis, speech synthesis, speech

Topics
recognition, internationalization.

Prerequisite/Required Students must have the ability to program in

at least one object-oriented programming
language (python, ruby, C++, java, etc.).
knowledge The course will be taught primarily in
python, so students with a background in
that language will have an advantage.

Make Python Talk: Build Apps with Voice

Textbooks Control and Speech Recognition, by Mark
Liu, 2021

Educational Media None

Auxiliary readings will be assigned from

References several web tutorials, as described under
each of the lectures.

Lecture Contents

No
Topic/Activity Reading
.

1 Praat https://www.researchgate.net/publication/
270819326_PRAAT_--_Short_Tutorial_--_An_introduction,
pages 1-12
Setting up
Python,
2 Anaconda, and Chapter 1 and Section 2.1 (Variables and Values)
Spyder. Scalar
variables.

Python loops,
functions,
3 modules, lists, Remainder of chapter 2
dicts, and
tuples

4 Numpy, the first part of Section 3.1 (about PyAudio), and

Matplotlib, and https://people.csail.mit.edu/hubert/pyaudio/,
PyAudio https://numpy.org/doc/stable/user/absolute_beginners.ht
ml, and
https://matplotlib.org/stable/users/getting_started/
5 Do-it-yourself https://towardsdatascience.com/understanding-audio-
speech data-fourier-transform-fft-spectrogram-and-speech-
synthesis recognition-a4072d228520
using Numpy
and Librosa
6 Do-it-yourself https://librosa.org/doc/latest/tutorial.html,
speech https://librosa.org/doc/latest/generated/librosa.display.sp
recognition ecshow.html,
using Numpy https://librosa.org/doc/latest/generated/librosa.feature.m
and Librosa elspectrogram.html
The
SpeechRecogn Sections 3.1 (Install), 3.2 (Test), and 3.3 (Voice-Controlled
7
ition module, Web Search)
part 1

8 The Sections 3.4 (Open Files), 3.5 (Local Module)

SpeechRecogn
ition module,
part 2

Speech
9 Chapter 4
Synthesis

Speech :
Sections 5.1 (Local Package) and 5.2 (Guess the Number
10 Guess the
Game)
Number

Sections 6.1 (Primer on Web Scraping) and 6.2 (Scrape

11 Web Scraping
Live Web Pages)

Voice-
Sections 6.3 (Voice-Activated Podcasts), 6.4 (Radio), and
12 Activated
6.5 (Videos)
Podcasts

Personal
13 Sections 7.1 (Overview) through 7.5 (Tell a Joke)
Assistant

World
14 Chapter 16
Languages

World
15 None
Languages

Developing Apps with Python and Flet
From Everand
Developing Apps with Python and Flet
Williams Asiedu
No ratings yet
Personal Voice Assistant in Python
86% (22)
Personal Voice Assistant in Python
30 pages
SlotDesigner Manual
No ratings yet
SlotDesigner Manual
142 pages
Desktop Assistant Final
No ratings yet
Desktop Assistant Final
15 pages
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
From Everand
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
James Tudor
5/5 (1)
Python String Coding Interview Questions PDF
100% (1)
Python String Coding Interview Questions PDF
14 pages
Speech Understanding Content
No ratings yet
Speech Understanding Content
10 pages
How Speech Recognition Works: Hidden Markov Model
No ratings yet
How Speech Recognition Works: Hidden Markov Model
25 pages
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
No ratings yet
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
48 pages
Basic Guide to Programming Languages Python, JavaScript, and Ruby
From Everand
Basic Guide to Programming Languages Python, JavaScript, and Ruby
Kiet Huynh
No ratings yet
Lecture
No ratings yet
Lecture
7 pages
Voice Assistant Using Python 2
No ratings yet
Voice Assistant Using Python 2
20 pages
Doc-20231217-Wa0003. 20231217 234608 0000
No ratings yet
Doc-20231217-Wa0003. 20231217 234608 0000
11 pages
The spaCy Handbook: Simplifying Natural Language Processing
From Everand
The spaCy Handbook: Simplifying Natural Language Processing
Robert Johnson
No ratings yet
Speech Recognition System
No ratings yet
Speech Recognition System
16 pages
py report
No ratings yet
py report
8 pages
Ost PROJECT
No ratings yet
Ost PROJECT
8 pages
Lecture 1
No ratings yet
Lecture 1
48 pages
Voice - Assistant - Research Paper
No ratings yet
Voice - Assistant - Research Paper
6 pages
Automatic Speech Recognition Using Python
No ratings yet
Automatic Speech Recognition Using Python
18 pages
Personal Voice Assistant in Python
100% (1)
Personal Voice Assistant in Python
30 pages
Chat Bot 1
No ratings yet
Chat Bot 1
7 pages
Text Analysis with Python: A Research-Oriented Guide
From Everand
Text Analysis with Python: A Research-Oriented Guide
Mamta Mittal
No ratings yet
Voice - Assistant - Research Paper
No ratings yet
Voice - Assistant - Research Paper
6 pages
Minor
No ratings yet
Minor
25 pages
Voice Assistant presentation
No ratings yet
Voice Assistant presentation
10 pages
Project 2023
No ratings yet
Project 2023
34 pages
Voice Assistant
No ratings yet
Voice Assistant
14 pages
Virtual Assistance Project Brief
No ratings yet
Virtual Assistance Project Brief
8 pages
Mastering Python in 7 Days
From Everand
Mastering Python in 7 Days
Alex Wood
No ratings yet
Python Mini Manual
From Everand
Python Mini Manual
CodeCraft Dynamics
No ratings yet
Project Report
No ratings yet
Project Report
58 pages
Department of Mechanical Engineering: Mini Project Phase 1 Presentation
No ratings yet
Department of Mechanical Engineering: Mini Project Phase 1 Presentation
12 pages
Vioce Assistant by Python
No ratings yet
Vioce Assistant by Python
38 pages
Basic Course Material Winter 2015
100% (1)
Basic Course Material Winter 2015
19 pages
Voice Recognition Using Python
No ratings yet
Voice Recognition Using Python
24 pages
Mastering Sublime Text
From Everand
Mastering Sublime Text
Dan Peleg
No ratings yet
Project Testing
No ratings yet
Project Testing
11 pages
How to Learn PHP, MySQL and Javascript Quickly!: For Dummies
From Everand
How to Learn PHP, MySQL and Javascript Quickly!: For Dummies
Andrei Besedin
5/5 (1)
Group No. 5: AI Desktop Assistant
No ratings yet
Group No. 5: AI Desktop Assistant
10 pages
ai-voice-assistant-ppt-project-ppt (1)
No ratings yet
ai-voice-assistant-ppt-project-ppt (1)
23 pages
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler Welch
180 pages
Choto Nunu Sumit Anand
No ratings yet
Choto Nunu Sumit Anand
13 pages
Your First Python Program
From Everand
Your First Python Program
Alexander Paz
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
No ratings yet
Coding The Future: A Comprehensive Guide To AI Development-By Tyler P Welch - The Astral Merchant
31 pages
Jarvis Tutorial
No ratings yet
Jarvis Tutorial
3 pages
Master Python Without Prior Experience
From Everand
Master Python Without Prior Experience
CodeCraft Dynamics
No ratings yet
Ai Voice Assistant
No ratings yet
Ai Voice Assistant
14 pages
DT - Final
No ratings yet
DT - Final
5 pages
Final
No ratings yet
Final
12 pages
Speech Processing
No ratings yet
Speech Processing
5 pages
Effortless Python: Learn Python Quickly from Beginner to Pro
From Everand
Effortless Python: Learn Python Quickly from Beginner to Pro
Aarav Joshi
No ratings yet
synopsis
No ratings yet
synopsis
6 pages
Python Programming Techniques: The Art of Coding and Programming Explained
From Everand
Python Programming Techniques: The Art of Coding and Programming Explained
Lance Gifford
No ratings yet
Python Mastery: From Absolute Beginner to Pro
From Everand
Python Mastery: From Absolute Beginner to Pro
NIBEDITA Sahu
No ratings yet
Python Programming for Newbies
From Everand
Python Programming for Newbies
Abound Academy
No ratings yet
HG3052 CourseOutline SpeechSynthesisRecognition AY2019-20 SEM1 Update Sep10
No ratings yet
HG3052 CourseOutline SpeechSynthesisRecognition AY2019-20 SEM1 Update Sep10
6 pages
System Programming Essentials with Go: System calls, networking, efficiency, and security practices with practical projects in Golang
From Everand
System Programming Essentials with Go: System calls, networking, efficiency, and security practices with practical projects in Golang
Alex Rios
No ratings yet
Python 3 Fundamentals: A Complete Guide for Modern Programmers
From Everand
Python 3 Fundamentals: A Complete Guide for Modern Programmers
Robert Johnson
No ratings yet
CCS369 - TSS-Unit 5
No ratings yet
CCS369 - TSS-Unit 5
23 pages
Practical Guide to Python: From Basics to Advanced Programming
From Everand
Practical Guide to Python: From Basics to Advanced Programming
Arcadia J. Darell
No ratings yet
Understanding Python: Beginner's Guide to Programming
From Everand
Understanding Python: Beginner's Guide to Programming
Sabry Fattah
No ratings yet
Relational Algebra
100% (1)
Relational Algebra
40 pages
RTOS uCOS II
No ratings yet
RTOS uCOS II
66 pages
Cops - Potw PDF
No ratings yet
Cops - Potw PDF
1 page
Data Analyst Resume Entry Level
100% (2)
Data Analyst Resume Entry Level
5 pages
Business Computing Dissertation Ideas
100% (1)
Business Computing Dissertation Ideas
7 pages
A 1 - Official - CCS21103 - drWali-THIS
No ratings yet
A 1 - Official - CCS21103 - drWali-THIS
5 pages
Lab Print
No ratings yet
Lab Print
42 pages
Visual Basic Language Companion PDF
No ratings yet
Visual Basic Language Companion PDF
139 pages
Hadoop Course Contents PDF
No ratings yet
Hadoop Course Contents PDF
3 pages
2023-07-27 IDTA Tutorial V3.0 Specification AAS Part1 Metamodel
No ratings yet
2023-07-27 IDTA Tutorial V3.0 Specification AAS Part1 Metamodel
48 pages
Adil Practicall Final
No ratings yet
Adil Practicall Final
48 pages
TY - Lab-III CS-359 Core JAVA Slip (Rev 2021-22)
0% (1)
TY - Lab-III CS-359 Core JAVA Slip (Rev 2021-22)
30 pages
AgenticAi Roadmap
No ratings yet
AgenticAi Roadmap
9 pages
System Verilog
No ratings yet
System Verilog
132 pages
Lect11 DP Lcs
No ratings yet
Lect11 DP Lcs
6 pages
Owl Exporter
No ratings yet
Owl Exporter
22 pages
Training - Release Strategy
No ratings yet
Training - Release Strategy
9 pages
BITS Pilani: Distributed Computing
No ratings yet
BITS Pilani: Distributed Computing
73 pages
Moving Boat Final Report
0% (1)
Moving Boat Final Report
34 pages
1z0-819_2
No ratings yet
1z0-819_2
22 pages
Focus Manual
No ratings yet
Focus Manual
597 pages
F
No ratings yet
F
7 pages
How To Debug Transfer Rules and Update Rules
No ratings yet
How To Debug Transfer Rules and Update Rules
6 pages
Python TCS
0% (1)
Python TCS
6 pages
មេរៀនទី 11-Dynamic Data Structures
No ratings yet
មេរៀនទី 11-Dynamic Data Structures
19 pages
ADF Code Corner: 93. Put A Different Look To Your Train Stops
No ratings yet
ADF Code Corner: 93. Put A Different Look To Your Train Stops
8 pages
Bda Unit 4 060115 Big Data Analytics Unit 4
No ratings yet
Bda Unit 4 060115 Big Data Analytics Unit 4
19 pages
Abap Performance Tuning
No ratings yet
Abap Performance Tuning
12 pages

Speech Understanding Content

Uploaded by

Speech Understanding Content

Uploaded by

Office Hour:Reserve via email.

This course introduces the basic signal

By the end of this course, students should

Students interested in developing artificial

Speech analysis, speech synthesis, speech

Prerequisite/Required Students must have the ability to program in

Make Python Talk: Build Apps with Voice

Educational Media None

Auxiliary readings will be assigned from

4 Numpy, the first part of Section 3.1 (about PyAudio), and

8 The Sections 3.4 (Open Files), 3.5 (Local Module)

Sections 6.1 (Primer on Web Scraping) and 6.2 (Scrape

You might also like