0% found this document useful (0 votes)
95 views16 pages

EEE 6211 Digital Speech Processing: Course Instructor Dr. Mohammad Ariful Haque Professor, Dept. of EEE, BUET

This document provides an overview of the EEE 6211 Digital Speech Processing course at BUET. The course introduces key topics in speech processing including speech production and perception, analysis, coding, enhancement, synthesis, and recognition. Coursework includes presentations, projects, and a final exam. Reading materials include textbooks on digital speech processing and Matlab examples. Applications of speech processing span areas such as telephony, cellular communications, VoIP, speech synthesis in devices, and speech-enabled interfaces.

Uploaded by

Stevs Shamim
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
95 views16 pages

EEE 6211 Digital Speech Processing: Course Instructor Dr. Mohammad Ariful Haque Professor, Dept. of EEE, BUET

This document provides an overview of the EEE 6211 Digital Speech Processing course at BUET. The course introduces key topics in speech processing including speech production and perception, analysis, coding, enhancement, synthesis, and recognition. Coursework includes presentations, projects, and a final exam. Reading materials include textbooks on digital speech processing and Matlab examples. Applications of speech processing span areas such as telephony, cellular communications, VoIP, speech synthesis in devices, and speech-enabled interfaces.

Uploaded by

Stevs Shamim
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 16

EEE 6211

Digital Speech Processing

Course Instructor
Dr. Mohammad Ariful Haque
Professor, Dept. of EEE, BUET
Course Syllabus
• Introduction to Speech Processing and its
applications
• Speech production and hearing
• Speech Analysis
• Linear Prediction
• Speech Coding
• Speech Enhancement
• Text to speech synthesis
• Speech Recognition
Course Reading
• Course Textbook: Lawrence R. Rabiner and
Ronald W. Schafer, “Theory and Applications of
Digital Speech Processing”, Pearson Higher
Education, Inc., 2011
• Supplementary Textbook: Thomas F. Quatieri,
“Discrete-time Speech Signal Processing:
Principles and Practice”, Pearson Higher
Education, Inc., 2002
• Matlab Examples:
https://www.mathworks.com/academia/coursew
are/digital-speech-processing.html
Course Grading
• Presentation: 10%
• Project: 20%
• Final exam: 70%
Speech Signal
• The fundamental purpose of speech is human
communication; i.e., the transmission of
messages between a speaker and a listener.
• Speech signal can be converted to an electrical
waveform by a microphone.
Speech production/perception process
Speech production/perception process
Speech Applications
• Speech Coding
• Speech Synthesis
• Speech recognition and understanding
• Other speech applications
Speech coding
• Speech coding is the process of transforming a speech
signal into a representation for efficient transmission
and storage of speech.
• Speech coders often employ many aspects of both the
speech production and speech perception processes.
• Applications: Wired telephony, cellular communications,
voice over Internet protocol (VoIP), secure voice for
privacy and encryption, extremely narrowband
communications channels, telephone answering
machines, interactive voice response (IVR) systems, and
pre-recorded messages.
Text-to-speech synthesis
• Synthesis of speech is the process of generating a
speech signal using computational means for effective
human-machine interactions.
• There are many procedures for assembling the speech
sounds and compiling them into a proper sentence, but
the most promising one today is called “unit selection
and concatenation.” In this method, the computer
stores multiple versions of each of the basic linguistic
units of speech (phones, half phones, syllables, etc.),
and then decides which sequence of speech units
sounds best for the particular text message that is
being produced.
Text-to-speech synthesis
• Text-to-speech synthesis systems are an essential component of
modern human–machine communications systems and are used to
do things like read email messages over a telephone, provide voice
output from a GPS (global positioning system) in automobiles,
provide the voices for talking agents for completion of transactions
over the Internet, handle call center help desks and customer care
applications, serve as the voice for providing information from
handheld devices such as foreign language phrasebooks,
dictionaries, crossword puzzle helpers, and as the voice of
announcement machines that provide information such as stock
quotes, airline schedules, updates on arrivals and departures of
flights, etc.
• Another important application is in reading machines for the blind,
where an optical character recognition system provides the text
input to a speech synthesis system.
Speech recognition and other pattern
matching problems
• Speech recognition
• Speaker recognition
• Speaker verification
• Word spotting
• Automatic indexing of audio files
Other speech applications
• Speech Enhancement
• Language Translation
Speech-technology enabled devices
• Echo Plus connects to Alexa—a cloud-based voice service—to
play music, make calls, set timers and alarms, ask questions,
check traffic and weather, and more.
• Alexa can control your compatible smart lights, thermostats,
locks, garage doors, sprinklers, and more.
• Just ask for a song, artist, or genre and you can play music on
your Echo devices. Echo Plus can also play audiobooks, radio
stations, news briefs, and more.
• Call or message anyone hands-free with your Echo device.
• With seven microphones, beamforming technology, and noise
cancellation, Echo hears you from any direction—even while
music is playing.
• Alexa is always getting smarter and adding new features and
skills. Just ask Alexa to control your TV, request an Uber, order
a pizza, and more.

You might also like