Audio/Speech Signal Processing: An Overview
Audio/Speech Signal Processing: An Overview
An Overview
Application Fields
• Audio effects
( FIR/IIR - Digital Filtering & Spectral Modifications)
Audio/Speech Codecs
Voice Call flow through mobile
Echo Cancellation
Speech Codec
Noise Reduction
Approximate data transfer size for 60 sec Call
Raw Data: (Just analog to digital converted data)
Total data size = Number of samples * Storage space for one sample
= Samples/sec * Number of seconds * Storage space
= 8000 * 60 * 8 bits = 3840 Kbits
Frequency
Frequency
Frequency
Time
Audio and Speech Codecs
• Envelope/Stereo Processing
• Voice/Vocal Enhancement
• Base Enhancement
• Sibilant/Fricative Smoothing
Signal delay:
y(t) = x(t) + decay*x(t-delay)
Raw Sound:
Echoed Sound:
Bass Enhancement :Information in Frequency domain
QA Community:
Signal Processing Stack exchange
http://dsp.stackexchange.com/
Research Labs:
• Fraunhofer Institute, Germany
• Dolby Laboratories
• Philips Research
• DTS/SRS Labs
Acknowledgment