0% found this document useful (0 votes)
26 views

Audio Compression

The document discusses audio compression techniques including MPEG audio compression. It describes the goals of audio compression as well as techniques like vocoding, LPC, mu-law compression and DPCM. It then focuses on MPEG audio compression, describing its layers, features, and techniques like quantization and psychoacoustic modeling.

Uploaded by

white paint
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
26 views

Audio Compression

The document discusses audio compression techniques including MPEG audio compression. It describes the goals of audio compression as well as techniques like vocoding, LPC, mu-law compression and DPCM. It then focuses on MPEG audio compression, describing its layers, features, and techniques like quantization and psychoacoustic modeling.

Uploaded by

white paint
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 23

Audio Compression

Usha Sree
CMSC 691M
10/12/04
Motivation

 Efficient Storage
 Streaming
 Interactive Multimedia Applications
Compression Goals
 Reduced bandwidth
 Make decoded signal sound as close as
possible to original signal
 Lowest Implementation Complexity
 Robust
 Scalable
Compression Techniques

 Voc File Compression


 Linear Predictive Coding
 Mu-law compression
 Differential Pulse Code Modulation
 MPEG
MPEG
 Moving Picture Experts Group
 Part of a multiple standard for
 Video compression
 Audio compression

 Audio, Video and Data synchronization

to an aggregate bit rate of1.5 Mbit/sec


MPEG Audio Compression
 Physically Lossy compression algorithm
 Perceptually lossless, transparent algorithm
 Exploits perceptual properties of human ear
 Psychoacoustic modeling
 MPEG Audio Standard ensures inter-operability,
defines coded bit stream syntax, defines
decoding process and guarantees decoder’s
accuracy.
MPEG Audio Features
 No assumptions about the nature of the audio
source
 Exploitation of human auditory system
perceptual limitations
 Removal of perceptually irrelevant parts of audio
signal
 It offers a sampling rate of 32, 44.1 and 48 kHz.
 Offers a choice of three independent layers
MPEG Audio Feautures cont.
 All three layers allow single chip real-time
decoder implementation
 Optional Cyclic Redundancy Check (CRC) error
detection
 Ancillary data may be included in the bit stream
 Also features such as random access, audio fast
forwarding and audio reverse are possible.
Overview
 Quantization, the key to MPEG audio
compression
 Transparent, perceptually lossless compression
 No distinction between original and 6-to-1
compressed audio clips
The Polyphase Filter Bank
 Key component common to all layers
 Divides the audio signal into 32 equal-width
frequency subbands
 The filters provide good time and reasonable
frequency resolution
 Critical bands associated with psychoacoustic
models
Psychoacoustics
 The aim is to remove irrelevant parts of the
audio signal
 The human auditory system is unable to hear
quantization noise under conditions of auditory
masking
 Masking occurs whenever a strong signal makes
a neighborhood of weaker audio signals
imperceptible
Noise masking threshold
 Human ear resolving
power is frequency
dependent
 Noise masking
threshold, at any
frequency, depends
only on the signal
energy within a
limited bandwidth
neighborhood that
frequency
The Psychoacoustic Model
 Analyzes the audio signal and computes the
amount of noise masking as a function of
frequency
 The encoder decides how best to represent the
input signal with a minimum number of bits
Basic Steps
 Time align audio data
 Convert audio to frequency domain
representation
 Process spectral values into tonal and non-tonal
components
 Apply a spreading function
 Set a lower bound for threshold values
 Find the threshold values for each subband
 Calculate the signal to mask ratio
MPEG Audio Layer I
 Simplest coding
 Suitable for bit rates above 128 kbits/sec per
channel
 Each frame contains header, an optional CRC
error check word and possibly ancillary data.
 Eg. Philips Digital Compact Cassette
MPEG Audio Layer II
 Intermediate complexity
 Bit rates around 128 kbits/sec per channel
 Digital Audio Broadcasting (DAB)
 Synchronized Video and Audio on CD-ROM
 Forms frames of 1152 samples per audio
channel.
MPEG Audio Layer III
 Based on Layer I&II filter banks
 Most complex coding
 Best audio quality
 Bit rates around 64 kbits/sec per channel
 Suitable for audio transmission over ISDN
 Compensates filter deficiencies by processing
outputs with a two different MDCT blocks.
Layer III enhancements
 Alias reduction
 Non uniform quantization
 Scalefactor bands
 Entropy coding of data values
 Use of a “bit reservoir”
MPEG and the Future?
 MPEG-1: Video CD and MP3.
 MPEG-2: Digital Television set top boxes and
DVD
 MPEG-4: Fixed and mobile web
 MPEG-7: description and search of audio and
visual content
 MPEG-21: Multimedia Framework
References
 Digital Audio Compression -
http://das.iocon.com/res/docs/pdf/Digital_Audio_Compression_01oct1993D
TJA03P8.pdf

 MPEG Audio Standard-www.cs.columbia.edu/~coms6181/slides/6R/


mpegaud.pdf
Thank You

You might also like