Basic Audio Compression Techniques

The document discusses various audio compression techniques, highlighting key concepts such as hearing threshold, frequency and temporal masking, and critical bands. It outlines both lossy and lossless compression methods, including silence compression, companding, and ADPCM, as well as specific codecs like MP3, FLAC, and MLP. The document emphasizes the importance of maintaining audio quality while reducing file size and details the processes involved in different compression techniques.

Uploaded by

advaithmanoj10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views17 pages

Basic Audio Compression Techniques

Uploaded by

advaithmanoj10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Basic Audio Compression

Techniques

Aron Thomas
Some key concepts
1. Hearing Threshold
• It defines the minimum volume at which a sound is perceivable.
• Any audio samples below this threshold can be safely deleted.
• Ex. Soft background noise that is not audible can be removed without
affecting the perceived audio quality.

2. Frequency Masking
• Occurs when a sound that is normally heard is masked by another sound of
a nearby frequency
• Audio compression techniques employ this property to remove sounds that
are masked, thereby reducing the file size.
Some key concepts
3. Temporal Masking
• Occurs when a strong sound is preceded or followed in time by a weaker
sound at a nearby or same frequency.
• Affects sounds occurring closely in time.
• Used to eliminate inaudible sounds occurring around louder ones.

4. Critical Bands
• The range of audible frequencies can be split into a bunch of critical bands.
• Critical bands are determined according to the sound perception of the ear.
• There are 27 critical bands and audio compression techniques assess each
critical band for unnecessary sound data and remove them
Audio Compression
• Audio compression reduces the storage size of audio files while
maintaining sound quality.
• Compression techniques can be lossy or lossless.
• Two features of audio compression:
• It can be lossy
• It requires fast decoding.
• Audio compression methods are asymmetric.
• This is also the reason why dictionary based methods are not
used for audio compression.
Conventional Methods
• RLE can work well only in cases where there are long runs of the
same samples.
• 8 bit audio can produce such cases. However 16 bit audio has a
higher variability and therefore RLE will be ineffective.
• Statistical methods will not respond well to 8 bit audio, but may
respond well to 16 bit audio.
• Dictionary based methods are not well suited for audio
compression as audio can have minute changes.
Lossy Audio Compression

• Works by discarding information that is not perceptible to the

human ear.
• Audio, in itself may lose some quality during digitization and if
compression is done right, playback quality may not be affected.
• Two common approaches to lossy audio compression are Silence
Compression and Companding.
• Some lossy audio codes are MP3, AAC, OGG, WMA, etc.
Silence Compression
• This method treats smaller amplitude sound samples as silence.
• This results in a sequence of consecutive zeroes, and thus this
method can be considered a variation of RLE.
• Audio that contains long segments of low-amplitude sounds
respond well to silence compression.
• A user-defined parameter is required to specify the maximum
amplitude that should be suppressed.
• Two other parameters – Minimum run length, Silence termination
condition.
Companding
• This method takes advantage of how the human ear works.
• Greater precision at lower amplitudes and can tolerate more
errors at higher amplitudes.
• Companding applies a non-linear formula to reduce the number
of bits per sample.
• Ex. Consider 16-bit audio, it can have 65,536 samples.
Companding can apply a formula to non-linearly map these
samples to 15-bit numbers. Doing so can reduce the range to
[0, 32,767]
Companding
• More complex methods are present which include μ-law and A-
law companding. They have been designated as international
standards.
• The μ-law encoder takes a 14-bit signed input sample and
outputs 8-bit code words, whereas the A-law encoder takes a 13-
bit signed input sample and outputs 8-bit code words.

• The data is first normalized to the range [-1, 1] and then the
formula is applied. This is then scaled to the range [-256, 256]
ADPCM Audio Compression
• Adjacent audio samples tend to be similar. Thus, one can code the
differences between each successive samples rather than absolute
values.
• Such a kind of compression method is referred to as Differential Pulse
Code Modulation.
• ADPCM stands for Adaptive Pulse Code Modulation.
• It employs linear prediction. It uses previous samples to predict the
current sample, computes the difference between them, and
quantizes the difference.
• Decoding is done by multiplying this difference with the quantization
step. These differences are then added to the predicted values.
ADPCM Audio Compression
ADPCM Audio Compression
• Consider 16-bit input samples [1000, 1012, 1020, 1008]
Encoding
Quantized C[n] Dequantized
X[n] Xp[n−1] D[n]=X[n]−Xp[n−1] Xp[n]
C[n]C[n] Dq[n]
1000 1000 0 0000 0 1000
1012 1000 12 0011 10 1010
1020 1010 10 0010 8 1018
1008 1018 -10 1101 -8 1010

Decoding
C[n] Xp[n−1] Dq[n] Xn (Reconstructed)
0000 1000 0 1000
0011 1000 10 1010
0010 1010 8 1018
1101 1018 -8 1010
Lossless Audio Compression
• Lossless audio compression techniques work by removing
redundancies in the audio signal
• These files have a larger size as compared to lossy compression
methods.
• Such files are used in music production, Blu-Ray audio, Archival
and long term storage of audio.
• Some lossless audio codecs include FLAC, ALAC, MLP.
MLP Audio
• Stands for Meridian Lossless Packing.
• This is developed by Meridian Audio and it compresses high-
fidelity digital audio by eliminating redundancy without loss of
data.
• It supports upto 192 kHz sample rates and 63 channels and is
optimized for DVD-Audio.
• It can also handle variable sample rates.
MLP Audio
• Steps include:
1. Lossless Processing
- Removes any unnecessary information in the audio signals

2. Matrixing
- Similar audio signals across channels removed using an affine transformation matrix.

3. IIR Filtering
- Predicts next samples based on the previous ones and stores the differences.

4. Entropy Encoding
- These differences are further compressed using entropy encoding through variable-length codes.

5. FIFO buffering
- This output is put into a FIFO buffer to smooth data output.

• Output from the buffer is divided into packets, check bits and restart
points are added.
Shorten
• A simple, special-purpose, lossless compressor for waveform files.
• Works on any file whose samples go up and down in a wave format.
• Performs best on low-amplitude, low-frequency, waveforms.
• Compression process:
1. Partition the audio into blocks.
2. Predict each sample using the previous sample.
3. Encode the differences using a variable-size code.

• If there are multiple channels, Shorten first separates them before

compression.
• Predictors of different orders are present: Zeroth-order, First-order, and
so on.
FLAC
• Stands for Free Lossless Audio Compression.
• It is open-source, optimized for real-time playback.
• It is supported on multiple platforms including Windows, Linux, macOS,
BeOS, OS/2, and Unix-based systems.
• Compression process:
1. Partition the audio into blocks.
2. Predict each sample using the previous sample.
3. Encode the differences using Rice codes.
4. Store metadata(sampling rate, channels, etc).

• FLAC also supports robust error detection and metadata support

through MD5 signatures, CRC checksums, seek tables, Tags, Cuesheets

Advanced Audio Coding (Aac)
100% (1)
Advanced Audio Coding (Aac)
33 pages
Lossless and Lossy Audio Data Compression Revisi
100% (1)
Lossless and Lossy Audio Data Compression Revisi
8 pages
Multimedia System: Chapter Five: Basics of Digital Audio
No ratings yet
Multimedia System: Chapter Five: Basics of Digital Audio
42 pages
Audio Basics
No ratings yet
Audio Basics
11 pages
Audio Compression
0% (1)
Audio Compression
26 pages
Compression Research Project - Advanced Music Technology January 2002
100% (2)
Compression Research Project - Advanced Music Technology January 2002
4 pages
3 MM Compression
100% (1)
3 MM Compression
35 pages
Audio Compression Using Daubechie Wavelet
No ratings yet
Audio Compression Using Daubechie Wavelet
4 pages
Audio Compression: Usha Sree
No ratings yet
Audio Compression: Usha Sree
23 pages
Lecture10 AudioVideo Compression
No ratings yet
Lecture10 AudioVideo Compression
61 pages
Audio Compression
No ratings yet
Audio Compression
6 pages
Audio Compression Standards: James Rodney P. Santiago
No ratings yet
Audio Compression Standards: James Rodney P. Santiago
51 pages
Stress at Work
No ratings yet
Stress at Work
4 pages
Lossy and Lossless Compression Techniques
100% (1)
Lossy and Lossless Compression Techniques
18 pages
Digital Audio Compression: by Davis Yen Pan
No ratings yet
Digital Audio Compression: by Davis Yen Pan
14 pages
MPEG, The MP3 Standard, and Audio Compression
No ratings yet
MPEG, The MP3 Standard, and Audio Compression
12 pages
Audio Compression: Ashish Sharma
No ratings yet
Audio Compression: Ashish Sharma
7 pages
Unit-Ii Itc
No ratings yet
Unit-Ii Itc
42 pages
GDPHM 505
No ratings yet
GDPHM 505
19 pages
Audio Compression
No ratings yet
Audio Compression
23 pages
Overview of Audio Coding Techniques
No ratings yet
Overview of Audio Coding Techniques
4 pages
Multimedia Unit-2
No ratings yet
Multimedia Unit-2
10 pages
4 B DX 9 Vis
No ratings yet
4 B DX 9 Vis
16 pages
Digital Audio & Quantization and Transmission of Audio
No ratings yet
Digital Audio & Quantization and Transmission of Audio
17 pages
Chapter 3
No ratings yet
Chapter 3
23 pages
Data Compression Techniques Module 1 Ktu
No ratings yet
Data Compression Techniques Module 1 Ktu
15 pages
Dereje Teferi (PHD) Dereje - Teferi@Aau - Edu.Et
No ratings yet
Dereje Teferi (PHD) Dereje - Teferi@Aau - Edu.Et
30 pages
Audio Compression1
No ratings yet
Audio Compression1
22 pages
Digital Audio: Teppo Räisänen Liike/Oamk
No ratings yet
Digital Audio: Teppo Räisänen Liike/Oamk
18 pages
Chapter 3
No ratings yet
Chapter 3
27 pages
RT202C-3 Audio Streaming
No ratings yet
RT202C-3 Audio Streaming
20 pages
MPEG
No ratings yet
MPEG
12 pages
Cs Notes
No ratings yet
Cs Notes
6 pages
Digital Audio
No ratings yet
Digital Audio
29 pages
5 Basics of Digital Audio
No ratings yet
5 Basics of Digital Audio
29 pages
Le 4
No ratings yet
Le 4
3 pages
Multimedia 4
No ratings yet
Multimedia 4
23 pages
Digital Representation of Audio Information
No ratings yet
Digital Representation of Audio Information
22 pages
Lecture 16
No ratings yet
Lecture 16
23 pages
For A Digital Audio Recording, Explain What Is Meant By:: Research The Following Questions Independently
No ratings yet
For A Digital Audio Recording, Explain What Is Meant By:: Research The Following Questions Independently
4 pages
Audio Compression
No ratings yet
Audio Compression
50 pages
Sub-Band Coding
No ratings yet
Sub-Band Coding
2 pages
Mp3 Vs Aac Vs Flac Vs CD
No ratings yet
Mp3 Vs Aac Vs Flac Vs CD
9 pages
Simple Audio Compression Methods: A Udio Com Pression
No ratings yet
Simple Audio Compression Methods: A Udio Com Pression
6 pages
Venkata Lakshmi 08011012170 Sep Audio Compression
No ratings yet
Venkata Lakshmi 08011012170 Sep Audio Compression
8 pages
Low Bit Rate Coding
No ratings yet
Low Bit Rate Coding
4 pages
Digital Audio Coding - Dr. T. Collins: Standard MIDI Files Perceptual Audio Coding MPEG-1 Layers 1, 2 & 3 MPEG-4
No ratings yet
Digital Audio Coding - Dr. T. Collins: Standard MIDI Files Perceptual Audio Coding MPEG-1 Layers 1, 2 & 3 MPEG-4
23 pages
4 Chapter Audio and Video Compression
No ratings yet
4 Chapter Audio and Video Compression
122 pages
Mod 1 DCT
No ratings yet
Mod 1 DCT
37 pages
Audio Compression
No ratings yet
Audio Compression
81 pages
MPEG Standards For Audio
No ratings yet
MPEG Standards For Audio
46 pages
Bab 7 Multimedia Kompresi Audio
No ratings yet
Bab 7 Multimedia Kompresi Audio
52 pages
Determinnant 3 by 3 Matrix Practice
100% (1)
Determinnant 3 by 3 Matrix Practice
4 pages
Audio Compression
No ratings yet
Audio Compression
53 pages
CD3291 - Data Structures Lesson Plan New Format
No ratings yet
CD3291 - Data Structures Lesson Plan New Format
8 pages
Audio Compression Notes (Data Compression)
No ratings yet
Audio Compression Notes (Data Compression)
35 pages
Main Techniques and Performance of Each Compression
No ratings yet
Main Techniques and Performance of Each Compression
23 pages
Audio Coding and Standards
No ratings yet
Audio Coding and Standards
32 pages
Huff Man 1
No ratings yet
Huff Man 1
4 pages
Multimedia Individual Assignement
No ratings yet
Multimedia Individual Assignement
12 pages
Complete Deep Learning Interview Question
No ratings yet
Complete Deep Learning Interview Question
46 pages
Digital Signal Processing
No ratings yet
Digital Signal Processing
8 pages
AVL TREES With Insert and Delete Examples 22
No ratings yet
AVL TREES With Insert and Delete Examples 22
58 pages
The Maximum Flow Problem
No ratings yet
The Maximum Flow Problem
64 pages
0282 Algorithms
No ratings yet
0282 Algorithms
90 pages
CSE256
No ratings yet
CSE256
2 pages
Adaptive Equalizer PDF
No ratings yet
Adaptive Equalizer PDF
24 pages
Sahinidis2019 Article Mixed-IntegerNonlinearProgramm
No ratings yet
Sahinidis2019 Article Mixed-IntegerNonlinearProgramm
6 pages
Chapter-III Water Resources Systems: Analysis
No ratings yet
Chapter-III Water Resources Systems: Analysis
53 pages
Cubic Spline L16
No ratings yet
Cubic Spline L16
23 pages
Artificial Neural Network Using R
No ratings yet
Artificial Neural Network Using R
15 pages
Numerical Methods & Probability Theory (20A54402) : Lecture Notes
No ratings yet
Numerical Methods & Probability Theory (20A54402) : Lecture Notes
199 pages
Midterm
No ratings yet
Midterm
4 pages
Performance - Evaluation - of - Recurrent - Neural - Networks-LSTM - and - GRU - For ASR - IC2E3
No ratings yet
Performance - Evaluation - of - Recurrent - Neural - Networks-LSTM - and - GRU - For ASR - IC2E3
6 pages
CSE 446 Machine Learning: Instructor: Pedro Domingos
No ratings yet
CSE 446 Machine Learning: Instructor: Pedro Domingos
17 pages
Wa0018
No ratings yet
Wa0018
61 pages
Experiment 08
No ratings yet
Experiment 08
17 pages
程Model order reduction method based on (r) POD-ANNs for parameterized
No ratings yet
程Model order reduction method based on (r) POD-ANNs for parameterized
13 pages
4 - Integration by Partial Fractions
No ratings yet
4 - Integration by Partial Fractions
19 pages
Mws Gen Int PPT Simpson3by8
No ratings yet
Mws Gen Int PPT Simpson3by8
43 pages
Instructions For How To Solve Assignment
No ratings yet
Instructions For How To Solve Assignment
3 pages
Chapter-1 2
No ratings yet
Chapter-1 2
79 pages
2.4 Graphs Question Paper
No ratings yet
2.4 Graphs Question Paper
17 pages
3a) Assignment 2 - OR
No ratings yet
3a) Assignment 2 - OR
2 pages
A Novel Metaheuristic: Jaguar Algorithm With Learning Behavior
No ratings yet
A Novel Metaheuristic: Jaguar Algorithm With Learning Behavior
6 pages
In The Complex Plane With Application: On Polynomial Approximation To Conformai Mapping
No ratings yet
In The Complex Plane With Application: On Polynomial Approximation To Conformai Mapping
9 pages
Lab Exp
No ratings yet
Lab Exp
2 pages
DSP Tut1 Questions
No ratings yet
DSP Tut1 Questions
3 pages
100 Circuits - Audio 1
From Everand
100 Circuits - Audio 1
Newton C. Braga
5/5 (1)
Noise Reduction: Enhancing Clarity, Advanced Techniques for Noise Reduction in Computer Vision
From Everand
Noise Reduction: Enhancing Clarity, Advanced Techniques for Noise Reduction in Computer Vision
Fouad Sabry
No ratings yet

Basic Audio Compression Techniques

Uploaded by

Basic Audio Compression Techniques

Uploaded by

Basic Audio Compression

• Works by discarding information that is not perceptible to the

• If there are multiple channels, Shorten first separates them before

• FLAC also supports robust error detection and metadata support

You might also like