0% found this document useful (0 votes)

4 views35 pages

Multimedia Compression Techniques

The document provides an overview of various media compression techniques, including image (JPEG, GIF, Wavelet, Fractal), sound (MPEG Audio, MP3, AAC), and video (H.261, MPEG-2, MPEG-4, MPEG-7) compression methods. It details the principles behind these techniques, such as how JPEG utilizes human vision insensitivity to high frequencies for lossy compression, and how MPEG audio takes advantage of psychoacoustic models for efficient sound encoding. Additionally, it discusses the advantages of different compression methods and their applications in digital media.

Uploaded by

lookforeverything44

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views35 pages

Multimedia Compression Techniques

Uploaded by

lookforeverything44

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Media Compression

Techniques
Table of Contents
◼ Image Compression Methods
◼ JPEG
◼ GIF 89a
◼ Wavelet Compression
◼ Fractal
◼ Sound Compression
◼ MPEG Audio Overview
◼ MPEG Layer-3 (MP3)
◼ MPEG AAC
◼ Video Compression Methods
◼ H.261
◼ MPEG/MPEG-2
◼ MPEG-4
◼ MPEG-7
JPEG Compression: Basics
◼ Human vision is insensitive to high spatial frequencies
◼ JPEG Takes advantage of this by compressing high frequencies
more coarsely and storing image as frequency data
◼ JPEG is a “lossy” compression scheme.

Losslessly compressed image, ~150KB JPEG compressed, ~14KB

Digital Image Representation
◼ JPEG can handle arbitrary color spaces (RGB,
CMYK, YCbCr (separates colors into grayscale
components)
◼ Luminance/Chrominance commonly used,
with Chrominance subsampled due to human
vision insensitivity
◼ Uncompressed spatial color data components
are stored in quantized values (8, 16, 24bit,
etc).
Flow Chart of JPEG
Compression Process
◼ Divide image into 8x8 pixel blocks
◼ Apply 2D Fourier Discrete Cosine Transform
(FDCT) Transform
◼ Apply coarse quantization to high spatial
frequency components
◼ Compress resulting data losslessly and store

8x8 Frequency JPEG

pixel FDCT Dependent Zig-zag Huffman
encoding syntax
blocks quantization scan generator

Quantization output
Table
Example of Frequency
Quantization with 8x8 blocks
128 128 128 128 128 128 128 128 -80 4 -6 6 2 -2 -2 0

118 111 112 117 120 123 123 122 24 -8 8 12 0 0 0 2

125 121 115 111 119 119 118 117 10 -4 0 -12 -4 4 4 -2

120 121 113 113 125 124 115 108 8 0 -2 -6 10 4 -2 0

120 120 116 119 124 120 115 110 18 4 -4 6 -8 -4 0 0

117 113 111 122 120 110 116 119 -2 8 6 -4 0 -2 0 0

109 113 111 122 120 110 116 119 12 0 6 0 0 0 -2 -2

111 121 124 118 115 121 117 113 0 8 0 -4 -2 0 0 0

Color space values (spatial data) Color space values (spatial data)

-5 0 0 0 0 0 0 0
16 11 10 16 24 40 51 61
2 -1 1 1 0 0 0 0
12 12 14 19 26 58 60 55
1 0 0 -1 0 0 0 0
14 13 16 24 40 57 69 56
1 0 0 0 0 0 0 0
14 17 22 29 51 87 80 62
1 0 0 0 0 0 0 0
18 22 37 56 68 109 103 77
0 0 0 0 0 0 0 0
24 35 55 64 81 104 113 92
0 0 0 0 0 0 0 0
49 64 78 87 103 121 120 101
0 0 0 0 0 0 0 0
72 92 95 98 112 100 103 99
Quantized spatial frequency values
Quantization Matrix to divide by
Scanning and Huffman
Encoding
◼ Spatial Frequencies scanned in zig-zag
pattern (note high frequencies mostly zero)
◼ Huffman encoding used to losslessly record
values in table

-5 0 0 0 0 0 0 0

2 -1 1 1 0 0 0 0 0,2,1,-1,0,0,1,0,1,1,0,0,1,0,0,0,-1,0,0,… 0
1 0 0 -1 0 0 0 0
Can be stored as:
1 0 0 0 0 0 0 0

1 0 0 0 0 0 0 0
(1,2),(0,1),(0,-1),(2,1),(1,1),(0,1),(0,1),(2,1),(3,1),EOB
0 0 0 0 0 0 0 0

0 0 0 0 0 0 0 0

0 0 0 0 0 0 0 0
Examples of varying JPEG
compression ratios

500KB image, minimum compression 40KB image, half compression 11KB image, max compression
Close-up details of different
JPEG compression ratios

Uncompressed image Half compression, blurring & Max compression, 8-pixel

(roughness between pixels halos around sharp edges blocks apparent, large
still visible) distortion in high-frequency
areas
JPEG Encoding modes
◼ Sequential mode
◼ Image scanned in a raster scan with single pass, 8-bit
resolution
◼ Sequential mode
◼ Step-by-step buildup of image from low to high frequency,
useful for applications with long loading times (internet,
portable devices, etc)
◼ Hierarchical mode
◼ Encoded using low spatial resolution image and encoding
higher resolution images based on interpolated difference,
for display on varying equipment
GIF 89a Image Compression
◼ Compuserve’s image compression
format
◼ Best for images with sharp edges, low
bits per channel, computer graphics
where JPEG spatial averaging is
inadequate
◼ Usually used with 8-bit images, whereas
JPEG is better for 16-bit images.
GIF 89a examples vs. JPEG

GIF Image, 7.5KB, JPEG, blotchy spots in

optimal encoding single-color areas
Wavelet Image Compression

◼ Optimal for images containing

sharp edges, or continuous
curves/lines (fingerprints)
◼ Compared with DCT, uses
more optimal set of functions
to represent sharp edges
than cosines.
◼ Wavelets are finite in extent
as opposed to sinusoidal
Several different families of wavelets.
functions
Source: “An Introduction to Wavelets”.
http://www.amara.com/IEEEwave/IEEEwavelet.html#contents
Wavelet vs. JPEG compression

Wavelet compression JPEG compression

file size: 1861 bytes file size: 1895 bytes
compression ratio - 105.6 compression ratio - 103.8

Source: “About Wavelet Compression”. http://www.barrt.ru/parshukov/about.htm.

Wavelet compression
advantages

Fig. 1. Fourier basis functions, time- Fig. 2. Daubechies wavelet basis functions, time-
frequency tiles, and coverage of the frequency tiles, and coverage of the time-
time-frequency plane. frequency plane

Source: “An Introduction to Wavelets”.

http://www.amara.com/IEEEwave/IEEEwavelet.html#contents
Fractal Based Image
Compression
◼ Image compressed in terms of self-
similarity rather than pixel resolution
◼ Can be digitally scaled to any resolution
when decoded
Table of Contents
◼ Image Compression Methods
◼ JPEG
◼ GIF 89a
◼ Wavelet Compression
◼ Fractal
◼ Sound Compression
◼ MPEG Audio Overview
◼ MPEG Layer-3 (MP3)
◼ MPEG AAC
◼ Video Compression Methods
◼ H.261
◼ MPEG/MPEG-2
◼ MPEG-4
◼ MPEG-7
MPEG Audio basics &
Psychoacoustic Model
◼ Human hearing limited to values lower than
~20kHz in most cases
◼ Human hearing is insensitive to quiet
frequency components to sound
accompanying other stronger frequency
components
◼ Stereo audio streams contain largely
redundant information
◼ MPEG audio compression takes advantage of
these facts to reduce extent and detail of
mostly inaudible frequency ranges
MPEG-Layer3 Overview

MP3 Compression Flow Chart

MPEG Layer-3 performance
sound quality bandwidth mode bitrate reduction ratio

telephone sound 2.5 kHz mono 8 kbps * 96:1

better than short wave 4.5 kHz mono 16 kbps 48:1

better than AM radio 7.5 kHz mono 32 kbps 24:1

similar to FM radio 11 kHz stereo 56...64 kbps 26...24:1

near-CD 15 kHz stereo 96 kbps 16:1

CD >15 kHz stereo 112..128kbps 14..12:1

MPEG-2 Advanced Audio Coding
(AAC) codec (next generation)
◼ Sampling frequencies from 8kHz to 96kHz
◼ 1 to 48 channels per stream
◼ Temporal Noise Shaping (TNS) smooths
quantization noise by making frequency
domain predictions
◼ Prediction: Allows predictable sound patterns
such as speech to be predicted and
compressed with better quality
MPEG-2 AAC Flowchart
Table of Contents
◼ Image Compression Methods
◼ JPEG
◼ GIF 89a
◼ Wavelet Compression
◼ Fractal
◼ Sound Compression
◼ MPEG Audio Overview
◼ MPEG Layer-3 (MP3)
◼ MPEG AAC
◼ Video Compression Methods
◼ H.261
◼ MPEG/MPEG-2
◼ MPEG-4
◼ MPEG-7
Video Compression with
Temporal Redundancy
◼ Using strictly spatial redundancy (JPEG)
gives video compression ratios from 7:1
to 27:1
◼ Taking advantage of temporal
redundancy in video gives 20:1 to
300:1 compression for H.261, or 30:1
to 100:1 for high quality MPEG-2
Videoconferencing
Compression with H.261
◼ H.261 is standard recommended for
videoconferencing over ISDN lines.
◼ Takes advantage of both spatial and
temporal redundancy in moving images
◼ Extremely similar to JPEG, but uses
initial frame plus motion vectors to
predict subsequent frames
H.261 Block Structure
◼ Basic unit of processing is in 8x8 pixel
blocks.
◼ Macro Blocks (MB, 16x16 pixels) are
used for motion estimation, 4 blocks of
luminance, 2 of chrominance
◼ Groups of Blocks (GOB) of 3x11 MB’s
are stored together with a header in
stream.
H.261 Block Structure of
bitstream

Block structure of H.261 video bitstream, Common Intermediate Format

(CIF), 360x288 pixels luminance, 180x144 pixels chrominance

Source: “H.261 Videoconferencing Codec” http://www.uh.edu/~hebert/ece6354/H261-report.pdf

H.261 Decoding (Similar to
encoding process)

Loop Filter

Encoded Bitstream Inverse Decompressed

Bitstream DEcoder Quantizer IDCT Video

Motion
Compensation

Reference
Frame
MPEG Video Compression
◼ Supports JPEG and H.261 through downward
compatibility
◼ Supports higher Chrominance resolution and pixel
resolution (720x480 is standard used for TV signals)
◼ Supports interlaced and noninterlaced modes
◼ Uses Bidirectional prediction in “Group Of Pictures”
to encode difference frames.

“Group Of Pictures” inter-frame dependencies in a stream

Source: “Parallelization of Software Mpeg Compression” http://www.evl.uic.edu/fwang/mpeg.html

MPEG 1 & 2 Bitstream

The MPEG data hierarchy

Source: http://www.doc.ic.ac.uk/~nd/surprise_96/journal/vol4/sab/report.html
MPEG-4
◼ Original goal was for 10 times better
compression than H.261
◼ Goals shifted to
◼ Flexible bitstreams for varying receiver capabilities
◼ Stream can contain new applications and
algorithms
◼ Content-based interactivity with data stream
◼ Network independence (used for Internet,
Wireless, POTS, etc)
◼ Object based representations
MPEG-4 audio-visual scene
composition
◼ Can place media objects anywhere in a scene
◼ Apply transforms to change appearance or
qualities of an object
◼ Group objects to form compound objects
◼ Apply streamed data to objects
◼ Interactively change viewer’s position in the
virtual scene

http://www.iis.fraunhofer.de/amm/techinf/mpeg4/mp4_overv.pdf
MPEG-4 “Audiovisual Scene”
Example

Source: “MPEG-4 Overview” http://www.chiariglione.org/mpeg/standards/mpeg-4/mpeg-4.htm

MPEG-7
◼ Media tagging format for doing searches on arbitrary media formats via
feature extraction algorithms
◼ Visual descriptors such as:
◼ Basic Structures
◼ Color
◼ Texture
◼ Shape
◼ Localization of spatio-temporal objects
◼ Motion
◼ Face Recognition
◼ Audio descriptors such as :
◼ Sound effects description
◼ Musical Instrument Timbre Description
◼ Spoken Content Description
◼ Melodic Descriptors (search by tune)
◼ Uniform Silence Segment
◼ Example application: Play a few notes on a keyboard and have
matched song retrieved.
Conclusion
◼ Media compression is indispensable
even as storage and streaming
capacities increase
◼ Future goals oriented towards
increasing ease of access to media
information (similar to google for text
based information)

General Systems Theory
100% (1)
General Systems Theory
7 pages
Overview of Multimedia Compression Technologies: Vinay Kumar
No ratings yet
Overview of Multimedia Compression Technologies: Vinay Kumar
34 pages
Multimedia Compression: Audio, Image and Video Require Vast Amounts of Data
100% (1)
Multimedia Compression: Audio, Image and Video Require Vast Amounts of Data
59 pages
Technical Notes On Digital Audio and Video Compression
No ratings yet
Technical Notes On Digital Audio and Video Compression
2 pages
Compression: DMET501 - Introduction To Media Engineering
No ratings yet
Compression: DMET501 - Introduction To Media Engineering
26 pages
JPEG Standard, MPEG and Recognition
No ratings yet
JPEG Standard, MPEG and Recognition
32 pages
Ch-5 Data Compression
No ratings yet
Ch-5 Data Compression
58 pages
Video Compression Makes Big Gains
No ratings yet
Video Compression Makes Big Gains
4 pages
Iamge Video Compression V2
No ratings yet
Iamge Video Compression V2
29 pages
Data Representation (Part 3)
No ratings yet
Data Representation (Part 3)
26 pages
Data Compression
No ratings yet
Data Compression
19 pages
Chapter - 5 Data Compression
No ratings yet
Chapter - 5 Data Compression
8 pages
Compression Techniques
No ratings yet
Compression Techniques
7 pages
Data Compression Techniques
No ratings yet
Data Compression Techniques
21 pages
Beginner Guide For MPEG-2 Standard
No ratings yet
Beginner Guide For MPEG-2 Standard
12 pages
Wk8 MPEG Part1
No ratings yet
Wk8 MPEG Part1
36 pages
Siddu 02
No ratings yet
Siddu 02
25 pages
Compression Techniques
No ratings yet
Compression Techniques
17 pages
Itc - Mpeg Case Study
No ratings yet
Itc - Mpeg Case Study
29 pages
Itc - Mpeg Case Study
No ratings yet
Itc - Mpeg Case Study
12 pages
Week 14 Compression Techniques
100% (1)
Week 14 Compression Techniques
81 pages
Data Represintation and Storage
No ratings yet
Data Represintation and Storage
19 pages
MPEG Video Compression
No ratings yet
MPEG Video Compression
14 pages
Compression
No ratings yet
Compression
39 pages
MMS - Unit3-Part-1
No ratings yet
MMS - Unit3-Part-1
45 pages
Presentation (1) - Read-Only
No ratings yet
Presentation (1) - Read-Only
17 pages
Dolby Ac3
No ratings yet
Dolby Ac3
43 pages
MC-12 (MPEG Video Compression)
No ratings yet
MC-12 (MPEG Video Compression)
22 pages
Video Compression MPEG
100% (3)
Video Compression MPEG
31 pages
Special Topics Data Compression
No ratings yet
Special Topics Data Compression
51 pages
Asynchronous (Cervo Ramboyong)
No ratings yet
Asynchronous (Cervo Ramboyong)
16 pages
Video Compression MPEG
No ratings yet
Video Compression MPEG
25 pages
MPEG Audio - Compression - 2
No ratings yet
MPEG Audio - Compression - 2
5 pages
Analysis and Implementation of Video Compression Using MPEG Standard
No ratings yet
Analysis and Implementation of Video Compression Using MPEG Standard
13 pages
Mpeg 4 1109
No ratings yet
Mpeg 4 1109
38 pages
JPEG, Basic Ideas, Standards H.261, MPEG-1, MPEG-2 AVC, HEVC, Container Formats
No ratings yet
JPEG, Basic Ideas, Standards H.261, MPEG-1, MPEG-2 AVC, HEVC, Container Formats
20 pages
Data Compression
No ratings yet
Data Compression
29 pages
Compression
No ratings yet
Compression
32 pages
JPEG and H.26x Standards
No ratings yet
JPEG and H.26x Standards
30 pages
Mul c5
No ratings yet
Mul c5
115 pages
Wavelet-Based Image Compression: By: Heriniaina Andrianirina #1817662 Akakpo Agbago #1817699
No ratings yet
Wavelet-Based Image Compression: By: Heriniaina Andrianirina #1817662 Akakpo Agbago #1817699
19 pages
JPEG and MPEG Image Compression
No ratings yet
JPEG and MPEG Image Compression
3 pages
MP3 Audio & Introduction To MPEG-4 (Part 6) : Klara Nahrstedt Spring 2011
100% (1)
MP3 Audio & Introduction To MPEG-4 (Part 6) : Klara Nahrstedt Spring 2011
37 pages
Lesson 06
No ratings yet
Lesson 06
38 pages
Audio Compression Standards: James Rodney P. Santiago
No ratings yet
Audio Compression Standards: James Rodney P. Santiago
51 pages
85 Video Compress
No ratings yet
85 Video Compress
28 pages
Digital Video
No ratings yet
Digital Video
26 pages
CG Unit 5
No ratings yet
CG Unit 5
76 pages
ضغط الصوت
No ratings yet
ضغط الصوت
31 pages
Image and Video Compression
No ratings yet
Image and Video Compression
8 pages
Image and Video Compression: Communication Systems EECE 453 Hassan Mansour
No ratings yet
Image and Video Compression: Communication Systems EECE 453 Hassan Mansour
8 pages
CH 2. Size 10.5 Finalised
No ratings yet
CH 2. Size 10.5 Finalised
25 pages
Image and Video Compression Standards.4
No ratings yet
Image and Video Compression Standards.4
24 pages
02 2 CH8
No ratings yet
02 2 CH8
34 pages
Multimedia
No ratings yet
Multimedia
14 pages
Video Coding
No ratings yet
Video Coding
23 pages
Lecture8 Dcbe4
No ratings yet
Lecture8 Dcbe4
33 pages
Computer Networks 2 10CS64 UNIT 7 Notes
No ratings yet
Computer Networks 2 10CS64 UNIT 7 Notes
22 pages
Video/Image Compression Technologies: An Overview
No ratings yet
Video/Image Compression Technologies: An Overview
37 pages
Image Compression Standards: by A.Raju EC094201 M.Tech, ACS
No ratings yet
Image Compression Standards: by A.Raju EC094201 M.Tech, ACS
28 pages
Mega Drive Architecture: Architecture of Consoles: A Practical Analysis, #3
From Everand
Mega Drive Architecture: Architecture of Consoles: A Practical Analysis, #3
Rodrigo Copetti
No ratings yet
Water and Steam Chemistry, Deposits and Corrosion.
No ratings yet
Water and Steam Chemistry, Deposits and Corrosion.
41 pages
Ensayos de Permeabilidad
No ratings yet
Ensayos de Permeabilidad
27 pages
6715 4545 Global Support Toll Free Numbers
No ratings yet
6715 4545 Global Support Toll Free Numbers
1 page
Sol 5
No ratings yet
Sol 5
7 pages
TCS Digital - Quantitative Aptitude
No ratings yet
TCS Digital - Quantitative Aptitude
39 pages
Cambridge Ext2 Ch1 Complex Numbers IWEB
No ratings yet
Cambridge Ext2 Ch1 Complex Numbers IWEB
62 pages
9 Fraunhofer Snail Trails
No ratings yet
9 Fraunhofer Snail Trails
4 pages
Mathlogicp1 PDF
No ratings yet
Mathlogicp1 PDF
122 pages
Unit 6
No ratings yet
Unit 6
9 pages
PROKON Footing-Design-Calculation PDF
100% (3)
PROKON Footing-Design-Calculation PDF
45 pages
Basic Programming Sample Paper
No ratings yet
Basic Programming Sample Paper
11 pages
Macros 4 B
No ratings yet
Macros 4 B
5 pages
SKYMARK C-Series Water-Cooled Self-Contained Units Catalog (1112)
No ratings yet
SKYMARK C-Series Water-Cooled Self-Contained Units Catalog (1112)
24 pages
Suggestion For ICT-2024
No ratings yet
Suggestion For ICT-2024
5 pages
Classification of Air Masses and Fronts - Geography Optional - UPSC - Digitally Learn
No ratings yet
Classification of Air Masses and Fronts - Geography Optional - UPSC - Digitally Learn
14 pages
Chapter 17 Exercise
0% (1)
Chapter 17 Exercise
3 pages
Omkw 1
No ratings yet
Omkw 1
32 pages
ISI-Entrance Solutions: Economics
No ratings yet
ISI-Entrance Solutions: Economics
4 pages
Design Process en 1993-1!3!2006
No ratings yet
Design Process en 1993-1!3!2006
22 pages
SAP Business Explorer Tools
No ratings yet
SAP Business Explorer Tools
12 pages
E GMAT SC Complete StudyPlan
No ratings yet
E GMAT SC Complete StudyPlan
6 pages
CAL Script For MDG - Governing Profit Center
No ratings yet
CAL Script For MDG - Governing Profit Center
29 pages
Using The Leica TC 307 v2
No ratings yet
Using The Leica TC 307 v2
4 pages
Statics - Chapter 5
No ratings yet
Statics - Chapter 5
12 pages
Protech Controller LF-313LD
100% (4)
Protech Controller LF-313LD
2 pages
Precedence, Dominance and C-Command: Binding Theory
100% (1)
Precedence, Dominance and C-Command: Binding Theory
6 pages
Chassis Control Systems
100% (2)
Chassis Control Systems
312 pages
Preheat Calculation 2 PDF
No ratings yet
Preheat Calculation 2 PDF
3 pages
A Report On Switchyard Equipment Testing (132
No ratings yet
A Report On Switchyard Equipment Testing (132
11 pages

Multimedia Compression Techniques

Uploaded by

Multimedia Compression Techniques

Uploaded by

Media Compression

Losslessly compressed image, ~150KB JPEG compressed, ~14KB

8x8 Frequency JPEG

118 111 112 117 120 123 123 122 24 -8 8 12 0 0 0 2

125 121 115 111 119 119 118 117 10 -4 0 -12 -4 4 4 -2

120 121 113 113 125 124 115 108 8 0 -2 -6 10 4 -2 0

120 120 116 119 124 120 115 110 18 4 -4 6 -8 -4 0 0

117 113 111 122 120 110 116 119 -2 8 6 -4 0 -2 0 0

109 113 111 122 120 110 116 119 12 0 6 0 0 0 -2 -2

111 121 124 118 115 121 117 113 0 8 0 -4 -2 0 0 0

Uncompressed image Half compression, blurring & Max compression, 8-pixel

GIF Image, 7.5KB, JPEG, blotchy spots in

◼ Optimal for images containing

Wavelet compression JPEG compression

Source: “About Wavelet Compression”. http://www.barrt.ru/parshukov/about.htm.

Source: “An Introduction to Wavelets”.

MP3 Compression Flow Chart

telephone sound 2.5 kHz mono 8 kbps * 96:1

better than short wave 4.5 kHz mono 16 kbps 48:1

better than AM radio 7.5 kHz mono 32 kbps 24:1

similar to FM radio 11 kHz stereo 56...64 kbps 26...24:1

near-CD 15 kHz stereo 96 kbps 16:1

CD >15 kHz stereo 112..128kbps 14..12:1

Block structure of H.261 video bitstream, Common Intermediate Format

Source: “H.261 Videoconferencing Codec” http://www.uh.edu/~hebert/ece6354/H261-report.pdf

Encoded Bitstream Inverse Decompressed

“Group Of Pictures” inter-frame dependencies in a stream

Source: “Parallelization of Software Mpeg Compression” http://www.evl.uic.edu/fwang/mpeg.html

The MPEG data hierarchy

Source: “MPEG-4 Overview” http://www.chiariglione.org/mpeg/standards/mpeg-4/mpeg-4.htm

You might also like