Multimedia Compression Techniques
Multimedia Compression Techniques
Techniques
Table of Contents
◼ Image Compression Methods
◼ JPEG
◼ GIF 89a
◼ Wavelet Compression
◼ Fractal
◼ Sound Compression
◼ MPEG Audio Overview
◼ MPEG Layer-3 (MP3)
◼ MPEG AAC
◼ Video Compression Methods
◼ H.261
◼ MPEG/MPEG-2
◼ MPEG-4
◼ MPEG-7
JPEG Compression: Basics
◼ Human vision is insensitive to high spatial frequencies
◼ JPEG Takes advantage of this by compressing high frequencies
more coarsely and storing image as frequency data
◼ JPEG is a “lossy” compression scheme.
Quantization output
Table
Example of Frequency
Quantization with 8x8 blocks
128 128 128 128 128 128 128 128 -80 4 -6 6 2 -2 -2 0
Color space values (spatial data) Color space values (spatial data)
-5 0 0 0 0 0 0 0
16 11 10 16 24 40 51 61
2 -1 1 1 0 0 0 0
12 12 14 19 26 58 60 55
1 0 0 -1 0 0 0 0
14 13 16 24 40 57 69 56
1 0 0 0 0 0 0 0
14 17 22 29 51 87 80 62
1 0 0 0 0 0 0 0
18 22 37 56 68 109 103 77
0 0 0 0 0 0 0 0
24 35 55 64 81 104 113 92
0 0 0 0 0 0 0 0
49 64 78 87 103 121 120 101
0 0 0 0 0 0 0 0
72 92 95 98 112 100 103 99
Quantized spatial frequency values
Quantization Matrix to divide by
Scanning and Huffman
Encoding
◼ Spatial Frequencies scanned in zig-zag
pattern (note high frequencies mostly zero)
◼ Huffman encoding used to losslessly record
values in table
-5 0 0 0 0 0 0 0
2 -1 1 1 0 0 0 0 0,2,1,-1,0,0,1,0,1,1,0,0,1,0,0,0,-1,0,0,… 0
1 0 0 -1 0 0 0 0
Can be stored as:
1 0 0 0 0 0 0 0
1 0 0 0 0 0 0 0
(1,2),(0,1),(0,-1),(2,1),(1,1),(0,1),(0,1),(2,1),(3,1),EOB
0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0
Examples of varying JPEG
compression ratios
500KB image, minimum compression 40KB image, half compression 11KB image, max compression
Close-up details of different
JPEG compression ratios
Fig. 1. Fourier basis functions, time- Fig. 2. Daubechies wavelet basis functions, time-
frequency tiles, and coverage of the frequency tiles, and coverage of the time-
time-frequency plane. frequency plane
Loop Filter
Motion
Compensation
Reference
Frame
MPEG Video Compression
◼ Supports JPEG and H.261 through downward
compatibility
◼ Supports higher Chrominance resolution and pixel
resolution (720x480 is standard used for TV signals)
◼ Supports interlaced and noninterlaced modes
◼ Uses Bidirectional prediction in “Group Of Pictures”
to encode difference frames.
Source: http://www.doc.ic.ac.uk/~nd/surprise_96/journal/vol4/sab/report.html
MPEG-4
◼ Original goal was for 10 times better
compression than H.261
◼ Goals shifted to
◼ Flexible bitstreams for varying receiver capabilities
◼ Stream can contain new applications and
algorithms
◼ Content-based interactivity with data stream
◼ Network independence (used for Internet,
Wireless, POTS, etc)
◼ Object based representations
MPEG-4 audio-visual scene
composition
◼ Can place media objects anywhere in a scene
◼ Apply transforms to change appearance or
qualities of an object
◼ Group objects to form compound objects
◼ Apply streamed data to objects
◼ Interactively change viewer’s position in the
virtual scene
http://www.iis.fraunhofer.de/amm/techinf/mpeg4/mp4_overv.pdf
MPEG-4 “Audiovisual Scene”
Example