0% found this document useful (0 votes)

5 views35 pages

Data Compression

The document discusses data compression techniques, categorizing them into lossless and lossy methods. It details various compression methods such as Run-length Encoding, Huffman Coding, and Lempel Ziv encoding, explaining their mechanisms and applications. Additionally, it provides insights into file formats that utilize these compression techniques.

Uploaded by

Charles Chalo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views35 pages

Data Compression

Uploaded by

Charles Chalo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Data

Compression

15.1
Data compression implies sending or storing a smaller
number of bits. Although many methods are used for this
purpose, in general these methods can be divided into two
broad categories: lossless and lossy methods.

Data compression methods

15.2
General Data Compression
Scheme
Input Data Encoder
(compression)
Codes /
Codewords Storage or
Networks

Codes /
Codewords Decoder
(decompression)

Output Data

15.3
Compression Techniques
Run-length Coding

Entropy
Huflfman Coding
Encoding

Arithmetic Coding

DPCM

Prediction DM

FFT

Transformation DCT

Source Coding Bit Position

Layered Coding Subsampling

Sub-band Coding

Vector Quantization

J PEG
15.4
15-1 LOSSLESS COMPRESSION

In lossless data compression, the integrity of the data is

preserved. The original data and the data after
compression and decompression are exactly the same
because, in these methods, the compression and
decompression algorithms are exact inverses of each
other: no part of the data is lost in the process.
Redundant data is removed in compression and added
during decompression. Lossless compression methods
are normally used when we cannot afford to lose any
data.

15.5
Run-length encoding
Run-length encoding is probably the simplest method of
compression. It can be used to compress data made of any
combination of symbols.
It does not need to know the frequency of occurrence of
symbols and can be very efficient if data is represented as 0s
and 1s.
The general idea behind this method is to replace
consecutive repeating occurrences of a symbol by one
occurrence of the symbol followed by the number of
occurrences.
The method can be even more efficient if the data uses only
two symbols (for example 0 and 1) in its bit pattern and one
symbol is more frequent than the other.
15.6
Run-length encoding example
15.7
Run-length encoding for two symbols
15.8
15.9
Huffman coding
Huffman coding assigns shorter codes to symbols that occur
more frequently and longer codes to those that occur less
frequently.
For example, imagine we have a text file that uses only five
characters (A, B, C, D, E).
Before we can assign bit patterns to each character, we
assign each character a weight based on its frequency of use.
In this example, assume that the frequency of the characters
is as shown in the table below.
Steps to produce a binary code tree by reducing redundancy.
Frequency of character

15.10
Algorithm
a. Make a leaf node for each code symbol.
Add the generation probability of each symbol
to the leaf node (arrange the symbols in
ascending order and their frequency count.)
b. Take the two leaf nodes with the smallest
probability(Frequencies) and connect them into a new
node.
1. Add 1 or 0 to each of the two branches
2. The probability of the new node is the sum of
the probabilities of the two connecting nodes.
c. If there is only one node left, the code construction
is completed. If not, go back to (b)

15.11
Rules:
1. If you assign weight ‘0’ to the left edges, then assign weight ‘1’ to the
right edges.
2. If you assign weight ‘1’ to the left edges, then assign weight ‘0’ to the
right edges.
3. Any of the above two conventions may be followed.
But follow the same convention at the time of decoding that is adopted at
the time of encoding.

Huffman coding
15.12
A character’s code is found by starting at the root and
following the branches that lead to that character.
The code itself is the bit value of each branch on the path,
taken in sequence.

Final tree and code

15.13
Encoding
Let us see how to encode text using the code for our five
characters. Figure below shows the original and the encoded
text.

Huffman encoding
15.14
Decoding
The recipient has a very easy job in decoding the data it
receives. Figure below shows how decoding takes place.

Huffman decoding
15.15
Question:
Given a file that consists of the following set of characters
along with corresponding frequencies.
Characters Frequencies

a 10
e 15
i 12
o 3
u 4
s 13
t 1
Using Huffman Coding scheme as data compression,
determine:
1. Huffman Code for each character
2. Draw the Huffman tree
3. Length of Huffman encoded message (in bits)
4. Encode the message aeiou.
15.16
Lempel Ziv encoding
Lempel Ziv (LZ) encoding is an example of a category of
algorithms called dictionary-based encoding. The idea is to
create a dictionary (a table) of strings used during the
communication session. If both the sender and the receiver
have a copy of the dictionary, then previously-encountered
strings can be substituted by their index in the dictionary to
reduce the amount of information transmitted.

15.17
Compression
In this phase there are two concurrent events: building an
indexed dictionary and compressing a string of symbols. The
algorithm extracts the smallest substring that cannot be
found in the dictionary from the remaining uncompressed
string. It then stores a copy of this substring in the dictionary
as a new entry and assigns it an index value. Compression
occurs when the substring, except for the last character, is
replaced with the index found in the dictionary. The process
then inserts the index and the last character of the substring
into the compressed string.

15.18
An example of Lempel Ziv encoding
15.19
Decompression
Decompression is the inverse of the compression process.
The process extracts the substrings from the compressed
string and tries to replace the indexes with the corresponding
entry in the dictionary, which is empty at first and built up
gradually. The idea is that when an index is received, there is
already an entry in the dictionary corresponding to that
index.

15.20
An example of Lempel Ziv decoding
15.21
File Formats
TIFF files
Scanning as TIFF
LZW compression
EPS files (vector)

Encapsulated PostScript
DCS files
PICT files (Macintosh)
BMP files (Windows)
WMF files (windows)
GIF file format
GIF file format
PNG file format
JPEG files
PDF files

Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
From Everand
Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
Sherwyn Allibang
5/5 (2)
Multimedia Communication Notes
89% (18)
Multimedia Communication Notes
220 pages
Compression
No ratings yet
Compression
21 pages
Chapter 3 Multimedia Data Compression
No ratings yet
Chapter 3 Multimedia Data Compression
21 pages
Unit 1 Data Compression
No ratings yet
Unit 1 Data Compression
30 pages
Chapter 7
No ratings yet
Chapter 7
70 pages
Ut 1 PPT
No ratings yet
Ut 1 PPT
77 pages
HGGJ Chapter Four
No ratings yet
HGGJ Chapter Four
30 pages
Data Compression Chapter 7
No ratings yet
Data Compression Chapter 7
40 pages
Why Needed?: Without Compression, These Applications Would Not Be Feasible
No ratings yet
Why Needed?: Without Compression, These Applications Would Not Be Feasible
11 pages
Chap15 1473751047 598113
No ratings yet
Chap15 1473751047 598113
34 pages
CH 6
No ratings yet
CH 6
21 pages
Data Compression Techniques: Pushpender Rana, Student
No ratings yet
Data Compression Techniques: Pushpender Rana, Student
4 pages
Chapter Three
No ratings yet
Chapter Three
30 pages
Day 20
No ratings yet
Day 20
33 pages
Chapter 3-Part II
100% (1)
Chapter 3-Part II
26 pages
Mad Unit 3-Jntuworld
No ratings yet
Mad Unit 3-Jntuworld
53 pages
Comparison of Lossless Data Compression Algorithms
No ratings yet
Comparison of Lossless Data Compression Algorithms
12 pages
ICT - Module 1 Lecture 3
No ratings yet
ICT - Module 1 Lecture 3
43 pages
Multimedia Data Compression
No ratings yet
Multimedia Data Compression
31 pages
CH 15
No ratings yet
CH 15
34 pages
Image Compression
100% (1)
Image Compression
38 pages
Compression For Sending and Storing Information: Text, Audio, Images, Videos
No ratings yet
Compression For Sending and Storing Information: Text, Audio, Images, Videos
28 pages
Huffman Coding, RLE, LZW
No ratings yet
Huffman Coding, RLE, LZW
41 pages
Data Compression (RCS 087)
No ratings yet
Data Compression (RCS 087)
51 pages
Chapter 4 Multi
No ratings yet
Chapter 4 Multi
45 pages
KMA SS05 Kap03 Compression
No ratings yet
KMA SS05 Kap03 Compression
54 pages
Chapter 3 Multimedia Data Compression
100% (2)
Chapter 3 Multimedia Data Compression
23 pages
Text and Text Compression
No ratings yet
Text and Text Compression
28 pages
Data Compression Btech Notes
No ratings yet
Data Compression Btech Notes
32 pages
15 Data Compression: Foundations of Computer Science Cengage Learning
No ratings yet
15 Data Compression: Foundations of Computer Science Cengage Learning
33 pages
CHAPTER FOURmultimedia
No ratings yet
CHAPTER FOURmultimedia
23 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Data Compression
No ratings yet
Data Compression
7 pages
4 Huffman and Shannon Fano Coding
No ratings yet
4 Huffman and Shannon Fano Coding
23 pages
Multimedia Systems Chapter 7
No ratings yet
Multimedia Systems Chapter 7
21 pages
MMC Module 3
No ratings yet
MMC Module 3
65 pages
unit 5 dc ppt
No ratings yet
unit 5 dc ppt
33 pages
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
No ratings yet
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
37 pages
Module IV
No ratings yet
Module IV
37 pages
Data Compression
No ratings yet
Data Compression
22 pages
Entropy & Run Length Coding
No ratings yet
Entropy & Run Length Coding
45 pages
Wa0023.
No ratings yet
Wa0023.
28 pages
Arithmetic, Run Length, Compression
No ratings yet
Arithmetic, Run Length, Compression
62 pages
Data Compression Techniques
No ratings yet
Data Compression Techniques
41 pages
An Introduction To Arithmetic Coding: Glen G. Langdon, JR
No ratings yet
An Introduction To Arithmetic Coding: Glen G. Langdon, JR
15 pages
Ultimedia OF ATA Ompression: IS502:M D I S
No ratings yet
Ultimedia OF ATA Ompression: IS502:M D I S
29 pages
Chapter 4 Lossless Compression Algorithims
No ratings yet
Chapter 4 Lossless Compression Algorithims
30 pages
6.1 Lossless Compression Algorithms: Introduction: Unit 6: Multimedia Data Compression
No ratings yet
6.1 Lossless Compression Algorithms: Introduction: Unit 6: Multimedia Data Compression
25 pages
Chapter 4 - Introduction To Source Coding
No ratings yet
Chapter 4 - Introduction To Source Coding
72 pages
Data Compression
No ratings yet
Data Compression
20 pages
Data Compression
No ratings yet
Data Compression
28 pages
L117, L18, L19, L20, L21 - Module 5 - Source Coding - II
No ratings yet
L117, L18, L19, L20, L21 - Module 5 - Source Coding - II
53 pages
Data Structures and Algorithms Compression Methods
No ratings yet
Data Structures and Algorithms Compression Methods
21 pages
Chapter 5 New
No ratings yet
Chapter 5 New
19 pages
Compression: Some Slides Courtesy James Allan@umass
No ratings yet
Compression: Some Slides Courtesy James Allan@umass
47 pages
Lecture 3-Huffman Coding
No ratings yet
Lecture 3-Huffman Coding
30 pages
Nen Anh
No ratings yet
Nen Anh
36 pages
Digital Comm Class Notes Personal
No ratings yet
Digital Comm Class Notes Personal
40 pages
20 Compression
No ratings yet
20 Compression
58 pages
C Programming for Arduino
From Everand
C Programming for Arduino
Julien Bayle
4/5 (13)
Diploma Final Exam Timetable Jan April 2025 20-3-2025
No ratings yet
Diploma Final Exam Timetable Jan April 2025 20-3-2025
49 pages
Group 13
No ratings yet
Group 13
3 pages
Emmaculate CV
No ratings yet
Emmaculate CV
3 pages
P.O BOX 56808,-00200, NAIROBI TEL (254) 020 8566177,8561803 FAX (020) 8561077 Email
No ratings yet
P.O BOX 56808,-00200, NAIROBI TEL (254) 020 8566177,8561803 FAX (020) 8561077 Email
12 pages
Multimedia Audio and Videos
No ratings yet
Multimedia Audio and Videos
41 pages
Daa LM Practical Exercises
No ratings yet
Daa LM Practical Exercises
49 pages
Basic Data Compression Concepts: - Lossless - Lossy - Compression Ratio
No ratings yet
Basic Data Compression Concepts: - Lossless - Lossy - Compression Ratio
9 pages
Image Processing - Unit - 5 - MCQ
No ratings yet
Image Processing - Unit - 5 - MCQ
11 pages
Unit 4
No ratings yet
Unit 4
6 pages
Ec 1009 - Digital Image Processing
75% (4)
Ec 1009 - Digital Image Processing
30 pages
DAA Unit 3
No ratings yet
DAA Unit 3
23 pages
Multimedia Igds MSC Exam 2000 Solutions
No ratings yet
Multimedia Igds MSC Exam 2000 Solutions
16 pages
Ec1009 Digital Image Processing
100% (15)
Ec1009 Digital Image Processing
37 pages
Question Bank For 5 Units DIP-IT6005
0% (2)
Question Bank For 5 Units DIP-IT6005
5 pages
Adp Huffman Coding
No ratings yet
Adp Huffman Coding
15 pages
EC6018-Multimedia Compression and Communication
0% (2)
EC6018-Multimedia Compression and Communication
12 pages
Introduction To Information Technology: Lecture #6
No ratings yet
Introduction To Information Technology: Lecture #6
22 pages
Constructing Trees in Parallel: N2/logn N3/log
No ratings yet
Constructing Trees in Parallel: N2/logn N3/log
11 pages
Dsa CPP
No ratings yet
Dsa CPP
11 pages
ICS 202: Data Structures (3-3-4) : King Fahd University of Petroleum & Minerals
No ratings yet
ICS 202: Data Structures (3-3-4) : King Fahd University of Petroleum & Minerals
5 pages
Greedy Algos
No ratings yet
Greedy Algos
102 pages
Com Lab-2 Jvit JBK Manual
No ratings yet
Com Lab-2 Jvit JBK Manual
56 pages
Data Compression (Term 2 Set A)
No ratings yet
Data Compression (Term 2 Set A)
1 page
(IJCST-V6I6P2) :Savita.A.Harkude, Dr. G.N.Kodanda Ramaiah
No ratings yet
(IJCST-V6I6P2) :Savita.A.Harkude, Dr. G.N.Kodanda Ramaiah
5 pages
What Is Algorith (Autosaved) 12 Size
No ratings yet
What Is Algorith (Autosaved) 12 Size
14 pages
Telegram Channel Telegram Group
No ratings yet
Telegram Channel Telegram Group
55 pages
Design and Analysis of Algorithms-3
No ratings yet
Design and Analysis of Algorithms-3
3 pages
Data Compression Seminar Report
67% (6)
Data Compression Seminar Report
34 pages
Mesleki Yeterlilik
No ratings yet
Mesleki Yeterlilik
106 pages
Dip Manual
No ratings yet
Dip Manual
44 pages
Shortened MATLAB Codes
No ratings yet
Shortened MATLAB Codes
11 pages
Dip Part-C PDF
No ratings yet
Dip Part-C PDF
4 pages
DAA Module 3 Power Point-S.Mercy
No ratings yet
DAA Module 3 Power Point-S.Mercy
56 pages

Data Compression

Uploaded by

Data Compression

Uploaded by

Data

Data compression methods

Source Coding Bit Position

Layered Coding Subsampling

In lossless data compression, the integrity of the data is

Final tree and code

You might also like