0% found this document useful (0 votes)

34 views34 pages

Data Compression Basics: Discrete Source

This document discusses data compression basics, including source entropy and variable length codes. It defines source entropy as a measure of uncertainty in a discrete information source. Source entropy is calculated using Shannon's formula as the weighted average of self-information values for all possible outcomes. Source entropy reaches its maximum for a uniform source distribution and minimum for a skewed distribution. According to Shannon's noiseless coding theorem, source entropy defines the minimum average code length needed to represent a memoryless source without error.

Uploaded by

Lal Chand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views34 pages

Data Compression Basics: Discrete Source

Uploaded by

Lal Chand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 34

Data Compression Basics

 Discrete source
 Information=uncertainty
 Quantification of uncertainty
 Source entropy
 Variable length codes
 Motivation
 Prefix condition
 Huffman coding algorithm

1
Information
 What do we mean by information?
 “A numerical measure of the uncertainty of an
experimental outcome” – Webster Dictionary
 How to quantitatively measure and represent
information?
 Shannon proposes a probabilistic approach
 Let us first look at how we assess the amount of
information in our daily lives using common
sense

2
Information = Uncertainty
 Zero information
 Pittsburgh Steelers won the Superbowl XL (past news, no
uncertainty)
 Afridi plays for Pakistan (celebrity fact, no uncertainty)
 Little information
 It will be very cold in Lahore tomorrow (not much uncertainty
since this is winter time)
 It is going to rain in Malaysia next week (not much uncertainty
since it rains nine months a year in South East Asia)
 Large information
 An earthquake is going to hit Indonesia in July 2006 (are you sure?
an unlikely event)
 Someone has shown P=NP (Wow! Really? Who did it?)

3
Shannon’s Picture on Communication
(1948)

channel channel
source channel destination
encoder decoder
super-channel

source source
encoder decoder

The goal of communication is to move information

from here to there and from now to then

Examples of source:
Human speeches, photos, text messages, computer programs …

Examples of channel:
storage media, telephone lines, wireless transmission …
4
Source-Channel Separation Principle

The role of channel coding:

Fight against channel errors for reliable transmission of information

We simply assume the super-channel achieves error-free transmission

The role of source coding (data compression):

Facilitate storage and transmission by eliminating source redundancy
Our goal is to maximally remove the source redundancy
by intelligent designing source encoder/decoder

5
Discrete Source
 A discrete source is characterized by a discrete
random variable X
 Examples
 Coin flipping: P(X=H)=P(X=T)=1/2
 Dice tossing: P(X=k)=1/6, k=1-6
 Playing-card drawing:
P(X=S)=P(X=H)=P(X=D)=P(X=C)=1/4
What is the redundancy with a discrete source?

6
Two Extreme Cases
tossing source source
channel
a fair coin encoder decoder

P(X=H)=P(X=T)=1/2: (maximum uncertainty)

Minimum (zero) redundancy, compression impossible

tossing a coin with Head HHHH…

or channel duplication
two identical sides
Tail? TTTT…
P(X=H)=1,P(X=T)=0: (minimum redundancy)
Maximum redundancy, compression trivial (1bit is enough)

Redundancy is the opposite of uncertainty

7
Quantifying Uncertainty of an Event

Self-information

I ( p )   log 2 p p - probability of the event x

(e.g., x can be X=H or X=T)
p I ( p) notes

1 0 must happen
(no uncertainty)

 unlikely to happen
0
(infinite amount of uncertainty)
Intuitively, I(p) measures the amount of uncertainty with event x

8
Weighted Self-information

p I ( p) I w ( p)  p  I ( p)
0  0
1/2 1 1/2
1 0 0

As p evolves from 0 to 1, weighted self-information

I w ( p )   p  log 2 p first increases and then decreases

Question: Which value of p maximizes Iw(p)?

9
Maximum of Weighted Self-information*

p=1/e

1
I w ( p) 
e ln 2

10
Quantification of Uncertainty of a Discrete Source

 A discrete source (random variable) is a collection

(set) of individual events whose probabilities sum to 1
X is a discrete random variable
x  {1,2,..., N }
N
pi  prob ( x  i ), i  1,2,..., N p
i 1
i 1

 To quantify the uncertainty of a discrete source,

we simply take the summation of weighted self-
information over the whole set
11
Shannon’s Source Entropy Formula
N
H ( X )   I w ( pi )
i 1

N
H ( X )   pi log 2 pi (bits/sample)
i 1 or bps

Weighting
coefficients

12
Source Entropy Examples

 Example 1: (binary Bernoulli source)

Flipping a coin with probability of head being p (0<p<1)

p  prob ( x  0), q  1  p  prob ( x  1)

H ( X )  ( p log 2 p  q log 2 q)

Check the two extreme cases:

As p goes to zero, H(X) goes to 0 bps  compression gains the most

As p goes to a half, H(X) goes to 1 bps  no compression can help

13
Entropy of Binary Bernoulli Source

14
Source Entropy Examples
N
 Example 2: (4-way random walk)

1 1 W E
prob ( x  S )  , prob ( x  N ) 
2 4
1
prob ( x  E )  prob ( x  W )  S
8
1 1 1 1 1 1 1 1
H ( X )  ( log 2  log 2  log 2  log 2 )  1.75bps
2 2 4 4 8 8 8 8

15
Source Entropy Examples (Con’t)

 Example 3: (source with geometric distribution)

A jar contains the same number of balls with two different colors: blue and red.
Each time a ball is randomly picked out from the jar and then put back. Consider
the event that at the k-th picking, it is the first time to see a red ball – what is the
probability of such event?

1 1
p  prob( x  red )  ,1  p  prob( x  blue) 
2 2
Prob(event)=Prob(blue in the first k-1 picks)Prob(red in the k-th pick )
=(1/2)k-1(1/2)=(1/2)k

16
Source Entropy Calculation
If we consider all possible events, the sum of their probabilities will be one.
 k
1
Check:    1
k 1  2  k
1
Then we can define a discrete random variable X with P( x  k )   
2
Entropy:
  k
1
H ( X )   pk log 2 pk   k    2bps
k 1 k 1  2 

17
Properties of Source Entropy
 Nonnegative and concave
 Achieves the maximum when the source
observes uniform distribution (i.e.,
P(x=k)=1/N, k=1-N)
 Goes to zero (minimum) as the source becomes
more and more skewed (i.e., P(x=k)1, P(xk)
0)

18
What is the use of H(X)?

Shannon’s first theorem (noiseless coding theorem)

For a memoryless discrete source X, its entropy H(X)
defines the minimum average code length required to
noiselessly code the source.

Notes:
1. Memoryless means that the events are independently
generated (e.g., the outcomes of flipping a coin N times
are independent events)
2. Source redundancy can be then understood as the
difference between raw data rate and source entropy
19
Code Redundancy*
Practical performance Theoretical bound

r  l  H(X )  0
N li: the length of
Average code length: l   pi li
i 1
codeword assigned
N
1 to the i-th symbol
H ( X )   pi log 2
i 1 pi
Note: if we represent each symbol by q bits (fixed length codes),
Then redundancy is simply q-H(X) bps
20
How to achieve source entropy?

discrete entropy binary

source X coding bit stream

P(X)

Note: The above entropy coding problem is based on simplified

assumptions are that discrete source X is memoryless and P(X)
is completely known. Those assumptions often do not hold for
real-world data such as images and we will recheck them later.

21
Data Compression Basics
 Discrete source
 Information=uncertainty
 Quantification of uncertainty
 Source entropy
 Variable length codes
 Motivation
 Prefix condition
 Huffman coding algorithm

22
Variable Length Codes (VLC)
Recall:
Self-information I ( p )   log 2 p

It follows from the above formula that a small-probability event contains

much information and therefore worth many bits to represent it. Conversely,
if some event frequently occurs, it is probably a good idea to use as few bits
as possible to represent it. Such observation leads to the idea of varying the
code lengths based on the events’ probabilities.

Assign a long codeword to an event with small probability

Assign a short codeword to an event with large probability

23
4-way Random Walk Example
fixed-length variable-length
symbol k pk codeword codeword
S 0.5 00 0
N 0.25 01 10
E 0.125 10 110
W 0.125 11 111

symbol stream : SSNWSENNNWSSSNESS

fixed length: 00 00 01 11 00 10 01 01 11 00 00 00 01 10 00 00 32bits
variable length: 0 0 10 111 0 110 10 10 111 0 0 0 10 110 0 0 28bits
4 bits savings achieved by VLC (redundancy eliminated)

24
Toy Example (Con’t)
• source entropy: 4
H ( X )   pk log 2 pk
k 1
=0.5×1+0.25×2+0.125×3+0.125×3
=1.75 bits/symbol
• average code length:
Nb Total number of bits
l (bps)
Ns Total number of symbols

fixed-length variable-length
l  2bps  H ( X ) l  1.75bps  H ( X )

25
Problems with VLC
 When codewords have fixed lengths, the
boundary of codewords is always identifiable.
 For codewords with variable lengths, their
boundary could become ambiguous
symbol VLC SSNW SE …
e
S 0 0 0 1 11 0 10…
N 1 0 0 11 1 0 10… 0 0 1 11 0 1 0…
E 10 d d
W 11 SSWN SE … SSNW SE …
26
Uniquely Decodable Codes
 To avoid the ambiguity in decoding, we need to
enforce certain conditions with VLC to make
them uniquely decodable
 Since ambiguity arises when some codeword
becomes the prefix of the other, it is natural to
consider prefix condition
Example: p  pr  pre  pref  prefi  prefix

ab: a is the prefix of b

27
Prefix condition

No codeword is allowed to
be the prefix of any other
codeword.

We will graphically illustrate this condition

with the aid of binary codeword tree

28
Binary Codeword Tree
root # of codewords

Level 1 1 0 2

Level 2 11 10 01 00 22

Level k …… 2k

29
Prefix Condition Examples
symbol x codeword 1 codeword 2
S 0 0
N 1 10
E 10 110
W 11 111

1 0 1 0

11 10 01 00 11 10

…… 111 110… …
codeword 1 codeword 2
30
How to satisfy prefix condition?
 Basic rule: If a node is used as a codeword,
then all its descendants cannot be used as
codeword.
Example 1 0

11 10
111 110

31
Property of Prefix Codes
N
Kraft’s inequality  1
2  li

i 1

li: length of the i-th codeword (proof skipped)

Example symbol x VLC- 1 VLC-2

S 0 0
N 1 10
E 10 110
W 4 11 4 111

 1
2
i 1
 li
 1
2
i 1
 li

32
Two Goals of VLC design
• achieve optimal code length (i.e., minimal redundancy)
For an event x with probability of p(x), the optimal
code-length is –log2p(x) , where x denotes the
smallest integer larger than x (e.g., 3.4=4 )
code redundancy: r  l  H ( X )  0

Unless probabilities of events are all power of 2,

we often have r>0

• satisfy prefix condition

33
Golomb Codes for Geometric Distribution
Optimal VLC for geometric source: P(X=k)=(1/2)k, k=1,2,…

k codeword
1 0 1 0
2 10
3 110 1 0
4 1110
5 11110 1 0
6 111110
1 0
7 1111110
8 11111110
… …… …
34

Prewitt Versus Sobel Masks
No ratings yet
Prewitt Versus Sobel Masks
29 pages
Kiesel
100% (1)
Kiesel
38 pages
Data Compression
No ratings yet
Data Compression
113 pages
Source Coding
No ratings yet
Source Coding
29 pages
Information Coding Techniques
No ratings yet
Information Coding Techniques
42 pages
Lecture 7 Source Coding 2024
No ratings yet
Lecture 7 Source Coding 2024
28 pages
Chapter 2 - Edited
No ratings yet
Chapter 2 - Edited
45 pages
DC-PPT 5
No ratings yet
DC-PPT 5
44 pages
Information Theory: Dr. Muhammad Imran Farid
No ratings yet
Information Theory: Dr. Muhammad Imran Farid
32 pages
Mesleki Yeterlilik
No ratings yet
Mesleki Yeterlilik
106 pages
Unit 5 - Part-Ii
No ratings yet
Unit 5 - Part-Ii
41 pages
Module-3 Information Theory: Entropy Source-Coding Theorem
No ratings yet
Module-3 Information Theory: Entropy Source-Coding Theorem
14 pages
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
No ratings yet
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
35 pages
Lossless Math
No ratings yet
Lossless Math
32 pages
L15 Compression
No ratings yet
L15 Compression
63 pages
Channel Coding Theorem
No ratings yet
Channel Coding Theorem
23 pages
Noise, Information Theory, and Entropy: CS414 - Spring 2007
No ratings yet
Noise, Information Theory, and Entropy: CS414 - Spring 2007
44 pages
Information and Coding Theory
No ratings yet
Information and Coding Theory
177 pages
Unit 1 INFORMATION ENTROPY FUNDAMENTALS
No ratings yet
Unit 1 INFORMATION ENTROPY FUNDAMENTALS
13 pages
CH 6
No ratings yet
CH 6
21 pages
Lecture 2 28 August, 2015: 2.1 An Example of Data Compression
No ratings yet
Lecture 2 28 August, 2015: 2.1 An Example of Data Compression
7 pages
Source Coding: Source Encoder Channel Encoder Digital Source Source Entropy Symbols Binary Sequence Modulator
No ratings yet
Source Coding: Source Encoder Channel Encoder Digital Source Source Entropy Symbols Binary Sequence Modulator
18 pages
Group Presentation Digital Communication Systems
No ratings yet
Group Presentation Digital Communication Systems
29 pages
Week 3
No ratings yet
Week 3
30 pages
CE Notes
No ratings yet
CE Notes
32 pages
Lecture 3-Huffman Coding
No ratings yet
Lecture 3-Huffman Coding
30 pages
Unit 1
100% (2)
Unit 1
45 pages
The Information Theory: C.E. Shannon, A Mathematical Theory of Communication'
No ratings yet
The Information Theory: C.E. Shannon, A Mathematical Theory of Communication'
43 pages
Image Compression: Sankalp Kallakuri
No ratings yet
Image Compression: Sankalp Kallakuri
21 pages
Cha 02
No ratings yet
Cha 02
45 pages
All Coding
No ratings yet
All Coding
52 pages
ETN3046 Chapter 6
No ratings yet
ETN3046 Chapter 6
31 pages
Data Compression: Reference: Proakis Salehi (II Ed.) Cap.4
No ratings yet
Data Compression: Reference: Proakis Salehi (II Ed.) Cap.4
30 pages
Chap 2
No ratings yet
Chap 2
47 pages
Lossless Compression: Lesson 1
No ratings yet
Lossless Compression: Lesson 1
10 pages
15ec54 PDF
No ratings yet
15ec54 PDF
56 pages
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
No ratings yet
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
31 pages
Entropy 3
No ratings yet
Entropy 3
10 pages
Foundations of Information Processing: Information and Data Compression
No ratings yet
Foundations of Information Processing: Information and Data Compression
35 pages
Video Processing Communications Yao Wang Chapter8a
No ratings yet
Video Processing Communications Yao Wang Chapter8a
19 pages
Introduction To Information Theory and Coding
No ratings yet
Introduction To Information Theory and Coding
46 pages
3 Information Theory
No ratings yet
3 Information Theory
48 pages
Noise, Information Theory, and Entropy
No ratings yet
Noise, Information Theory, and Entropy
34 pages
Information Theory and Coding
100% (1)
Information Theory and Coding
79 pages
Information Theory
No ratings yet
Information Theory
12 pages
Script PDF
No ratings yet
Script PDF
78 pages
Topic 2 Information and Coding Theory
No ratings yet
Topic 2 Information and Coding Theory
68 pages
Unit 1
No ratings yet
Unit 1
94 pages
Information Theory
No ratings yet
Information Theory
38 pages
3 Source Coding
No ratings yet
3 Source Coding
31 pages
Lec 42024
No ratings yet
Lec 42024
13 pages
4 20240 456
0% (1)
4 20240 456
5 pages
Unit 4 - DC - 2023-2024
No ratings yet
Unit 4 - DC - 2023-2024
100 pages
3-1-Lossless Compression
No ratings yet
3-1-Lossless Compression
10 pages
Lecture 2-Print
No ratings yet
Lecture 2-Print
19 pages
2017 May 24 Huffman Lecture1
No ratings yet
2017 May 24 Huffman Lecture1
24 pages
Ch. 2 Source Coding-Ppt1 PDF
No ratings yet
Ch. 2 Source Coding-Ppt1 PDF
59 pages
Basics of Compression
No ratings yet
Basics of Compression
10 pages
Mathematical Foundations of Information Theory
From Everand
Mathematical Foundations of Information Theory
A. Ya. Khinchin
3.5/5 (9)
Ultimate Newman’s Law: Book of the Laws, #1
From Everand
Ultimate Newman’s Law: Book of the Laws, #1
S. M. Newman
No ratings yet
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
A System of Legal Logic: Using Aristotle, Ayn Rand, and Analytical Philosophy to Understand the Law, Interpret Cases, and Win in Litigation
From Everand
A System of Legal Logic: Using Aristotle, Ayn Rand, and Analytical Philosophy to Understand the Law, Interpret Cases, and Win in Litigation
Russell Hasan
No ratings yet
Final Exam Schedule
No ratings yet
Final Exam Schedule
1 page
Difference B/W CNC and 3D Printer
No ratings yet
Difference B/W CNC and 3D Printer
2 pages
Institute of Business Management (Iobm) : College of Engineering & Sciences (Ces)
No ratings yet
Institute of Business Management (Iobm) : College of Engineering & Sciences (Ces)
1 page
DC-DC Switch-Mode Converters: Applications
100% (1)
DC-DC Switch-Mode Converters: Applications
16 pages
Atif Baig (22467), Lal Chand (21591) Abdul Hannan (21618 Alveena (21651)
No ratings yet
Atif Baig (22467), Lal Chand (21591) Abdul Hannan (21618 Alveena (21651)
1 page
Week 1 Introduction To Power Electronics
No ratings yet
Week 1 Introduction To Power Electronics
24 pages
Institute of Business Management (Iobm) : College of Engineering & Sciences (Ces)
No ratings yet
Institute of Business Management (Iobm) : College of Engineering & Sciences (Ces)
3 pages
Digital Systems and VLSI: Chapter # 01 Tahniyat Aslam
No ratings yet
Digital Systems and VLSI: Chapter # 01 Tahniyat Aslam
37 pages
VLSI Design: Chapter #02
No ratings yet
VLSI Design: Chapter #02
33 pages
Transmition Lines: Presented by
No ratings yet
Transmition Lines: Presented by
20 pages
Binary Image Analysis: Skeleton Finding Via Distance Transform
No ratings yet
Binary Image Analysis: Skeleton Finding Via Distance Transform
43 pages
Lab 12 PDF
No ratings yet
Lab 12 PDF
2 pages
AC-DC Conversion (Rectifiers)
No ratings yet
AC-DC Conversion (Rectifiers)
47 pages
Binary Image Compression: - The Art of Modeling Image Source
No ratings yet
Binary Image Compression: - The Art of Modeling Image Source
25 pages
Digital Image Processing: Image Enhancement (Spatial Filtering 1)
No ratings yet
Digital Image Processing: Image Enhancement (Spatial Filtering 1)
19 pages
DIP Image Denoising
No ratings yet
DIP Image Denoising
22 pages
Tour Guide: Image Acquisition Image Generation
No ratings yet
Tour Guide: Image Acquisition Image Generation
40 pages
Dandruff Treatment
No ratings yet
Dandruff Treatment
1 page
Face Recognition Using Stereo Imaging - 18 - 03 - Final
No ratings yet
Face Recognition Using Stereo Imaging - 18 - 03 - Final
24 pages
Lab 9
No ratings yet
Lab 9
5 pages
Transmission Lines: Neha Gangwar
No ratings yet
Transmission Lines: Neha Gangwar
23 pages
Notes On Hype DEVON POWERS
No ratings yet
Notes On Hype DEVON POWERS
7 pages
User Manual
No ratings yet
User Manual
44 pages
Social Intervention 2
No ratings yet
Social Intervention 2
12 pages
Harari Und Gosling - 2023
No ratings yet
Harari Und Gosling - 2023
13 pages
Circular On The Regulatory Framework For BVN Watchlist For Nigerian Financial System
No ratings yet
Circular On The Regulatory Framework For BVN Watchlist For Nigerian Financial System
15 pages
The New Rational Manager New Cover Chapters 1 and 4
No ratings yet
The New Rational Manager New Cover Chapters 1 and 4
46 pages
WD 138 Word Chapter 2 Creating A Research Paper
No ratings yet
WD 138 Word Chapter 2 Creating A Research Paper
9 pages
Learning Strand 6 Lesson 1
No ratings yet
Learning Strand 6 Lesson 1
13 pages
Navin Document
No ratings yet
Navin Document
23 pages
SAM Mapping Public V2 File Layout - Modified
No ratings yet
SAM Mapping Public V2 File Layout - Modified
63 pages
R18 IT - Internet of Things (IoT) Unit-I
No ratings yet
R18 IT - Internet of Things (IoT) Unit-I
11 pages
Physics Lab Experiments - (Chapter 1 Introduction)
No ratings yet
Physics Lab Experiments - (Chapter 1 Introduction)
6 pages
CREST Student Profile Form
No ratings yet
CREST Student Profile Form
13 pages
Chapter 7 - Discourse Comprehension AND Memory
No ratings yet
Chapter 7 - Discourse Comprehension AND Memory
61 pages
History and Philosophy of Physical Education and Sport-X2023bbq
100% (3)
History and Philosophy of Physical Education and Sport-X2023bbq
285 pages
Syllabus: Cambridge International AS & A Level Information Technology 9626
No ratings yet
Syllabus: Cambridge International AS & A Level Information Technology 9626
35 pages
Listening to Jazz by Benjamin Bierman eBook and TestBank Bundle Instructor Test Bank
No ratings yet
Listening to Jazz by Benjamin Bierman eBook and TestBank Bundle Instructor Test Bank
316 pages
CBAM - Case Study - The NASA ECS Project
No ratings yet
CBAM - Case Study - The NASA ECS Project
9 pages
Project Report E-Governance
No ratings yet
Project Report E-Governance
13 pages
Introduction To IT and ITeS
No ratings yet
Introduction To IT and ITeS
4 pages
Contact Tracing
No ratings yet
Contact Tracing
125 pages
Wardlaws Contemporary Nutrition 10th Edition Smith Ebook and TestBank Bundle Full Download
100% (2)
Wardlaws Contemporary Nutrition 10th Edition Smith Ebook and TestBank Bundle Full Download
406 pages
Lesson 1 Introduction To Research and Its Importance To Daily Lives
No ratings yet
Lesson 1 Introduction To Research and Its Importance To Daily Lives
29 pages
Management Information Systems Assignment 2
No ratings yet
Management Information Systems Assignment 2
8 pages
Mass Media Roles As Communication Tools For National Development
No ratings yet
Mass Media Roles As Communication Tools For National Development
5 pages
Mfinanga Final Research Report Nov2021
No ratings yet
Mfinanga Final Research Report Nov2021
107 pages
GST202 Module2
No ratings yet
GST202 Module2
51 pages
Hoàng Writing 2
No ratings yet
Hoàng Writing 2
4 pages
Higher Computing Coursework 2011
100% (2)
Higher Computing Coursework 2011
8 pages

Data Compression Basics: Discrete Source

Uploaded by

Data Compression Basics: Discrete Source

Uploaded by

Data Compression Basics

The goal of communication is to move information

The role of channel coding:

We simply assume the super-channel achieves error-free transmission

The role of source coding (data compression):

P(X=H)=P(X=T)=1/2: (maximum uncertainty)

tossing a coin with Head HHHH…

Redundancy is the opposite of uncertainty

I ( p )   log 2 p p - probability of the event x

As p evolves from 0 to 1, weighted self-information

Question: Which value of p maximizes Iw(p)?

 A discrete source (random variable) is a collection

 To quantify the uncertainty of a discrete source,

 Example 1: (binary Bernoulli source)

p  prob ( x  0), q  1  p  prob ( x  1)

Check the two extreme cases:

As p goes to a half, H(X) goes to 1 bps  no compression can help

 Example 3: (source with geometric distribution)

Shannon’s first theorem (noiseless coding theorem)

discrete entropy binary

Note: The above entropy coding problem is based on simplified

It follows from the above formula that a small-probability event contains

Assign a long codeword to an event with small probability

symbol stream : SSNWSENNNWSSSNESS

ab: a is the prefix of b

We will graphically illustrate this condition

li: length of the i-th codeword (proof skipped)

Example symbol x VLC- 1 VLC-2

Unless probabilities of events are all power of 2,

• satisfy prefix condition

You might also like