0% found this document useful (0 votes)

69 views26 pages

Lecture 4 - Arithmetic Coding and Lempel-Ziv

Motivations for arithmetic coding include: 1) Huffman coding is inefficient for small block sizes and impractical for large block sizes due to exponential complexity. 2) Arithmetic coding can achieve the entropy rate of a source with only linear complexity. 3) It addresses the limitations of Huffman coding and block coding by allowing compression close to the entropy rate.

Uploaded by

perhacker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views26 pages

Lecture 4 - Arithmetic Coding and Lempel-Ziv

Uploaded by

perhacker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Motivation for Arithmetic Coding

Motivations for arithmetic coding:

1) Huffman coding algorithm can generate prefix codes with a
minimum average codeword length. But this length is usually strictly
greater than H(X1)

2) To improve the coding efficiency, one can use block memoryless

code by working with the extended alphabet Xn. But computational
complexity will grow exponentially as n increases

Thus for small n, the Huffman coding is inefficient. On the other hand,
for large n, it is unpractical due to its exponential coding complexity.

Solution: Arithmetic coding is one of the algorithms that can address

the above issue. It can achieve the entropy rate of a stationary
source with a linear coding complexity.
Shannon-Fano-Elias Codes
Let (X1, ··· Xn) be a random vector with joint pmf p(u1,u2, ··· un),
uiєX={x0, ··· xJ-1}. We partition the interval [0,1] into disjoint sub-intervals
I(u1,u2, ··· un), u1,u2, ··· un єXn such that the following properties hold:
1) The length of the interval I(u1,u2, ··· un) is equal to p(u1,u2, ··· un).
2) u∪u n
I (u1 ⋯un ) = [0,1]
1⋯ n∈X
3) The intervals I(u1,u2, ··· un) are arranged according to the natural
lexicographic order on the sequence u1,u2, ··· un.

…
n=1
I(x0) I(x1) I(x2) … I(xJ-2) I(xJ-1)

… …
… n=2
I(x0x0) I(x0x1) …I(x0xJ-1) I(x1x0) … I(xJ-1xJ-1)
Shannon-Fano-Elias Codes (cntd)
I(x0x0··· x0 x0) = [0, p(x0x0··· x0 x0)]
I(x0x0··· x0 x1) = [p(x0x0··· x0 x0), p(x0x0··· x0 x0)+ p(x0x0··· x0 x1)]

﹕
I(xJ-1xJ-1···xJ-1)= [1-p(xJ-1xJ-1··· xJ-1), 1]
To get the codeword corresponding to u1u2··· un, let
I(u1u2··· un) = [a, b].
Represent the mid-point a+b/2 by its binary expansion
a+b
= 0.B1 B2 ⋯ BL ⋯
2
∞
= ∑B 2
i=1
i
−i
, Bi ∈ {0,1}.

Let L =  − log p (u1 ⋯ un )  + 1 =  − log(b − a )  + 1

The binary sequence B1B2…BL is the codeword of u1u2··· un. The length of
the codeword assigned to u1u2··· un is equal to

 − log p (u1 ⋯ un )  + 1
Shannon-Fano-Elias Codes: Decoding
a + b
Let  2  = 0.B1 B2 ⋯ BL
L

a + b is the real number obtained by rounding off (a+b)/2 to the first L bits.
 2 
L
a + b
We can prove  2  is inside the interval [a,b].
L

a +b a+b
 2  ≤
L 2 After receiving the codeword
a + b a + b B1B2…Bn, the decoder searches
−  = 0.00⋯ 0 BL +1 BL + 2 ⋯
2  2 L through all u1u2··· unєXn until the
∞
= ∑ B2 i
−i
< 2− L = 2
−[  − log p ( u1⋯un )  +1] unique u1u2··· un is found for
i=L+1 which I(u1u2··· un) contains
− ( − log p ( u1⋯un ) +1)
≤2 a + b
 2  = 0.B1 B2 ⋯ BL , and then
1 b−a L

= p (u1 ⋯ un ) = decodes B1B2…BLas the unique

2 2
a + b u1u2··· un.
 2  is inside [a, b]. Furthermore,
L

 a + b   a + b  −L 
,
 2   2  + 2  ⊂ [ a , b]
 L  L 
Shannon-Fano-Elias Codes: Example

x p(x) I(x) L(X)=  − log p( x)  + 1 midpoint C(x)

x0 0.25 [0, 0.25] 3 0.001··· 001
x1 0.5 [0.25, 0.75] 2 0.10··· 10
x2 0.125 [0.75, 0.875] 4 0.1101··· 1101
x3 0.125 [0.875, 1] 4 0.1111··· 1111

Shannon-Fano-Elias Code is a prefix code.

Arithmetic Coding
• The encoding complexity of the Shannon-Fano-Elias coding
algorithm mainly lies in the process of determining the interval
I(u1u2···un).
• Similarly, given B1B2 ···BL, the decoding complexity of the Shannon-
Fano-Elias coding algorithm mainly lies in the process of finding the
unique interval I(u1u2···un) such that the point 0.B1B2 ···BL is in
I(u1u2···un).
• In arithmetic coding, both of the processes can be realized
sequentially with linear complexity.
• The idea of arithmetic coding was originated by Elias and later made
practical by Rissanen, Pasco, Moffat and Witten.
Arithmetic Coding (Continued)
1) To determine the interval I(u1u2···un), we decompose the joint
probability p(u1u2···un) as
p(u1u2···un) = p(u1 ) p(u2|u1 ) p(u3|u1u2) ···p(un|u1···un-1)
we then construct a sequence of embedded intervals
I (u1 ) ⊃ I (u1u 2 ) ⊃ ⋯ ⊃ I (u1u2 ⋯ un )
2) Partition the interval [0, 1] into disjoint subintervals I(xj),
0≤j≤J-1 shown below
…
0 I(x ) I(x ) I(x ) … I(xJ-2) I(xJ-1) 1
0 1 2

The length of the interval I(xj) is equal to p(xj). Then I(u1)= I(xj) if
u1=xj.
3) If I(u1u2···ui)=[ai,bi], we then partition [ai,bi] into disjoint sub-intervals
I(u1 ···uixj), 0≤j≤J-1 according to the conditional pmf p(xj|u1···ui),
0≤j≤J-1, shown below.
ai … bi
I(u ···u x ) I(u ···u x ) … I(u ···u x )
1 i 0 1 i 0 1 i J-1
Arithmetic Coding (Continued)
The length of the interval I(u1 ···uixj) is equal to
p(u1···uixj) = p(u1···ui ) p(xj|u1···ui ) = the length of [ai, bi]x p(xj|u1···ui )
Then I(u1 ···uiui+1) = I(u1 ···uixj) if ui+1=xj
4) Repeat step 3) until the interval I(u1···un) is determined. The last interval
I(u1···un) is the desired interval.
5) To get the codeword corresponding to u1···un, we apply the same
procedure as in the Shannon-Fano-Elias coding. let
I(u1u2··· un) = [a, b].

Let L =  − log p (u1 ⋯ un )  + 1 . Rounding off the midpoint (a+b)/2 to the first L
bits, we get
a +b
 2  = 0.B1 B2 ⋯ BL
L

The sequence B1B2···BL is the codeword corresponding to u1···un.

Arithmetic coding ( Decoding)
The decoding process can be realized sequentially.
1) Partition [0, 1) into disjoint sub-intervals I(xj), 0≤j≤J-1. If
0.B1B2···BLєI(xj), set u1=xj.
2) Having decoded u1u2···ui, we then partition I(u1u2···ui) into
disjoint subintervals I(u1u2···uixj ), 0≤j≤J-1. If 0.B1B2···BLє
I(u1u2···uixj ), then set ui+1=xj.
3) Repeat step 2) until the sequence u1u2···un is decoded.
Arithmetic coding

1) In arithmetic coding, the length n of the sequence

u1u2···un to be compressed is assumed to be known to
both the encoder and the decoder.
2) The length of the codeword length assigned to u1u2···un is

L =  − log p (u1 ⋯ un )  + 1
Thus the average codeword length in bits/symbol
converges to the entropy rate of a stationary source as n
approaches infinity.
Arithmetic Coding (Example)
Let {xi} be a discrete memoryless source with a common pmf
p(0)=2/5, p(1)=3/5, and the alphabet X={0,1}
Let u1u2···u5=10110. We have
I(1)=[2/5, 1]
I(10)=[2/5, 16/25]
I(101)=[62/125, 16/25]
I(1011)=[346/625, 16/25]
I(10110)=[346/625, 1838/3125]
The length of I(101100) is 108/3125

 108 
⇒ L =  − log  +1 = 6
 3125 
Midpoint = 1784/3125 = 0.100100 ···

and the codeword = 100100

Arithmetic coding

Source symbol Probability Initial Subinterval

x0 0.2 [0.0, 0.2)

x1 0.2 [0.2, 0.4)
x2 0.4 [0.4, 0.8)
x3 0.2 [0.8, 1.0]

Let the message to be encoded be x0x1x2x2x3

Encoding sequence: x0x1x2x2x3
x0 x1 x2 x2 x3
0.2 0.08 0.072 0.0688
1.0

0.16 0.072 0.0688 0.06752

0.8

0.08 0.056 0.0624 0.06496

0.4

0.04 0.048 0.0592 0.06368

0.2

0.0 0.04
0 0.056 0.0624
The final interval [0.06752,0.0688) , we can get the
codeword length L and the corresponding codeword.
Adaptive Arithmetic Coding
In the above description of arithmetic coding, we assume that both the
encoder and decoder know in advance the joint pmf of the random vector
(X1, X2, ···Xn).
In practice, the pmf is often unknown, and has to be estimated online and
offline.
For simplicity, let x={0,1}. The initial pmf is equally likely, i. e.,
p(0) = p(1) = ½
After u1u2···ui is processed, the conditional pmf given u1u2···ui is given by

number of 1 in u1u 2 ⋯ u i + 1
p(1| u1u 2 ⋯ u i ) =
i+2
number of 0 in u 1 u 2 ⋯ u i + 1
p(0| u 1 u 2 ⋯ u i ) =
i+2

Let u1u2···u8 = 11001010. Then according to the above

1 2 1 2 3 3 4 4
p( u1u 2 ⋯ u 6 ) =p(11001010)= ⋅ ⋅ ⋅ ⋅ ⋅ ⋅ ⋅
2 3 4 5 6 7 8 9
Adaptive Arithmetic Coding
Another choice for the conditional pmf given u1u2···ui is as follows

number of 1 in u1u 2 ⋯ u i + 1/ 2
p(1| u1u 2 ⋯ u i ) =
i +1
number of 0 in u1u 2 ⋯ u i + 1/ 2
p(0| u1u 2 ⋯ u i ) =
i +1
1 3 / 2 1/ 2 3 / 2 5 / 2 5 / 2 7 / 2 7 / 2
p(11001010)= ⋅ ⋅ ⋅ ⋅ ⋅ ⋅ ⋅
2 2 3 4 5 6 7 8
Lempel-Ziv Algorithm

• Adaptive arithmetic coding presented at the end of the last section

is universal because it does not require source statistics and can
achieve the ultimate compression rate of any discrete memoryless
source
• Lempel-Ziv is another universal source coding algorithm
developed by Ziv and Lempel.
• One Lempel-Ziv algorithm is LZ77 which is known as sliding
window Lempel-Ziv algorithm, which is published in 1977.
• One year later, they propose a variant of LZ77, the incremental
parsing Lempel-Ziv algorithm, i.e., LZ78.
• In this course we will look at LZ78.
Lempel-Ziv parsing
• LZ78 adopts a incremental parsing procedure, which parses the source
sequence u1u2···un into non-overlapping variable-length blocks.
• The first substring in the incremental parsing of u1u2···un is u1. The second
substring in the parsing is the shortest phrase of u1u2···un that has not
appeared so far in the parsing.
• Assume that u1, u1···un2,un2+1···un3, uni-1+1···uni are the substrings created so far
in the parsing process.
The next substring, which is denoted as uni+1···uni+1 is the shortest phrase of
uni+1···un that has not appeared in {u1, u1···un2,un2+1···un3, uni-1+1···uni } is such a
prefix exists
• Otherwise uni+1···uni+1 = uni+1···un with ni+1=n, and the incremental parsing
procedure terminates.
Lempel-Ziv parsing: Example
Example 1

1 0 10 11 100 111 00 1110 001 110 01

The incremental parsing procedure yields the following partition

1, 0, 10, 11, 100, 111, 00, 1110, 001, 110, 01

Example 2

1 10 11 0 00 110 1

1, 10, 11, 0, 00, 110, 1

In this example, the last substring 1 has already appeared.
Lempel-Ziv parsing

• The concatenation of all phrases is equal to the original source sequence.

• All phrases are distinct, except that the last phrase could be equal to one of
the preceding ones. In Example 2, the last phrase is equal to the first one. All
phrases except the last one are distinct.
• Let Λ denote an empty string. Think of Λ as an initial phrase before the first
phrase in the incremental parsing. Each new phrase in the parsing is the
concatenation of a previous phrase with a new output letter from the source
sequence.
For example, the first phase 1 is the concatenation of the empty string with
the new symbol 1. similarly, the phrase 110 is the concatenation of the
phrases 11 with the new symbol 0.
Lempel-Ziv Encoding

Let X={x0,··· xJ-1}. The Lempel-Ziv encoding of the sequence u1u2···un can be
implemented sequentially as follows.
1. The first phrase u1 is uniquely determined by (0, u1) where the index 0 is
corresponding to the initial empty phrase Λ . Represent the pair (0,u1)
by the integer 0xJ+index(u1) where the index(u1)=j if u1=xj, 0≤j ≤J-1.
Encode the first phrase into the binary representation of the integer
0xJ+index(u1) = index(u1) padded with possible zeros on the left to
ensure that the total length of the codeword is log J 
2. Having determined the ith phrase, we know that the ith phrase is equal
to the concatenation of the mth phrase with a new symbol xj for some
0≤m≤i-1 and 0≤j ≤J-1. Represent the ith phrase into the binary
representation of the integer mxJ+j padded with some possible zeros on
the left to ensure that the total of the codeword is log iJ 
3. Repeat step 2 until all phrases are encoded.
Lempel Ziv Encoding: Example

Partitioned phrases: 1 10 11 0 00 110 1

X ={0,1}, J=2.
Phrases ( m, j ) codewords length
1 (0, 1) 1 1
10 (1, 0) 10 2
11 (1, 1) 011 3
0 (0, 0) 000 3
00 (4, 0) 1000 4
110 (3, 0) 0110 4
1 (0,1) 0001 4

So the Lempel-Ziv coding transforms from the original source sequence

1 10 11 0 00 110 1
To
1 10 011 000 1000 0110 0001
Lempel Ziv Encoding

• In the example, instead of compression, we get expansion. The problem is

that the source sequence in the example is too short. In fact the LZ78 can
achieve the entropy rate of any stationary source as the length of the
source sequence goes without bound.
• If there are t phrases in the incremental parsing of u1u2···un, then the length
t
of the whole Lempel-Ziv codeword for u1u2···un is ∑ log iJ 
i =1
Lempel Ziv Decoding

• The decoding process is easy and can also be done sequentially

since the decoder knows in advance that the length of the codeword
corresponding to the ith phrase is log iJ 
• After receiving the whole codeword, the decoder parses the whole
codeword into non-overlapping substring of lengths log iJ  , 1≤i≤t.
From the ith string, the decoder finds the integer mJ+j and the pair
(m,j). Then the ith phrase is the concatenation of the mth phrase with
the symbol xj.
Lempel Ziv Decoding: Example

1 10 011 000 1000 0110 0001

Integers 1 2 3 0 8 6 1

pairs (0,1) (1,0) (1,1) (0,0) (4,0) (3,0) (0,1)

Phrases 1 10 11 0 00 110 1
Performance of Lempel-Ziv Coding
Theorem 2.6.1

Let {Xi} be a discrete stationary source. Let r(X1···Xn) be

the ratio between the length of the whole Lempel-Ziv
codeword for X1···Xn and the length n of X1···Xn is the
compression rate in bits per symbol. Then
E[r(X1···Xn) ] → H∞ ( X )
as n→∞

ITC 2020 21 Lecture 6
No ratings yet
ITC 2020 21 Lecture 6
22 pages
Data Compression Unit III
No ratings yet
Data Compression Unit III
23 pages
cp467 12 Lecture14 Compression1
No ratings yet
cp467 12 Lecture14 Compression1
146 pages
Chapter 4 - Arithmetic Coding
No ratings yet
Chapter 4 - Arithmetic Coding
66 pages
Lec4 Arith Compression
No ratings yet
Lec4 Arith Compression
36 pages
Ut 1 PPT
No ratings yet
Ut 1 PPT
77 pages
Error-Free Compression: Variable Length Coding
No ratings yet
Error-Free Compression: Variable Length Coding
13 pages
Chap 2
No ratings yet
Chap 2
47 pages
Ic23 Unit04 Script
No ratings yet
Ic23 Unit04 Script
29 pages
ADA Module 5 - Full
No ratings yet
ADA Module 5 - Full
32 pages
EE4740 Lecture4 Slides
No ratings yet
EE4740 Lecture4 Slides
43 pages
Ic23 Unit03 Script
No ratings yet
Ic23 Unit03 Script
26 pages
Solution Manual For Data Structures and Abstractions With Java, 5th Edition Frank M. Carrano, Timothy M. Henry Download
100% (2)
Solution Manual For Data Structures and Abstractions With Java, 5th Edition Frank M. Carrano, Timothy M. Henry Download
44 pages
Stu-Lossless Compression Algos
No ratings yet
Stu-Lossless Compression Algos
21 pages
Image Compression
100% (1)
Image Compression
38 pages
Huffman&Shannon 2
No ratings yet
Huffman&Shannon 2
20 pages
Lecture 10
No ratings yet
Lecture 10
23 pages
Shannon-Fano-Elias Coding
No ratings yet
Shannon-Fano-Elias Coding
3 pages
4-Arithmetic Coding
No ratings yet
4-Arithmetic Coding
1 page
Artificial Variables: V O Thomas
No ratings yet
Artificial Variables: V O Thomas
19 pages
Information Theory and Coding PDF
No ratings yet
Information Theory and Coding PDF
150 pages
Lossless Data Compression
No ratings yet
Lossless Data Compression
77 pages
Information Theory: Dr. Muhammad Imran Farid
No ratings yet
Information Theory: Dr. Muhammad Imran Farid
32 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
20 pages
Source Coding
No ratings yet
Source Coding
35 pages
Source 515 A
No ratings yet
Source 515 A
80 pages
Multimedia Systems: Chapter 7: Data Compression
No ratings yet
Multimedia Systems: Chapter 7: Data Compression
25 pages
Arithmetic, Run Length, Compression
No ratings yet
Arithmetic, Run Length, Compression
62 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Lecture 8-Print
No ratings yet
Lecture 8-Print
24 pages
CDI15-04 - Arithmetic Coding
No ratings yet
CDI15-04 - Arithmetic Coding
17 pages
05 Arith 1
No ratings yet
05 Arith 1
54 pages
Notes 7 2013 - Arithmetic Coding
No ratings yet
Notes 7 2013 - Arithmetic Coding
34 pages
L117, L18, L19, L20, L21 - Module 5 - Source Coding - II
No ratings yet
L117, L18, L19, L20, L21 - Module 5 - Source Coding - II
53 pages
Lecture 5
No ratings yet
Lecture 5
13 pages
Unit 5 - Part-Ii
No ratings yet
Unit 5 - Part-Ii
41 pages
Ict 107 - Dsa - Notes
No ratings yet
Ict 107 - Dsa - Notes
56 pages
Audio and Video Coding PDF
No ratings yet
Audio and Video Coding PDF
72 pages
Merge Sort Algorithm
No ratings yet
Merge Sort Algorithm
5 pages
Arini, MT, MSC: Basic Compression Entropy Coding Statistical
No ratings yet
Arini, MT, MSC: Basic Compression Entropy Coding Statistical
34 pages
Implementation Details and Examples: Variable-Length Entropy Encoding Lossless Data Compression
No ratings yet
Implementation Details and Examples: Variable-Length Entropy Encoding Lossless Data Compression
26 pages
TI-06-Stream Codes
No ratings yet
TI-06-Stream Codes
88 pages
Information Theory Notes
No ratings yet
Information Theory Notes
4 pages
ECEVSP L03 Compression2
No ratings yet
ECEVSP L03 Compression2
40 pages
11 Data Structures and Algorithms - Narasimha Karumanchi
No ratings yet
11 Data Structures and Algorithms - Narasimha Karumanchi
12 pages
ENSC 424 - Multimedia Communications Engineering: Topic 6: Arithmetic Coding 1
No ratings yet
ENSC 424 - Multimedia Communications Engineering: Topic 6: Arithmetic Coding 1
23 pages
Goal Stack Planning
100% (8)
Goal Stack Planning
10 pages
Entropy & Run Length Coding
No ratings yet
Entropy & Run Length Coding
45 pages
Newton-Raphson - Solve Equation
No ratings yet
Newton-Raphson - Solve Equation
12 pages
CH 6
No ratings yet
CH 6
21 pages
Arithmetic Coding: Presented By: Einat & Kim
No ratings yet
Arithmetic Coding: Presented By: Einat & Kim
48 pages
Benchmarking Latest Optimization Algorithms
No ratings yet
Benchmarking Latest Optimization Algorithms
4 pages
Week 03-Informtion Sources and Source Coding
No ratings yet
Week 03-Informtion Sources and Source Coding
25 pages
A Machine Learning Perspective On Predictive Coding With PAQ
No ratings yet
A Machine Learning Perspective On Predictive Coding With PAQ
30 pages
Source Coding Ompression
No ratings yet
Source Coding Ompression
34 pages
Lec 05 - Arithmetic Coding
No ratings yet
Lec 05 - Arithmetic Coding
44 pages
Chapter 2
No ratings yet
Chapter 2
13 pages
Information Theory and Coding - Chapter 3
No ratings yet
Information Theory and Coding - Chapter 3
33 pages
Arithmetic Coding (Float Binary) Leangroup Org
No ratings yet
Arithmetic Coding (Float Binary) Leangroup Org
49 pages
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Shannon Fano and Huffman
No ratings yet
Shannon Fano and Huffman
10 pages
Running Time Calculation
No ratings yet
Running Time Calculation
8 pages
Practical Implementations of Arithmetic Coding
No ratings yet
Practical Implementations of Arithmetic Coding
32 pages
NM Lab in C Program
No ratings yet
NM Lab in C Program
14 pages
Github Com Nishant Tiwari24 Paypal Series Blob Main Timetable Plan MD
No ratings yet
Github Com Nishant Tiwari24 Paypal Series Blob Main Timetable Plan MD
2 pages
Entropy 3
No ratings yet
Entropy 3
10 pages
A Tutorial On Hidden Markov Models - Dugad and Desai
No ratings yet
A Tutorial On Hidden Markov Models - Dugad and Desai
16 pages
41 - Data Structure and Algorithms - Tries
No ratings yet
41 - Data Structure and Algorithms - Tries
4 pages
Context-Based Adaptive Arithmetic Coding
No ratings yet
Context-Based Adaptive Arithmetic Coding
13 pages
Arithmetic Coding: I J I J J
No ratings yet
Arithmetic Coding: I J I J J
12 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
6 pages
Final Project
No ratings yet
Final Project
7 pages
Image Compression-Decompression Technique Using Arithmetic Coding
No ratings yet
Image Compression-Decompression Technique Using Arithmetic Coding
12 pages
A Universal Data Compression System
No ratings yet
A Universal Data Compression System
9 pages
A Review of Data Compression Techniques
No ratings yet
A Review of Data Compression Techniques
9 pages
Unit 4 DS
No ratings yet
Unit 4 DS
46 pages
A Novel Encoding Algorithm For Textual Data Compression
No ratings yet
A Novel Encoding Algorithm For Textual Data Compression
14 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
15 pages
A Novel Approach of Data Compression For Dynamic Data
No ratings yet
A Novel Approach of Data Compression For Dynamic Data
7 pages
Complexity of Algorithms 1
No ratings yet
Complexity of Algorithms 1
27 pages
Computational Methods in Process Engineering Lab Experiment - 1
No ratings yet
Computational Methods in Process Engineering Lab Experiment - 1
10 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
22 pages
A Unique Perspective On Data Coding and Decoding
No ratings yet
A Unique Perspective On Data Coding and Decoding
11 pages
Data Compression Arithmetic Coding
No ratings yet
Data Compression Arithmetic Coding
38 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Mfcs Hand Book
No ratings yet
Mfcs Hand Book
41 pages
Final Exam Cheat Sheet - Object Oriented Coding
No ratings yet
Final Exam Cheat Sheet - Object Oriented Coding
2 pages
A Time-Domain Based Lossless Data Compression Technique
No ratings yet
A Time-Domain Based Lossless Data Compression Technique
4 pages
Various Data Structure
No ratings yet
Various Data Structure
56 pages
Arithmetic Coding: Implementation Details and Examples
No ratings yet
Arithmetic Coding: Implementation Details and Examples
11 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
15 pages
Algorithm - Multiply Polynomials - Stack Overflow
No ratings yet
Algorithm - Multiply Polynomials - Stack Overflow
4 pages
Project Report: "Shannon Fannon Coding"
No ratings yet
Project Report: "Shannon Fannon Coding"
8 pages
Lecture 6 PDF
No ratings yet
Lecture 6 PDF
5 pages
Solution To Exercise R-1.7, Page 47: Sept 5, 2001
No ratings yet
Solution To Exercise R-1.7, Page 47: Sept 5, 2001
2 pages
Search Problems
No ratings yet
Search Problems
19 pages
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
Data Structures & Algorithms: Resource Person: Zafar Mehmood Khattak
No ratings yet
Data Structures & Algorithms: Resource Person: Zafar Mehmood Khattak
12 pages
15.053 - Optimization Methods in Management Science (Spring 2007) Problem Set 5
No ratings yet
15.053 - Optimization Methods in Management Science (Spring 2007) Problem Set 5
7 pages
Radix Sort Algorithm
No ratings yet
Radix Sort Algorithm
10 pages
Answer All Questions
No ratings yet
Answer All Questions
2 pages
Fuck You Scribd
No ratings yet
Fuck You Scribd
1 page
Sorting Visualizer
No ratings yet
Sorting Visualizer
16 pages
Ad3271 Lab
No ratings yet
Ad3271 Lab
2 pages

Lecture 4 - Arithmetic Coding and Lempel-Ziv

Uploaded by

Lecture 4 - Arithmetic Coding and Lempel-Ziv

Uploaded by

Motivation for Arithmetic Coding

Motivations for arithmetic coding:

2) To improve the coding efficiency, one can use block memoryless

Solution: Arithmetic coding is one of the algorithms that can address

Let L =  − log p (u1 ⋯ un )  + 1 =  − log(b − a )  + 1

= p (u1 ⋯ un ) = decodes B1B2…BLas the unique

x p(x) I(x) L(X)=  − log p( x)  + 1 midpoint C(x)

Shannon-Fano-Elias Code is a prefix code.

The sequence B1B2···BL is the codeword corresponding to u1···un.

1) In arithmetic coding, the length n of the sequence

and the codeword = 100100

Source symbol Probability Initial Subinterval

x0 0.2 [0.0, 0.2)

Let the message to be encoded be x0x1x2x2x3

0.16 0.072 0.0688 0.06752

0.08 0.056 0.0624 0.06496

0.04 0.048 0.0592 0.06368

Let u1u2···u8 = 11001010. Then according to the above

• Adaptive arithmetic coding presented at the end of the last section

1 0 10 11 100 111 00 1110 001 110 01

The incremental parsing procedure yields the following partition

1, 0, 10, 11, 100, 111, 00, 1110, 001, 110, 01

1, 10, 11, 0, 00, 110, 1

• The concatenation of all phrases is equal to the original source sequence.

Partitioned phrases: 1 10 11 0 00 110 1

So the Lempel-Ziv coding transforms from the original source sequence

• In the example, instead of compression, we get expansion. The problem is

• The decoding process is easy and can also be done sequentially

1 10 011 000 1000 0110 0001

pairs (0,1) (1,0) (1,1) (0,0) (4,0) (3,0) (0,1)

Let {Xi} be a discrete stationary source. Let r(X1···Xn) be

You might also like