0% found this document useful (0 votes)

262 views6 pages

Arithmetic Coding

The document summarizes arithmetic coding, an entropy encoding technique. It explains that arithmetic coding encodes the entire data source as a single number between 0 and 1, rather than assigning unique codewords to symbols. It works by assigning intervals to each symbol based on probability, and iteratively narrowing the interval as more symbols are encoded. The number produced corresponds to the final interval. Decoding involves mapping the number back to symbols based on their intervals. Arithmetic coding achieves higher compression than Huffman coding by encoding sequences of symbols with a single number.

Uploaded by

perhacker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

262 views6 pages

Arithmetic Coding

Uploaded by

perhacker

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Arithmetic Coding

In the Huffman-type coders, we assigned code bits to individual symbols. The

philosophy of Arithmetic coding is somewhat different in the sense that it tries to
encode the whole data source once, instead of assigning codebits to symbols. The
efficiency is even more apparent if the symbol probabilities are not in terms of
negative integer powers of 2 ( where the optimum bit code length
comes out to be a non-integer number).

The description of the algorithm is as follows:

Assign non-overlapping intervals on the (0,1] axis for each symbol in the
alphabet. The length of each interval must be proportional to the probability of
the corresponding symbol. As an example, if we have 3 symbols x1, x2, x3
with probabilities 0.5, 0.3, and 0.2, then the intervals can be : x1 (0,0.5], x2
(0.5,0.8], x3 (0.8, 1]. This interval assignment is illustrated as follows:

The philosophy is getting clear now. Suppose our source has one symbol: x1.
How could I encode it using these intervals? Just by producing a number
between 0 and 0.5 (which belongs to the interval of x1). What would happen if
my message consisted of two consecutive symbols: x1 x2 ? Then the
Arithmetic coder would split the interval of x1 (which was (0,0.5]) into
subintervals with the same proportional interval lengths for each symbol:
The code that represents the sequence x1 x2 is therefore any number in the
interval (0.25,0.4]. You can think of this code as splitting the intervals on the
(0,1] axis finer and finer. So, for example, what is the code for the sequence
x1 x2 x2 ? The answer is: split the last x2 interval again, and find the last
appropriate interval:

So after three symbols, the encoder must produce a number between 0.325
and 0.37. We can say that the output is the interval (0.325,0.37]. For the last
time in this example, let's emit one more symbol and encode the sequence x1
x2 x2 x3:
Apparently, the interval for the sequence x1 x2 x2 x3 is (0.361,0.37].

The important observation is : as we encode longer sequences, the numbers

to represent the corresponding interval becomes more and more precise.

Click here to see the generation of interval for arithmetic coding of the above
example in animation.

More precise numbers require longer bitstreams to represent. Of course, we need to

define a way to represent numbers between 0 and 1 using binary digits. The way to
do this is to use 2's inverse arithmetic. In the normal binary arithmetic, the
rightmost digit is the 2's digit, the one to its left is 4's digit, the one to its left is 8's
digit, etc. In the 2's inverse notation, the leftmost digit is the (1/2)'s digit. The one
to its right is (1/4)'s digit. The one to its right is (1/8)'s digit.

For example 0101 stands for 0*(1/2) + 1*(1/4) + 0*(1/8) + 1*(1/16) = 5/16 =
0.3125

Exercise: Write the number in full decimal precision whose binary representation is
10011:

Check Answer Reset

The next important step in the Arithmetic encoding is to find a suitable number
within the final interval which can be represented with as few bits as possible.
For our example, the final interval is (0.361,0.37]. The shortest amount of bits to
represent a number in this interval can be obtained as
1/4+1/16+1/32+1/64+1/128 = 0.3672. In binary form, this summation means :
0101111

Exercise: Try to obtain another number in this interval with a different binary
representation (this is possible).

For this example, 0101111 is our encoded bitstream to express x1 x2 x2 x3. As you
see, we used 7 bits. For three symbols, we normally require 2 bits per symbol.
Therefore, 4 symbols make 8 bits. We have a saving of one bit for this short
sequence.

The efficiency of Arithmetic coding becomes more clear if you encode longer
sequences. For example, if the next symbol is again x2, we have a sequence of x1
x2 x2 x3 x2, and the interval would be (0.3655, 0.3682]. As you see, our previous
encoding number (0.3672) is still in this range, therefore 0101111 is the
compressed bitstream of x1 x2 x2 x3 x2 too! Now, the saving is 3 bits.

But!

Clearly, the last situation indicates a property of Arithmetic coding that has to be
handled carefully: Bitstreams correspond to many symbol sequences. For instance,
here, 0.3672 corresponds to {x1}, {x1 x2}, {x1 x2 x2}, {x1 x2 x2 x3}, {x1 x2 x2
x3 x2}, etc... You can find arbitrarily long sequences whose interval contains the
number 0.3672 (or 0101111). So where should we stop? At the point where we are
told to! We have to indicate how many symbols we are encoding. The compressed
data, therefore, contains the extra information regarding the size of the source.

So, up to now, we have observed two difficulties about the Arithmetic encoding:

1. Which number inside the interval to chose.

2. Where to stop.

The second difficulty was overcome by transmitting the extra information of how
many symbols have been encoded.

The first difficulty is usually overcome by selecting the previously defined edge of
the interval. For example, selecting the smaller edge of the interval as the
representative symbol is a common practice. Nevertheless, there are numercial
algorithms which efficiently perform Arithmetic encoding using finite precision
(binary) arithmetic. The finite precision algorithm is both complicated to mention in
this course, and patented by IBM. Therefore, if one implements the finite precision
Arithmetic encoder (they are available in many books), (s)he has to resolve the
patent issues with IBM before being able to sell the software.

Let's give the pseudo-code for the classical Arithmetic encoder:

Low=0;
High=1;
Range=1;
while input X are coming:
High=Low+Range*HighValue(X);
Low=Low+Range*LowValue(X);
Range=High-Low

Of course, the HighValue and LowValue numbers of the symbols are determined
according to their probabilities. For example, for the previous example,
HighValue(x1)=0.5, LowValue(x1)=0, HighValue(x2)=0.8, LowValue(x2)=0.5,
HighValue(x3)=1, LowValue(x3)=0.8.

Finally, let us talk about the decoding process.

The decoder works just the opposite way. It first gets the number, and looks for the
interval in which it fits. The first symbol is now determined according to its interval.
Now, the new range is calculated according to the found symbol, and our number
should be modified. The modification is: the LowValue of the symbol is subtracted
from our number, and the result is divided by the range. Now, we look at our
number again and see which interval it fits. The next step does exactly the same
steps as the first step. The algorithm continues as long as the number of encoded
symbols.

It may be more illustrative if we perform decoding on the result of our previous

example:

The number is 0.3672. It fits into the range of x1. LowValue(x1)=0. We

subtract this value, so number=0.3672-0=0.3672. We divide this to the range
of x1 (0.5), so number=0.3672/0.5=0.7344.
The number is 0.7344. It fits into the range of x2. LowValue(x2)=0.5. We
subtract this value, so number=0.7344-0.5=0.2344. We divide this to the
range of x2 (0.3), so number=0.2344/0.3=0.7813.
The number is 0.7813. It fits into the range of x2. LowValue(x2)=0.5. We
subtract this value, so number=0.7813-0.5=0.2813. We divide this to the
range of x2 (0.3), so number=0.2813/0.3=0.9378.
The number is 0.9378. It fits into the range of x3. LowValue(x3)=0.8. We
subtract this value, so number=0.9378-0.8=0.1378. We divide this to the
range of x3 (0.2), so number=0.1378/0.2=0.6889.
The number is 0.6889. It fits into the range of x2. LowValue(x2)=0.5. We
subtract this value, so number=0.6889-0.5=0.1889. We divide this to the
range of x2 (0.3), so number=0.1889/0.3=0.6269.

We can continue decoding as long as required. We must stop at the point where we
reach the number of encoded symbols. For example, for the above case, we have
decoded the sequence {x1,x2,x2,x3,x2}.

Finally, let us give the pseudo-code for the classic Arithmetic decoder:

get encoded number: X

do:
find symbol whose range straddles the encoded number;
output the symbol;
range = LowValue(symbol) – HighValue(symbol);
X = X - LowValue (symbol);
X = X / range;
until no more symbols needed;

Homework: Obtain 6 symbol decoded version of the number 01101110 for the
symbols and probabilities of the above exercise.

For the practical engineer: The above explanations of the arithmetic coder gives
an idea about the philosophy of an arithmetic coder. On the other hand, the
practical implementation of the arithmetic coder is a little different. Instead of
obtaining real numbers between 0 and 1 (which may then be converted into binary
representation), the probability splitting may be done immediately using binary
numbers. The web page [Link] gives
excellent explanations of practical arithmetic coder implementations. It also
describes the probability splitting concepts together with the binary implementation.

You can find more information, usage, and implementation details about Arithmetic
coding in the following links:

[Link]
[Link]
[Link]
[Link]
[Link]
[Link]

Arithmetic Coding Explained
No ratings yet
Arithmetic Coding Explained
12 pages
Arithmetic Coding: Presented By: Einat & Kim
No ratings yet
Arithmetic Coding: Presented By: Einat & Kim
48 pages
Data Compression with Arithmetic Coding
No ratings yet
Data Compression with Arithmetic Coding
11 pages
Understanding Arithmetic Coding
No ratings yet
Understanding Arithmetic Coding
5 pages
Arithmetic Coding (Float Binary) Leangroup Org
No ratings yet
Arithmetic Coding (Float Binary) Leangroup Org
49 pages
Huffman Coding in Image Processing
No ratings yet
Huffman Coding in Image Processing
12 pages
Shannon Fano Solved Examples
No ratings yet
Shannon Fano Solved Examples
4 pages
Truncated Huffman
No ratings yet
Truncated Huffman
5 pages
Source Coding Techniques Overview
No ratings yet
Source Coding Techniques Overview
111 pages
ITC Mod1@
No ratings yet
ITC Mod1@
79 pages
Understanding Predictive Coding Techniques
No ratings yet
Understanding Predictive Coding Techniques
14 pages
Shannon's Source Coding Overview
No ratings yet
Shannon's Source Coding Overview
48 pages
DC Unit3
No ratings yet
DC Unit3
97 pages
Shannon Fano and Huffman
No ratings yet
Shannon Fano and Huffman
10 pages
Linear Block Codes Explained
No ratings yet
Linear Block Codes Explained
114 pages
Wavelet Coding in JPEG2000
No ratings yet
Wavelet Coding in JPEG2000
44 pages
Error Coding Reed Solomon
No ratings yet
Error Coding Reed Solomon
28 pages
Turbo Codes for Engineers
No ratings yet
Turbo Codes for Engineers
37 pages
Source Coding Theorem Explained
No ratings yet
Source Coding Theorem Explained
27 pages
Line Coding Design with Scilab
No ratings yet
Line Coding Design with Scilab
12 pages
Haar Wavelets
No ratings yet
Haar Wavelets
4 pages
Shannon-Fano vs Huffman Coding Efficiency
100% (1)
Shannon-Fano vs Huffman Coding Efficiency
3 pages
Algebraic Coding Theory MA 407: 1 Introduction and Motivation
No ratings yet
Algebraic Coding Theory MA 407: 1 Introduction and Motivation
12 pages
Introduction To Information Theory Channel Capacity and Models
No ratings yet
Introduction To Information Theory Channel Capacity and Models
36 pages
Bit Loading and Energy Allocation Algorithm
No ratings yet
Bit Loading and Energy Allocation Algorithm
13 pages
ECE515FL - Activity7 (FIR Filter Design)
No ratings yet
ECE515FL - Activity7 (FIR Filter Design)
13 pages
Embedded C Projects for MCB1700 Kit
No ratings yet
Embedded C Projects for MCB1700 Kit
44 pages
State Space Search For Solving Problems: Lecture Module 4
100% (1)
State Space Search For Solving Problems: Lecture Module 4
37 pages
LZW Compression in MATLAB Code
No ratings yet
LZW Compression in MATLAB Code
3 pages
Image Compression with DCT
0% (1)
Image Compression with DCT
5 pages
Ecc - BCH Codes
No ratings yet
Ecc - BCH Codes
35 pages
Digital Logic Design Course Outline
No ratings yet
Digital Logic Design Course Outline
2 pages
Comp CFL 10
No ratings yet
Comp CFL 10
38 pages
HW1
No ratings yet
HW1
1 page
Data Compression Techniques
No ratings yet
Data Compression Techniques
11 pages
Information Theory and Coding
No ratings yet
Information Theory and Coding
226 pages
DSP Syllabus
No ratings yet
DSP Syllabus
2 pages
Sprinklr OA Interview Process Guide
No ratings yet
Sprinklr OA Interview Process Guide
8 pages
Ofdm Simulation in Matlab
No ratings yet
Ofdm Simulation in Matlab
59 pages
Overview of Golay Codes and Their Forms
No ratings yet
Overview of Golay Codes and Their Forms
6 pages
Information Theory & Coding Techniques-DCom
No ratings yet
Information Theory & Coding Techniques-DCom
28 pages
Hamming Code FPGA Encoder/Decoder Design
No ratings yet
Hamming Code FPGA Encoder/Decoder Design
20 pages
Understanding Affine Transformations
No ratings yet
Understanding Affine Transformations
4 pages
MATLAB Toolboxes & Applications
No ratings yet
MATLAB Toolboxes & Applications
37 pages
Turbo Codes Tutorial Guide
No ratings yet
Turbo Codes Tutorial Guide
21 pages
Arithmetic Coding for CS Students
No ratings yet
Arithmetic Coding for CS Students
36 pages
Arithmetic Coding Techniques Explained
No ratings yet
Arithmetic Coding Techniques Explained
24 pages
Data Compression Unit III
No ratings yet
Data Compression Unit III
22 pages
Data Compression Unit III
No ratings yet
Data Compression Unit III
23 pages
Understanding Arithmetic Coding Techniques
No ratings yet
Understanding Arithmetic Coding Techniques
15 pages
Arithmetic Coding Implementation Guide
No ratings yet
Arithmetic Coding Implementation Guide
11 pages
Arithmetic Coding in Image Compression
No ratings yet
Arithmetic Coding in Image Compression
34 pages
Understanding Range Coding Techniques
No ratings yet
Understanding Range Coding Techniques
6 pages
Lecture 4 - Arithmetic Coding and Lempel-Ziv
No ratings yet
Lecture 4 - Arithmetic Coding and Lempel-Ziv
26 pages
Arithmetic Coding: Algorithm & Issues
No ratings yet
Arithmetic Coding: Algorithm & Issues
7 pages
Understanding Arithmetic Coding Basics
No ratings yet
Understanding Arithmetic Coding Basics
26 pages
05 Arith 1
No ratings yet
05 Arith 1
54 pages
10.7 Arithmetic Coding: Figure 10.9 Assignment of Ranges Between 0 and 1
No ratings yet
10.7 Arithmetic Coding: Figure 10.9 Assignment of Ranges Between 0 and 1
4 pages
Multimedia Coding Techniques
No ratings yet
Multimedia Coding Techniques
44 pages
Arithmetic Coding for ECE Students
No ratings yet
Arithmetic Coding for ECE Students
1 page
A Universal Data Compression System
No ratings yet
A Universal Data Compression System
9 pages
FCSC 2023 Pre Challenge
No ratings yet
FCSC 2023 Pre Challenge
17 pages
A Unique Perspective On Data Coding and Decoding
No ratings yet
A Unique Perspective On Data Coding and Decoding
11 pages
A Time-Domain Based Lossless Data Compression Technique
No ratings yet
A Time-Domain Based Lossless Data Compression Technique
4 pages
A Tutorial On Hidden Markov Models - Dugad and Desai
No ratings yet
A Tutorial On Hidden Markov Models - Dugad and Desai
16 pages
A Review of Data Compression Techniques
No ratings yet
A Review of Data Compression Techniques
9 pages
A Novel Approach of Data Compression For Dynamic Data
No ratings yet
A Novel Approach of Data Compression For Dynamic Data
7 pages
A Novel Encoding Algorithm For Textual Data Compression
No ratings yet
A Novel Encoding Algorithm For Textual Data Compression
14 pages
Practical Implementations of Arithmetic Coding
No ratings yet
Practical Implementations of Arithmetic Coding
32 pages
TI-06-Stream Codes
No ratings yet
TI-06-Stream Codes
88 pages
Notes 7 2013 - Arithmetic Coding
No ratings yet
Notes 7 2013 - Arithmetic Coding
34 pages
New Image Compression Technique Using Arithmetic Coding
No ratings yet
New Image Compression Technique Using Arithmetic Coding
12 pages
Context-Based Adaptive Arithmetic Coding
No ratings yet
Context-Based Adaptive Arithmetic Coding
13 pages
Arithmetic Coding Techniques Explained
No ratings yet
Arithmetic Coding Techniques Explained
17 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
20 pages
EC2029-Digital Image Processing Two Marks Questions and Answers - New PDF
No ratings yet
EC2029-Digital Image Processing Two Marks Questions and Answers - New PDF
20 pages
Image Compression & Segmentation
No ratings yet
Image Compression & Segmentation
6 pages
Image Compression Techniques
No ratings yet
Image Compression Techniques
75 pages
Image Compression Explained
No ratings yet
Image Compression Explained
29 pages
Next Gen Video Coding: H.265/HEVC Insights
No ratings yet
Next Gen Video Coding: H.265/HEVC Insights
147 pages
ON2 VP6 Codec Explained
No ratings yet
ON2 VP6 Codec Explained
7 pages
Unit 5 Data Compression
No ratings yet
Unit 5 Data Compression
12 pages
Multimedia Systems: Chapter 7: Data Compression
No ratings yet
Multimedia Systems: Chapter 7: Data Compression
25 pages
Data Compression Quiz
No ratings yet
Data Compression Quiz
12 pages
Arithmetic Coding Explained
No ratings yet
Arithmetic Coding Explained
11 pages
UNIT 3 Notes
No ratings yet
UNIT 3 Notes
6 pages
Arithmetic vs Huffman Coding Analysis
No ratings yet
Arithmetic vs Huffman Coding Analysis
23 pages
Arithmetic Coding in Parallel: Jan Supol and Bo Rivoj Melichar
No ratings yet
Arithmetic Coding in Parallel: Jan Supol and Bo Rivoj Melichar
11 pages
Huffman and Arithmetic Coding
No ratings yet
Huffman and Arithmetic Coding
10 pages
9 Run Length Codes
No ratings yet
9 Run Length Codes
9 pages
Module 4
No ratings yet
Module 4
91 pages
Mpeg Coding Principles
No ratings yet
Mpeg Coding Principles
23 pages
Audio Compression Techniques Overview
No ratings yet
Audio Compression Techniques Overview
109 pages
Working With Compression: Fabian 'Ryg' Giesen
No ratings yet
Working With Compression: Fabian 'Ryg' Giesen
49 pages
Second Midterm DIP Question Paper - Revised
No ratings yet
Second Midterm DIP Question Paper - Revised
1 page
Winter Semester 2023-24 - CSE4007 - ETH - AP2023246000230 - 2024-04-17 - Reference-Material-I
No ratings yet
Winter Semester 2023-24 - CSE4007 - ETH - AP2023246000230 - 2024-04-17 - Reference-Material-I
17 pages
Verilog-Based Lossless Data Compression
No ratings yet
Verilog-Based Lossless Data Compression
6 pages
Question Bank
No ratings yet
Question Bank
37 pages
Text and Text Compression
No ratings yet
Text and Text Compression
28 pages
Compression Theory
No ratings yet
Compression Theory
7 pages
2023.final Project
No ratings yet
2023.final Project
2 pages
ITC Imp Ques
No ratings yet
ITC Imp Ques
8 pages
Deneliwovilopo
No ratings yet
Deneliwovilopo
3 pages

Arithmetic Coding

Uploaded by

Arithmetic Coding

Uploaded by

Arithmetic Coding

In the Huffman-type coders, we assigned code bits to individual symbols. The

The description of the algorithm is as follows:

The important observation is : as we encode longer sequences, the numbers

More precise numbers require longer bitstreams to represent. Of course, we need to

Check Answer Reset

1. Which number inside the interval to chose.

Let's give the pseudo-code for the classical Arithmetic encoder:

Finally, let us talk about the decoding process.

It may be more illustrative if we perform decoding on the result of our previous

The number is 0.3672. It fits into the range of x1. LowValue(x1)=0. We

get encoded number: X

You might also like