0% found this document useful (0 votes)

174 views6 pages

Understanding Range Coding Techniques

Range coding is an entropy coding method that produces a stream of bits to represent symbols based on their probabilities more efficiently than variable-length codes like Huffman coding. It works by assigning sub-ranges of a large range of integers to each symbol, then progressively narrowing the range after encoding each symbol. When the message is fully encoded, the final sub-range identifies the message. Range coding can achieve greater compression than Huffman coding and adapts well to changing probabilities.

Uploaded by

nigel989

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

174 views6 pages

Understanding Range Coding Techniques

Uploaded by

nigel989

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Range coding

Range coding (or range encoding) is an entropy coding method defined by G. Nigel N. Martin in a 1979
paper,[1] which effectively rediscovered the FIFO arithmetic code first introduced by Richard Clark Pasco
in 1976.[2] Given a stream of symbols and their probabilities, a range coder produces a space-efficient
stream of bits to represent these symbols and, given the stream and the probabilities, a range decoder
reverses the process.

Range coding is very similar to arithmetic coding, except that coding is done with digits in any base,
instead of with bits, and so it is faster when using larger bases (e.g. a byte) at small cost in compression
efficiency.[3] After the expiration of the first (1978) arithmetic coding patent,[4] range coding appeared to
clearly be free of patent encumbrances. This particularly drove interest in the technique in the open source
community. Since that time, patents on various well-known arithmetic coding techniques have also expired.

How range coding works

Range coding conceptually encodes all the symbols of the message
into one number, unlike Huffman coding which assigns each
symbol a bit-pattern and concatenates all the bit-patterns together.
Thus range coding can achieve greater compression ratios than the
one-bit-per-symbol lower bound on Huffman coding and it does not
suffer the inefficiencies that Huffman does when dealing with
probabilities that are not an exact power of two.
Graphical representation of the
The central concept behind range coding is this: given a large- coding process. The message being
enough range of integers, and a probability estimation for the encoded here is "AABA<EOM>"
symbols, the initial range can easily be divided into sub-ranges
whose sizes are proportional to the probability of the symbol they
represent. Each symbol of the message can then be encoded in turn, by reducing the current range down to
just that sub-range which corresponds to the next symbol to be encoded. The decoder must have the same
probability estimation the encoder used, which can either be sent in advance, derived from already
transferred data or be part of the compressor and decompressor.

When all symbols have been encoded, merely identifying the sub-range is enough to communicate the
entire message (presuming of course that the decoder is somehow notified when it has extracted the entire
message). A single integer is actually sufficient to identify the sub-range, and it may not even be necessary
to transmit the entire integer; if there is a sequence of digits such that every integer beginning with that
prefix falls within the sub-range, then the prefix alone is all that's needed to identify the sub-range and thus
transmit the message.

Example

Suppose we want to encode the message "AABA<EOM>", where <EOM> is the end-of-message symbol.
For this example it is assumed that the decoder knows that we intend to encode exactly five symbols in the
base 10 number system (allowing for 105 different combinations of symbols with the range [0, 100000))
using the probability distribution {A: .60; B: .20; <EOM>: .20}. The encoder breaks down the range [0,
100000) into three subranges:
A: [ 0, 60000)
B: [ 60000, 80000)
<EOM>: [ 80000, 100000)

Since our first symbol is an A, it reduces our initial range down to [0, 60000). The second symbol choice
leaves us with three sub-ranges of this range. We show them following the already-encoded 'A':

AA: [ 0, 36000)
AB: [ 36000, 48000)
A<EOM>: [ 48000, 60000)

With two symbols encoded, our range is now [0, 36000) and our third symbol leads to the following
choices:

AAA: [ 0, 21600)
AAB: [ 21600, 28800)
AA<EOM>: [ 28800, 36000)

This time it is the second of our three choices that represent the message we want to encode, and our range
becomes [21600, 28800). It may look harder to determine our sub-ranges in this case, but it is actually not:
we can merely subtract the lower bound from the upper bound to determine that there are 7200 numbers in
our range; that the first 4320 of them represent 0.60 of the total, the next 1440 represent the next 0.20, and
the remaining 1440 represent the remaining 0.20 of the total. Adding back the lower bound gives us our
ranges:

AABA: [21600, 25920)

AABB: [25920, 27360)
AAB<EOM>: [27360, 28800)

Finally, with our range narrowed down to [21600, 25920), we have just one more symbol to encode. Using
the same technique as before for dividing up the range between the lower and upper bound, we find the
three sub-ranges are:

AABAA: [21600, 24192)

AABAB: [24192, 25056)
AABA<EOM>: [25056, 25920)

And since <EOM> is our final symbol, our final range is [25056, 25920). Because all five-digit integers
starting with "251" fall within our final range, it is one of the three-digit prefixes we could transmit that
would unambiguously convey our original message. (The fact that there are actually eight such prefixes in
all implies we still have inefficiencies. They have been introduced by our use of base 10 rather than base 2.)

The central problem may appear to be selecting an initial range large enough that no matter how many
symbols we have to encode, we will always have a current range large enough to divide into non-zero sub-
ranges. In practice, however, this is not a problem, because instead of starting with a very large range and
gradually narrowing it down, the encoder works with a smaller range of numbers at any given time. After
some number of digits have been encoded, the leftmost digits will not change. In the example after coding
just three symbols, we already knew that our final result would start with "2". More digits are shifted in on
the right as digits on the left are sent off. This is illustrated in the following code:

int low = 0;
int range = 100000;
void Run()
{
Encode(0, 6, 10); // A
Encode(0, 6, 10); // A
Encode(6, 2, 10); // B
Encode(0, 6, 10); // A
Encode(8, 2, 10); // <EOM>

// emit final digits - see below

while (range < 10000)
EmitDigit();

low += 10000;
EmitDigit();
}

void EmitDigit()
{
[Link](low / 10000);
low = (low % 10000) * 10;
range *= 10;
}

void Encode(int start, int size, int total)

{
// adjust the range based on the symbol interval
range /= total;
low += start * range;
range *= size;

// check if left-most digit is same throughout range

while (low / 10000 == (low + range) / 10000)
EmitDigit();

// readjust range - see reason for this below

if (range < 1000)
{
EmitDigit();
EmitDigit();
range = 100000 - low;
}
}

To finish off we may need to emit a few extra digits. The top digit of low is probably too small so we need
to increment it, but we have to make sure we don't increment it past low+range. So first we need to
make sure range is large enough.

// emit final digits

while (range < 10000)
EmitDigit();

low += 10000;
EmitDigit();

One problem that can occur with the Encode function above is that range might become very small but
low and low+range still have differing first digits. This could result in the interval having insufficient
precision to distinguish between all of the symbols in the alphabet. When this happens we need to fudge a
little, output the first couple of digits even though we might be off by one, and re-adjust the range to give us
as much room as possible. The decoder will be following the same steps so it will know when it needs to
do this to keep in sync.

// this goes just before the end of Encode() above

if (range < 1000)
{
EmitDigit();
EmitDigit();
range = 100000 - low;
}

Base 10 was used in this example, but a real implementation would just use binary, with the full range of
the native integer data type. Instead of 10000 and 1000 you would likely use hexadecimal constants
such as 0x1000000 and 0x10000. Instead of emitting a digit at a time you would emit a byte at a time
and use a byte-shift operation instead of multiplying by 10.

Decoding uses exactly the same algorithm with the addition of keeping track of the current code value
consisting of the digits read from the compressor. Instead of emitting the top digit of low you just throw it
away, but you also shift out the top digit of code and shift in a new digit read from the compressor. Use
AppendDigit below instead of EmitDigit.

int code = 0;
int low = 0;
int range = 1;

void InitializeDecoder()
{
AppendDigit(); // with this example code, only 1 of these is actually needed
AppendDigit();
AppendDigit();
AppendDigit();
AppendDigit();
}

void AppendDigit()
{
code = (code % 10000) * 10 + ReadNextDigit();
low = (low % 10000) * 10;
range *= 10;
}

void Decode(int start, int size, int total) // Decode is same as Encode with EmitDigit
replaced by AppendDigit
{
// adjust the range based on the symbol interval
range /= total;
low += start * range;
range *= size;

// check if left-most digit is same throughout range

while (low / 10000 == (low + range) / 10000)
AppendDigit();

// readjust range - see reason for this below

if (range < 1000)
{
AppendDigit();
AppendDigit();
range = 100000 - low;
}
}

In order to determine which probability intervals to apply, the decoder needs to look at the current value of
code within the interval [low, low+range) and decide which symbol this represents.

void Run()
{
int start = 0;
int size;
int total = 10;
InitializeDecoder(); // need to get range/total >0
while (start < 8) // stop when receive EOM
{
int v = GetValue(total); // code is in what symbol range?
switch (v) // convert value to symbol
{
case 0:
case 1:
case 2:
case 3:
case 4:
case 5: start=0; size=6; [Link]("A"); break;
case 6:
case 7: start=6; size=2; [Link]("B"); break;
default: start=8; size=2; [Link]("");
}
Decode(start, size, total);
}
}

int GetValue(int total)

{
return (code - low) / (range / total);
}

For the AABA<EOM> example above, this would return a value in the range 0 to 9. Values 0 through 5
would represent A, 6 and 7 would represent B, and 8 and 9 would represent <EOM>.

Relationship with arithmetic coding

Arithmetic coding is the same as range coding, but with the integers taken as being the numerators of
fractions. These fractions have an implicit, common denominator, such that all the fractions fall in the range
[0,1). Accordingly, the resulting arithmetic code is interpreted as beginning with an implicit "0". As these
are just different interpretations of the same coding methods, and as the resulting arithmetic and range codes
are identical, each arithmetic coder is its corresponding range encoder, and vice versa. In other words,
arithmetic coding and range coding are just two, slightly different ways of understanding the same thing.

In practice, though, so-called range encoders tend to be implemented pretty much as described in Martin's
paper,[1] while arithmetic coders more generally tend not to be called range encoders. An often noted
feature of such range encoders is the tendency to perform renormalization a byte at a time, rather than one
bit at a time (as is usually the case). In other words, range encoders tend to use bytes as coding digits, rather
than bits. While this does reduce the amount of compression that can be achieved by a very small amount, it
is faster than when performing renormalization for each bit.

See also
Arithmetic coding
Asymmetric numeral systems
Data compression
Entropy encoding
Huffman coding
Multiscale Electrophysiology Format
Shannon–Fano coding

References
1. G. Nigel N. Martin, Range encoding: An algorithm for removing redundancy from a digitized
message ([Link] Video & Data
Recording Conference, Southampton, UK, July 24–27, 1979.
2. "Source coding algorithms for fast data compression" Richard Clark Pasco, Stanford, CA
1976
3. "On the Overhead of Range Coders ([Link]
Timothy B. Terriberry, Technical Note 2008
4. U.S. Patent 4,122,440 ([Link] — (IBM) Filed March
4, 1977, Granted 24 October 1978 (Now expired)

External links
Range Encoder ([Link]
com/rangecoder/)
"Range coder" by Arturo Campos ([Link]
[Link]/ac_range.html)
"Anatomy of Range Encoder" by Andrew Polar ([Link]
ml)
Fast implementation of range coding and rANS of James K. Bonfield ([Link]
nfield/rans_static)
A fast open source implementation of a 24-bit SSE 4.1 interleaved Range Coder ([Link]
[Link]/richgel999/sserangecoding)

Retrieved from "[Link]

Arithmetic Coding
No ratings yet
Arithmetic Coding
6 pages
05 Arith 1
No ratings yet
05 Arith 1
54 pages
Arithmetic Coding in Image Compression
No ratings yet
Arithmetic Coding in Image Compression
34 pages
Example:: (I) SWISS MISS 10 Symbols
100% (1)
Example:: (I) SWISS MISS 10 Symbols
12 pages
Arithmetic Coding for CS Students
No ratings yet
Arithmetic Coding for CS Students
36 pages
Image Compression
No ratings yet
Image Compression
10 pages
Arithmetic Encoder/Decoder Example: Encode B
No ratings yet
Arithmetic Encoder/Decoder Example: Encode B
2 pages
Arithmetic Coding Explained
No ratings yet
Arithmetic Coding Explained
12 pages
Arithmetic Coding Implementation Guide
No ratings yet
Arithmetic Coding Implementation Guide
11 pages
Entropy Coding Techniques Explained
No ratings yet
Entropy Coding Techniques Explained
45 pages
Verilog-Based Lossless Data Compression
No ratings yet
Verilog-Based Lossless Data Compression
6 pages
Understanding Arithmetic Coding
No ratings yet
Understanding Arithmetic Coding
5 pages
10.7 Arithmetic Coding: Figure 10.9 Assignment of Ranges Between 0 and 1
No ratings yet
10.7 Arithmetic Coding: Figure 10.9 Assignment of Ranges Between 0 and 1
4 pages
Data Compression with Arithmetic Coding
No ratings yet
Data Compression with Arithmetic Coding
11 pages
Arithmetic Coding: Presented By: Einat & Kim
No ratings yet
Arithmetic Coding: Presented By: Einat & Kim
48 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Understanding Arithmetic Coding Basics
No ratings yet
Understanding Arithmetic Coding Basics
26 pages
Module IV
No ratings yet
Module IV
37 pages
Understanding Arithmetic Coding Techniques
No ratings yet
Understanding Arithmetic Coding Techniques
15 pages
Testing - Document - Sourjyendra - Data Compression Techniques - Lecture 4 - Integer Codes 2 - University of Helsinky - Slides (DCT2015-Lecture4)
No ratings yet
Testing - Document - Sourjyendra - Data Compression Techniques - Lecture 4 - Integer Codes 2 - University of Helsinky - Slides (DCT2015-Lecture4)
56 pages
Multimedia Coding Techniques
No ratings yet
Multimedia Coding Techniques
44 pages
Multimedia Systems: Chapter 7: Data Compression
No ratings yet
Multimedia Systems: Chapter 7: Data Compression
25 pages
Audio and Video Coding PDF
No ratings yet
Audio and Video Coding PDF
72 pages
Arithmetic Coding: Algorithm & Issues
No ratings yet
Arithmetic Coding: Algorithm & Issues
7 pages
Data Compression Unit-5
No ratings yet
Data Compression Unit-5
17 pages
Chapter 2
No ratings yet
Chapter 2
13 pages
Arithmetic Coding Techniques Explained
No ratings yet
Arithmetic Coding Techniques Explained
24 pages
Data Compression Unit III
No ratings yet
Data Compression Unit III
23 pages
Data Compression Unit III
No ratings yet
Data Compression Unit III
22 pages
Range Coder
No ratings yet
Range Coder
11 pages
Image Compression
100% (1)
Image Compression
38 pages
Cs Book
No ratings yet
Cs Book
5 pages
20.5 Arithmetic Coding
No ratings yet
20.5 Arithmetic Coding
6 pages
Mu-Law Encoding in C++ Explained
No ratings yet
Mu-Law Encoding in C++ Explained
3 pages
Input Source Encoder Channel Encoder Binary Interface
No ratings yet
Input Source Encoder Channel Encoder Binary Interface
29 pages
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
No ratings yet
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
37 pages
Huffman Coding Algorithm
No ratings yet
Huffman Coding Algorithm
3 pages
Chapter Three Source Coding: 1-Sampling Theorem
No ratings yet
Chapter Three Source Coding: 1-Sampling Theorem
19 pages
Compression II
No ratings yet
Compression II
51 pages
ENSC 424 - Multimedia Communications Engineering: Topic 6: Arithmetic Coding 1
No ratings yet
ENSC 424 - Multimedia Communications Engineering: Topic 6: Arithmetic Coding 1
23 pages
Lec05 Arithmetic Coding II
No ratings yet
Lec05 Arithmetic Coding II
44 pages
Testing - Document - Sourjyendra - Data Compression Techniques - Lecture 6 - Arithmetic Coding (2015)
No ratings yet
Testing - Document - Sourjyendra - Data Compression Techniques - Lecture 6 - Arithmetic Coding (2015)
18 pages
Dkkexer8 Ans
No ratings yet
Dkkexer8 Ans
7 pages
Introduction To Data Compression - Guy E. Blelloch PDF
No ratings yet
Introduction To Data Compression - Guy E. Blelloch PDF
54 pages
Lec 6
No ratings yet
Lec 6
31 pages
Entropy Coding Techniques Guide
No ratings yet
Entropy Coding Techniques Guide
10 pages
Source Coding & Theorems Guide
No ratings yet
Source Coding & Theorems Guide
29 pages
Mesleki Yeterlilik
No ratings yet
Mesleki Yeterlilik
106 pages
Basics of Information Theory
No ratings yet
Basics of Information Theory
21 pages
Ibook - Pub Basic Arithmetic Coding Based Approach To Compress A Character String
No ratings yet
Ibook - Pub Basic Arithmetic Coding Based Approach To Compress A Character String
8 pages
Uniquely Decodable Codes in Source Coding
No ratings yet
Uniquely Decodable Codes in Source Coding
26 pages
Source Encoder and Decoder
No ratings yet
Source Encoder and Decoder
3 pages
Support Vector Machine
No ratings yet
Support Vector Machine
12 pages
Applications of Artificial Intelligence
No ratings yet
Applications of Artificial Intelligence
44 pages
Automated Theorem Proving
No ratings yet
Automated Theorem Proving
8 pages
Reservoir Computing Explained
No ratings yet
Reservoir Computing Explained
8 pages
Principles of Structural Health Monitoring
100% (1)
Principles of Structural Health Monitoring
7 pages
Advances in Computer-Aided Diagnosis
No ratings yet
Advances in Computer-Aided Diagnosis
20 pages
Brain-Computer Interface Overview
No ratings yet
Brain-Computer Interface Overview
45 pages
Duck - Ai - 2024 12 17 - 03 53 58
No ratings yet
Duck - Ai - 2024 12 17 - 03 53 58
3 pages
76 Command Set
No ratings yet
76 Command Set
27 pages
Comsats University Islamabad (Attock Campus) Class Assignment #04 Department of Management Sciences
No ratings yet
Comsats University Islamabad (Attock Campus) Class Assignment #04 Department of Management Sciences
6 pages
Coding Integer-: Jpeg2000 Compression Standard Is The Coding
No ratings yet
Coding Integer-: Jpeg2000 Compression Standard Is The Coding
5 pages
Notepad++ Regex Guide
No ratings yet
Notepad++ Regex Guide
9 pages
BBEdit Grep Quick Reference Guide (2023) by Charles Poynton, PHD
No ratings yet
BBEdit Grep Quick Reference Guide (2023) by Charles Poynton, PHD
1 page
Base64 Decode and Encode - Online
No ratings yet
Base64 Decode and Encode - Online
1 page
Data Compression Exam Winter 2023
No ratings yet
Data Compression Exam Winter 2023
1 page
Extended Binary Coded Decimal Interchange Code PDF
No ratings yet
Extended Binary Coded Decimal Interchange Code PDF
2 pages
Ascii
No ratings yet
Ascii
1 page
Android - [email protected] IComponentStore Software
No ratings yet
Android - [email protected] IComponentStore Software
4 pages
Alt Codes
No ratings yet
Alt Codes
13 pages
Erased Log by Sos
No ratings yet
Erased Log by Sos
2 pages
ASCII Table - Table of ASCII Codes, Characters and Symbols
No ratings yet
ASCII Table - Table of ASCII Codes, Characters and Symbols
9 pages
Data Compression
No ratings yet
Data Compression
23 pages
Warrior Collection Data Overview
No ratings yet
Warrior Collection Data Overview
15 pages
Character Set
No ratings yet
Character Set
9 pages
ASCII Code Binary Hex Chart
No ratings yet
ASCII Code Binary Hex Chart
3 pages
Android Debug Log Analysis
No ratings yet
Android Debug Log Analysis
11 pages
Earth, Wind & Fire - September (Bass Transcription)
100% (2)
Earth, Wind & Fire - September (Bass Transcription)
2 pages
Código de Conduta (2009) 720p BDRip
No ratings yet
Código de Conduta (2009) 720p BDRip
2 pages
CCS353 Unit1 Basics of Data Compression
No ratings yet
CCS353 Unit1 Basics of Data Compression
9 pages
Compression
No ratings yet
Compression
3 pages
XML Encoding and Structure Guide
No ratings yet
XML Encoding and Structure Guide
6 pages
Trace
No ratings yet
Trace
51 pages
Lab Chapter 3 Mod 4
No ratings yet
Lab Chapter 3 Mod 4
6 pages
Data Compression Home Assignment Questions
No ratings yet
Data Compression Home Assignment Questions
3 pages
Data Compression: Objective Questions
No ratings yet
Data Compression: Objective Questions
7 pages
Huffman Coding Example Explained
No ratings yet
Huffman Coding Example Explained
26 pages
Alphabetical Series: Directions (1-28) : Each of The Following Questions Is Based On The Following Alphabet Series
No ratings yet
Alphabetical Series: Directions (1-28) : Each of The Following Questions Is Based On The Following Alphabet Series
5 pages

Understanding Range Coding Techniques

Uploaded by

Understanding Range Coding Techniques

Uploaded by

Range coding

How range coding works

AABA: [21600, 25920)

AABAA: [21600, 24192)

// emit final digits - see below

void Encode(int start, int size, int total)

// check if left-most digit is same throughout range

// readjust range - see reason for this below

// emit final digits

// this goes just before the end of Encode() above

// check if left-most digit is same throughout range

// readjust range - see reason for this below

int GetValue(int total)

Relationship with arithmetic coding

Retrieved from "[Link]

You might also like