Open navigation menu

Scribd

0% found this document useful (0 votes)

23 views13 pages

Chapter 2

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views13 pages

Chapter 2

Uploaded by

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Chapter 2 :

Entropy Coding
Coding and compression: II) Entropy coding

3) Entropy coding
Is a type of lossless source coding, also called variable length statistical coding. This name
comes from the fact that the coding process uses the statistical properties of the source,
where it assigns the shortest code words to the most frequent symbols.

Encoders’ evaluation criteria

When choosing or evaluating a source encoder, we consider various criteria that assess its
effectiveness in representing the information. Here, we cite some of the most important ones:
a) The code rate : knowns also as the average code length, is the average number of bits per
symbol or codeword. It is defined as :
, where is the length of the code associated with the symbol
b) Code efficiency : The efficiency of a C code for a source S of entropy H is defined as:

Dr BETTAYEB Nadjla University of Kasdi Merbah , Ouargla, February 2024

Coding and compression: II) Entropy coding

c) Redundancy of a code : is the difference between the entropy of the source H(X) and
the code rate. ρ=. It can be defined also as , where it represents the percentage of
additional bits compared to an optimal code.
d) variance of a code : is calculated as

Some types of encoders

1) Shannon-Fano encoder : follows an entropy coding process that verifies the prefix
condition. The compression/decompression is acheived according to a tree as follows :

Dr BETTAYEB Nadjla University of Kasdi Merbah , Ouargla, February 2024

Coding and compression: II) Entropy coding

• Rank the symbols’ probabilities (like states) in descending order

• Divide the ranked symbols into two subsets, ensuring that the difference in total probability
between them is as small as possible.
• Assign “ 0” to symbols in the first subset and a 1 to of symbols in the second subset.
• Repeat the set and subsets division step and concatenate “0” or “1” to the symbols code each
subset contains only one symbol.

Dr BETTAYEB Nadjla University of Kasdi Merbah , Ouargla, February 2024

Coding and compression: II) Entropy coding
2) Huffman encoder :
like the previous coding process, the probabilities of the symbols are sorted in descending order.
Then the construction of a tree structure. The principle consists in grouping the 2 symbols of low
probability (weight) then creating a new node of weight equal to the sum of the grouped weights.
This step is repeated until all symbols are grouped together. At the end the right node receives “1”
and the other receives “0” the code then reads from top to bottom.

Dr BETTAYEB Nadjla University of Kasdi Merbah , Ouargla, February 2024

Coding and compression: II) Entropy coding

The problem with SHANNON-FANO and HUFFMAN encoders lies in the allocation of an
integer number of bits for each code, which is not always optimal . For example, if the

around 0.11 bits ( 𝑛 = I ⟹ 𝑛 = − 𝑙𝑜𝑔 2 𝑃𝑖 ). Huffman encoder will assign either 1 or 2 bits
probability of a symbol is 0.9, the optimal number of bits to encode this character is

to this symbol, which is much longer than the possible theoretical value!!. This is why
other process of coding are proposed:

3) arithmetic code
The arithmetic coding process does not replace a symbol with a specific code like the
case of Huffman encoder but it replaces a stream of symbols with a single floating point
number. The output of this coding process is a Number [0,1[.

Dr BETTAYEB Nadjla University of Kasdi Merbah , Ouargla, February 2024

Coding and compression: II) Entropy coding

With this method, we actually use partitions of an interval [a, b] (initially [0,1]), the size
of each sub-interval is proportional to the frequency of the symbol corresponding to it.
Each symbol to be compressed reduces the current interval [a, b] to the sub-interval
[a', b'] corresponding to it, the latter is itself partitioned and then undergoes the same
processing as [a, b]. Thus, we finally obtain a very small interval [A, B] including the
code value.
The calculation of the bounds of each interval is as follows
Coding
Low = 0.0; High = 1.0; Decoding
While (C = New character ) Number = input code
Begin For symbol = Find_symbol (which is in this range);
Range = High-Low; Range = High_Range (Symbol)-LowRange(Symbol);
High = Low + Range * High_Range (C);; Number = Number - Low_Range (Symbol);
Low=Low+Range* Low_Range (C); Number = Number/Range;
End

Dr BETTAYEB Nadjla University of Kasdi Merbah , Ouargla, February 2024

Coding and compression: II) Entropy coding

EXP: encode then decode the message: BILL GATES

This message will be coded by a code which belongs to the interval
[ 0.25721167752 , 0.2572167756 [
See the following partitioning , encoding and decoding tables

T S I I G E B A Space Character
1/10 1/10 2/10 1/10 1/10 1/10 1/10 1/10 1/10 Probability

0.9 ≤ x < 0.8 ≤ x < 0.6 ≤ x < 0.5 ≤ x < 0.4 ≤ x < 0.3 ≤ x < 0.2 ≤ x < 0.1 ≤ x < 0≤x< Interval
1 0.9 0.8 0.6 0.5 0.4 0.3 0.2 0.1

Dr BETTAYEB Nadjla University of Kasdi Merbah , Ouargla, February 2024

Coding and compression: II) Entropy coding

coding

decoding

Dr BETTAYEB Nadjla University of Kasdi Merbah , Ouargla, February 2024

Coding and compression: II) Entropy coding

Adaptive Codes
The compression methods used so far use a statistical model to encode unique
symbols. They perform the compression by encoding the symbols into bit strings that
use fewer bits than the original symbols. The quality of the compression increases or
decreases depending on the program's ability to develop a good model. Moreover the
model must accurately predict the probabilities of the symbols, which is not always
feasible.
Adaptive codes are more desirable coding algorithms for data streaming as they adapt
to localized changes in symbols.
They start with or without minimal dictionary of each symbol's codes and update it for
each new character.
1) Adaptive Huffman Coding : see the additional document

Dr BETTAYEB Nadjla University of Kasdi Merbah , Ouargla, February 2024

Coding and compression: II) Entropy coding

2) LZW

An improvement of the LZ77 and LZ78 codes Created in 1984 by Terry Welsh. This
Algorithm the existance of an initial dictionary comprising all the unitary symbols of the
message.
So, it starts with a dictionary of all single characters with indexes starting at 1. Then it
upgrade the dictionary as it processes the text, When, a new character is read a new code
is generated. The compression algorithm follows the steps below:

1) Read the longest serie of consecutive symbols “x” present in the dictionary.
2) Write the index of "x" in the output file.
3) Read the symbol “i” that follows “x”.
4) Add "xi" to the dictionary.
5) Repeat this algorithm until you get i=empty.

Dr BETTAYEB Nadjla University of Kasdi Merbah , Ouargla, February 2024

Coding and compression: II) Entropy coding

Example : un_ver_vert_goes_to_un_verre_vert
Initial Dictionary
u v t s r n e a _ Symbol
8 7 6 5 4 3 2 1 0 index
Coding
_vert e_ re _verr un_ _u s_ rs _ver a_ _va t_ ert _ve r_ er ve _u n_ um Symbol
28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 index
20 2 4 20 9 0 5 4 15 1 11 6 13 11 4 2 7 0 3 8 coded

Final code: 8 3 0 7 2 4 11 13 6 11 1 15 4 5 0 9 20 4 2 20 6

In the decoding process, we decode from the dictionary, where the latter is updated by the
characters corresponding to the preceding code + the 1st symbol of the current string.

Dr BETTAYEB Nadjla University of Kasdi Merbah , Ouargla, February 2024

References

1. Souad.Berhab , coding and compression , M1 Telecom course , Kasdi University Merbah ,

Ouargla, 2019-2020.
2. Yamina.Bekri , information theory and coding , course for Telecommunications modern ,
University of Mascara, 2015.
3. Thomas, MTCAJ and Joy, A. Thomas. Elements of information theory . Wiley- Interscience ,
2006.
4. Mvogo Ngono Joseph , General principles of source entropy coding , Image Compression
course for Master II: IASIG
5. Merouane Bouzid, Joint Source and Channel Coding for Noisy Channel Transmissions ,
Doctoral Thesis in Electronics, USTHB, 2006.
6. https://www.youtube.com/watch?v=znqMMOZ_Brs
7. https://www.youtube.com/watch?v=lV6L5O8sL8w

Dr BETTAYEB Nadjla Kasdi University Merbah , Ouargla February 2024

You might also like

Kennedy A Guide To Econometrics
100% (2)
Kennedy A Guide To Econometrics
942 pages
Implementation Details and Examples: Variable-Length Entropy Encoding Lossless Data Compression
No ratings yet
Implementation Details and Examples: Variable-Length Entropy Encoding Lossless Data Compression
26 pages
CH 6
No ratings yet
CH 6
21 pages
Data Compression Unit-5
No ratings yet
Data Compression Unit-5
17 pages
Lecture 3-Huffman Coding
No ratings yet
Lecture 3-Huffman Coding
30 pages
Entropy
No ratings yet
Entropy
10 pages
Entropy & Run Length Coding
No ratings yet
Entropy & Run Length Coding
45 pages
Ut 1 PPT
No ratings yet
Ut 1 PPT
77 pages
Chapter10 Part1 Huffman
No ratings yet
Chapter10 Part1 Huffman
17 pages
Lesson - Huffman and Entropy Coding
No ratings yet
Lesson - Huffman and Entropy Coding
31 pages
Source Coding
No ratings yet
Source Coding
35 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Lossless Compression: Lesson 1
No ratings yet
Lossless Compression: Lesson 1
10 pages
Information Theory Notes
No ratings yet
Information Theory Notes
4 pages
Source Coding: Source Encoder Channel Encoder Digital Source Source Entropy Symbols Binary Sequence Modulator
No ratings yet
Source Coding: Source Encoder Channel Encoder Digital Source Source Entropy Symbols Binary Sequence Modulator
18 pages
Module IV
No ratings yet
Module IV
37 pages
Mad Unit 3-Jntuworld
No ratings yet
Mad Unit 3-Jntuworld
53 pages
Information Theory: Dr. Muhammad Imran Farid
No ratings yet
Information Theory: Dr. Muhammad Imran Farid
32 pages
Multimedia Data Compression
No ratings yet
Multimedia Data Compression
31 pages
Data Compression (Pt2)
No ratings yet
Data Compression (Pt2)
22 pages
Text Compression
No ratings yet
Text Compression
16 pages
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
No ratings yet
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
37 pages
Arithmetic Coding Algorithm and Implementation Issues
No ratings yet
Arithmetic Coding Algorithm and Implementation Issues
7 pages
Arithmetic Code Discussion and Implementation
No ratings yet
Arithmetic Code Discussion and Implementation
11 pages
Ch. 2 Source Coding-Ppt1 PDF
No ratings yet
Ch. 2 Source Coding-Ppt1 PDF
59 pages
DCT Based Coding
No ratings yet
DCT Based Coding
49 pages
Audio and Video Coding PDF
No ratings yet
Audio and Video Coding PDF
72 pages
05 Arith 1
No ratings yet
05 Arith 1
54 pages
Mesleki Yeterlilik
No ratings yet
Mesleki Yeterlilik
106 pages
cp467 12 Lecture14 Compression1
No ratings yet
cp467 12 Lecture14 Compression1
146 pages
ICT - Module 1 Lecture 3
No ratings yet
ICT - Module 1 Lecture 3
43 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
15 pages
Algorithms in The Real World: Data Compression: Lectures 1 and 2
No ratings yet
Algorithms in The Real World: Data Compression: Lectures 1 and 2
55 pages
Compressor Principles
No ratings yet
Compressor Principles
32 pages
Data Compression Chapter 7
No ratings yet
Data Compression Chapter 7
40 pages
Source Coding
No ratings yet
Source Coding
29 pages
2201.01741v2 - Understanding Entropy Coding With Asymmetric Numeral Systems (ANS) - Statistician Perspective
No ratings yet
2201.01741v2 - Understanding Entropy Coding With Asymmetric Numeral Systems (ANS) - Statistician Perspective
26 pages
Ic23 Unit02 Script
No ratings yet
Ic23 Unit02 Script
29 pages
DC 3
No ratings yet
DC 3
20 pages
3-1-Lossless Compression
No ratings yet
3-1-Lossless Compression
10 pages
Arithmetic, Run Length, Compression
No ratings yet
Arithmetic, Run Length, Compression
62 pages
Text and Image Compression
100% (1)
Text and Image Compression
57 pages
Arithmetic Coding: Implementation Details and Examples
No ratings yet
Arithmetic Coding: Implementation Details and Examples
11 pages
Huffman Coding: Vida Movahedi
No ratings yet
Huffman Coding: Vida Movahedi
8 pages
Chapter 7
No ratings yet
Chapter 7
70 pages
Image Compression
100% (1)
Image Compression
38 pages
Lecture 5
No ratings yet
Lecture 5
13 pages
Unit 5 - Part-Ii
No ratings yet
Unit 5 - Part-Ii
41 pages
MMC Module 3
No ratings yet
MMC Module 3
65 pages
Entropy Coding - Wikipedia
No ratings yet
Entropy Coding - Wikipedia
2 pages
2017 May 24 Huffman Lecture1
No ratings yet
2017 May 24 Huffman Lecture1
24 pages
ECEVSP L03 Compression2
No ratings yet
ECEVSP L03 Compression2
40 pages
Chapter Three
No ratings yet
Chapter Three
30 pages
Chapter 3 Multimedia Data Compression
No ratings yet
Chapter 3 Multimedia Data Compression
21 pages
6.1 Lossless Compression Algorithms: Introduction: Unit 6: Multimedia Data Compression
No ratings yet
6.1 Lossless Compression Algorithms: Introduction: Unit 6: Multimedia Data Compression
25 pages
PERRERAS, Karen A. (Activity 3-Module 7)
No ratings yet
PERRERAS, Karen A. (Activity 3-Module 7)
1 page
1-Introduction To Business Forecasting
No ratings yet
1-Introduction To Business Forecasting
19 pages
How Smart Is My Dummy? Time Series Tests For The Influence of Politics
No ratings yet
How Smart Is My Dummy? Time Series Tests For The Influence of Politics
18 pages
Normal Approximation To Binomial
No ratings yet
Normal Approximation To Binomial
5 pages
Regression Analysis: March 2014
No ratings yet
Regression Analysis: March 2014
42 pages
Application of Statistical Concepts in The Determination of Weight Variation in Coin Samples
No ratings yet
Application of Statistical Concepts in The Determination of Weight Variation in Coin Samples
2 pages
Test of Significance - : 3-Standard Error of Difference
No ratings yet
Test of Significance - : 3-Standard Error of Difference
4 pages
Multiple Regression Example (Salary Experience and Score)
No ratings yet
Multiple Regression Example (Salary Experience and Score)
4 pages
Normal Distribution
No ratings yet
Normal Distribution
21 pages
FRA Assignment - India Credit Model
No ratings yet
FRA Assignment - India Credit Model
14 pages
Simple Linear Regression - Assignn5
No ratings yet
Simple Linear Regression - Assignn5
8 pages
Hypothesis Testing - 7
100% (1)
Hypothesis Testing - 7
102 pages
Exam 1 GNUR 405
No ratings yet
Exam 1 GNUR 405
16 pages
Marshall Jonker Inferent Stats 2010
No ratings yet
Marshall Jonker Inferent Stats 2010
12 pages
Department of Mathematics Indian Institute of Technology Guwahati Problem Sheet 3
No ratings yet
Department of Mathematics Indian Institute of Technology Guwahati Problem Sheet 3
2 pages
Analysis of Variance (ANOVA) : Table 1 K Random Samples
No ratings yet
Analysis of Variance (ANOVA) : Table 1 K Random Samples
5 pages
An Introduction To Neural Data Compression: Yibo Yang, Stephan Mandt, and Lucas Theis
No ratings yet
An Introduction To Neural Data Compression: Yibo Yang, Stephan Mandt, and Lucas Theis
20 pages
Convolutional Codes
No ratings yet
Convolutional Codes
26 pages
Change of Variable Technique
No ratings yet
Change of Variable Technique
2 pages
00 s1 Papers To June 10
0% (1)
00 s1 Papers To June 10
75 pages
Occupancytuts - Survey-Level Covariates
No ratings yet
Occupancytuts - Survey-Level Covariates
35 pages
Eviews VAR Stata
No ratings yet
Eviews VAR Stata
17 pages
Stochastic Hydrology
No ratings yet
Stochastic Hydrology
187 pages
Ch7 - Hypothesis Testing
No ratings yet
Ch7 - Hypothesis Testing
36 pages
Mode, Types and Merits and Demerits
No ratings yet
Mode, Types and Merits and Demerits
10 pages
Budgeting
No ratings yet
Budgeting
17 pages
Parameters and Hyperparameters Notes
No ratings yet
Parameters and Hyperparameters Notes
2 pages
Lecture 1A: Statistical Estimators of Grade: Min 4025 Geostatistics
No ratings yet
Lecture 1A: Statistical Estimators of Grade: Min 4025 Geostatistics
23 pages
Smda Project
No ratings yet
Smda Project
12 pages