0% found this document useful (0 votes)

16 views13 pages

Lecture 5

The document discusses various coding techniques, including instantaneous codes, Shannon codes, Shannon-Fano codes, and Huffman codes, emphasizing their unique decodability and efficiency. It provides examples of encoding messages using different probability distributions and calculates code efficiency, average code length, and source entropy for each method. The document illustrates the steps involved in creating these codes and compares their performance metrics.

Uploaded by

maxi milian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views13 pages

Lecture 5

Uploaded by

maxi milian

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

D = 111

This code can be instantaneously decoded since no complete codeword is a prefix of a

larger codeword. This is in contrast to the previous example where A is a prefix of both
B and D . This example is also a ‘comma code’ as the symbol zero indicates the end
of a codeword except for the all ones word whose length is known.

Example
Consider a 4 alphabet symbols with symbols represented by binary digits as follows:
A=0
B = 01
C = 011
D = 111
The code is identical to the previous example but the bits are time reversed. It is still
uniquely decodable but no longer instantaneous, since early codewords are now prefixes
of later ones.

Shannon Code
For messages x1 , x2 , x3 ,… xn with probabilities p( x1 ) , p ( x2 ) , p( x3 ) ,… p( xn ) then:
r
1) li = − log2 p( xi ) if p( xi ) =  1  1 1 1
{ , , ,...}
2 2 4 8
r
2) li = Int[− log2 p( xi )] + 1 if p( x )   1 
i
2
i −1
Also define Fi =  p( xk ) 1  i  0
k =1

then the codeword of xi is the binary equivalent of Fi consisting of li bits.

Ci = (Fi )2i
l

where Ci is the binary equivalent of Fi up to li bits. In encoding, messages must be

arranged in a decreasing order of probabilities.

DR. MAHMOOD 2024-11-16 39

Example
Develop the Shannon code for the following set of messages,
p( x) = [0.3 0.2 0.15 0.12 0.1 0.08 0.05]

then find:
(a) Code efficiency,
(b) p(0) at the encoder output.
Solution

xi p( xi ) li Fi Ci 0i
x1 0.3 2 0 00 2

x2 0.2 3 0.3 010 2

x3 0.15 3 0.5 100 2

x4 0.12 4 0.65 1010 2

x5 0.10 4 0.77 1100 2

x6 0.08 4 0.87 1101 1

x7 0.05 5 0.95 11110 1

To find To find To find

0 0 1
0 1 0

To find To find

1 1
0 1
1 0
0 0

DR. MAHMOOD 2024-11-16 40

(a) To find the code efficiency, we have
7
LC =  li p( xi ) = 3.1 bits/message.
i =1

7
H ( X ) = − p( x i ) log 2 p( x i ) = 2.6029 bits/message.
i =1

H (X )
=  100% = 83.965%
LC

(b) p(0) at the encoder output is

 0i p( xi ) 0.6 + 0.4 + 0.3 + 0.24 + 0.2 + 0.08 + 0.05

p (0) = i =1
=
LC 3 .1

p(0) = 0.603
Example
Repeat the previous example using ternary coding.
Solution
r
1) li = − log3 p( xi ) if p( x ) =  1  1 1 1
i
{ , , ,...}
 3 3 9 27

2) li = Int[− log3 p( xi )] + 1 Ci = (Fi )3i

l
if 1
r
and
p( xi )   
 3

xi p( xi ) li Fi Ci 0i
x1 0.3 2 0 00 2

x2 0.2 2 0.3 02 1

x3 0.15 2 0.5 11 0

x4 0.12 2 0.65 12 0

x5 0.10 3 0.77 202 1

DR. MAHMOOD 2024-11-16 41

x6 0.08 3 0.87 212 0

x7 0.05 3 0.95 221 0

To find To find To find

0 0 1
1

To find To find

1 2
2 0

(a) To find the code efficiency, we have

7
LC =  li p( xi ) = 2.23 ternary unit/message.
i =1

7
H ( X ) = − p( x i ) log 3 p( x i ) = 1.642 ternary unit/message.
i =1

H (X )
=  100% = 73.632%
LC

(b) p(0) at the encoder output is

 0i p( xi ) 0 . 6 + 0 .2 + 0 .1
p ( 0) = i =1
=
LC 2.23

p(0) = 0.404

DR. MAHMOOD 2024-11-16 42

Shannon- Fano Code:

In Shannon–Fano coding, the symbols are arranged in order from most probable to
least probable, and then divided into two sets whose total probabilities are as close
as possible to being equal. All symbols then have the first digits of their codes
assigned; symbols in the first set receive "0" and symbols in the second set receive
"1". As long as any sets with more than one member remain, the same process is
repeated on those sets, to determine successive digits of their codes.

Example:

The five symbols which have the following frequency and probabilities, design
suitable Shannon-Fano binary code. Calculate average code length, source entropy
and efficiency.

Symbol count Probabilities Binary Length

codes
A 15 0.385 00 2
B 7 0.1795 01 2
C 6 0.154 10 2
D 6 0.154 110 3
E 5 0.128 111 3

The average code word length:

𝑚

𝐿 = ∑ 𝑃(𝑥𝑗 )𝑙𝑗
𝑗=1

𝐿 = 2 × 0.385 + 2 × 0.1793 + 2 × 0.154 + 3 × 0.154 + 3 × 0.128

= 2.28 𝑏𝑖𝑡𝑠/𝑠𝑦𝑚𝑏𝑜𝑙

DR. MAHMOOD 2024-11-16 43

The source entropy is:
𝑚

𝐻(𝑌) = − ∑ 𝑃(𝑦𝑗 ) log 2 𝑃(𝑦𝑗 )

𝑗=1

𝐻(𝑌) = −[0.385𝑙𝑛0.385 + 0.1793𝑙𝑛0.1793 + 2 × 0.154𝑙𝑛0.154

+ 0.128𝑙0.128]/𝑙𝑛2

𝐻(𝑌) = 2.18567 𝑏𝑖𝑡𝑠/𝑠𝑦𝑚𝑏𝑜𝑙

The code efficiency:

𝐻(𝑌) 2.18567
𝜂= × 100 = × 100 = 95.86%
L 2.28

Example
Develop the Shannon - Fano code for the following set of messages,
p( x) = [0.35 0.2 0.15 0.12 0.1 0.08] then find the code efficiency.
Solution
xi p( xi ) Code li
x1 0.35 0 0 2

x2 0.2 0 1 2

x3 0.15 1 0 0 3

x4 0.12 1 0 1 3

x5 0.10 1 1 0 3

x6 0.08 1 1 1 3

6
LC =  li p( xi ) = 2.45 bits/symbol
i =1

DR. MAHMOOD 2024-11-16 44

6
H ( X ) = − p( xi ) log 2 p( xi ) =2.396
bits/symbol
i =1

H (X )
=  100% = 97.796%
LC

Example
Repeat the previous example using with r = 3
Solution

xi p( xi ) Code li

x1 0.35 0 1

x2 0.2 1 0 2

x3 0.15 1 1 2

x4 0.12 2 0 2

x5 0.10 2 1 2

x6 0.08 2 2 2

6
LC =  li p( xi ) = 1.65 ternary unit/symbol
i =1

6
H ( X ) = − p( xi ) log 3 p( xi ) =1.512 ternary unit/symbol
i =1

H (X )
=  100% = 91.636%
LC

Huffman Code

The Huffman coding algorithm comprises two steps, reduction and splitting. These
steps can be summarized as follows:

DR. MAHMOOD 2024-11-16 45

1) Reduction
a) List the symbols in descending order of probability.

b) Reduce the r least probable symbols to one symbol with a probability

equal to their combined probability.

c) Reorder in descending order of probability at each stage.

d) Repeat the reduction step until only two symbols remain.

2) Splitting

a) Assign 0,1,...r to the r final symbols and work backwards.

b) Expand or lengthen the code to cope with each successive split.

Example: Design Huffman codes for 𝐴 = {𝑎1 , 𝑎2 , … … . 𝑎5 }, having the probabilities

{0.2, 0.4, 0.2, 0.1, 0.1}.

DR. MAHMOOD 2024-11-16 46

The average code word length:

𝐿 = 0.4 × 1 + 0.2 × 2 + 0.2 × 3 + 0.1 × 4 + 0.1 × 4 = 2.2 𝑏𝑖𝑡𝑠/𝑠𝑦𝑚𝑏𝑜𝑙

The source entropy:

𝐻(𝑌) = −[0.4𝑙𝑛0.4 + 2 × 0.2𝑙𝑛0.2 + 2 × 0.1𝑙𝑛0.1]/𝑙𝑛2 = 2.12193 bits/symbol

The code efficiency:

2.12193
𝜂= × 100 = 96.45%
2.2

It can be design Huffman codes with minimum variance:

The average code word length is still 2.2 bits/symbol. But variances are different!

Example

Develop the Huffman code for the following set of symbols

Symbol A B C D E F G H

Probability 0.1 0.18 0.4 0.05 0.06 0.1 0.07 0.04

DR. MAHMOOD 2024-11-16 47

Solution
0
C 0.40 0.40 0.40 0.40 0.40 0.40 0.60 1.0

0
B 0.18 0.18 0.18 0.19 0.23 0.37 0.40
1

0
A 0.10 0.10 0.13 0.18 0.19 0.23
1

0
F 0.10 0.10 0.10 0.13 0.18
1

0
G 0.07 0.09 0.10 0.10
1
0
E 0.06 0.07 0.09 1

0
D 0.05 0.06 1

H 0.04 1

So we obtain the following codes

Symbol A B C D E F G H
Probability 0.1 0.18 0.4 0.05 0.06 0.1 0.07 0.04
Codeword 011 001 1 00010 0101 0000 0100 00011
li 3 3 1 5 4 4 4 5
8
H ( X ) = − p( xi ) log 2 p( xi ) = 2.552 bits/symbol
i =1

8
LC =  li p( xi ) = 2.61 bits/symbol
i =1

DR. MAHMOOD 2024-11-16 48

H (X )
=  100% = 97.778%
LC

Note:
The condition that the number of symbols n so that we can decode them using r
Huffman coding is n − r must be an integer value, otherwise, add a redundant symbols
r −1
with a probabilities equal to zero so that the condition is satisfied.
Data Compression:
In computer science and information theory, data compression, source coding, or bit-
rate reduction involves encoding information using fewer bits than the original
representation. Compression can be either lossy or lossless.

Lossless data compression algorithms usually exploit statistical redundancy to

represent data more concisely without losing information, so that the process is
reversible. Lossless compression is possible because most real-world data has statistical
redundancy. For example, an image may have areas of color that do not change over
several pixels.

Lossy data compression is the converse of lossless data compression. In these

schemes, some loss of information is acceptable. Dropping nonessential detail from the
data source can save storage space. There is a corresponding trade-off between
preserving information and reducing size.

Run-Length Encoding (RLE):

Run-Length Encoding is a very simple lossless data compression technique that
replaces runs of two or more of the same character with a number which represents the
length of the run, followed by the original character; single characters are coded as
runs of 1. RLE is useful for highly-redundant data, indexed images with many pixels
of the same color in a row.
Example:

DR. MAHMOOD 2024-11-16 49

Input: AAABBCCCCDEEEEEEAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAAAAAAA
Output: 3A2B4C1D6E38A

The input message to RLE encoder is a variable while the output code word is fixed,
unlike Huffman code where the input is fixed while the output is varied.
Example : Consider these repeated pixels values in an image … 0 0 0 0 0 0 0 0 0 0 0 0
5 5 5 5 0 0 0 0 0 0 0 0 We could represent them more efficiently as (12, 0)(4, 5)(8, 0)
24 bytes reduced to 6 which gives a compression ratio of 24/6 = 4:1.
Example :Original Sequence (1 Row): 111122233333311112222 can be encoded as:
(4,1),(3,2),(6,3),(4,1),(4,2). 21 bytes reduced to 10 gives a compression ratio of 21/10 =
21:10.
Example : Original Sequence (1 Row): – HHHHHHHUFFFFFFFFFFFFFF can be
encoded as: (7,H),(1,U),(14,F) . 22 bytes reduced to 6 gives a compression ratio of 22/6
= 11:3 .
Savings Ratio : the savings ratio is related to the compression ratio and is a measure of
the amount of redundancy between two representations (compressed and
uncompressed). Let:
N1 = the total number of bytes required to store an uncompressed (raw) source image.
N2 = the total number of bytes required to store the compressed data.
The compression ratio Cr is then defined as:
𝑁1
𝐶𝑟 =
𝑁2
 Larger compression ratios indicate more effective compression
 Smaller compression ratios indicate less effective compression
 Compression ratios less than one indicate that the uncompressed representation
has high degree of irregularity.
The saving ratio Sr is then defined as :
(𝑁1 − 𝑁2 )
𝑆𝑟 =
𝑁1
 Higher saving ratio indicate more effective compression while negative ratios are
possible and indicate that the compressed image has larger memory size than the
original.

DR. MAHMOOD 2024-11-16 50

Example: a 5 Megabyte image is compressed into a 1 Megabyte image, the savings
ratio is defined as (5-1)/5 or 4/5 or 80%.
This ratio indicates that 80% of the uncompressed data has been eliminated in the
compressed encoding.

DR. MAHMOOD 2024-11-16 51

Advanced Optimization and Operations Research (Bhunia A.k., Sahoo L., Shaikh A.a) (Z-Library)
No ratings yet
Advanced Optimization and Operations Research (Bhunia A.k., Sahoo L., Shaikh A.a) (Z-Library)
626 pages
DL - Assignment 6 Solution
100% (3)
DL - Assignment 6 Solution
6 pages
M1 - 7 - Construction of Basic Codes - Shannon Fano and Huffman
No ratings yet
M1 - 7 - Construction of Basic Codes - Shannon Fano and Huffman
55 pages
Data Compression Solutions
79% (19)
Data Compression Solutions
67 pages
Quick Sort
100% (1)
Quick Sort
19 pages
Error Coding Reed Solomon
No ratings yet
Error Coding Reed Solomon
28 pages
cp467 12 Lecture14 Compression1
No ratings yet
cp467 12 Lecture14 Compression1
146 pages
Information Theory: Dr. Muhammad Imran Farid
No ratings yet
Information Theory: Dr. Muhammad Imran Farid
32 pages
05 Arith 1
No ratings yet
05 Arith 1
54 pages
Source 515 A
No ratings yet
Source 515 A
80 pages
DCT Based Coding
No ratings yet
DCT Based Coding
49 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Arini, MT, MSC: Basic Compression Entropy Coding Statistical
No ratings yet
Arini, MT, MSC: Basic Compression Entropy Coding Statistical
34 pages
Unit 5 - Part-Ii
No ratings yet
Unit 5 - Part-Ii
41 pages
Mesleki Yeterlilik
No ratings yet
Mesleki Yeterlilik
106 pages
VTU E&CE (CBCS) 5th Sem Information Theory and Coding Full Notes (1-5 Modules)
80% (5)
VTU E&CE (CBCS) 5th Sem Information Theory and Coding Full Notes (1-5 Modules)
691 pages
ECEVSP L03 Compression2
No ratings yet
ECEVSP L03 Compression2
40 pages
ETN3046 Chapter 6
No ratings yet
ETN3046 Chapter 6
31 pages
Source Coding
No ratings yet
Source Coding
35 pages
AlgorithmComplexityCA Correction
No ratings yet
AlgorithmComplexityCA Correction
4 pages
3 Source Coding
No ratings yet
3 Source Coding
31 pages
Module IV
No ratings yet
Module IV
37 pages
Arithmetic Coding: Presented By: Einat & Kim
No ratings yet
Arithmetic Coding: Presented By: Einat & Kim
48 pages
CH 6
No ratings yet
CH 6
21 pages
Unit 2
No ratings yet
Unit 2
28 pages
Entropy: A 00 A 01 A 10 A 11
No ratings yet
Entropy: A 00 A 01 A 10 A 11
22 pages
Audio and Video Coding PDF
No ratings yet
Audio and Video Coding PDF
72 pages
PCM - Decoding PCM - Decoding PCM - Decoding PCM - Decoding
No ratings yet
PCM - Decoding PCM - Decoding PCM - Decoding PCM - Decoding
4 pages
Coding Techniques Important Questions-1
No ratings yet
Coding Techniques Important Questions-1
6 pages
Shanon Encoding and Fano Encoding, Theorem, Problems On Entropy
No ratings yet
Shanon Encoding and Fano Encoding, Theorem, Problems On Entropy
25 pages
Lec4 Arith Compression
No ratings yet
Lec4 Arith Compression
36 pages
Ch. 2 Source Coding-Ppt1 PDF
No ratings yet
Ch. 2 Source Coding-Ppt1 PDF
59 pages
Chapter 2
No ratings yet
Chapter 2
13 pages
Shannon Fano and Huffman
No ratings yet
Shannon Fano and Huffman
10 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
21 pages
Source Coding of Discrete Sources: 1-The Average Code Length L Must Be As Minimum As Possible. This Average Length Is
No ratings yet
Source Coding of Discrete Sources: 1-The Average Code Length L Must Be As Minimum As Possible. This Average Length Is
17 pages
Week 3
No ratings yet
Week 3
30 pages
Source Coding Shannon Fano Coding
No ratings yet
Source Coding Shannon Fano Coding
24 pages
Unit 2
No ratings yet
Unit 2
30 pages
Lecture 4 - Arithmetic Coding and Lempel-Ziv
No ratings yet
Lecture 4 - Arithmetic Coding and Lempel-Ziv
26 pages
Multimedia Systems: Chapter 7: Data Compression
No ratings yet
Multimedia Systems: Chapter 7: Data Compression
25 pages
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
No ratings yet
Ec8093-Digital Image Processing: Dr.K.Kalaivani Associate Professor Dept. of EIE Easwari Engineering College
37 pages
Source Coding
No ratings yet
Source Coding
18 pages
Dac and Adc Lec 23
No ratings yet
Dac and Adc Lec 23
44 pages
Shanonfano and Huffman Coding
No ratings yet
Shanonfano and Huffman Coding
18 pages
Week 03-Informtion Sources and Source Coding
No ratings yet
Week 03-Informtion Sources and Source Coding
25 pages
Source Coding
No ratings yet
Source Coding
10 pages
66 IC PPT Lecture 6
No ratings yet
66 IC PPT Lecture 6
20 pages
شانون
No ratings yet
شانون
3 pages
Block 2 Part2
No ratings yet
Block 2 Part2
8 pages
Chapter Three Source Coding: 1-Sampling Theorem
No ratings yet
Chapter Three Source Coding: 1-Sampling Theorem
19 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
22 pages
L12, L13, L14, L15, L16 - Module 4 - Source Coding
No ratings yet
L12, L13, L14, L15, L16 - Module 4 - Source Coding
59 pages
Basic Concepts of Encoding
No ratings yet
Basic Concepts of Encoding
34 pages
Information Theory and Coding - Chapter 3
No ratings yet
Information Theory and Coding - Chapter 3
33 pages
3.source Coding Data Compression
No ratings yet
3.source Coding Data Compression
25 pages
Source Coding: Source Encoder Channel Encoder Digital Source Source Entropy Symbols Binary Sequence Modulator
No ratings yet
Source Coding: Source Encoder Channel Encoder Digital Source Source Entropy Symbols Binary Sequence Modulator
18 pages
6.1 Reminder: Communications Systems II-Lec.6 Fourth Stage 2020-2021
No ratings yet
6.1 Reminder: Communications Systems II-Lec.6 Fourth Stage 2020-2021
5 pages
A Review of Deep Learning Models For Time Series Prediction
No ratings yet
A Review of Deep Learning Models For Time Series Prediction
16 pages
Data Compression Arithmetic Coding
No ratings yet
Data Compression Arithmetic Coding
38 pages
Information Theory Notes
No ratings yet
Information Theory Notes
4 pages
Viola Jones Presentation
No ratings yet
Viola Jones Presentation
33 pages
Data Structures Lab Exercise Using Python
No ratings yet
Data Structures Lab Exercise Using Python
48 pages
ICT (Source Encoding & Channel Encoding)
No ratings yet
ICT (Source Encoding & Channel Encoding)
15 pages
Lecture35-37 SourceCoding
No ratings yet
Lecture35-37 SourceCoding
20 pages
Lesson 2 - Digitizing and Packetizing Voice
No ratings yet
Lesson 2 - Digitizing and Packetizing Voice
22 pages
Lecture Three LU Decomposition: Numerical Analysis Math351/352
No ratings yet
Lecture Three LU Decomposition: Numerical Analysis Math351/352
15 pages
Wikipedia K Nearest Neighbor Algorithm
No ratings yet
Wikipedia K Nearest Neighbor Algorithm
4 pages
Entropy
No ratings yet
Entropy
10 pages
Factoring Polynomials: Be Sure Your Answers Will Not Factor Further!
No ratings yet
Factoring Polynomials: Be Sure Your Answers Will Not Factor Further!
5 pages
KMP Skip Search Algorithm: Advisor: Prof. R. C. T. Lee Speaker: Z. H. Pan
No ratings yet
KMP Skip Search Algorithm: Advisor: Prof. R. C. T. Lee Speaker: Z. H. Pan
18 pages
Difference Equation: Michael Haag
No ratings yet
Difference Equation: Michael Haag
4 pages
Huffman Coding
No ratings yet
Huffman Coding
10 pages
Ta ZC164 Course Handout
No ratings yet
Ta ZC164 Course Handout
6 pages
Numerical Lab Full
No ratings yet
Numerical Lab Full
15 pages
Spectrum Sensing in Cognitive Radio Networks
No ratings yet
Spectrum Sensing in Cognitive Radio Networks
19 pages
6.241 Dynamic Systems and Control: Lecture 5: Matrix Perturbations Readings: DDV, Chapter 5
No ratings yet
6.241 Dynamic Systems and Control: Lecture 5: Matrix Perturbations Readings: DDV, Chapter 5
11 pages
Applsci 12 00828
No ratings yet
Applsci 12 00828
18 pages
January - 2018
No ratings yet
January - 2018
2 pages
Module 2 Part3
No ratings yet
Module 2 Part3
31 pages
IT245 - Module 7
No ratings yet
IT245 - Module 7
23 pages
Backtracking Search For CSPs
No ratings yet
Backtracking Search For CSPs
9 pages
Advance-Math Portfolio Carbon
No ratings yet
Advance-Math Portfolio Carbon
98 pages
Spring 2024 - CS601 - 2 - SOL
No ratings yet
Spring 2024 - CS601 - 2 - SOL
3 pages
ML CS3035 Question Bank Part I
No ratings yet
ML CS3035 Question Bank Part I
2 pages
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
From Everand
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
CSPacademic
No ratings yet
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Transformation of Axes (Geometry) Mathematics Question Bank
From Everand
Transformation of Axes (Geometry) Mathematics Question Bank
Mohmmad Khaja Shareef
3/5 (1)

Lecture 5

Uploaded by

Lecture 5

Uploaded by

D = 111

This code can be instantaneously decoded since no complete codeword is a prefix of a

then the codeword of xi is the binary equivalent of Fi consisting of li bits.

where Ci is the binary equivalent of Fi up to li bits. In encoding, messages must be

DR. MAHMOOD 2024-11-16 39

x2 0.2 3 0.3 010 2

x3 0.15 3 0.5 100 2

x4 0.12 4 0.65 1010 2

x5 0.10 4 0.77 1100 2

x6 0.08 4 0.87 1101 1

x7 0.05 5 0.95 11110 1

To find To find To find

DR. MAHMOOD 2024-11-16 40

(b) p(0) at the encoder output is

 0i p( xi ) 0.6 + 0.4 + 0.3 + 0.24 + 0.2 + 0.08 + 0.05

2) li = Int[− log3 p( xi )] + 1 Ci = (Fi )3i

x5 0.10 3 0.77 202 1

DR. MAHMOOD 2024-11-16 41

x7 0.05 3 0.95 221 0

To find To find To find

(a) To find the code efficiency, we have

(b) p(0) at the encoder output is

DR. MAHMOOD 2024-11-16 42

Symbol count Probabilities Binary Length

The average code word length:

𝐿 = 2 × 0.385 + 2 × 0.1793 + 2 × 0.154 + 3 × 0.154 + 3 × 0.128

DR. MAHMOOD 2024-11-16 43

𝐻(𝑌) = − ∑ 𝑃(𝑦𝑗 ) log 2 𝑃(𝑦𝑗 )

𝐻(𝑌) = −[0.385𝑙𝑛0.385 + 0.1793𝑙𝑛0.1793 + 2 × 0.154𝑙𝑛0.154

𝐻(𝑌) = 2.18567 𝑏𝑖𝑡𝑠/𝑠𝑦𝑚𝑏𝑜𝑙

The code efficiency:

DR. MAHMOOD 2024-11-16 44

DR. MAHMOOD 2024-11-16 45

b) Reduce the r least probable symbols to one symbol with a probability

c) Reorder in descending order of probability at each stage.

d) Repeat the reduction step until only two symbols remain.

a) Assign 0,1,...r to the r final symbols and work backwards.

b) Expand or lengthen the code to cope with each successive split.

Example: Design Huffman codes for 𝐴 = {𝑎1 , 𝑎2 , … … . 𝑎5 }, having the probabilities

DR. MAHMOOD 2024-11-16 46

𝐿 = 0.4 × 1 + 0.2 × 2 + 0.2 × 3 + 0.1 × 4 + 0.1 × 4 = 2.2 𝑏𝑖𝑡𝑠/𝑠𝑦𝑚𝑏𝑜𝑙

The source entropy:

𝐻(𝑌) = −[0.4𝑙𝑛0.4 + 2 × 0.2𝑙𝑛0.2 + 2 × 0.1𝑙𝑛0.1]/𝑙𝑛2 = 2.12193 bits/symbol

The code efficiency:

It can be design Huffman codes with minimum variance:

Develop the Huffman code for the following set of symbols

Probability 0.1 0.18 0.4 0.05 0.06 0.1 0.07 0.04

DR. MAHMOOD 2024-11-16 47

So we obtain the following codes

DR. MAHMOOD 2024-11-16 48

Lossless data compression algorithms usually exploit statistical redundancy to

Lossy data compression is the converse of lossless data compression. In these

Run-Length Encoding (RLE):

DR. MAHMOOD 2024-11-16 49

DR. MAHMOOD 2024-11-16 50

DR. MAHMOOD 2024-11-16 51

You might also like