ITC 4 SourceCoding
ITC 4 SourceCoding
• Source Coding
The conversion of the output of a Discrete Memoryless Source (DMS) into a sequence of binary symbols is called
source coding
The aim of source coding is to minimize the average bit rate required for representation of the source by reducing
redundancy of the information source
L≥H
i.e.
Lmin = H
Lmin H
η= =
L L
H
γ =1−η =1−
L
1
• Entropy Bound
For m symbol xi with occurrence probability pi
0 ≤ H ≤ log2 m
Proof of the left hand side inequality 0 ≤ H
pi ∈ [0, 1]
1
⇒ ≥1
pi
1
⇒ log2 ≥0
pi
1
⇒ pi log2 ≥0
pi
∑ 1
⇒ pi log2 ≥0
pi
⇒ H≥0
ln x ≤ x − 1
Consider two group of probability {pi } , {qi } on {xi }
By axiom of probability distribution , sum of probability should be 1
∑ ∑
qi = pi = 1
∑ qi ∑ ln p 1 ∑
qi
qi
i
pi log = pi = pi ln
pi ln 2 ln 2 pi
Using the inequality
∑ ( )
qi 1 ∑ qi 1 ∑ qi
pi log = pi ln ≤ pi −1
pi ln 2 pi ln 2 pi
1 ∑
= qi − pi
ln 2
1 (∑ ∑ )
= qi − pi
ln 2
By axiom of probability distribution
=0
Thus
∑ qi 1 ∑ qi
pi log = pi ln
pi ln 2 pi
( )
1 ∑ qi
≤ pi −1
ln 2 pi
=0
2
Therefore
∑ qi
pi log ≤0
pi
Let
1
qi = (Equal probability distribution for m-message)
m
∑ qi
pi log ≤0
pi
∑ 1
⇐⇒ pi log ≤0
pi m
∑ 1 ∑
⇐⇒ pi log2 − pi log2 m ≤ 0
pi
∑ 1 ∑
⇐⇒ pi log2 − log2 m pi ≤ 0
pi | {z }
| {z }
1
H
H ≤ log2 m
Thus
0 ≤ H ≤ log2 m
L≥H
i.e. When the coding is optimal ( using shortest amount of code to represent information )
Lmin = H
Consider the inequality
∑ qi
pi log ≤0
pi
1
And let qi = ∑ 2 i 1 , notice that
l
m
i=1
2li
∑m 1
∑ ∑
m 1
i=1
qi = 2li
=∑ 2li = 1
∑m 1 m 1
i=1 i=1 i=1
2 li 2li
And thus
∑ qi
pi log ≤0
pi
∑ 1
1
2li
⇐⇒ pi log ∑ ≤0
pi m 1
i=1
2li
3
∑ ( ) ∑
1 1 1
m
⇐⇒ pi
log2 + log2 li − log2 ≤0
li
pi 2
| {z } i=1
2
−li
( ) ( )
∑ 1 ∑ ∑m∑ 1
⇐⇒ pi log2 − pi li − pi log2 ≤0
pi 2li
| {z } | {z } | {z } i=1
L 1
H
∑m
1
⇐⇒ H − L − log2 ≤0
i=1
2li
Using Kraf t Inequality
∑m
1
≤1
i=1
2li
Thus
∑m
1
log2 ≤ log2 1 = 0
i=1
2li
Therefore
∑m
1
H − L ≤ log2
i=1
2li
H −L≤0
Thus
L≥H
Equality holds when
Lmin = H
−EN D−