Shannon Source Coding Theorem

Shannon's source coding theorem states that typical messages, which occur with high probability, can be encoded using fewer bits as the message length increases. Specifically: 1) Only typical messages that occur with a probability close to the expected frequency need be encoded, as atypical messages have a very low probability of occurring. 2) The number of bits needed to encode a message of length N approaches the entropy of the message source as N increases. 3) By only encoding typical messages, which represent the vast majority as N increases, the probability of error can be made arbitrarily small for long messages.

Uploaded by

ETC

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

180 views3 pages

Shannon Source Coding Theorem

Uploaded by

ETC

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Shannon’s Source Coding Theorem

Kim Boström
∗
Institut für Physik, Universität Potsdam, 14469 Potsdam, Germany

The idea of Shannon’s famous source coding theorem Now consider a very long message x. Typically, the let-
[1] is to encode only typical messages. Since the typical ter ak will appear with the frequency Nk ≈ N pk . Hence,
messages form a tiny subset of all possible messages, we the probability of such typical message is roughly
need less resources to encode them. We will show that K
the probability for the occurence of non-typical strings Y
p(x) ≈ ptyp ≡ pN Nk
1 · · · pK =
1
pN
k
pk
. (4)
tends to zero in the limit of large message lengths. Thus
k=1
we have the paradoxical situation that although we “for-
get” to encode most messages, we loose no information We see that typical messages are uniformly distributed
in the limit of very long strings. In fact, we make use of by ptyp . This indicates that the set T of typical messages
redundancy, i.e. we do not encode “unnecessary” infor- has the size
mation represented by strings which almost never occur. 1
Recall that a random message of length N is a string |T | ≈ . (5)
ptyp
If we encode each member of T by a binary string we
need
possible
messages K
X
IN = log |T | = −N pk log pk ≡ N H(X), (6)
k=1

bits, where H(X) is the Shannon entropy of the letter en-

typical
messages code words
semble. Thus for very long messages the average number
of bits per letter reads
1
I≡ IN = H(X). (7)
N
This is Shannon’s source coding theorem in a nutshell.
FIG. 1: Lossy coding. Now let us get a bit more into detail. In order to rigor-
ously prove the theorem we need the concept of a random
variable and the law of large numbers. Given the letter
x ≡ x1 · · · xN of letters, which are independently drawn
from an alphabet A = {a1 , . . . , aK } with a priori proba-
ensemble X, the function f : A → R
defines a discrete,
real random variable. The realizations of f (X) are the
bilities real numbers f (x), x ∈ A. The average of f (X) is defined
as
p(ak ) = pk ∈ (0, 1], k = 1, . . . , K (1)
X K
X
P
where k pk = 1. Each given string x of a random mes- hf (X)i := p(x) f (x) = pk f (ak ), (8)
sage is an instance or realization of the message ensemble x∈A k=1

X ≡ X1 · · · XN , where each random letter Xn is identical and the variance is given by

to a fixed letter ensemble X,
∆2 f (X) := hf 2 (X)i − hf (X)i2 . (9)
Xn = X, n = 1, . . . , N. (2)
For the sequence f (X) ≡ f (X1 ), . . . , f (XN ) we define its
A particular message x = x1 · · · xN appears with the arithmetic average as
probability N
1 X
A := f (Xn ), (10)
p(x1 · · · xn ) = p(x1 ) · · · p(xn ), (3) N n=1

which expresses the fact that the letters are statistically which is also a random variable. Since the Xn are iden-
independent from each other. tical copies of the letter ensemble X, the average of A is
equal to the average of f (X),
N
1 X
hAi = hf (Xn )i = hf (X)i, (11)
∗ Electronic address: [email protected] N i=1
2

and the variance of A reads or equivalently

∆2 A = hA2 i − hAi2 (12) 2−N (H+δ) ≤ p(x) ≤ 2−N (H−δ) , (22)

1 X
= hf (Xn )f (Xm )i where H ≡ H(X). By the law of large numbers, the
N 2 n,m
probability for a randomly drawn message x to be a mem-
1 X ber of T reads
− 2 hf (Xn )ihf (Xm )i (13)
N n,m X
PT ≡ p(x) ≥ 1 − . (23)
1 X 2 x∈T
hf (Xn )i − hf (Xn )i2

= (14)
N2 n
If we encode only typical sequences, the probability of
1 2 error
= ∆ f (X). (15)
N
Perr := 1 − PT ≤ (24)
The relative standard deviation of A yields
can be made arbitrarily small by choosing N large
∆A 1 ∆f (X) enough. Now let us determine how many typical se-
=√ . (16)
hAi N hf (X)i quences there are. The lefthand side of (22) gives
Concluding, in the limit of large N the arithmetic average p(x) ≥ 2−N (H+δ) (25)
of the sequence f (X) and the ensemble average of f (X) X
−N (H+δ)
coincide. This is the law of large numbers. It is respon- ⇔ p(x) ≥ |T | 2 . (26)
sible for the validity of statistical experiments. Without x∈T
this law, we could never verify statistical properties of
a system by performing many experiments. In partic- The righthand side of (22) gives
ular, quantum mechanics would be free of any physical
meaning. p(x) ≤ 2−N (H−δ) (27)
X
Let us reformulate the law of large numbers in the , δ- −N (H−δ)
⇔ p(x) ≤ |T | 2 , (28)
language. For δ > 0 we define the typical set T of a ran- x∈T
dom sequence X as the set of realizations x ≡ x1 · · · xN
such that which yields together with (23)

1 X
N
|T | 2−N (H−δ) ≥ 1 − (29)
hf (X)i − δ ≤ f (xn ) ≤ hf (X)i + δ. (17) N (H−δ)
N n=1 ⇔ |T | ≥ (1 − ) 2 . (30)

The law of large numbers implies that for every , δ > 0 Relations (28) and (30) can be combined into the crucial
there is a natural number N0 , such that for all N > N0 relation
the total probability of all typical sequences fulfills
(1 − ) 2N (H−δ) ≤ |T | ≤ 2N (H+δ) . (31)
X
PT ≡ p(x) ≥ 1 − . (18)
For N → ∞ we can choose , δ = 0 and obtain the desired
x∈T
expression
The total probability PT represents the probability for a
randomly chosen sequence x to lie in the typical set T . |T | → 2N H(X) , (32)
Now consider the special random variable
thus we need IN → N H(X) bits to encode the message.
f (X) := − log p(X). (19) Equivalently, the information content per letter reads I =
H(X) bits. Finally, let us investigate if we can further
The average of f (X) equals the Shannon entropy of the improve the compression. Relation (30) gives a lower
ensemble X, bound for the size of the typical set. Let us compress
below H bits per letter by fixing some 0 > 0 and encode
only sequences that lie in a “subtypical set” T 0 ⊂ T whose
X
hf (X)i = − p(x) log p(x) = H(X). (20)
x∈A
size reads
0 0
The typical set now contains all messages x whose prob- |T 0 | ≤ (1 − )2N (H−δ− ) < 2N (H−δ− ) . (33)
ability fulfills
The righthand side of (22) states that the probability of
N a typical sequence is bounded from above by
1 X
H −δ ≤− log p(xn ) ≤ H + δ, (21)
N n=1 p(x) ≤ pmax ≡ 2−N (H−δ) . (34)
3

If we encode only the typical sequences in the subtypical goes to 0 for N → ∞,

set T 0 , the probability that a sequence is in T 0 fulfills
X
PT 0 = p(x) (35) PT 0 → 0. (38)
x∈T 0
0
≤ |T 0 | · pmax = 2N (H−δ− ) 2−N (H−δ) (36) Concluding, if we compress the messages below N H(X)
−N 0 bits, we are not able to encode all typical messages and
=2 . (37)
for N → ∞ we will loose all information. A good review
Because 0 > 0, the probability of a successful encoding on the issue can also be found in [2, 3].

[1] C. E. Shannon and W. Weaver. A mathematical Theory (1995-2000).

of communication, The Bell System Technical Journal, 27, [3] J. Preskill. Lecture notes.
379–423,623–656, (1948). http://www.theory.caltech.edu/people/preskill/ph219/,
[2] D.J.C. MacKay. (1997-1999).
Information theory, inference, and learning algorithms,
http://wol.ra.phy.cam.ac.uk/mackay/itprnn/book.html,

Data Compression Solutions
79% (19)
Data Compression Solutions
67 pages
Jonathan Miller's Brief History of Disbelief
No ratings yet
Jonathan Miller's Brief History of Disbelief
4 pages
Master Your Mind Critical-Thinking Exercises and Activities To Boost Brain Power and Think Smarter (Danesi PHD, Marcel) (Z-Library)
100% (3)
Master Your Mind Critical-Thinking Exercises and Activities To Boost Brain Power and Think Smarter (Danesi PHD, Marcel) (Z-Library)
170 pages
QI Theory-2 PDF
No ratings yet
QI Theory-2 PDF
66 pages
Steiner, George - Poetry of Thought From Hellenism To Celan (New Directions, 2011) PDF
83% (6)
Steiner, George - Poetry of Thought From Hellenism To Celan (New Directions, 2011) PDF
222 pages
Chapel of ST - Ignatius - Lanting Tian PDF
No ratings yet
Chapel of ST - Ignatius - Lanting Tian PDF
16 pages
Shanon Encoding and Fano Encoding, Theorem, Problems On Entropy
No ratings yet
Shanon Encoding and Fano Encoding, Theorem, Problems On Entropy
25 pages
Noiseless Coding
No ratings yet
Noiseless Coding
5 pages
Proof To Shannon's Source Coding Theorem
No ratings yet
Proof To Shannon's Source Coding Theorem
5 pages
Shannon's Theorems: Math and Science Summer Program 2020
No ratings yet
Shannon's Theorems: Math and Science Summer Program 2020
28 pages
Entropy 3
No ratings yet
Entropy 3
10 pages
Lecture 2 28 August, 2015: 2.1 An Example of Data Compression
No ratings yet
Lecture 2 28 August, 2015: 2.1 An Example of Data Compression
7 pages
Entropy
No ratings yet
Entropy
9 pages
3 Information Theory
No ratings yet
3 Information Theory
48 pages
27may 3
No ratings yet
27may 3
3 pages
Digital Communication Process Through Swayam
No ratings yet
Digital Communication Process Through Swayam
31 pages
Materi Source Coding
No ratings yet
Materi Source Coding
39 pages
Week 4 - Channel Capacity (Chapter 7) and Differential Entropy (Chapter 8)
No ratings yet
Week 4 - Channel Capacity (Chapter 7) and Differential Entropy (Chapter 8)
16 pages
A Mini-Introduction To Information Theor PDF
No ratings yet
A Mini-Introduction To Information Theor PDF
40 pages
A Mini-Introduction To Information Theor PDF
No ratings yet
A Mini-Introduction To Information Theor PDF
40 pages
Introduction To Information Theory and Coding
No ratings yet
Introduction To Information Theory and Coding
46 pages
1805.11965 Edward Witten
No ratings yet
1805.11965 Edward Witten
40 pages
Information Theory
No ratings yet
Information Theory
38 pages
EE 376A: Information Theory: Lecture Notes
No ratings yet
EE 376A: Information Theory: Lecture Notes
75 pages
ITC Unit1 Book
No ratings yet
ITC Unit1 Book
33 pages
A Mini-Introduction To Information Theory: Edward Witten
No ratings yet
A Mini-Introduction To Information Theory: Edward Witten
39 pages
A Proofless Introduction To Information Theory - Math Programming
No ratings yet
A Proofless Introduction To Information Theory - Math Programming
4 pages
CS174: Note11
No ratings yet
CS174: Note11
6 pages
Lecture 5 - AEP: Nguyễn Phương Thái
No ratings yet
Lecture 5 - AEP: Nguyễn Phương Thái
20 pages
Lecture 7 Source Coding 2024
No ratings yet
Lecture 7 Source Coding 2024
28 pages
Information Theory
No ratings yet
Information Theory
108 pages
A Brief Introduction On Shannon's Information Theory
No ratings yet
A Brief Introduction On Shannon's Information Theory
12 pages
Information Theory Lecture Notes
No ratings yet
Information Theory Lecture Notes
37 pages
Digital Communication Professor Surendra Prasad Department of Electrical Engineering Indian Institute of Technology Delhi Lecture No 34 Source Coding
No ratings yet
Digital Communication Professor Surendra Prasad Department of Electrical Engineering Indian Institute of Technology Delhi Lecture No 34 Source Coding
56 pages
02 Practice Problems
No ratings yet
02 Practice Problems
1 page
Source Coding: Source Encoder Channel Encoder Digital Source Source Entropy Symbols Binary Sequence Modulator
No ratings yet
Source Coding: Source Encoder Channel Encoder Digital Source Source Entropy Symbols Binary Sequence Modulator
18 pages
شانون
No ratings yet
شانون
3 pages
TEOI InformationOfDataSources
No ratings yet
TEOI InformationOfDataSources
55 pages
Chapter 2 - Edited
No ratings yet
Chapter 2 - Edited
45 pages
Lecture 11
No ratings yet
Lecture 11
5 pages
CE Notes
No ratings yet
CE Notes
32 pages
1-S2.0-S0020025502001731-Mainslmnbnvcbm VC, MN
No ratings yet
1-S2.0-S0020025502001731-Mainslmnbnvcbm VC, MN
7 pages
Chap 2
No ratings yet
Chap 2
47 pages
Information Theory: Dr. Muhammad Imran Farid
No ratings yet
Information Theory: Dr. Muhammad Imran Farid
32 pages
Chap 3 Capacity of AWGN Channels
No ratings yet
Chap 3 Capacity of AWGN Channels
12 pages
Week 3
No ratings yet
Week 3
30 pages
Unit I Information Theory & Coding Techniques P I
No ratings yet
Unit I Information Theory & Coding Techniques P I
48 pages
A Joint Representation of Rényi's and Tsalli's Entropy With Application in Coding Theory - 2017 - International Journal of Mathematics A
No ratings yet
A Joint Representation of Rényi's and Tsalli's Entropy With Application in Coding Theory - 2017 - International Journal of Mathematics A
6 pages
Notes It
No ratings yet
Notes It
46 pages
Source Coding
No ratings yet
Source Coding
29 pages
Elements of Information Theory-Chapter1-2
No ratings yet
Elements of Information Theory-Chapter1-2
63 pages
An Introduction To Information Theory: Adrish Banerjee
No ratings yet
An Introduction To Information Theory: Adrish Banerjee
6 pages
Channel Coding Theorem
No ratings yet
Channel Coding Theorem
23 pages
Dabel Info Theory
No ratings yet
Dabel Info Theory
25 pages
Unit-2 (Aep, Optimal Codes, Kraft Inequality)
No ratings yet
Unit-2 (Aep, Optimal Codes, Kraft Inequality)
6 pages
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
No ratings yet
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
31 pages
Unit 2
No ratings yet
Unit 2
30 pages
Information Coding Techniques
No ratings yet
Information Coding Techniques
42 pages
Information Theory Coding
No ratings yet
Information Theory Coding
9 pages
Kjom351 09 PDF
No ratings yet
Kjom351 09 PDF
7 pages
Lect 02
No ratings yet
Lect 02
20 pages
15ec54 PDF
No ratings yet
15ec54 PDF
56 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
4.5/5 (2)
International
No ratings yet
International
33 pages
12 - Consumer Behaviour
No ratings yet
12 - Consumer Behaviour
18 pages
The Digital Transformation of Bangladesh's Marketing Industry
No ratings yet
The Digital Transformation of Bangladesh's Marketing Industry
48 pages
Grey Part 1
No ratings yet
Grey Part 1
34 pages
Chapter # 01 The Role & Environment of Financial Management
0% (1)
Chapter # 01 The Role & Environment of Financial Management
18 pages
Customer Perceptions of Service
No ratings yet
Customer Perceptions of Service
24 pages
SL No. Time Specific Objective Content AV Aids Teachers's Activity Group Activit y Evaluation
100% (1)
SL No. Time Specific Objective Content AV Aids Teachers's Activity Group Activit y Evaluation
14 pages
NOVO Theater
No ratings yet
NOVO Theater
3 pages
Full Document 121
No ratings yet
Full Document 121
154 pages
Translate Tools For Data Collection 2
No ratings yet
Translate Tools For Data Collection 2
17 pages
Example: $ Cat /proc/meminfo: Statistics Details
No ratings yet
Example: $ Cat /proc/meminfo: Statistics Details
3 pages
5 Picture
100% (2)
5 Picture
1 page
Student's T' Test:: T (N +N - 2) DF /N + /N)
No ratings yet
Student's T' Test:: T (N +N - 2) DF /N + /N)
1 page
Layout-Narayana Hrudayalaya College of Nursing
No ratings yet
Layout-Narayana Hrudayalaya College of Nursing
1 page
Nursing Management Consists of The Performance of The Leadership Functions of Governance
No ratings yet
Nursing Management Consists of The Performance of The Leadership Functions of Governance
7 pages
2018 Jabeel UPDATED CV PDF
No ratings yet
2018 Jabeel UPDATED CV PDF
1 page
Worksheet (Basic Grammar)
No ratings yet
Worksheet (Basic Grammar)
3 pages
Lewis Coser
100% (1)
Lewis Coser
6 pages
SCHIFFRIN, Deborah - How A Story Says What It Means and Does
No ratings yet
SCHIFFRIN, Deborah - How A Story Says What It Means and Does
29 pages
How To Teach Vowels and Consonants: Vowel and A Consonant
No ratings yet
How To Teach Vowels and Consonants: Vowel and A Consonant
20 pages
Religious Training and Religiosity in Psychiatry Residency Programs
No ratings yet
Religious Training and Religiosity in Psychiatry Residency Programs
7 pages
Eternal Sanskrit and The Meaning of The Tripartite Prakrit Terminology A DROCCO PDF
No ratings yet
Eternal Sanskrit and The Meaning of The Tripartite Prakrit Terminology A DROCCO PDF
18 pages
LP Kuesioner Reaksi Sosial Potensa Resource
No ratings yet
LP Kuesioner Reaksi Sosial Potensa Resource
2 pages
3-IMC Partners and Cross-Functional Organization
No ratings yet
3-IMC Partners and Cross-Functional Organization
29 pages
The Battle in Seattle (Reaction Paper) : Martes, John Cris C. Blis - 2A
No ratings yet
The Battle in Seattle (Reaction Paper) : Martes, John Cris C. Blis - 2A
2 pages
Need of Value Education
No ratings yet
Need of Value Education
21 pages
Thesis
No ratings yet
Thesis
11 pages
You Shall Receive Power
No ratings yet
You Shall Receive Power
11 pages
How Can We Avoid Riba
No ratings yet
How Can We Avoid Riba
3 pages
JEducEthicsDent5241-2260055 061640
No ratings yet
JEducEthicsDent5241-2260055 061640
6 pages
Dejl Karnegijeve Knjige, I Njegove Beleske Za Njih
No ratings yet
Dejl Karnegijeve Knjige, I Njegove Beleske Za Njih
10 pages
Personal Social and Emotional Development July 13
No ratings yet
Personal Social and Emotional Development July 13
4 pages
Analogy
No ratings yet
Analogy
24 pages
Sweeney, Lucas - The History and Origins of Satan PDF
100% (1)
Sweeney, Lucas - The History and Origins of Satan PDF
11 pages
Free Verbal Reasoning Questions Answers
100% (1)
Free Verbal Reasoning Questions Answers
8 pages
Fazlur Rahman Malik: TH TH
No ratings yet
Fazlur Rahman Malik: TH TH
3 pages
History of ASL
No ratings yet
History of ASL
9 pages
William Blake, A Dissenter
No ratings yet
William Blake, A Dissenter
7 pages
Legal Theory Skepticism and Philosophical Skepticism: Comparison, Contrast, and Assessment
No ratings yet
Legal Theory Skepticism and Philosophical Skepticism: Comparison, Contrast, and Assessment
2 pages
Rudolf Hilferding Böhm-Bawerk's Criticism of Marx
No ratings yet
Rudolf Hilferding Böhm-Bawerk's Criticism of Marx
40 pages
Work Energy and Power
No ratings yet
Work Energy and Power
9 pages
Arundhati Postcolonial Cosmopolitanism
No ratings yet
Arundhati Postcolonial Cosmopolitanism
18 pages

Shannon Source Coding Theorem

Uploaded by

Shannon Source Coding Theorem

Uploaded by

Shannon’s Source Coding Theorem

bits, where H(X) is the Shannon entropy of the letter en-

X ≡ X1 · · · XN , where each random letter Xn is identical and the variance is given by

and the variance of A reads or equivalently

∆2 A = hA2 i − hAi2 (12) 2−N (H+δ) ≤ p(x) ≤ 2−N (H−δ) , (22)

If we encode only the typical sequences in the subtypical goes to 0 for N → ∞,

[1] C. E. Shannon and W. Weaver. A mathematical Theory (1995-2000).

You might also like