0% found this document useful (0 votes)

92 views6 pages

Noisy Channel Theorem

The document discusses the noisy coding theorem, which states that reliable communication is possible over a noisy channel if the source entropy is less than the channel capacity. It defines key information theory concepts like mutual information and average mutual information. It then provides an example calculation of the channel capacity for a binary symmetric channel with a binary source.

Uploaded by

ChaseVetruba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

92 views6 pages

Noisy Channel Theorem

Uploaded by

ChaseVetruba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

1 The Noisy Coding Theorem

1.1 The Idea of the Noisy Coding Theorem

We have a discrete memoryless source, with M words and entropy H.
We have a binary symmetric channel with crossover probability p. The
channel has capacity C, which we have yet to define.
Can we reliably send streams of source words through this channel? We are
allowed to code the source words as bitstrings of some fixed length before sending
through the channel. We can utilize whatever error detecting and correcting
capabilities that we have built into the code.
We set an acceptable failure rate, . If we want 99.5% reliability, we set
= 0.5%. We are asking that at least 1 = 99.5% of the words that are sent
can be correctly read (after using the code to detect and correct errors.) We
will think of setting to be small, near 0.
Is there an error-correcting and detecting code that can do that?
Lets imagine a code where codewords are bitstrings of length n, where we
are prepared to accept a very large n in order to make a fancy code.
Theres a tradeoff that comes with increasing the length of the codewords.
When you send a longer codeword, there are probably going to more bit
errors. In fact, the number of bit errors is on average np for each word
sent.
When you code with longer bit strings, there are more possible codewords.
We only need a fixed number |W | = M of codewords, and there are 2n bit
strings that could be codewords. Thus we can spread out the codewords
more and more, so there is a greater Hamming distance between any two
of them. Thus we can detect and correct more and more errors, up to half
as many as the minimum Hamming distance between codewords.
So we have to ask: As n, the length of the codewords, increases, does the
Hamming distance between codewords grow faster than the probable number
of bit errors? If yes then there exists a good code. If no, then there is no such
code. When a good code does exist (in this sense), no promises are made about
how easy coding and encoding might be).
To prove a theorem addressing these questions, we have to figure out (1)
how far apart codewords can be; and (2) estimate error probabilities.
Theres another consideration relating to both the source and the channel.
The higher the entropy H of the source, the more information we want to
send, and the more difficult it becomes.
The lower the cross-over probability p of the channel, the fewer errors the
errors, making it easier to detect and correct them.
If the source entropy is higher, we might need a lower channel cross-over prob-
ability.

1
Shannon figured this all out. His theorem asserts that if source entropy
is less than the channel capacity, then -reliable communication is possible for
every > 0. But if the source entropy is greater than the channel capacity, then
there is a positive lower limit to the reliability that is possible.

1.2 Mutual Information

Consider a probability distribution on a cross product X Y , where X =
{x1 , . . . , xM } and Y = {y1 , . . . , yN }. We have a set of joint probabilities
P (xi & yj ) = P (ij) that sum to 1 and that can conveniently be displayed in a
table of joint probabilities. We are thinking of X as source symbols (e.g. 0 and
1) and Y as received symbols (perhaps also 0 and 1), but the joint probabilities
are NOT transition probabilities (and cannot be, because the row probabilities
do not add to 1.) From this table we can produce marginal probabilities for
each xi separately, which we think of as source probabilities for a memoryless
source X. The marginal probabilities fire the ys give the probabilities for what
is received after transmission through the channel.
Heres a running example.

Joint Probabilities
sent \ received y1 y2 Marginal X prob. (Source prob)
x1 0.1 0.3 0.4
x2 0.4 0.2 0.6
Marginal Y 0.5 0.5

From the joint probabilities we can compute transition (conditional) prob-

abilities, p(yj |xi ) = P (j|i). Simply normalize each row by dividing by its
marginal probability. Thus

P (xi & yj )
P (yj |xi ) = .
P (xi )

In our example we have

Transition Probabilities
sent \ received y1 y2 Row sum
x1 0.25 0.75 1
x2 0.33 0.67 1
Summarizing: From a joint probability distribution on X Y we found
both source probabilities for X and transition probabilities from X to Y . The
process is reversible, which is very important for the sequel.. If you begin with
just source and transition probabilities, then you can easily construct a full
table of joint probabilities. Just multiply each row of the transition table by its
corresponding source probability. The joint probability table is constructed from
information about both the source (the source probabilities) and the channel
(the transition probabilities). From the joint probabilities you can find marginal

2
probabilities for the received variables. For the same channel, different sources
lead to different joint probabilities.
Given either a joint probability model on X Y OR (equivalently) a source
S and a channel Ch, we define the mutual information
P (yj |xi )
I(xi , yj ) = log2 .
P (yj )
The mutual information is the ratio of a conditional probability and a marginal
probability for the same quantity yj . In terms of sources and channels, its the
ratio of a transition probability to yj (from xi ) and the probability that yj will
be received when a random word is transmitted from the source.
In our example we have
Mutual Information I(xi , yj ).
sent \ received y1 y2

x1 log2 (0.25/0.5) = 1.000 log2 (0.75/0.5) = 0.585

.
x2 log2 (0.33/0.5) = 0.585 log2 (0.67/0.5) = 0.415

When yj is more likely to occur after xi than it is overall, the mutual informa-
tion is positive. The occurrence of xi indicates an increased probability of yj .
When yj is less likely to occur after xi than it is overall, the mutual information
is negative. The occurrence of xi indicates a decreased probability of yj . If the
mutual information is zero, I(xi , yj ) = 0, then P (yj |xi ) = P (yj ), so informa-
tion about the occurrence of xi gives no information enabling us to revise our
estimate of the probability of yj .
Mutual information is symmetric in the two variables: I(xi , yj ) = I(yj , xi ).
Indeed, since
P (yj |xi )P (xi ) = P (xi & yj ) = P (xi |yj )P (yj )
we have
P (yj |xi ) P (xi |yj )
= .
P (yj ) P (xi )
Finally, we can define the Average Mutual Information of a joint distri-
bution on X Y (equivalently of a channel specified by transition probabilities
P (yj |xi ) together with a source specified by its word probabilities P (xi )) to be
the expected value of the mutual information.
M X
X N
I(X, Y ) = P (xi & yj )I(xi , yj ).
i=1 j=1

In our example, the average mutual information is

I(X, Y ) = (0.1)(1.000) + (0.3)(0.585) + (0.4)(0.585) + (0.2)(0.4 5)
= 0.07549.

3
1.3 Capacity of a Binary Symmetric Channel
Main Example.
Our goal is to compute the channel capacity of a binary symmetric channel,
Ch(), with cross-over probability .
We work with a source, S(p), with words {0, 1} and probabilities P (1) = p
and P (0) = q = 1 p.

Joint Probabilities
Binary symmetric channel ()
Source (p, q)

sent\received 0 1 MarginalX Source prob

0 p(1 ) p p

1 q q(1 ) q

Marginal Y p(1 ) + q p + q(1 )

Mutual Information
Binary symmetric channel ()
Source (p, q)

sent\received 0 1

1
0 log2 p(1)+q log2 p+q(1)
.
1
1 log2 p(1)+q log2 p+q(1)

The average mutual information for the source and binary symmetric chan-
nel, obtained from the joint probabilities and the element-level mutual informa-
tion, is
1
I(, p) = p(1 ) log2 + p log2
p(1 ) + q p + q(1 )
1
+ q log2 + q(1 ) log2
p(1 ) + q p + q(1 )
= (1 ) log2 + log2 (p(1 ) + q) log2 (p(1 ) + q)
(p + q(1 )) log2 (p + q(1 )).

Notice that a single channel, the average mutual information depends on

the source probabilities p = P (1) and q = P (0), so it is not a property of the
channel alone. Some sources give greater average mutual information and others
smaller average mutual information for the same cross-over probabilities . If

4
among all sources we choose one for which the channel has the greatest average
mutual information, that maximal value is a property of the channel alone.
Definition The Capacity of a discrete memoryless channel is the average
mutual information when computed with the source that gives greatest such
average.
To find the capacity of the binary symmetric channel, we have to maximize
I(, p) with respect to p, on the domain 0 p 1. Its a calculus problem.
Differentiating with respect to p (remembering that q = 1 p) and equating
the result to zero shows that maximum average mutual information occurs with
p = q = 1/2. Hence

1 1 1 1
Channel capacity = (1 ) log2 + log2 log2 log2
2 2 2 2
= 1 + log2 + (1 ) log2 (1 ).
The capacity of the channel depends on the cross-over probability . If
the cross-over error rate is 50%, then the channel has no capacity to transmit
information reliably. The lower the cross-over error rate, the more efficiently
information can be sent. If there = 0, so there is no error at all, then it takes
just 1 bit through the channel to transmit 1 bit of information.

1.0

0.8

0.6

0.4

0.2

0.0
0.0 0.2 0.4 0.6 0.8 1.0
-

1.4 Channel coding theorem

We aim to define the capacity of a discrete memoryless channel channel, a mea-
sure of how much information we can reliably send through the channel (assum-
ing we are smart about the way we code the data before transmitting). There
is some intuition that comes from the definition, but ultimately the definition
of channel capacity is justified by Shannons great coding theorem.

5
The hope is to take a word stream from the source, encode it is some fancy
way so that after being sent through the noisy channel we can detect and correct
as many transmission errors as possible and then decode, in the end producing
a message that is virtually error free, nearly identical with the word stream
produced by the source. Can we do this?
Theorem (Shannons noisy coding theorem, aka channel coding theorem)
Given a memoryless source and a discrete memoryless channel:
1. If the source entropy is less than the channel capacity, then the error
probability can be reduced to any desired level by using a sufficiently
complex encoder and decoder. There exist codes that can do the job.

2. If the source entropy is greater than the channel capacity, arbitrarily small
error probability cannot be achieved. There is a limit to how effective a
code can be.
In the cases where the noisy coding theorem asserts existence of good codes,
the proof of the theorem gives no indication of how to create these codes. The
theorem is a pure existence theorem. Moreover, the theorem does not claim
that the codes with these minimal error rates are easy to implement. There
has to be some good underlying structure to the code in order for encoding and
decoding to be efficient.

Advance Excel Presentation
67% (3)
Advance Excel Presentation
71 pages
Sih PS 2024 PDF
No ratings yet
Sih PS 2024 PDF
10 pages
UV-LED Curing For An Industrial Wood Coating Application
No ratings yet
UV-LED Curing For An Industrial Wood Coating Application
6 pages
Norms For PoP - AICTE - Draft Addendum
No ratings yet
Norms For PoP - AICTE - Draft Addendum
3 pages
ITC Unit 2 Notes Detailed
No ratings yet
ITC Unit 2 Notes Detailed
3 pages
PA6VRF5gHAV5HsPPq8osbLcakWLRcNRc Signedfinal
No ratings yet
PA6VRF5gHAV5HsPPq8osbLcakWLRcNRc Signedfinal
4 pages
Chapter 4
No ratings yet
Chapter 4
89 pages
Technical Change, Inequality, and The Labor Market
No ratings yet
Technical Change, Inequality, and The Labor Market
80 pages
Joint Entropy, Equivocation and Mutual Information (Lecture)
No ratings yet
Joint Entropy, Equivocation and Mutual Information (Lecture)
86 pages
H (Y/X), As: (/) ( (0.5 (1) Log (1) ) 2 (0.5 Log) 2) (/) ( (1) Log (1) Log) Then I (X, Y) H (Y) - H (Y/X) 1+ (1-p) Log (1-p) +P Log P
No ratings yet
H (Y/X), As: (/) ( (0.5 (1) Log (1) ) 2 (0.5 Log) 2) (/) ( (1) Log (1) Log) Then I (X, Y) H (Y) - H (Y/X) 1+ (1-p) Log (1-p) +P Log P
7 pages
Gajanan
No ratings yet
Gajanan
2 pages
ETN642 Lec8 Ch8 Handouts
No ratings yet
ETN642 Lec8 Ch8 Handouts
12 pages
T4 NoiseAndMutualInformation
No ratings yet
T4 NoiseAndMutualInformation
8 pages
TEOI-Capacity of Discrete Channels
No ratings yet
TEOI-Capacity of Discrete Channels
62 pages
Communication Ii 4 - Year 3Hrs Theor, 1 HR Pratical
No ratings yet
Communication Ii 4 - Year 3Hrs Theor, 1 HR Pratical
29 pages
Local Content Law: A Critical and Comparative Analysis
100% (2)
Local Content Law: A Critical and Comparative Analysis
27 pages
Two Marks-8051.docx - 1570617188187
No ratings yet
Two Marks-8051.docx - 1570617188187
4 pages
Fix32 Scada
No ratings yet
Fix32 Scada
51 pages
CS472 Principles of Information Security - Image.marked
No ratings yet
CS472 Principles of Information Security - Image.marked
2 pages
Information Theory and Coding PDF
No ratings yet
Information Theory and Coding PDF
61 pages
IDfbc976cd6-1995 Volvo 2 4 Engine Cooling Hose Diagram
No ratings yet
IDfbc976cd6-1995 Volvo 2 4 Engine Cooling Hose Diagram
2 pages
It Co 3 en
No ratings yet
It Co 3 en
14 pages
Jagannath Gupta Institute of Engineering and Technology Computer Networks Unit 1
No ratings yet
Jagannath Gupta Institute of Engineering and Technology Computer Networks Unit 1
4 pages
Information Theory: Today's Topics
No ratings yet
Information Theory: Today's Topics
4 pages
RFW Series Ps
No ratings yet
RFW Series Ps
1 page
Tutorial 2
No ratings yet
Tutorial 2
3 pages
1 Convolutional Codes: 1.1 Example
No ratings yet
1 Convolutional Codes: 1.1 Example
4 pages
Ece458 L6
No ratings yet
Ece458 L6
18 pages
Coding Bounds Linear Codes
No ratings yet
Coding Bounds Linear Codes
5 pages
Chapter 1
No ratings yet
Chapter 1
22 pages
PoC Guide VM Migration To Azure Final
0% (1)
PoC Guide VM Migration To Azure Final
72 pages
Motorsport
No ratings yet
Motorsport
238 pages
Chapte-2 Information Theory and Coding
No ratings yet
Chapte-2 Information Theory and Coding
68 pages
Channel Capacity
No ratings yet
Channel Capacity
51 pages
Digital Communication Chapter 3
No ratings yet
Digital Communication Chapter 3
37 pages
CH 11
No ratings yet
CH 11
36 pages
IMRpart II
No ratings yet
IMRpart II
69 pages
CSC 310: Information Theory: University of Toronto, Fall 2011 Instructor: Radford M. Neal
No ratings yet
CSC 310: Information Theory: University of Toronto, Fall 2011 Instructor: Radford M. Neal
15 pages
DC Lecture Slides 1 - Information Theory
No ratings yet
DC Lecture Slides 1 - Information Theory
22 pages
TE361 Channel Coding
No ratings yet
TE361 Channel Coding
65 pages
TE361 Channel Coding 1
No ratings yet
TE361 Channel Coding 1
24 pages
DCT Dual Clutch Transmission
50% (2)
DCT Dual Clutch Transmission
16 pages
Comm... System CH2-Lec1
No ratings yet
Comm... System CH2-Lec1
36 pages
IRIScanBook2 GettingStarted ENG
No ratings yet
IRIScanBook2 GettingStarted ENG
5 pages
Bucklin Buildings Offers Cost Effective and Efficient Grain Storage
No ratings yet
Bucklin Buildings Offers Cost Effective and Efficient Grain Storage
5 pages
Types of Press Tools - Wikipedia PDF
100% (1)
Types of Press Tools - Wikipedia PDF
24 pages
Digital Communication Chapter 3
No ratings yet
Digital Communication Chapter 3
37 pages
Chapter 2
No ratings yet
Chapter 2
12 pages
Unit 1
No ratings yet
Unit 1
94 pages
Site Boss 530 Manual
No ratings yet
Site Boss 530 Manual
101 pages
3.2.2 External Timber Walls
No ratings yet
3.2.2 External Timber Walls
24 pages
Information Theory Text Book
100% (7)
Information Theory Text Book
178 pages
Robinson Series 22 - 44
100% (3)
Robinson Series 22 - 44
39 pages
Release Note
No ratings yet
Release Note
9 pages
Calculation For Septic Tank
100% (2)
Calculation For Septic Tank
2 pages
1-Information Theory-2021
No ratings yet
1-Information Theory-2021
31 pages
Topics in Algebraic Combinatorics
No ratings yet
Topics in Algebraic Combinatorics
225 pages
Marina Bay Sands
No ratings yet
Marina Bay Sands
5 pages
Automotive Trades: Workbook
100% (1)
Automotive Trades: Workbook
99 pages
Design Guidelines - Spot Welding Chapter
No ratings yet
Design Guidelines - Spot Welding Chapter
11 pages
Important Questions..
No ratings yet
Important Questions..
18 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Rock Bolts - Improved Design and Possibilities by Capucine Thomas-Lepine PDF
100% (1)
Rock Bolts - Improved Design and Possibilities by Capucine Thomas-Lepine PDF
105 pages
IC Engines
No ratings yet
IC Engines
17 pages
Channel Coding: Reliable Communication Through Noisy Channels
No ratings yet
Channel Coding: Reliable Communication Through Noisy Channels
23 pages
Information Theory and Coding - Chapter 5
No ratings yet
Information Theory and Coding - Chapter 5
41 pages
DC Handouts
No ratings yet
DC Handouts
51 pages
ch7 PDF
No ratings yet
ch7 PDF
33 pages
GATE Online Coaching Classes: Digital Communications
No ratings yet
GATE Online Coaching Classes: Digital Communications
64 pages
Lecture 15: Channel Capacity, Rate of Channel Code
No ratings yet
Lecture 15: Channel Capacity, Rate of Channel Code
6 pages
Digital Communication Notes
No ratings yet
Digital Communication Notes
9 pages
15ec54 PDF
No ratings yet
15ec54 PDF
56 pages
Channel Capacity: 1 Preliminaries and Definitions
No ratings yet
Channel Capacity: 1 Preliminaries and Definitions
5 pages
Information Theory: Mohamed Hamada
No ratings yet
Information Theory: Mohamed Hamada
24 pages
Channel Capacity and The Channel Coding Theorem, Part I: Information Theory 2013
No ratings yet
Channel Capacity and The Channel Coding Theorem, Part I: Information Theory 2013
17 pages
The Information Theory: C.E. Shannon, A Mathematical Theory of Communication'
No ratings yet
The Information Theory: C.E. Shannon, A Mathematical Theory of Communication'
43 pages
Channel Capacity PDF
No ratings yet
Channel Capacity PDF
30 pages
Mimo Antennas Lec2
No ratings yet
Mimo Antennas Lec2
11 pages
ITCT Lab Manual 2018-19
100% (3)
ITCT Lab Manual 2018-19
40 pages
Chapter 7: Channel Capacity
No ratings yet
Chapter 7: Channel Capacity
33 pages
A Mathematical Theory of Communication: Jin Woo Shin, Sang Joon Kim
No ratings yet
A Mathematical Theory of Communication: Jin Woo Shin, Sang Joon Kim
6 pages
Channel Encoding
No ratings yet
Channel Encoding
11 pages
INGI 2348: Information Theory and Coding
No ratings yet
INGI 2348: Information Theory and Coding
33 pages
Information Theory PDF
No ratings yet
Information Theory PDF
26 pages
Channel Capacity and Models
No ratings yet
Channel Capacity and Models
30 pages
Channel Coding: X X y y
No ratings yet
Channel Coding: X X y y
13 pages
Information Theory
No ratings yet
Information Theory
97 pages
Channel Coding Theorem
No ratings yet
Channel Coding Theorem
23 pages
Introduction To Information Theory Channel Capacity and Models
No ratings yet
Introduction To Information Theory Channel Capacity and Models
36 pages
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
No ratings yet
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
31 pages
Unit 1
100% (2)
Unit 1
45 pages
01-Syllabus and Intro
No ratings yet
01-Syllabus and Intro
21 pages
Channel Coding: - Channel Capacity Channel Capacity, C Is Defined As
No ratings yet
Channel Coding: - Channel Capacity Channel Capacity, C Is Defined As
11 pages
Amount of Information I Log (1/P)
No ratings yet
Amount of Information I Log (1/P)
2 pages

Noisy Channel Theorem

Uploaded by

Noisy Channel Theorem

Uploaded by

1 The Noisy Coding Theorem

1.1 The Idea of the Noisy Coding Theorem

1.2 Mutual Information

From the joint probabilities we can compute transition (conditional) prob-

In our example we have

x1 log2 (0.25/0.5) = 1.000 log2 (0.75/0.5) = 0.585

In our example, the average mutual information is

sent\received 0 1 MarginalX Source prob

Marginal Y p(1 ) + q p + q(1 )

Notice that a single channel, the average mutual information depends on

1.4 Channel coding theorem

You might also like

Marginal Y p(1 ) + q p + q(1 )