0% found this document useful (0 votes)

4 views

ITC Module2 1

The document covers key concepts in Information Theory and Coding, including joint entropy, conditional entropy, mutual information, and discrete memoryless channels. It explains the relationships between these concepts, their properties, and provides examples and numerical problems related to binary symmetric channels and Jensen-Shannon Divergence. Additionally, it discusses the implications of noise in communication channels and presents various calculations to illustrate these theories.

Uploaded by

durgasathwik92

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

ITC Module2 1

Uploaded by

durgasathwik92

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Information Theory and Coding

Module 2
H(X,Y) Joint entropy (combined randomness of X and Y)
P(X,Y) Joint probability (combined probability of occurrence of X and Y)
H(X|Y) Conditional entropy (The uncertainty in X, Y is known)
H(Y|X) Conditional entropy (The uncertainty in Y, X is known)
P(X|Y) Conditional probability (The probability of occurrence of X, Y is known)
P(Y|X) Conditional probability (The probability of occurrence of Y, X is known)
I(X;Y) Mutual information (amount of information shared between X and Y)
H(X;Y) Mutual entropy (total information contained in X and Y.)
Joint entropy
The joint entropy of two random variables 𝑋X and 𝑌Y, denoted as 𝐻(𝑋,𝑌)H(X,Y), measures the total uncertainty or
information contained in the pair (𝑋,𝑌)(X,Y). P(x,y) is the joint probability.
It generalizes the concept of entropy for a single random variable to the case where two variables are involved.:

𝑯 ( 𝑿 ,𝒀 ) =− ∑ ∑ 𝒑 ( 𝒙 , 𝒚 ) 𝐥𝐨𝐠 ⁡𝒑( 𝒙 , 𝒚)
𝒙 ∈ 𝑿 𝒚 ∈𝒀
Properties:
1) The joint entropy of a set of random variables is a nonnegative number. i.e: H(X,Y) ≥ 0.
2) The joint entropy of a set of variables is greater than or equal to the maximum of all of the individual entropies
of the variables in the set. i.e: H(X,Y) ≥ max [H(X), H(Y)]
3) The joint entropy of a set of variables is less than or equal to the sum of the individual entropies of the variables
in the set. i.e: H(X,Y) ≤ H(X) + H(Y)

Relations to other entropy measures:

• Joint entropy is used in the definition of conditional entropy. H(X|Y) = H(X,Y) – H(Y)
• t is also used in the definition of mutual information. I(X;Y) = H(X) + H(Y) – H(X,Y)

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

Joint entropy: Example

Marginal Probability
P(X=S)=0.1+0.4=0.5
P(X=R)=0.4+0.1=0.5
P(Y=Y)=0.1+0.4=0.5
P(Y=N)=0.4+0.1=0.5

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

Joint entropy: Example

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

Conditional entropy
• Conditional entropy quantifies the amount of information needed to describe the outcome of a random variable Y
given that the value of another random variable X is known.
• Entropy of Y when X is known H(Y|X).

𝑯 (𝒀 ∨ 𝑿 ) =− ∑ ∑ 𝒑 ( 𝒙 , 𝒚 ) 𝐥𝐨𝐠 ⁡𝒑(𝒚
𝑯∨𝒙)
(𝒀 ∨ 𝑿 ) =− ∑ ∑ 𝒑 ( 𝒙 , 𝒚 ) 𝐥𝐨𝐠 ⁡
𝒑(𝒙 , 𝒚 )
𝒙 ∈ 𝒚 ∈𝒀 𝒙 ∈ 𝒚 ∈𝒀 𝒑 ( 𝒙)

• Entropy of X when Y is known H(X|Y).

𝑯 ( 𝑿∨𝒀 ) =− ∑ ∑ 𝒑 ( 𝒙 , 𝒚 ) 𝐥𝐨𝐠 ⁡𝒑(𝒙∨𝒚 ) ) =− ∑ ∑

𝑯 ( 𝑿 ∨𝒀 𝒑 ( 𝒙 , 𝒚 ) 𝐥𝐨𝐠 ⁡
𝒑(𝒙 , 𝒚 )
𝒙 ∈ 𝒚 ∈𝒀 𝒙 ∈ 𝒚 ∈𝒀 𝒑(𝒚 )
• NOTE (Bayes’ Theorem:):

𝒑 ( 𝒙 , 𝒚 ) =𝒑 ( 𝒙| 𝒚 ) . 𝒑 ( 𝒚 ) =𝒑 ( 𝒚 |𝒙 ) . 𝒑 ( 𝒙 )

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

Chain rule
Chain rule: proof
Chain rule: proof
Conditional entropy: example
Relative entropy
Relative entropy: example
Mutual Information
It is a measure of the mutual dependence between the two variables. More specifically, it quantifies the "amount of
information" obtained about one random variable through observing the other random variable.

I(X;Y) = H(X) – H(X|Y) = H(Y) – H(Y|X) b/symbol

Properties:
1) I(X;Y) = I(Y;X)
2) I(X;Y) ≥ 0
3) I(X;Y) = H(X) + H(Y) – H(X,Y)

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

Mutual Information
Prove:
1) I(X;Y) = I(Y;X)
2) I(X;Y) ≥ 0

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

Discrete Memoryless Channel (DMC)
• DMC is a statistical model with an input A and an output B.
• Channel accept an input symbol A and respond as an output B.
• Channel is ‘discrete’ in the sense: Number of symbols in A and
B are finite.
• Memoryless: Current output only depends on current input not on
previous inputs.
• P(ai): is assumed to be known.
• Each possible input output path can be represented by a
conditional probability.: P(bj | ai)-----Channel Transmission
Probability.
• The conditional probabilities that describe an information channel
can be represented conveniently using a matrix representation:
Discrete Memoryless Channel (DMC)
P is the channel matrix and for notational convenience we may sometimes rewrite this as

Pij = P(bj/ai)

The channel matrix exhibits the following properties and structure:

• Each row of P contains the probabilities of all possible outputs from the same input
to the channel.
• Each column of P contains the probabilities of all possible inputs to a particular
output from the channel.
• If we transmit the symbol ai we must receive an output symbol with probability 1,
that is:

that is, the probability terms in each row must sum

to 1.
Discrete Memoryless Channel (DMC)
Noiseless: If the channel is noiseless there will be no error in transmission, the
channel matrix is given by

Noisy: Say the channel is noisy and introduces a bit inversion 1% of the time, then the channel
matrix is given by
Binary symmetric channel (BSC)

• In BSC the input to the channel will be the binary digits {0,
1}.

• channel is assumed memoryless.

• Ideally if there is no noise a transmitted 0 is detected by the

receiver as a 0, and a transmitted 1 is detected by the
receiver as a 1.

• The most common effect of noise is to force the detector to

detect the wrong bit (bit inversion), that is, a 0 is detected as
a 1, and a 1 is detected as a 0. p = 1-q
q = is the probability of error (also called bit error
probability, bit error rate (BER), or “crossover”
probability)

• The BSC is an important channel for digital communication

systems as noise present in physical transmission media
Binary erasure channel (BEC)
• Another effect that noise (or more usually, loss of
signal) may have is to prevent the receiver from
deciding whether the symbol was a 0 or a 1.

• In this case the output alphabet includes an additional

symbol, ? called the “erasure” symbol that denotes a
bit that was not able to be detected.

• Strictly speaking a BEC does not model the effect of bit

inversion; thus a transmitted bit is either received
correctly (probability = p) or is received as an
“erasure” (probability = q = 1-p ).

• A BEC is becoming an increasingly important model for

wireless mobile and satellite communication channels,
which suffer mainly from dropouts and loss of signal
leading to the receiver failing to detect any signal.
Jensen-Shannon Divergence (JSD)
The Jensen-Shannon Divergence (JSD) is a widely used method for measuring the similarity or dissimilarity
between two probability distributions. Unlike other divergence measures, such as Kullback-Leibler divergence
(DKL), JSD is symmetric and always yields a finite value. This makes it particularly suitable for comparing
distributions in real-world applications.
Jensen-Shannon Divergence (JSD)
The Jensen-Shannon Divergence (JSD) is a widely used method for measuring the similarity or dissimilarity
between two probability distributions. Unlike other divergence measures, such as Kullback-Leibler divergence
(DKL), JSD is symmetric and always yields a finite value. This makes it particularly suitable for comparing
distributions in real-world applications.
Jensen-Shannon Divergence (JSD)
Jensen-Shannon Divergence (JSD):Numerical
Find the similarity between X=[0.2,0.28,0.14,0.25,0.14] and Y=[0.25,0.25,0.1,0.3,0.1] using Jensen-Shannon Divergence
BSC: Numerical
For the given binary channel: 0. 9
a) Find the channel matrix. x1 y1
b) Find P(y1) and P(y2) if P(x1)=P(x2)=0.5
c) Find the joint probability P(x1, y2), P(x2,y1) when P(x1)=P(x2)=0.5
d) Find mutual information I(Y;X)
x2 y2
0. 8
BSC: Numerical
BSC: Numerical
Let X and Y are two independent random variables with probabilities P(X)={0.2,0.25,0.2,0.2,0.15} and P(Y)={0.1,0.25,0.25,0.4}.
Find the joint entropy H(X,Y).

Since independent H(X,Y)=H(X)+H(Y)

BSC: Numerical
BSC: Numerical
1-p
For the given BSC. P(x1 )= α y1
1) Show that the mutual information is given by p
I(X;Y)=H(Y) + p log2p + (1-p)log2(1-p)
2) Calculate I(X;Y) for α = 0.5 and p= 0.1 p
3) Calculate I(X;Y) for α = 0.5 and p= 0.5, comment on results P(x2 )= 1-α 1-p
y2
BSC: Numerical
A binary channel has the following noise characteristics:
If the input symbols are transmitted with probabilities ¾ & ¼ respectively, find H(X), H(Y),
H(X,Y), H(Y|X), H(X|Y).
BSC: Numerical
The joint probability matrix for a channel is given. Compute H(X), H(Y), H(XY), H(X/Y) &
H(Y/X).
BSC: Numerical
A source delivers the binary digits 0 and 1 with equal probability into a noisy channel at a
rate of 1000 digits / second. Owing to noise on the channel the probability of receiving a
transmitted ‘0’ as a ‘1’ is 1/16, while the probability of transmitting a ‘1’ and receiving a
‘0’ is 1/32. Determine the rate at which information is received.
BSC: Numerical
A transmitter produces three symbols ABC which are related with
joint probability shown. Calculate H(X,Y)
BSC: Numerical (Assignment)
BSC: Numerical (Assignment)

Listening Aca
No ratings yet
Listening Aca
16 pages
Vehicle Specifications: Contents
100% (2)
Vehicle Specifications: Contents
146 pages
Information Theory: Info Rmatio N Types
No ratings yet
Information Theory: Info Rmatio N Types
52 pages
ITC Module - I
No ratings yet
ITC Module - I
98 pages
Unit 1
No ratings yet
Unit 1
94 pages
LECTURE 1: Introduction
No ratings yet
LECTURE 1: Introduction
16 pages
Information Theory Final
No ratings yet
Information Theory Final
50 pages
Mutual Information
No ratings yet
Mutual Information
48 pages
Lecture 3 - Entropy
No ratings yet
Lecture 3 - Entropy
35 pages
Laboratory Journal: Signal Coding Estimation Theory
No ratings yet
Laboratory Journal: Signal Coding Estimation Theory
63 pages
Important Questions..
No ratings yet
Important Questions..
18 pages
ITCT Lab Manual 2018-19
100% (3)
ITCT Lab Manual 2018-19
40 pages
Lecture 2
No ratings yet
Lecture 2
55 pages
IT-CO-1-EN
No ratings yet
IT-CO-1-EN
26 pages
1.1 Shannon's Information Measures: Lecture 1 - January 26
No ratings yet
1.1 Shannon's Information Measures: Lecture 1 - January 26
5 pages
Conditional Entropy: Y X) Y Y/X)
No ratings yet
Conditional Entropy: Y X) Y Y/X)
12 pages
Lecture 2
No ratings yet
Lecture 2
22 pages
ECE4007 Information Theory and Coding: DR - Sangeetha R.G
No ratings yet
ECE4007 Information Theory and Coding: DR - Sangeetha R.G
44 pages
C&C Combined Module Notes
No ratings yet
C&C Combined Module Notes
206 pages
21ECE72_Coding and Cryp Module 1
No ratings yet
21ECE72_Coding and Cryp Module 1
34 pages
Information Theory Textbook
No ratings yet
Information Theory Textbook
14 pages
1-Information Theory-2021
No ratings yet
1-Information Theory-2021
31 pages
Chapte-2 Information Theory and Coding
No ratings yet
Chapte-2 Information Theory and Coding
68 pages
ITC-6 sem -1
No ratings yet
ITC-6 sem -1
66 pages
Problem Set 1
No ratings yet
Problem Set 1
3 pages
Information Theory
No ratings yet
Information Theory
26 pages
Ec23ec4211itc PPT
No ratings yet
Ec23ec4211itc PPT
148 pages
Lec35 - 210108062 - ZAINAB ALI
No ratings yet
Lec35 - 210108062 - ZAINAB ALI
9 pages
ICT - Module 1 Lecture 1
No ratings yet
ICT - Module 1 Lecture 1
34 pages
Communication System CH#2
No ratings yet
Communication System CH#2
40 pages
Information Theory 2
No ratings yet
Information Theory 2
41 pages
Elements of Information Theory 2006 Thomas M. Cover and Joy A. Thomas
No ratings yet
Elements of Information Theory 2006 Thomas M. Cover and Joy A. Thomas
16 pages
The Information Theory: C.E. Shannon, A Mathematical Theory of Communication'
No ratings yet
The Information Theory: C.E. Shannon, A Mathematical Theory of Communication'
43 pages
Information Theory
No ratings yet
Information Theory
29 pages
Ece458 L6
No ratings yet
Ece458 L6
18 pages
Data Compression: Reference: Proakis Salehi (II Ed.) Cap.4
No ratings yet
Data Compression: Reference: Proakis Salehi (II Ed.) Cap.4
30 pages
Information Theory and Coding PDF
No ratings yet
Information Theory and Coding PDF
61 pages
Entropy and Mutual Information
No ratings yet
Entropy and Mutual Information
63 pages
ITC_Module2
No ratings yet
ITC_Module2
30 pages
Elements of Information Theory-Chapter1-2
No ratings yet
Elements of Information Theory-Chapter1-2
63 pages
TE361 Channel Coding 1
No ratings yet
TE361 Channel Coding 1
24 pages
The Binary Entropy Function: ECE 7680 Lecture 2 - Definitions and Basic Facts
No ratings yet
The Binary Entropy Function: ECE 7680 Lecture 2 - Definitions and Basic Facts
8 pages
Lecture 8: Channel Capacity, Continuous Random Variables: 1.1 Examples
No ratings yet
Lecture 8: Channel Capacity, Continuous Random Variables: 1.1 Examples
6 pages
DC Handouts
No ratings yet
DC Handouts
51 pages
Lecture 2: Entropy and Mutual Information: 2.1 Example
No ratings yet
Lecture 2: Entropy and Mutual Information: 2.1 Example
8 pages
Module-1
No ratings yet
Module-1
40 pages
Math Supplement PDF
No ratings yet
Math Supplement PDF
17 pages
CoverThomas Ch2 PDF
No ratings yet
CoverThomas Ch2 PDF
38 pages
Entropy, Relative Entropy and Mutual Information
No ratings yet
Entropy, Relative Entropy and Mutual Information
38 pages
Information Theory
No ratings yet
Information Theory
114 pages
Information Theory: Mike Brookes E4.40, ISE4.51, SO20
No ratings yet
Information Theory: Mike Brookes E4.40, ISE4.51, SO20
114 pages
Unit 3 Entropy
No ratings yet
Unit 3 Entropy
25 pages
Entropy
No ratings yet
Entropy
21 pages
CH 11
No ratings yet
CH 11
36 pages
EC401 M1-Information Theory & Coding-Ktustudents - in PDF
No ratings yet
EC401 M1-Information Theory & Coding-Ktustudents - in PDF
50 pages
Information T Information Theory and Coding: S.Chandramohan
No ratings yet
Information T Information Theory and Coding: S.Chandramohan
38 pages
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
No ratings yet
Mobile Communicaton Engineering: Review On Fundamental Limits On Communications
31 pages
Classical Information Theory
No ratings yet
Classical Information Theory
6 pages
Comm... System CH2-Lec1
No ratings yet
Comm... System CH2-Lec1
36 pages
Information Theory: KIE 2008 Communication Systems
No ratings yet
Information Theory: KIE 2008 Communication Systems
52 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Recursive Analysis
From Everand
Recursive Analysis
R. L. Goodstein
No ratings yet
Exhaust Valve Main Engine
100% (1)
Exhaust Valve Main Engine
7 pages
Inorganic Chemistry Chemical Bonding question bank
No ratings yet
Inorganic Chemistry Chemical Bonding question bank
43 pages
Kirchhoff Love Plate Theory Wikipedia The Free Encyclopedia
No ratings yet
Kirchhoff Love Plate Theory Wikipedia The Free Encyclopedia
12 pages
Contemporary Aristotelianism
No ratings yet
Contemporary Aristotelianism
30 pages
Radar Target Detection
No ratings yet
Radar Target Detection
20 pages
Shaking Things Up - A Coca-Cola Case Study
No ratings yet
Shaking Things Up - A Coca-Cola Case Study
4 pages
eta_ieee_itsc__arbetsversion
No ratings yet
eta_ieee_itsc__arbetsversion
7 pages
2n3866 Series PDF
No ratings yet
2n3866 Series PDF
3 pages
Sem 3 Mock 2022 ST Thomas-Q Maths T
No ratings yet
Sem 3 Mock 2022 ST Thomas-Q Maths T
3 pages
Wend Catalogo 2012
100% (1)
Wend Catalogo 2012
144 pages
CHEMY101- EXPERIMENT 7
No ratings yet
CHEMY101- EXPERIMENT 7
4 pages
Moldflow: A Tool To Predict Post-Molding Problems
No ratings yet
Moldflow: A Tool To Predict Post-Molding Problems
47 pages
Exhibitor List
No ratings yet
Exhibitor List
30 pages
Kaplan Turbines
No ratings yet
Kaplan Turbines
28 pages
T N J & H A: He Ight Ourney Eavenly Scension
No ratings yet
T N J & H A: He Ight Ourney Eavenly Scension
4 pages
22 Tuv Certificaat Mcs100e HW
No ratings yet
22 Tuv Certificaat Mcs100e HW
175 pages
Disequilibrium in Balance of Payment
No ratings yet
Disequilibrium in Balance of Payment
6 pages
Name: Iumi, Jelkah, Tan Pian Section: Grade-11 STEM NEWTON Performance Task in Reading and Writing Skills
No ratings yet
Name: Iumi, Jelkah, Tan Pian Section: Grade-11 STEM NEWTON Performance Task in Reading and Writing Skills
2 pages
Road Map of The End Times
100% (1)
Road Map of The End Times
4 pages
Ubk Rasyiidu - Soal Dan Pembahasan (Bing-Ips Paket 1)
100% (1)
Ubk Rasyiidu - Soal Dan Pembahasan (Bing-Ips Paket 1)
63 pages
Moplen: Technical Data Sheet
No ratings yet
Moplen: Technical Data Sheet
2 pages
Tenets of Advaita Vedanta
No ratings yet
Tenets of Advaita Vedanta
1 page
CAP737 Flight-Crew Human Factors Handbook E2
100% (1)
CAP737 Flight-Crew Human Factors Handbook E2
242 pages
Fi 48 W
No ratings yet
Fi 48 W
52 pages
Luanshya Akatiti Dam
No ratings yet
Luanshya Akatiti Dam
6 pages
Pythagorean Theorem - Wikipedia
No ratings yet
Pythagorean Theorem - Wikipedia
105 pages
Marine Painting Course
No ratings yet
Marine Painting Course
57 pages
A Description of England and Wales, 1769
No ratings yet
A Description of England and Wales, 1769
360 pages

ITC Module2 1

Uploaded by

ITC Module2 1

Uploaded by

Information Theory and Coding

Relations to other entropy measures:

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

• Entropy of X when Y is known H(X|Y).

𝑯 ( 𝑿∨𝒀 ) =− ∑ ∑ 𝒑 ( 𝒙 , 𝒚 ) 𝐥𝐨𝐠 ⁡𝒑(𝒙∨𝒚 ) ) =− ∑ ∑

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

I(X;Y) = H(X) – H(X|Y) = H(Y) – H(Y|X) b/symbol

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

H(X|Y)------>The uncertainty in X, given by 𝑌 H(X;Y)---  The amount of information X contains about Y.

The channel matrix exhibits the following properties and structure:

that is, the probability terms in each row must sum

• channel is assumed memoryless.

• Ideally if there is no noise a transmitted 0 is detected by the

• The most common effect of noise is to force the detector to

• The BSC is an important channel for digital communication

• In this case the output alphabet includes an additional

• Strictly speaking a BEC does not model the effect of bit

• A BEC is becoming an increasingly important model for

Since independent H(X,Y)=H(X)+H(Y)

You might also like