0% found this document useful (0 votes)

5 views

Lecture 3

Uploaded by

liuyuexiao0305

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Lecture 3

Uploaded by

liuyuexiao0305

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Generalized entropy

Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Lecture 3 More properties of entropy and mutual

information

September 6th, 2022

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Outline

1 Generalized entropy

2 Fundamental inequality

3 Convex function and Jensen’s inequality

4 Convexity/Concavity of information measures

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Definition (Rényi entropy)

Given the parameter α > 0 with α ̸= 1, and given a discrete
random variable X with alphabet X and distribution PX , its Rényi
entropy of order α is given by
1 ∑
Hα = log( PX (x)α ).
1−α
x∈X

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Definition (Rényi divergence)

Given a parameter 0 < α < 1, and two discrete random variables
X and X̂ with common alphabet X and distribution PX and PX̂ ,
respectively, then the Rényi divergence of order α between X and
X̂ is given by
1 ∑
Dα (X∥X̂) = log( [PXα (x)P 1−α (x)]).
α−1 X̂
x∈X

This definition can be extended to α > 1 if PX̂ (x) > 0 for all
x ∈ X.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Lemma
When α → 1, we have the following:

lim Hα (X) = H(X)

α→1

and
lim Dα (X∥X̂) = D(X∥X̂).
α→1

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Fundamental inequality

Lemma (Fundamental inequality (FI))

For any x > 0 and D > 1, we have that

logD (x) ≤ logD (e) · (x − 1),

with equality if and only if x = 1.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Setting y = 1/x and using FI above directly that for any y > 0, we
also have that
1
logD (y) ≥ logD (e)(1 − ),
y
also with equality iff y = 1. In the above the base-D logarithm was
used. Specifically, for a logarithm with base-2, the above
inequalities become
1
log2 (e)(1 − ) ≤ log2 (x) ≤ log2 (e) · (x − 1),
x
with equality iff x = 1.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Information inequality

Theorem
Let X and X̂ be two random variables, with probability mass
functions PX and PX̂ . Then

D(X∥X̂) ≥ 0,

with equality if and only if PX (x) = PX̂ (x) for all x ∈ X , i.e., X
and X̂ have the same distribution.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Proof.
∑ PX (x)
D(X∥X̂) = PX (x) log2
x∈X
PX̂ (x)
∑ PX̂(x)
≥ (log2 e) PX (x)(1 − )
x∈X
P X (x)
∑ ∑
= (log2 e) PX (x) − PX̂ (x)
x∈X x∈X
= 0,

where the second step follows from FI, and the equality holds if
and only if for every x ∈ X ,
PX (x)
=1
PX̂ (x)

for all x ∈ X .
. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Corollary
For any two random variables X, Y ,

I(X; Y ) ≥ 0,

with equality if and only if X and Y are independent.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Corollary

D(p(y|x)∥q(y|x)) ≥ 0,
with equality if and only if p(y|x) = q(y|x) for all y and x such
that p(x) > 0.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Corollary

I(X; Y |Z) ≥ 0,
with equality if and only if X and Y are conditionally independent
given Z.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Upper bound on entropy

Theorem
If a random variable X takes values from a finite set X , then

H(X) ≤ log2 |X |,

where |X | is the size of the set X . Equality holds if and only if X

is equiprobable or uniformly distributed over X (i.e. PX (x) = |X1 |
for all x ∈ X ).

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Proof.

log2 |X | − H(X)
∑ ∑
= PX (x) · log2 |X | + PX (x) log2 PX (x)
x∈X x∈X
∑
= PX (x) · log2 [|X | · PX (x)]
x∈X
∑ 1
≥ PX (x) · log2 (e)(1 − )
|X | · PX (x)
x∈X
∑ 1
= log2 (e) (PX (x) − )
|X |
x∈X
= log2 (e)(1 − 1) = 0.

with equality if and only if |X | · PX (x) = 1.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Intuitively, entropy tells us how random X is.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Intuitively, entropy tells us how random X is.

X is deterministic if and only if H(X) = 0.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Intuitively, entropy tells us how random X is.

X is deterministic if and only if H(X) = 0.
If X is uniform (equiprobable), H(X) is maximized and equal
to log2 |X |.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Theorem
H(X|Y ) ≤ H(X),
with equality if and only if X and Y are independent.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Theorem
Let X1 , X2 , · · · , Xn be drawn according to p(x1 , x2 , · · · , xn ).
Then
∑
n
H(X1 , X2 , · · · , Xn ) ≤ H(Xi )
i=1

with equality if and only if the Xi are independent.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Theorem (Log-sum inequality)

For non-negative numbers a1 , a2 , · · · , an and b1 , b2 , · · · , bn ,

∑
n ∑n ∑n
ai ai
ai log ≥( ai ) log ∑i=1
n .
bi i=1 bi
i=1 i=1
∑n
ai ∑i=1 ai
with equality if and only if bi = n , which is a constant that
i=1 bi
does not depend on i.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Outline

1 Generalized entropy

2 Fundamental inequality

3 Convex function and Jensen’s inequality

4 Convexity/Concavity of information measures

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Convex and concave function

Definition
A function f (x) is said to be convex over an interval (a, b) if for
every x1 , x2 ∈ (a, b) and 0 ≤ λ ≤ 1,

f (λx1 + (1 − λ)x2 ) ≤ λf (x1 ) + (1 − λ)f (x2 ).

A function f is said to be strictly convex if equality holds only if
λ = 0 or λ = 1.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Convex and concave function

Definition
A function f (x) is said to be convex over an interval (a, b) if for
every x1 , x2 ∈ (a, b) and 0 ≤ λ ≤ 1,

f (λx1 + (1 − λ)x2 ) ≤ λf (x1 ) + (1 − λ)f (x2 ).

A function f is said to be strictly convex if equality holds only if
λ = 0 or λ = 1.

Definition
A function f is concave if −f is convex.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

A function is convex if it always lies below any chord. A function is

concave if it always lies above chord.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Theorem
If the function f has a second derivative that is non-negative
(positive) over an interval, the function is convex (strictly convex)
over that interval.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Jensen’s inequality

Theorem
If f is a convex function and X is a random variable,

Ef (x) ≥ f (EX).

Moreover, if f is strictly convex, the above inequality implies that

X = EX with probability 1.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

All the inequalities in last section can be also proved using

Jensen’s inequality.
∑
Let f be a strictly convex function, αi ≥ 0, and ni=1 αi = 1.
Jensen’s inequality states that

∑
n ∑
n
αi f (ti ) ≥ f ( αi ti ).
i=1 i=1

Equality holds if and only if ti is a constant for all i.

∑
To prove the log-sum inequality, set αi = bi / nj=1 bj ,
ti = ai /bi , and f (t) = t · logD (t), we obtain the desired result.

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Outline

1 Generalized entropy

2 Fundamental inequality

3 Convex function and Jensen’s inequality

4 Convexity/Concavity of information measures

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Theorem
H(PX ) is a concave function of PX , namely

H(λPX + (1 − λ)PX̃ ) ≥ λH(PX ) + (1 − λ)H(PX̃ )

for all λ ∈ [0, 1].

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

then I(X; Y ) is a concave function of PX (for fixed PY |X , and a

convex function of PY |X (for fixed PX ).

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Generalized entropy
Fundamental inequality
Convex function and Jensen’s inequality
Convexity/Concavity of information measures

Theorem
D(PX ∥PX̂ ) is convex in pair (PX , PX̂ ), i.e., if (PX , PX̂ ) and
(QX , QX̂ ) are two pairs of probability mass functions, then

D(λPX + (1 − λ)QX ∥λPX̂ + (1 − λ)QX̂ )

≤ λ · D(PX ∥PX̂ ) + (1 − λ) · D(QX ∥QX̂ ),

for all λ ∈ [0, 1].

. . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .

Lecture 3 More properties of entropy and mutual information

Statistics and Probablity SHS 11-Module 1 - Week1
94% (47)
Statistics and Probablity SHS 11-Module 1 - Week1
23 pages
Lect 6 Quantinfo 1112
No ratings yet
Lect 6 Quantinfo 1112
13 pages
Lecture 3: Entropy, Relative Entropy, and Mutual Information
No ratings yet
Lecture 3: Entropy, Relative Entropy, and Mutual Information
5 pages
session2
No ratings yet
session2
60 pages
The Binary Entropy Function: ECE 7680 Lecture 2 - Definitions and Basic Facts
No ratings yet
The Binary Entropy Function: ECE 7680 Lecture 2 - Definitions and Basic Facts
8 pages
Lecture 3 - Entropy
No ratings yet
Lecture 3 - Entropy
35 pages
IT_w1
No ratings yet
IT_w1
20 pages
Entropy
No ratings yet
Entropy
21 pages
Tema 1 Awp
No ratings yet
Tema 1 Awp
32 pages
Lecture 3: Entropy, Relative Entropy, and Mutual Information
No ratings yet
Lecture 3: Entropy, Relative Entropy, and Mutual Information
5 pages
Elements of Information Theory 2006 Thomas M. Cover and Joy A. Thomas
No ratings yet
Elements of Information Theory 2006 Thomas M. Cover and Joy A. Thomas
16 pages
Chapter 2
No ratings yet
Chapter 2
68 pages
Entropy and Mutual Information
No ratings yet
Entropy and Mutual Information
63 pages
IT-CO-1-EN
No ratings yet
IT-CO-1-EN
26 pages
Lecture2 1
No ratings yet
Lecture2 1
37 pages
Notes It
No ratings yet
Notes It
46 pages
Lec7 InformationTheory
No ratings yet
Lec7 InformationTheory
41 pages
Lect2 PDF
No ratings yet
Lect2 PDF
25 pages
Information and Entropy: Aria Nosratinia - Information Theory 2-1
No ratings yet
Information and Entropy: Aria Nosratinia - Information Theory 2-1
7 pages
Information Theory Textbook
No ratings yet
Information Theory Textbook
14 pages
Lecture 1: Introduction, Entropy and ML Estimation
No ratings yet
Lecture 1: Introduction, Entropy and ML Estimation
5 pages
session3
No ratings yet
session3
44 pages
info
No ratings yet
info
79 pages
Relative Entropy
No ratings yet
Relative Entropy
6 pages
Information Theory Entropy Relative Entropy
No ratings yet
Information Theory Entropy Relative Entropy
60 pages
On Measures of Entropy and Information
No ratings yet
On Measures of Entropy and Information
18 pages
Information Theory: Mike Brookes E4.40, ISE4.51, SO20
No ratings yet
Information Theory: Mike Brookes E4.40, ISE4.51, SO20
114 pages
Information Theory
No ratings yet
Information Theory
114 pages
Lecture 1
No ratings yet
Lecture 1
211 pages
2 Entropy and Mutual Information: I (A) F (P (A) )
No ratings yet
2 Entropy and Mutual Information: I (A) F (P (A) )
27 pages
It Lectures
No ratings yet
It Lectures
342 pages
L01
No ratings yet
L01
5 pages
Information Theoretic Inequalities
No ratings yet
Information Theoretic Inequalities
18 pages
Chapter2 PDF
No ratings yet
Chapter2 PDF
22 pages
Entropy, Relative Entropy and Mutual Information
No ratings yet
Entropy, Relative Entropy and Mutual Information
38 pages
CoverThomas Ch2 PDF
No ratings yet
CoverThomas Ch2 PDF
38 pages
CS340 Machine Learning Information Theory
No ratings yet
CS340 Machine Learning Information Theory
22 pages
Jensen's Inequality
No ratings yet
Jensen's Inequality
8 pages
Lecture 1: Entropy and Mutual Information: 2.1 Example
No ratings yet
Lecture 1: Entropy and Mutual Information: 2.1 Example
8 pages
SummaryFeb5 2024
No ratings yet
SummaryFeb5 2024
2 pages
Lecture2
No ratings yet
Lecture2
19 pages
ECE4007 Information Theory and Coding: DR - Sangeetha R.G
No ratings yet
ECE4007 Information Theory and Coding: DR - Sangeetha R.G
44 pages
Communication Theory and Coding: Basics
No ratings yet
Communication Theory and Coding: Basics
17 pages
Presentation Math7952
No ratings yet
Presentation Math7952
29 pages
Information Theory and Coding (Lecture 2) : Dr. Farman Ullah
No ratings yet
Information Theory and Coding (Lecture 2) : Dr. Farman Ullah
36 pages
L04
No ratings yet
L04
4 pages
dabel_info_theory
No ratings yet
dabel_info_theory
25 pages
Joint & Conditional Entropy, Mutual Information: Application of Information Theory, Lecture 2
No ratings yet
Joint & Conditional Entropy, Mutual Information: Application of Information Theory, Lecture 2
26 pages
Quantum Information Theory (Lecture Notes)
No ratings yet
Quantum Information Theory (Lecture Notes)
101 pages
Raginsky Martingales
No ratings yet
Raginsky Martingales
180 pages
MIT16 36s09 Lec03
No ratings yet
MIT16 36s09 Lec03
10 pages
04 - Random Variables 2
No ratings yet
04 - Random Variables 2
17 pages
Lecture Note PDF
No ratings yet
Lecture Note PDF
373 pages
ent-var-two-rmks
No ratings yet
ent-var-two-rmks
13 pages
ICT - Module 1 Lecture 1
No ratings yet
ICT - Module 1 Lecture 1
34 pages
ML_Lec_5
No ratings yet
ML_Lec_5
37 pages
Mit6 441s16 Course Notes
No ratings yet
Mit6 441s16 Course Notes
295 pages
Entropy and Uncertainty
No ratings yet
Entropy and Uncertainty
15 pages
The Gamma Function
From Everand
The Gamma Function
Emil Artin
No ratings yet
Complex Variables
From Everand
Complex Variables
Francis J. Flanigan
No ratings yet
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Saint Gba334 Unit 1 Quiz (25 Questions) All Correct
No ratings yet
Saint Gba334 Unit 1 Quiz (25 Questions) All Correct
5 pages
Econ 2063: Research Methods: Chapter-6: Data Processing and Analysis
No ratings yet
Econ 2063: Research Methods: Chapter-6: Data Processing and Analysis
22 pages
The Sign Test
No ratings yet
The Sign Test
12 pages
Compendium Iim Shillong Analytics and Prod Man
No ratings yet
Compendium Iim Shillong Analytics and Prod Man
68 pages
Chi-Square Goodness-of-Fit Test in SPSS
No ratings yet
Chi-Square Goodness-of-Fit Test in SPSS
2 pages
Statistic Tems - Msimei Statistic LTE - UE - Throughput - L1 - DL
No ratings yet
Statistic Tems - Msimei Statistic LTE - UE - Throughput - L1 - DL
26 pages
Multiple Comparison Procedures: Rajender Parsad I.A.S.R.I., Library Avenue, New Delhi - 110 012
No ratings yet
Multiple Comparison Procedures: Rajender Parsad I.A.S.R.I., Library Avenue, New Delhi - 110 012
16 pages
Read Pages 537 - 551.: Section 9.1 Reading Guide: Significant Test The Basics
No ratings yet
Read Pages 537 - 551.: Section 9.1 Reading Guide: Significant Test The Basics
5 pages
A Mediation Analysis of Achievement Motives, Goals, Learning Strategies, and Academic Achievement
No ratings yet
A Mediation Analysis of Achievement Motives, Goals, Learning Strategies, and Academic Achievement
17 pages
Assignment 02
No ratings yet
Assignment 02
3 pages
Ba Yes I An Multi Agent Beamer
No ratings yet
Ba Yes I An Multi Agent Beamer
15 pages
Variance and Standard Deviation
No ratings yet
Variance and Standard Deviation
16 pages
Course Outline Business Mathematics and Statistics
No ratings yet
Course Outline Business Mathematics and Statistics
4 pages
Answer All Questions in This Section
No ratings yet
Answer All Questions in This Section
7 pages
Training in MSA PQ Systems Training Material PDF
No ratings yet
Training in MSA PQ Systems Training Material PDF
109 pages
Homework 2
No ratings yet
Homework 2
3 pages
Lecture 5 Cont Prob If 2016
No ratings yet
Lecture 5 Cont Prob If 2016
35 pages
Source Coding Shannon Fano Coding
No ratings yet
Source Coding Shannon Fano Coding
24 pages
Summer Reading: The Birth of Plenty
No ratings yet
Summer Reading: The Birth of Plenty
2 pages
PRP UNIT IV Markove Process
No ratings yet
PRP UNIT IV Markove Process
52 pages
Foundations of Applied Statistical Methods - 2nd Edition Reference Book Download
100% (15)
Foundations of Applied Statistical Methods - 2nd Edition Reference Book Download
15 pages
Statistics and Probability
100% (2)
Statistics and Probability
71 pages
Hai DO Mathematics IA
No ratings yet
Hai DO Mathematics IA
18 pages
Skewness and Kurtosis
No ratings yet
Skewness and Kurtosis
10 pages
The Upper Tail Probability Q (Z) For The Normal Distribution N (0, 1) Z
No ratings yet
The Upper Tail Probability Q (Z) For The Normal Distribution N (0, 1) Z
2 pages
Week11 Notes
No ratings yet
Week11 Notes
19 pages
Applied-Probability-And-Statistics-Problems Basic With Questions
No ratings yet
Applied-Probability-And-Statistics-Problems Basic With Questions
20 pages
Sigma Plot Statistics User Guide
No ratings yet
Sigma Plot Statistics User Guide
470 pages
ML in 10 Pages 1683806402
No ratings yet
ML in 10 Pages 1683806402
10 pages

Lecture 3

Uploaded by

Lecture 3

Uploaded by

Generalized entropy

Lecture 3 More properties of entropy and mutual

September 6th, 2022

Lecture 3 More properties of entropy and mutual information

3 Convex function and Jensen’s inequality

4 Convexity/Concavity of information measures

Lecture 3 More properties of entropy and mutual information

Definition (Rényi entropy)

Lecture 3 More properties of entropy and mutual information

Definition (Rényi divergence)

Lecture 3 More properties of entropy and mutual information

lim Hα (X) = H(X)

Lecture 3 More properties of entropy and mutual information

Lemma (Fundamental inequality (FI))

logD (x) ≤ logD (e) · (x − 1),

with equality if and only if x = 1.

Lecture 3 More properties of entropy and mutual information

Lecture 3 More properties of entropy and mutual information

Lecture 3 More properties of entropy and mutual information

Lecture 3 More properties of entropy and mutual information

with equality if and only if X and Y are independent.

Lecture 3 More properties of entropy and mutual information

Lecture 3 More properties of entropy and mutual information

Lecture 3 More properties of entropy and mutual information

Upper bound on entropy

where |X | is the size of the set X . Equality holds if and only if X

Lecture 3 More properties of entropy and mutual information

with equality if and only if |X | · PX (x) = 1.

Lecture 3 More properties of entropy and mutual information

Intuitively, entropy tells us how random X is.

Lecture 3 More properties of entropy and mutual information

Intuitively, entropy tells us how random X is.

Lecture 3 More properties of entropy and mutual information

Intuitively, entropy tells us how random X is.

Lecture 3 More properties of entropy and mutual information

Lecture 3 More properties of entropy and mutual information

with equality if and only if the Xi are independent.

Lecture 3 More properties of entropy and mutual information

Theorem (Log-sum inequality)

Lecture 3 More properties of entropy and mutual information

3 Convex function and Jensen’s inequality

4 Convexity/Concavity of information measures

Lecture 3 More properties of entropy and mutual information

Convex and concave function

f (λx1 + (1 − λ)x2 ) ≤ λf (x1 ) + (1 − λ)f (x2 ).

Lecture 3 More properties of entropy and mutual information

Convex and concave function

f (λx1 + (1 − λ)x2 ) ≤ λf (x1 ) + (1 − λ)f (x2 ).

Lecture 3 More properties of entropy and mutual information

A function is convex if it always lies below any chord. A function is

Lecture 3 More properties of entropy and mutual information

Lecture 3 More properties of entropy and mutual information

Moreover, if f is strictly convex, the above inequality implies that

Lecture 3 More properties of entropy and mutual information

All the inequalities in last section can be also proved using

Equality holds if and only if ti is a constant for all i.

Lecture 3 More properties of entropy and mutual information

3 Convex function and Jensen’s inequality

4 Convexity/Concavity of information measures

Lecture 3 More properties of entropy and mutual information

H(λPX + (1 − λ)PX̃ ) ≥ λH(PX ) + (1 − λ)H(PX̃ )

for all λ ∈ [0, 1].

Lecture 3 More properties of entropy and mutual information

then I(X; Y ) is a concave function of PX (for fixed PY |X , and a

Lecture 3 More properties of entropy and mutual information

D(λPX + (1 − λ)QX ∥λPX̂ + (1 − λ)QX̂ )

for all λ ∈ [0, 1].

Lecture 3 More properties of entropy and mutual information

You might also like