Does Dirichlet Prior Smoothing Solve the Shannon Entropy Estimation Problem?

Han, Yanjun; Jiao, Jiantao; Weissman, Tsachy

doi:10.1109/TIT.2017.2733537

Computer Science > Information Theory

arXiv:1502.00327 (cs)

[Submitted on 1 Feb 2015 (v1), last revised 18 Sep 2017 (this version, v3)]

Title:Does Dirichlet Prior Smoothing Solve the Shannon Entropy Estimation Problem?

Authors:Yanjun Han, Jiantao Jiao, Tsachy Weissman

View PDF

Abstract:The Dirichlet prior is widely used in estimating discrete distributions and functionals of discrete distributions. In terms of Shannon entropy estimation, one approach is to plug-in the Dirichlet prior smoothed distribution into the entropy functional, while the other one is to calculate the Bayes estimator for entropy under the Dirichlet prior for squared error, which is the conditional expectation. We show that in general they do \emph{not} improve over the maximum likelihood estimator, which plugs-in the empirical distribution into the entropy functional. No matter how we tune the parameters in the Dirichlet prior, this approach cannot achieve the minimax rates in entropy estimation, as recently characterized by Jiao, Venkat, Han, and Weissman, and Wu and Yang. The performance of the minimax rate-optimal estimator with $n$ samples is essentially \emph{at least} as good as that of the Dirichlet smoothed entropy estimators with $n\ln n$ samples.
We harness the theory of approximation using positive linear operators for analyzing the bias of plug-in estimators for general functionals under arbitrary statistical models, thereby further consolidating the interplay between these two fields, which was thoroughly developed and exploited by Jiao, Venkat, Han, and Weissman. We establish new results in approximation theory, and apply them to analyze the bias of the Dirichlet prior smoothed plug-in entropy estimator. This interplay between bias analysis and approximation theory is of relevance and consequence far beyond the specific problem setting in this paper.

Comments:	27 pages, 1 figure, published on IEEE Transactions on Information Theory, merged with https://arxiv.org/abs/1406.6959
Subjects:	Information Theory (cs.IT)
Cite as:	arXiv:1502.00327 [cs.IT]
	(or arXiv:1502.00327v3 [cs.IT] for this version)
	https://doi.org/10.48550/arXiv.1502.00327
Journal reference:	IEEE Transactions on Information Theory, vol. 63, no. 10, pp. 6774-6798, Oct. 2017
Related DOI:	https://doi.org/10.1109/TIT.2017.2733537

Submission history

From: Jiantao Jiao [view email]
[v1] Sun, 1 Feb 2015 23:27:11 UTC (21 KB)
[v2] Tue, 10 Mar 2015 07:49:50 UTC (17 KB)
[v3] Mon, 18 Sep 2017 21:01:24 UTC (41 KB)

Computer Science > Information Theory

Title:Does Dirichlet Prior Smoothing Solve the Shannon Entropy Estimation Problem?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Theory

Title:Does Dirichlet Prior Smoothing Solve the Shannon Entropy Estimation Problem?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators