Variational inference for Monte Carlo objectives

Mnih, Andriy; Rezende, Danilo J.

Computer Science > Machine Learning

arXiv:1602.06725 (cs)

[Submitted on 22 Feb 2016 (v1), last revised 1 Jun 2016 (this version, v2)]

Title:Variational inference for Monte Carlo objectives

Authors:Andriy Mnih, Danilo J. Rezende

View PDF

Abstract:Recent progress in deep latent variable models has largely been driven by the development of flexible and scalable variational inference methods. Variational training of this type involves maximizing a lower bound on the log-likelihood, using samples from the variational posterior to compute the required gradients. Recently, Burda et al. (2016) have derived a tighter lower bound using a multi-sample importance sampling estimate of the likelihood and showed that optimizing it yields models that use more of their capacity and achieve higher likelihoods. This development showed the importance of such multi-sample objectives and explained the success of several related approaches.
We extend the multi-sample approach to discrete latent variables and analyze the difficulty encountered when estimating the gradients involved. We then develop the first unbiased gradient estimator designed for importance-sampled objectives and evaluate it at training generative and structured output prediction models. The resulting estimator, which is based on low-variance per-sample learning signals, is both simpler and more effective than the NVIL estimator proposed for the single-sample variational objective, and is competitive with the currently used biased estimators.

Comments:	Appears in Proceedings of the 33rd International Conference on Machine Learning (ICML), New York, NY, USA, 2016. JMLR: W&CP volume 48
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1602.06725 [cs.LG]
	(or arXiv:1602.06725v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1602.06725

Submission history

From: Andriy Mnih [view email]
[v1] Mon, 22 Feb 2016 11:06:06 UTC (189 KB)
[v2] Wed, 1 Jun 2016 16:36:06 UTC (198 KB)

Computer Science > Machine Learning

Title:Variational inference for Monte Carlo objectives

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Variational inference for Monte Carlo objectives

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators