GSNs : Generative Stochastic Networks

Alain, Guillaume; Bengio, Yoshua; Yao, Li; Yosinski, Jason; Thibodeau-Laufer, Eric; Zhang, Saizheng; Vincent, Pascal

Computer Science > Machine Learning

arXiv:1503.05571 (cs)

[Submitted on 18 Mar 2015 (v1), last revised 23 Mar 2015 (this version, v2)]

Title:GSNs : Generative Stochastic Networks

Authors:Guillaume Alain, Yoshua Bengio, Li Yao, Jason Yosinski, Eric Thibodeau-Laufer, Saizheng Zhang, Pascal Vincent

View PDF

Abstract:We introduce a novel training principle for probabilistic models that is an alternative to maximum likelihood. The proposed Generative Stochastic Networks (GSN) framework is based on learning the transition operator of a Markov chain whose stationary distribution estimates the data distribution. Because the transition distribution is a conditional distribution generally involving a small move, it has fewer dominant modes, being unimodal in the limit of small moves. Thus, it is easier to learn, more like learning to perform supervised function approximation, with gradients that can be obtained by back-propagation. The theorems provided here generalize recent work on the probabilistic interpretation of denoising auto-encoders and provide an interesting justification for dependency networks and generalized pseudolikelihood (along with defining an appropriate joint distribution and sampling mechanism, even when the conditionals are not consistent). We study how GSNs can be used with missing inputs and can be used to sample subsets of variables given the rest. Successful experiments are conducted, validating these theoretical results, on two image datasets and with a particular architecture that mimics the Deep Boltzmann Machine Gibbs sampler but allows training to proceed with backprop, without the need for layerwise pretraining.

Comments:	arXiv admin note: substantial text overlap with arXiv:1306.1091
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1503.05571 [cs.LG]
	(or arXiv:1503.05571v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1503.05571

Submission history

From: Guillaume Alain [view email]
[v1] Wed, 18 Mar 2015 20:06:07 UTC (11,819 KB)
[v2] Mon, 23 Mar 2015 16:44:52 UTC (5,908 KB)

Computer Science > Machine Learning

Title:GSNs : Generative Stochastic Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:GSNs : Generative Stochastic Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators