GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations

Engelcke, Martin; Kosiorek, Adam R.; Jones, Oiwi Parker; Posner, Ingmar

Computer Science > Machine Learning

arXiv:1907.13052 (cs)

[Submitted on 30 Jul 2019 (v1), last revised 23 Nov 2020 (this version, v4)]

Title:GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations

Authors:Martin Engelcke, Adam R. Kosiorek, Oiwi Parker Jones, Ingmar Posner

View PDF

Abstract:Generative latent-variable models are emerging as promising tools in robotics and reinforcement learning. Yet, even though tasks in these domains typically involve distinct objects, most state-of-the-art generative models do not explicitly capture the compositional nature of visual scenes. Two recent exceptions, MONet and IODINE, decompose scenes into objects in an unsupervised fashion. Their underlying generative processes, however, do not account for component interactions. Hence, neither of them allows for principled sampling of novel scenes. Here we present GENESIS, the first object-centric generative model of 3D visual scenes capable of both decomposing and generating scenes by capturing relationships between scene components. GENESIS parameterises a spatial GMM over images which is decoded from a set of object-centric latent variables that are either inferred sequentially in an amortised fashion or sampled from an autoregressive prior. We train GENESIS on several publicly available datasets and evaluate its performance on scene generation, decomposition, and semi-supervised learning.

Comments:	Published at the International Conference on Learning Representations (ICLR) 2020
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO); Machine Learning (stat.ML)
Cite as:	arXiv:1907.13052 [cs.LG]
	(or arXiv:1907.13052v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1907.13052

Submission history

From: Martin Engelcke [view email]
[v1] Tue, 30 Jul 2019 16:22:39 UTC (505 KB)
[v2] Sun, 29 Sep 2019 20:19:08 UTC (1,905 KB)
[v3] Mon, 3 Feb 2020 14:02:16 UTC (2,877 KB)
[v4] Mon, 23 Nov 2020 10:31:22 UTC (2,866 KB)

Computer Science > Machine Learning

Title:GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators