Learning Robust Dynamics through Variational Sparse Gating

Jain, Arnav Kumar; Sujit, Shivakanth; Joshi, Shruti; Michalski, Vincent; Hafner, Danijar; Ebrahimi-Kahou, Samira

Computer Science > Machine Learning

arXiv:2210.11698 (cs)

[Submitted on 21 Oct 2022]

Title:Learning Robust Dynamics through Variational Sparse Gating

Authors:Arnav Kumar Jain, Shivakanth Sujit, Shruti Joshi, Vincent Michalski, Danijar Hafner, Samira Ebrahimi-Kahou

View PDF

Abstract:Learning world models from their sensory inputs enables agents to plan for actions by imagining their future outcomes. World models have previously been shown to improve sample-efficiency in simulated environments with few objects, but have not yet been applied successfully to environments with many objects. In environments with many objects, often only a small number of them are moving or interacting at the same time. In this paper, we investigate integrating this inductive bias of sparse interactions into the latent dynamics of world models trained from pixels. First, we introduce Variational Sparse Gating (VSG), a latent dynamics model that updates its feature dimensions sparsely through stochastic binary gates. Moreover, we propose a simplified architecture Simple Variational Sparse Gating (SVSG) that removes the deterministic pathway of previous models, resulting in a fully stochastic transition function that leverages the VSG mechanism. We evaluate the two model architectures in the BringBackShapes (BBS) environment that features a large number of moving objects and partial observability, demonstrating clear improvements over prior models.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2210.11698 [cs.LG]
	(or arXiv:2210.11698v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.11698

Submission history

From: Arnav Kumar Jain [view email]
[v1] Fri, 21 Oct 2022 02:56:51 UTC (4,673 KB)

Computer Science > Machine Learning

Title:Learning Robust Dynamics through Variational Sparse Gating

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Robust Dynamics through Variational Sparse Gating

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators