ED2: Environment Dynamics Decomposition World Models for Continuous Control

Hao, Jianye; Yuan, Yifu; Wang, Cong; Wang, Zhen

Computer Science > Machine Learning

arXiv:2112.02817 (cs)

[Submitted on 6 Dec 2021 (v1), last revised 15 Feb 2024 (this version, v2)]

Title:ED2: Environment Dynamics Decomposition World Models for Continuous Control

Authors:Jianye Hao, Yifu Yuan, Cong Wang, Zhen Wang

View PDF

Abstract:Model-based reinforcement learning (MBRL) achieves significant sample efficiency in practice in comparison to model-free RL, but its performance is often limited by the existence of model prediction error. To reduce the model error, standard MBRL approaches train a single well-designed network to fit the entire environment dynamics, but this wastes rich information on multiple sub-dynamics which can be modeled separately, allowing us to construct the world model more accurately. In this paper, we propose the Environment Dynamics Decomposition (ED2), a novel world model construction framework that models the environment in a decomposing manner. ED2 contains two key components: sub-dynamics discovery (SD2) and dynamics decomposition prediction (D2P). SD2 discovers the sub-dynamics in an environment automatically and then D2P constructs the decomposed world model following the sub-dynamics. ED2 can be easily combined with existing MBRL algorithms and empirical results show that ED2 significantly reduces the model error, increases the sample efficiency, and achieves higher asymptotic performance when combined with the state-of-the-art MBRL algorithms on various continuous control tasks. Our code is open source and available at this https URL.

Comments:	10 pages, 13 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2112.02817 [cs.LG]
	(or arXiv:2112.02817v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2112.02817

Submission history

From: Cong Wang [view email]
[v1] Mon, 6 Dec 2021 07:11:19 UTC (1,907 KB)
[v2] Thu, 15 Feb 2024 16:05:26 UTC (4,444 KB)

Computer Science > Machine Learning

Title:ED2: Environment Dynamics Decomposition World Models for Continuous Control

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ED2: Environment Dynamics Decomposition World Models for Continuous Control

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators