Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments

Al-Shedivat, Maruan; Bansal, Trapit; Burda, Yuri; Sutskever, Ilya; Mordatch, Igor; Abbeel, Pieter

Computer Science > Machine Learning

arXiv:1710.03641 (cs)

[Submitted on 10 Oct 2017 (v1), last revised 23 Feb 2018 (this version, v2)]

Title:Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments

Authors:Maruan Al-Shedivat, Trapit Bansal, Yuri Burda, Ilya Sutskever, Igor Mordatch, Pieter Abbeel

View PDF

Abstract:Ability to continuously learn and adapt from limited experience in nonstationary environments is an important milestone on the path towards general intelligence. In this paper, we cast the problem of continuous adaptation into the learning-to-learn framework. We develop a simple gradient-based meta-learning algorithm suitable for adaptation in dynamically changing and adversarial scenarios. Additionally, we design a new multi-agent competitive environment, RoboSumo, and define iterated adaptation games for testing various aspects of continuous adaptation strategies. We demonstrate that meta-learning enables significantly more efficient adaptation than reactive baselines in the few-shot regime. Our experiments with a population of agents that learn and compete suggest that meta-learners are the fittest.

Comments:	Published as a conference paper at ICLR 2018
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1710.03641 [cs.LG]
	(or arXiv:1710.03641v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1710.03641

Submission history

From: Maruan Al-Shedivat [view email]
[v1] Tue, 10 Oct 2017 15:00:37 UTC (3,784 KB)
[v2] Fri, 23 Feb 2018 17:27:36 UTC (3,832 KB)

Computer Science > Machine Learning

Title:Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators