Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning

Agrawal, Aishwarya; Malinowski, Mateusz; Hill, Felix; Eslami, Ali; Vinyals, Oriol; Kulkarni, Tejas

Computer Science > Machine Learning

arXiv:1812.00898 (cs)

[Submitted on 3 Dec 2018]

Title:Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning

Authors:Aishwarya Agrawal, Mateusz Malinowski, Felix Hill, Ali Eslami, Oriol Vinyals, Tejas Kulkarni

View PDF

Abstract:Advances in Deep Reinforcement Learning have led to agents that perform well across a variety of sensory-motor domains. In this work, we study the setting in which an agent must learn to generate programs for diverse scenes conditioned on a given symbolic instruction. Final goals are specified to our agent via images of the scenes. A symbolic instruction consistent with the goal images is used as the conditioning input for our policies. Since a single instruction corresponds to a diverse set of different but still consistent end-goal images, the agent needs to learn to generate a distribution over programs given an instruction. We demonstrate that with simple changes to the reinforced adversarial learning objective, we can learn instruction conditioned policies to achieve the corresponding diverse set of goals. Most importantly, our agent's stochastic policy is shown to more accurately capture the diversity in the goal distribution than a fixed pixel-based reward function baseline. We demonstrate the efficacy of our approach on two domains: (1) drawing MNIST digits with a paint software conditioned on instructions and (2) constructing scenes in a 3D editor that satisfies a certain instruction.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1812.00898 [cs.LG]
	(or arXiv:1812.00898v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1812.00898

Submission history

From: Aishwarya Agrawal [view email]
[v1] Mon, 3 Dec 2018 16:51:35 UTC (800 KB)

Computer Science > Machine Learning

Title:Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators