Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

Goyal, Anirudh; Sodhani, Shagun; Binas, Jonathan; Peng, Xue Bin; Levine, Sergey; Bengio, Yoshua

Computer Science > Machine Learning

arXiv:1906.10667 (cs)

[Submitted on 25 Jun 2019]

Title:Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

Authors:Anirudh Goyal, Shagun Sodhani, Jonathan Binas, Xue Bin Peng, Sergey Levine, Yoshua Bengio

View PDF

Abstract:Reinforcement learning agents that operate in diverse and complex environments can benefit from the structured decomposition of their behavior. Often, this is addressed in the context of hierarchical reinforcement learning, where the aim is to decompose a policy into lower-level primitives or options, and a higher-level meta-policy that triggers the appropriate behaviors for a given situation. However, the meta-policy must still produce appropriate decisions in all states. In this work, we propose a policy design that decomposes into primitives, similarly to hierarchical reinforcement learning, but without a high-level meta-policy. Instead, each primitive can decide for themselves whether they wish to act in the current state. We use an information-theoretic mechanism for enabling this decentralized decision: each primitive chooses how much information it needs about the current state to make a decision and the primitive that requests the most information about the current state acts in the world. The primitives are regularized to use as little information as possible, which leads to natural competition and specialization. We experimentally demonstrate that this policy architecture improves over both flat and hierarchical policies in terms of generalization.

Comments:	Preprint, Under Review
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1906.10667 [cs.LG]
	(or arXiv:1906.10667v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1906.10667

Submission history

From: Shagun Sodhani [view email]
[v1] Tue, 25 Jun 2019 17:04:48 UTC (2,770 KB)

Computer Science > Machine Learning

Title:Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators