Reinforcement learning with world model

Liu, Jingbin; Gu, Xinyang; Liu, Shuai

Computer Science > Artificial Intelligence

arXiv:1908.11494 (cs)

[Submitted on 30 Aug 2019 (v1), last revised 26 Oct 2020 (this version, v4)]

Title:Reinforcement learning with world model

Authors:Jingbin Liu, Xinyang Gu, Shuai Liu

View PDF

Abstract:Nowadays, model-free reinforcement learning algorithms have achieved remarkable performance on many decision making and control tasks, but high sample complexity and low sample efficiency still hinder the wide use of model-free reinforcement learning algorithms. In this paper, we argue that if we intend to design an intelligent agent that learns fast and transfers well, the agent must be able to reflect key elements of intelligence, like intuition, Memory, PredictionandCuriosity. We propose an agent framework that integrates off-policy reinforcement learning with world model learning, so as to embody the important features of intelligence in our algorithm design. We adopt the state-of-art model-free reinforcement learning algorithm, Soft Actor-Critic, as the agent intuition, and world model learning through RNN to endow the agent with memory, curiosity, and the ability to predict. We show that these ideas can work collaboratively with each other and our agent (RMC) can give new state-of-art results while maintaining sample efficiency and training stability. Moreover, our agent framework can be easily extended from MDP to POMDP problems without performance loss.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1908.11494 [cs.AI]
	(or arXiv:1908.11494v4 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1908.11494

Submission history

From: Xinyang Gu [view email]
[v1] Fri, 30 Aug 2019 00:29:32 UTC (3,537 KB)
[v2] Tue, 3 Sep 2019 04:25:25 UTC (3,538 KB)
[v3] Wed, 11 Sep 2019 02:31:44 UTC (3,538 KB)
[v4] Mon, 26 Oct 2020 05:52:25 UTC (3,534 KB)

Computer Science > Artificial Intelligence

Title:Reinforcement learning with world model

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Reinforcement learning with world model

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators