KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge

Zhang, Peng; Hao, Jianye; Wang, Weixun; Tang, Hongyao; Ma, Yi; Duan, Yihai; Zheng, Yan

Computer Science > Artificial Intelligence

arXiv:2002.07418 (cs)

[Submitted on 18 Feb 2020 (v1), last revised 21 May 2020 (this version, v2)]

Title:KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge

Authors:Peng Zhang, Jianye Hao, Weixun Wang, Hongyao Tang, Yi Ma, Yihai Duan, Yan Zheng

View PDF

Abstract:Reinforcement learning agents usually learn from scratch, which requires a large number of interactions with the environment. This is quite different from the learning process of human. When faced with a new task, human naturally have the common sense and use the prior knowledge to derive an initial policy and guide the learning process afterwards. Although the prior knowledge may be not fully applicable to the new task, the learning process is significantly sped up since the initial policy ensures a quick-start of learning and intermediate guidance allows to avoid unnecessary exploration. Taking this inspiration, we propose knowledge guided policy network (KoGuN), a novel framework that combines human prior suboptimal knowledge with reinforcement learning. Our framework consists of a fuzzy rule controller to represent human knowledge and a refine module to fine-tune suboptimal prior knowledge. The proposed framework is end-to-end and can be combined with existing policy-based reinforcement learning algorithm. We conduct experiments on both discrete and continuous control tasks. The empirical results show that our approach, which combines human suboptimal knowledge and RL, achieves significant improvement on learning efficiency of flat RL algorithms, even with very low-performance human prior knowledge.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2002.07418 [cs.AI]
	(or arXiv:2002.07418v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2002.07418

Submission history

From: Peng Zhang [view email]
[v1] Tue, 18 Feb 2020 07:58:27 UTC (1,023 KB)
[v2] Thu, 21 May 2020 07:02:41 UTC (1,023 KB)

Computer Science > Artificial Intelligence

Title:KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators