Safe Deep RL in 3D Environments using Human Feedback

Rahtz, Matthew; Varma, Vikrant; Kumar, Ramana; Kenton, Zachary; Legg, Shane; Leike, Jan

Computer Science > Machine Learning

arXiv:2201.08102 (cs)

[Submitted on 20 Jan 2022 (v1), last revised 21 Jan 2022 (this version, v2)]

Title:Safe Deep RL in 3D Environments using Human Feedback

Authors:Matthew Rahtz, Vikrant Varma, Ramana Kumar, Zachary Kenton, Shane Legg, Jan Leike

View PDF

Abstract:Agents should avoid unsafe behaviour during both training and deployment. This typically requires a simulator and a procedural specification of unsafe behaviour. Unfortunately, a simulator is not always available, and procedurally specifying constraints can be difficult or impossible for many real-world tasks. A recently introduced technique, ReQueST, aims to solve this problem by learning a neural simulator of the environment from safe human trajectories, then using the learned simulator to efficiently learn a reward model from human feedback. However, it is yet unknown whether this approach is feasible in complex 3D environments with feedback obtained from real humans - whether sufficient pixel-based neural simulator quality can be achieved, and whether the human data requirements are viable in terms of both quantity and quality. In this paper we answer this question in the affirmative, using ReQueST to train an agent to perform a 3D first-person object collection task using data entirely from human contractors. We show that the resulting agent exhibits an order of magnitude reduction in unsafe behaviour compared to standard reinforcement learning.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2201.08102 [cs.LG]
	(or arXiv:2201.08102v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2201.08102

Submission history

From: Matthew Rahtz [view email]
[v1] Thu, 20 Jan 2022 10:26:34 UTC (31,623 KB)
[v2] Fri, 21 Jan 2022 16:10:14 UTC (39,643 KB)

Computer Science > Machine Learning

Title:Safe Deep RL in 3D Environments using Human Feedback

Submission history

Access Paper:

Ancillary files (details):

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Safe Deep RL in 3D Environments using Human Feedback

Submission history

Access Paper:

Ancillary files (details):

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators