Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines

Li, Andrew C.; Chen, Zizhao; Vaezipoor, Pashootan; Klassen, Toryn Q.; Icarte, Rodrigo Toro; McIlraith, Sheila A.

Computer Science > Machine Learning

arXiv:2211.10902 (cs)

[Submitted on 20 Nov 2022 (v1), last revised 23 Nov 2022 (this version, v2)]

Title:Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines

Authors:Andrew C. Li, Zizhao Chen, Pashootan Vaezipoor, Toryn Q. Klassen, Rodrigo Toro Icarte, Sheila A. McIlraith

View PDF

Abstract:Natural and formal languages provide an effective mechanism for humans to specify instructions and reward functions. We investigate how to generate policies via RL when reward functions are specified in a symbolic language captured by Reward Machines, an increasingly popular automaton-inspired structure. We are interested in the case where the mapping of environment state to a symbolic (here, Reward Machine) vocabulary -- commonly known as the labelling function -- is uncertain from the perspective of the agent. We formulate the problem of policy learning in Reward Machines with noisy symbolic abstractions as a special class of POMDP optimization problem, and investigate several methods to address the problem, building on existing and new techniques, the latter focused on predicting Reward Machine state, rather than on grounding of individual symbols. We analyze these methods and evaluate them experimentally under varying degrees of uncertainty in the correct interpretation of the symbolic vocabulary. We verify the strength of our approach and the limitation of existing methods via an empirical investigation on both illustrative, toy domains and partially observable, deep RL domains.

Comments:	NeurIPS Deep Reinforcement Learning Workshop 2022
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Formal Languages and Automata Theory (cs.FL)
Cite as:	arXiv:2211.10902 [cs.LG]
	(or arXiv:2211.10902v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2211.10902

Submission history

From: Andrew Li [view email]
[v1] Sun, 20 Nov 2022 08:13:48 UTC (379 KB)
[v2] Wed, 23 Nov 2022 05:05:41 UTC (379 KB)

Computer Science > Machine Learning

Title:Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators