Adaptive Multi-Goal Exploration

Tarbouriech, Jean; Domingues, Omar Darwiche; Ménard, Pierre; Pirotta, Matteo; Valko, Michal; Lazaric, Alessandro

Computer Science > Machine Learning

arXiv:2111.12045 (cs)

[Submitted on 23 Nov 2021 (v1), last revised 24 Feb 2022 (this version, v2)]

Title:Adaptive Multi-Goal Exploration

Authors:Jean Tarbouriech, Omar Darwiche Domingues, Pierre Ménard, Matteo Pirotta, Michal Valko, Alessandro Lazaric

View PDF

Abstract:We introduce a generic strategy for provably efficient multi-goal exploration. It relies on AdaGoal, a novel goal selection scheme that leverages a measure of uncertainty in reaching states to adaptively target goals that are neither too difficult nor too easy. We show how AdaGoal can be used to tackle the objective of learning an $\epsilon$-optimal goal-conditioned policy for the (initially unknown) set of goal states that are reachable within $L$ steps in expectation from a reference state $s_0$ in a reward-free Markov decision process. In the tabular case with $S$ states and $A$ actions, our algorithm requires $\tilde{O}(L^3 S A \epsilon^{-2})$ exploration steps, which is nearly minimax optimal. We also readily instantiate AdaGoal in linear mixture Markov decision processes, yielding the first goal-oriented PAC guarantee with linear function approximation. Beyond its strong theoretical guarantees, we anchor AdaGoal in goal-conditioned deep reinforcement learning, both conceptually and empirically, by connecting its idea of selecting "uncertain" goals to maximizing value ensemble disagreement.

Comments:	AISTATS 2022
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2111.12045 [cs.LG]
	(or arXiv:2111.12045v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2111.12045

Submission history

From: Jean Tarbouriech [view email]
[v1] Tue, 23 Nov 2021 17:59:50 UTC (204 KB)
[v2] Thu, 24 Feb 2022 10:31:34 UTC (3,194 KB)

Computer Science > Machine Learning

Title:Adaptive Multi-Goal Exploration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Adaptive Multi-Goal Exploration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators