Learning to Walk Autonomously via Reset-Free Quality-Diversity

Lim, Bryan; Reichenbach, Alexander; Cully, Antoine

doi:10.1145/3512290.3528715

Computer Science > Machine Learning

arXiv:2204.03655 (cs)

[Submitted on 7 Apr 2022]

Title:Learning to Walk Autonomously via Reset-Free Quality-Diversity

Authors:Bryan Lim, Alexander Reichenbach, Antoine Cully

View PDF

Abstract:Quality-Diversity (QD) algorithms can discover large and complex behavioural repertoires consisting of both diverse and high-performing skills. However, the generation of behavioural repertoires has mainly been limited to simulation environments instead of real-world learning. This is because existing QD algorithms need large numbers of evaluations as well as episodic resets, which require manual human supervision and interventions. This paper proposes Reset-Free Quality-Diversity optimization (RF-QD) as a step towards autonomous learning for robotics in open-ended environments. We build on Dynamics-Aware Quality-Diversity (DA-QD) and introduce a behaviour selection policy that leverages the diversity of the imagined repertoire and environmental information to intelligently select of behaviours that can act as automatic resets. We demonstrate this through a task of learning to walk within defined training zones with obstacles. Our experiments show that we can learn full repertoires of legged locomotion controllers autonomously without manual resets with high sample efficiency in spite of harsh safety constraints. Finally, using an ablation of different target objectives, we show that it is important for RF-QD to have diverse types solutions available for the behaviour selection policy over solutions optimised with a specific objective. Videos and code available at this https URL.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
Cite as:	arXiv:2204.03655 [cs.LG]
	(or arXiv:2204.03655v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2204.03655
Related DOI:	https://doi.org/10.1145/3512290.3528715

Submission history

From: Bryan Lim [view email]
[v1] Thu, 7 Apr 2022 14:07:51 UTC (15,926 KB)

Computer Science > Machine Learning

Title:Learning to Walk Autonomously via Reset-Free Quality-Diversity

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning to Walk Autonomously via Reset-Free Quality-Diversity

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators