Smooth Exploration for Robotic Reinforcement Learning

Raffin, Antonin; Kober, Jens; Stulp, Freek

Computer Science > Machine Learning

arXiv:2005.05719 (cs)

[Submitted on 12 May 2020 (v1), last revised 20 Jun 2021 (this version, v2)]

Title:Smooth Exploration for Robotic Reinforcement Learning

Authors:Antonin Raffin, Jens Kober, Freek Stulp

View PDF

Abstract:Reinforcement learning (RL) enables robots to learn skills from interactions with the real world. In practice, the unstructured step-based exploration used in Deep RL -- often very successful in simulation -- leads to jerky motion patterns on real robots. Consequences of the resulting shaky behavior are poor exploration, or even damage to the robot. We address these issues by adapting state-dependent exploration (SDE) to current Deep RL algorithms. To enable this adaptation, we propose two extensions to the original SDE, using more general features and re-sampling the noise periodically, which leads to a new exploration method generalized state-dependent exploration (gSDE). We evaluate gSDE both in simulation, on PyBullet continuous control tasks, and directly on three different real robots: a tendon-driven elastic robot, a quadruped and an RC car. The noise sampling interval of gSDE permits to have a compromise between performance and smoothness, which allows training directly on the real robots without loss of performance. The code is available at this https URL.

Comments:	Code: this https URL Training scripts: this https URL
Subjects:	Machine Learning (cs.LG); Robotics (cs.RO); Machine Learning (stat.ML)
Cite as:	arXiv:2005.05719 [cs.LG]
	(or arXiv:2005.05719v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2005.05719
Journal reference:	Proceedings of the 5th Conference on Robot Learning, PMLR 164:1634-1644, 2022

Submission history

From: Antonin Raffin [view email]
[v1] Tue, 12 May 2020 12:28:25 UTC (4,664 KB)
[v2] Sun, 20 Jun 2021 09:49:35 UTC (3,437 KB)

Computer Science > Machine Learning

Title:Smooth Exploration for Robotic Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Smooth Exploration for Robotic Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators