PUMA: Deep Metric Imitation Learning for Stable Motion Primitives

Pérez-Dattari, Rodrigo; Della Santina, Cosimo; Kober, Jens

Computer Science > Robotics

arXiv:2310.12831 (cs)

[Submitted on 19 Oct 2023 (v1), last revised 1 Oct 2024 (this version, v3)]

Title:PUMA: Deep Metric Imitation Learning for Stable Motion Primitives

Authors:Rodrigo Pérez-Dattari, Cosimo Della Santina, Jens Kober

View PDF HTML (experimental)

Abstract:Imitation Learning (IL) is a powerful technique for intuitive robotic programming. However, ensuring the reliability of learned behaviors remains a challenge. In the context of reaching motions, a robot should consistently reach its goal, regardless of its initial conditions. To meet this requirement, IL methods often employ specialized function approximators that guarantee this property by construction. Although effective, these approaches come with a set of limitations: 1) they are unable to fully exploit the capabilities of modern Deep Neural Network (DNN) architectures, 2) some are restricted in the family of motions they can model, resulting in suboptimal IL capabilities, and 3) they require explicit extensions to account for the geometry of motions that consider orientations. To address these challenges, we introduce a novel stability loss function, drawing inspiration from the triplet loss used in the deep metric learning literature. This loss does not constrain the DNN's architecture and enables learning policies that yield accurate results. Furthermore, it is not restricted to a specific state space geometry; therefore, it can easily incorporate the geometry of the robot's state space. We provide a proof of the stability properties induced by this loss and empirically validate our method in various settings. These settings include Euclidean and non-Euclidean state spaces, as well as first-order and second-order motions, both in simulation and with real robots. More details about the experimental results can be found in: this https URL.

Comments:	21 pages, 15 figures, 4 tables
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2310.12831 [cs.RO]
	(or arXiv:2310.12831v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2310.12831

Submission history

From: Rodrigo Pérez Dattari [view email]
[v1] Thu, 19 Oct 2023 15:35:37 UTC (23,760 KB)
[v2] Sun, 25 Feb 2024 18:16:23 UTC (23,708 KB)
[v3] Tue, 1 Oct 2024 10:56:44 UTC (23,706 KB)

Computer Science > Robotics

Title:PUMA: Deep Metric Imitation Learning for Stable Motion Primitives

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:PUMA: Deep Metric Imitation Learning for Stable Motion Primitives

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators