LazyDAgger: Reducing Context Switching in Interactive Imitation Learning

Hoque, Ryan; Balakrishna, Ashwin; Putterman, Carl; Luo, Michael; Brown, Daniel S.; Seita, Daniel; Thananjeyan, Brijen; Novoseller, Ellen; Goldberg, Ken

Computer Science > Robotics

arXiv:2104.00053 (cs)

[Submitted on 31 Mar 2021 (v1), last revised 20 Jul 2021 (this version, v2)]

Title:LazyDAgger: Reducing Context Switching in Interactive Imitation Learning

Authors:Ryan Hoque, Ashwin Balakrishna, Carl Putterman, Michael Luo, Daniel S. Brown, Daniel Seita, Brijen Thananjeyan, Ellen Novoseller, Ken Goldberg

View PDF

Abstract:Corrective interventions while a robot is learning to automate a task provide an intuitive method for a human supervisor to assist the robot and convey information about desired behavior. However, these interventions can impose significant burden on a human supervisor, as each intervention interrupts other work the human is doing, incurs latency with each context switch between supervisor and autonomous control, and requires time to perform. We present LazyDAgger, which extends the interactive imitation learning (IL) algorithm SafeDAgger to reduce context switches between supervisor and autonomous control. We find that LazyDAgger improves the performance and robustness of the learned policy during both learning and execution while limiting burden on the supervisor. Simulation experiments suggest that LazyDAgger can reduce context switches by an average of 60% over SafeDAgger on 3 continuous control tasks while maintaining state-of-the-art policy performance. In physical fabric manipulation experiments with an ABB YuMi robot, LazyDAgger reduces context switches by 60% while achieving a 60% higher success rate than SafeDAgger at execution time.

Comments:	IEEE CASE 2021
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2104.00053 [cs.RO]
	(or arXiv:2104.00053v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2104.00053

Submission history

From: Ryan Hoque [view email]
[v1] Wed, 31 Mar 2021 18:22:53 UTC (21,509 KB)
[v2] Tue, 20 Jul 2021 21:47:14 UTC (43,001 KB)

Computer Science > Robotics

Title:LazyDAgger: Reducing Context Switching in Interactive Imitation Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:LazyDAgger: Reducing Context Switching in Interactive Imitation Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators