skip to main content
10.1145/1957656.1957814acmconferencesArticle/Chapter ViewAbstractPublication PageshriConference Proceedingsconference-collections
research-article

Robot self-initiative and personalization by learning through repeated interactions

Published: 06 March 2011 Publication History

Abstract

We have developed a robotic system that interacts with the user, and through repeated interactions, adapts to the user so that the system becomes semi-autonomous and acts proactively. In this work we show how to design a system to meet a user's preferences, show how robot pro-activity can be learned and provide an integrated system using verbal instructions. All these behaviors are implemented in a real platform that achieves all these behaviors and is evaluated in terms of user acceptability and efficiency of interaction.

References

[1]
P. Abbeel and A. Y. Ng. Apprenticeship learning via inverse reinforcement learning. In Proceedings of the 21st International Conference on Machine Learning (ICML'04), pages 1--8, 2004.
[2]
B. Argall, S. Chernova, and M. Veloso. A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57(5):469--483, 2009.
[3]
S. Chernova and M. Veloso. Learning equivalent action choices from demonstration. In Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on, pages 1216--1221. IEEE, 2008.
[4]
M. desJardins, E. Eaton, and K. Wagstaff. Learning user preferences for sets of objects. In Proceedings of the 23rd international conference on Machine learning, page 280. ACM, 2006.
[5]
T. W. Fong, I. Nourbakhsh, and K. Dautenhahn. A survey of socially interactive robots. Robotics and Autonomous Systems, 2003.
[6]
D. Grollman and O. Jenkins. Dogged learning for robots. In Robotics and Automation, 2007 IEEE International Conference on, pages 2483--2488, 2007.
[7]
Y. Guo and C. Gomes. Learning optimal subsets with implicit user preferences. In Proceedings of the 21st international joint conference on Artifical intelligence, pages 1052--1057, 2009.
[8]
B. Hardin and M. Goodrich. On using mixed-initiative control: a perspective for managing large-scale robotic teams. In Proceedings of the 4th ACM/IEEE international conference on Human robot interaction, pages 165--172. ACM, 2009.
[9]
R. Jaulmes, J. Pineau, and D. Precup. Active learning in partially observable markov decision processes. In NIPS Workshop on Value of Information in Inference, Learning and Decision-Making, 2005.
[10]
S. Kirkpatrick, C. Gelatt Jr, and M. Vecchi. Optimization by simulated annealing. Science, 220(4598):671, 1983.
[11]
W. B. Knox and P. Stone. Combining manual feedback with subsequent mdp reward signals for reinforcement learning. In Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS'10), pages 5--12, 2010.
[12]
C. Lachi, G. Teti, G. Tambumni, E. Datteri, and P. Dario. Adaptable semi-autonomy in personal robots. In IEEE International Worksop on Robot and Human Interactive Communication, 2001.
[13]
S. LaValle. Planning algorithms. Cambridge Univ Pr, 2006.
[14]
M. Lopes, F. Melo, L. Montesano, and J. Santos-Victor. Abstraction levels for robotic imitation: Overview and computational approaches. In O. Sigaud and J. Peters, editors, From Motor to Interaction Learning in Robots. Springer, 2009.
[15]
M. Lopes, F. S. Melo, and L. Montesano. Active learning for reward estimation in inverse reinforcement learning. In European Conference on Machine Learning (ECML/PKDD), Bled, Slovenia, 2009.
[16]
L. Montesano and M. Lopes. Learning grasping affordances from local visual descriptors. In IEEE 8TH International Conference on Development and Learning, China, 2009.
[17]
Nuance. Nuance Speech Recognition System Version 8.5, 2004.
[18]
S. Russell and P. Norvig. Artificial Intelligence: A Modern Approach. Prentice Hall, second edition, 2003.
[19]
K. Severinson-Eklundh, A. Green, and H. Hüttenrauch. Social and collaborative aspects of interaction with a service robot. Robotics and Autonomous Systems, 42(3-4):223--234, 2003.
[20]
D. Silver, J. A. Bagnell, and A. Stentz. Learning from demonstration for autonomous navigation in complex unstructured terrain. International Journal of Robotics Research, 29(1):1565--1592, October 2010.
[21]
A. L. Thomaz and C. Breazeal. Teachable robots: Understanding human teaching behavior to build more effective robot learners. Artificial Intelligence Journal, 172:716--737, 2008.
[22]
C. Zhu, R. Byrd, P. Lu, and J. Nocedal. Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization. ACM Transactions on Mathematical Software, 23(4):550--560, 1997.
[23]
J. Zhu and T. Hastie. Kernel logistic regression and the import vector machine. In Adv. Neural Information Proc. Systems, pages 1081--1088, 2002.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HRI '11: Proceedings of the 6th international conference on Human-robot interaction
March 2011
526 pages
ISBN:9781450305617
DOI:10.1145/1957656
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

  • RA: IEEE Robotics and Automation Society
  • Human Factors & Ergonomics Soc: Human Factors & Ergonomics Soc
  • The Association for the Advancement of Artificial Intelligence (AAAI)
  • IEEE Systems, Man and Cybernetics Society

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 March 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. human-robot interaction
  2. learning by demonstration

Qualifiers

  • Research-article

Conference

HRI'11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 268 of 1,124 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)16
  • Downloads (Last 6 weeks)0
Reflects downloads up to 27 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media