skip to main content
10.1145/2451176.2451201acmconferencesArticle/Chapter ViewAbstractPublication PagesiuiConference Proceedingsconference-collections
demonstration

Teaching agents with human feedback: a demonstration of the TAMER framework

Published: 19 March 2013 Publication History

Abstract

Incorporating human interaction into agent learning yields two crucial benefits. First, human knowledge can greatly improve the speed and final result of learning compared to pure trial-and-error approaches like reinforcement learning. And second, human users are empowered to designate "correct" behavior. In this abstract, we present research on a system for learning from human interaction - the TAMER framework - then point to extensions to TAMER, and finally describe a demonstration of these systems.

References

[1]
Isbell, C., Kearns, M., Singh, S., Shelton, C., Stone, P., and Kormann, D. Cobot in LambdaMOO: An Adaptive Social Statistics Agent. Proceedings of The 5th Annual International Conference on Autonomous Agents and Multiagent Systems (AAMAS) (2006).
[2]
Knox, W., Glass, B., Love, B., Maddox, W., and Stone, P. How humans teach agents: A new experimental perspective. International Journal of Social Robotics, Special Issue on Robot Learning from Demonstration (2012).
[3]
Knox, W., and Stone, P. Combining manual feedback with subsequent MDP reward signals for reinforcement learning. Proceedings of The 9th Annual International Conference on Autonomous Agents and Multiagent Systems (AAMAS) (2010).
[4]
Knox, W. B. Learning from Human-Generated Reward. PhD thesis, Department of Computer Science, The University of Texas at Austin, August 2012.
[5]
Knox, W. B., and Stone, P. Interactively shaping agents via human reinforcement: The TAMER framework. In The 5th International Conference on Knowledge Capture (September 2009).
[6]
Knox, W. B., and Stone, P. Reinforcement learning from human reward: Discounting in episodic tasks. In 21st IEEE International Symposium on Robot and Human Interactive Communication (Ro-Man) (September 2012).
[7]
Knox, W. B., and Stone, P. Reinforcement learning with human and MDP reward. In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems (AAMAS) (June 2012).
[8]
Knox, W. B., and Stone, P. Learning non-myopically from human-generated reward. In International Conference on Intelligent User Interfaces (IUI) (March 2013).
[9]
León, A., Morales, E., Altamirano, L., and Ruiz, J. Teaching a robot to perform task through imitation and on-line feedback. Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications (2011), 549--556.
[10]
Pilarski, P., Dawson, M., Degris, T., Fahimi, F., Carey, J., and Sutton, R. Online human training of a myoelectric prosthesis controller via actor-critic reinforcement learning. In IEEE International Conference on Rehabilitation Robotics (ICORR), IEEE (2011), 1--7.
[11]
Suay, H., and Chernova, S. Effect of human guidance and state space size on interactive reinforcement learning. In 20th IEEE International Symposium on Robot and Human Interactive Communication (Ro-Man) (2011), 1--6.
[12]
Tenorio-Gonzalez, A., Morales, E., and Villaseñor-Pineda, L. Dynamic reward shaping: training a robot by voice. Advances in Artificial Intelligence - IBERAMIA (2010), 483--492.
[13]
Thomaz, A., and Breazeal, C. Teachable robots: Understanding human teaching behavior to build more effective robot learners. Artificial ntelligence 172, 6-7 (2008), 716--737.

Cited By

View all

Index Terms

  1. Teaching agents with human feedback: a demonstration of the TAMER framework

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    IUI '13 Companion: Proceedings of the companion publication of the 2013 international conference on Intelligent user interfaces companion
    March 2013
    140 pages
    ISBN:9781450319669
    DOI:10.1145/2451176

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 March 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. end-user programming
    2. human-agent interaction
    3. interactive machine learning
    4. modeling and prediction of user behavior
    5. reinforcement learning

    Qualifiers

    • Demonstration

    Conference

    IUI '13
    Sponsor:
    IUI '13: 18th International Conference on Intelligent User Interfaces
    March 19 - 22, 2013
    California, Santa Monica, USA

    Acceptance Rates

    Overall Acceptance Rate 746 of 2,811 submissions, 27%

    Upcoming Conference

    IUI '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 01 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media