research-article

Generation of Gestures During Presentation for Humanoid Robots

Authors:

Akihito Shimazu,

Takayuki Nagai,

Tomoaki Nakamura,

Osamu Nakagawa,

Tsuyoshi MaedaAuthors Info & Claims

2018 27th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN)

Pages 961 - 968

https://doi.org/10.1109/ROMAN.2018.8525621

Published: 27 August 2018 Publication History

Abstract

For presentation purposes, gestures play an exceptionally important role in improving the information transmission effect. It has been demonstrated that the body language expressing the enthusiasm and intention of the presenter affects the success of the presentation and the impression on the audience. For these reasons, presentation robots are required to perform such movements; however, manual design of these movements is a difficult task. In this research, we propose a method to model the relationship between speech prosodic information and motion using a recurrent neural network, and directly generate appropriate motions using the prosodic information. This study also proposes a method for generating motions that convey the meaning of specific words. We implement the proposed method on the “Pepper” robot to evaluate its performance.

References

[1]

S. Kita, “Feature articles, cognitive science of gestures: Why do people do gestures.” Cognitive science 7(1), 9–21. 2000.

[2]

K. W. Berger and G. R. Popelka, “Extra-facial gestures in relation to speechreading,” Journal of Communication Disorders pp. 302–308, 1971.

[3]

P. Bremner and U. Leonards, “Iconic gestures for robot avatars, recognition and integration with speech,” Frontiers in Psychology, vol. 7, 2016. [Online]. Available: https://www.frontiersin.org/article/10.3389/fpsyg.2016.00183.

[4]

M. Gentilucci and R. D. Volta, “Spoken language and arm gestures are controlled by the same motor control system,” Quarterly Journal of Experimental Psychology, vol. 61, no. 6, pp. 944–957, 2008. 18470824. [Online]. Available: https://doi.org/10.1080/17470210701625683.

[5]

A. Pentland, “Honest signals,” The MIT press, 2008.

Digital Library

[6]

D. McNeill, Hand and mind: what gestures reveal about thought / David McNeill University of Chicago; Press Chicago. 1992.

[7]

C.-M. Huang and B. Mutlu, “Modeling and evaluating narrative gestures for humanlike robots,” in Robotics: Science and Systems, 2013.

[8]

J. Cassell, H. H. Vilhjálmsson, and T. Bickmore, “Beat: The behavior expression animation toolkit,” in Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques, ser. SIGGRAPH ‘01. New York, NY, USA: ACM, 2001, pp. 477–486. [Online]. Available: http://doi.acm.org/10.1145/383259.383315.

[9]

S. Kopp, P. Tepper, K. Striegnitz, K. Ferriman, and J. Cassell, “Trading spaces: How humans and humanoids use speech and gesture to give directions,” in Conversational Informatics: an Engineering Approach, T. Nishida, Ed. John Wiley & Sons, 2007, pp. 133–160.

[10]

V. Ng-Thow-Hing, P. Luo, and S. Y. Okita, “Synchronized gesture and speech production for humanoid robots,” 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4617–4624, 2010.

[11]

O. Alemi, W. Li, and P. Pasquier, “Affect-expressive movement generation with factored conditional restricted boltzmann machines,” ACII.2015.

[12]

C.-C. Chiu and S. Marsella, “How to train your avatar: A data driven approach to gesture generation,” in Intelligent Virtual Agents, H. H. Vilhjálmsson, S. Kopp, S. Marsella, and K. R. Thórisson, Eds. Berlin, Heidelberg: Springer Berlin Heidelberg. 2011. pp. 127–140.

Digital Library

[13]

C.-C. Chiu, L.-P. Morency, and S. Marsella, “Predicting co-verbal gestures: A deep and temporal modeling approach,” in Intelligent Virtual Agents, W.-P. Brinkman, J. Broekens, and D. Heylen, Eds. Cham: Springer International Publishing, 2015, pp. 152–166.

[14]

K. Takeuchi, H. Hasegawa, S. Shirakawa, N. Kaneko, H. Sakuta, and K. Sumi, “An approchi in speech-to-gesture generation with bidirectional lstm,” HAI. 2017.

[15]

Z. Cao, T. Simon, S.-E. Wei, and Y. Sheikh, “Realtime multi-person 2d pose estimation using affinity fields,” CVPR, 2017.

[16]

X. Zhou, M. Zhu, G. Pavlakos, S. Leonardos, K. G. Derpanis, and K. Daniilidis, “Monocap: Monocular human motion capture using a cnn coupled with a geometric prior,” CVPR, 2016.

[17]

M. Morise, F. Yokomori, and K. Ozawa, “World: a vocoder-based high-quality speech synthesis system for real-time applications,” IE-ICE transactions on information and systems, 2016.

[18]

I. Sutskever, O. Vinyals, and Q. V. Le, “Sequence to sequence learning with neural networks,” NIPS, 2014.

[19]

TED Conferences LLC, “Ted (technology entertainment design),” https://www.ted.com/, 2017.

Cited By

Kucherenko TWolfert PYoon YViegas CNikolov TTsakov MHenter G(2024)Evaluating Gesture Generation in a Large-scale Open Challenge: The GENEA Challenge 2022ACM Transactions on Graphics10.1145/365637443:3(1-28)Online publication date: 27-Apr-2024
https://dl.acm.org/doi/10.1145/3656374
de Wit JVogt PKrahmer E(2022)The Design and Observed Effects of Robot-performed Manual Gestures: A Systematic ReviewACM Transactions on Human-Robot Interaction10.1145/354953012:1(1-62)Online publication date: 19-Jul-2022
https://dl.acm.org/doi/10.1145/3549530
Liu YMohammadi GSong YJohal W(2021)Speech-based Gesture Generation for Robots and Embodied Agents: A Scoping ReviewProceedings of the 9th International Conference on Human-Agent Interaction10.1145/3472307.3484167(31-38)Online publication date: 9-Nov-2021
https://dl.acm.org/doi/10.1145/3472307.3484167
Show More Cited By

Index Terms

Generation of Gestures During Presentation for Humanoid Robots
1. Computing methodologies
2. Human-centered computing
  1. Human computer interaction (HCI)

Index terms have been assigned to the content through auto-classification.

Recommendations

Gesture Generation from Trimodal Context for Humanoid Robots
HAI '24: Proceedings of the 12th International Conference on Human-Agent Interaction

Natural co-speech gestures are essential components to improve the experience of Human-robot interaction (HRI). However, current gesture generation approaches have many limitations of not being natural, not aligning with the speech and content, or the ...
Whole-Body Motion Generation Integrating Operator's Intention and Robot's Autonomy in Controlling Humanoid Robots

This paper introduces a framework for whole-body motion generation integrating operator's control and robot's autonomous functions during online control of humanoid robots. Humanoid robots are biped machines that usually possess multiple degrees of ...
Stepping over obstacles with humanoid robots

The wide potential applications of humanoid robots require that the robots can walk in complex environments and overcome various obstacles. To this end, we address the problem of humanoid robots stepping over obstacles in this paper. We focus on two ...

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings

2018 27th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN)

Aug 2018

1195 pages

Copyright © 2018.

Publisher

IEEE Press

Publication History

Published: 27 August 2018

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 31 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Kucherenko TWolfert PYoon YViegas CNikolov TTsakov MHenter G(2024)Evaluating Gesture Generation in a Large-scale Open Challenge: The GENEA Challenge 2022ACM Transactions on Graphics10.1145/365637443:3(1-28)Online publication date: 27-Apr-2024
https://dl.acm.org/doi/10.1145/3656374
de Wit JVogt PKrahmer E(2022)The Design and Observed Effects of Robot-performed Manual Gestures: A Systematic ReviewACM Transactions on Human-Robot Interaction10.1145/354953012:1(1-62)Online publication date: 19-Jul-2022
https://dl.acm.org/doi/10.1145/3549530
Liu YMohammadi GSong YJohal W(2021)Speech-based Gesture Generation for Robots and Embodied Agents: A Scoping ReviewProceedings of the 9th International Conference on Human-Agent Interaction10.1145/3472307.3484167(31-38)Online publication date: 9-Nov-2021
https://dl.acm.org/doi/10.1145/3472307.3484167
Hara TSugahara IOzawa MAndou M(2019)Practical automatic information authoring system for information guidanceProceedings of Asian CHI Symposium 2019: Emerging HCI Research Collection10.1145/3309700.3338433(31-37)Online publication date: 4-May-2019
https://dl.acm.org/doi/10.1145/3309700.3338433

View Options

View options

Figures

Tables

Media

View Table of Conten