Are Large Language Models Aligned with People's Social Intuitions for Human-Robot Interactions?

Wachowiak, Lennart; Coles, Andrew; Celiktutan, Oya; Canal, Gerard

Computer Science > Robotics

arXiv:2403.05701 (cs)

[Submitted on 8 Mar 2024 (v1), last revised 9 Jul 2024 (this version, v2)]

Title:Are Large Language Models Aligned with People's Social Intuitions for Human-Robot Interactions?

Authors:Lennart Wachowiak, Andrew Coles, Oya Celiktutan, Gerard Canal

View PDF HTML (experimental)

Abstract:Large language models (LLMs) are increasingly used in robotics, especially for high-level action planning. Meanwhile, many robotics applications involve human supervisors or collaborators. Hence, it is crucial for LLMs to generate socially acceptable actions that align with people's preferences and values. In this work, we test whether LLMs capture people's intuitions about behavior judgments and communication preferences in human-robot interaction (HRI) scenarios. For evaluation, we reproduce three HRI user studies, comparing the output of LLMs with that of real participants. We find that GPT-4 strongly outperforms other models, generating answers that correlate strongly with users' answers in two studies $\unicode{x2014}$ the first study dealing with selecting the most appropriate communicative act for a robot in various situations ($r_s$ = 0.82), and the second with judging the desirability, intentionality, and surprisingness of behavior ($r_s$ = 0.83). However, for the last study, testing whether people judge the behavior of robots and humans differently, no model achieves strong correlations. Moreover, we show that vision models fail to capture the essence of video stimuli and that LLMs tend to rate different communicative acts and behavior desirability higher than people.

Comments:	Accepted at IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2403.05701 [cs.RO]
	(or arXiv:2403.05701v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2403.05701

Submission history

From: Lennart Wachowiak [view email]
[v1] Fri, 8 Mar 2024 22:23:23 UTC (4,754 KB)
[v2] Tue, 9 Jul 2024 11:27:40 UTC (4,754 KB)

Computer Science > Robotics

Title:Are Large Language Models Aligned with People's Social Intuitions for Human-Robot Interactions?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Are Large Language Models Aligned with People's Social Intuitions for Human-Robot Interactions?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators