Should robots display what they hear? Mishearing as a practical accomplishment

  • Damien Rudaz
  • , Christian Licoppe

Research output: Contribution to journalArticlepeer-review

Abstract

As a contribution to research on transparency and failures in human–robot interaction (HRI), our study investigates whether the informational ecology configured by publicly displaying a robot’s automatic speech recognition (ASR) results is consequential in how miscommunications emerge and are dealt with. After a preliminary quantitative analysis of our participants’ gaze behavior during an experiment where they interacted with a conversational robot, we rely on a micro-analytic approach to detail how the interpretation of this robot’s conduct as inadequate was configured by what it displayed as having “heard” on its tablet. We investigate cases where an utterance or gesture by the robot was treated by participants as sequentially relevant only as long as they had not read the automatic speech recognition transcript but re-evaluated it as troublesome once they had read it. In doing so, we contribute to HRI by showing that systematically displaying an ASR transcript can play a crucial role in participants’ interpretation of a co-constructed action (such as shaking hands with a robot) as having “failed”. We demonstrate that “mistakes” and “errors” can be approached as practical accomplishments that emerge as such over the course of interaction rather than as social or technical phenomena pre-categorized by the researcher in reference to criteria exogenous to the activity being analyzed. In the end, while narrowing down on two video fragments, we find that this peculiar informational ecology did not merely impact how the robot was responded to. Instead, it modified the very definition of “mutual understanding” that was enacted and oriented to as relevant by the human participants in these fragments. Besides social robots, we caution that systematically providing such transcripts is a design decision not to be taken lightly; depending on the setting, it may have unintended consequences on interactions between humans and any form of conversational interface.

Original languageEnglish
Article number1597276
JournalFrontiers in Robotics and AI
Volume12
DOIs
Publication statusPublished - 1 Jan 2025

Keywords

  • action ascription
  • automatic speech recognition
  • conversation analysis
  • errors and mistakes
  • ethnomethodology
  • mishearing
  • repair
  • transparency

Fingerprint

Dive into the research topics of 'Should robots display what they hear? Mishearing as a practical accomplishment'. Together they form a unique fingerprint.

Cite this