Passer à la navigation principale Passer à la recherche Passer au contenu principal

SRG3: Speech-driven Robot Gesture Generation with GAN

  • ENSTA ParisTech

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

The human gestures occur spontaneously and usually they are aligned with speech, which leads to a natural and expressive interaction. Speech-driven gesture generation is important in order to enable a social robot to exhibit social cues and conduct a successful human-robot interaction. In this paper, the generation process involves mapping acoustic speech representation to the corresponding gestures for a humanoid robot. The paper proposes a new GAN (Generative Adversarial Network) architecture for speech to gesture generation. Instead of the fixed mapping from one speech to one gesture pattern, our end-to-end GAN structure can generate multiple mapped gestures patterns from one speech (with multiple noises) just like humans do. The generated gestures can be applied to social robots with arms. The evaluation result shows the effectiveness of our generative model for speech-driven robot gesture generation.

langue originaleAnglais
titre16th IEEE International Conference on Control, Automation, Robotics and Vision, ICARCV 2020
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages759-766
Nombre de pages8
ISBN (Electronique)9781728177090
Les DOIs
étatPublié - 13 déc. 2020
Modification externeOui
Evénement16th IEEE International Conference on Control, Automation, Robotics and Vision, ICARCV 2020 - Virtual, Shenzhen, Chine
Durée: 13 déc. 202015 déc. 2020

Série de publications

Nom16th IEEE International Conference on Control, Automation, Robotics and Vision, ICARCV 2020

Une conférence

Une conférence16th IEEE International Conference on Control, Automation, Robotics and Vision, ICARCV 2020
Pays/TerritoireChine
La villeVirtual, Shenzhen
période13/12/2015/12/20

Empreinte digitale

Examiner les sujets de recherche de « SRG3: Speech-driven Robot Gesture Generation with GAN ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation