TY - JOUR
T1 - Imitation in relative terms using ReGAIL
T2 - Making motion controllers agile and transferable
AU - Boursin, Paul
AU - Kedadry, Yannis
AU - Chevalier, Tony
AU - Zordan, Victor
AU - Kry, Paul
AU - Grégoire, Sophie
AU - Cani, Marie Paule
N1 - Publisher Copyright:
© 2025 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license. http://creativecommons.org/licenses/by/4.0/
PY - 2025/12/1
Y1 - 2025/12/1
N2 - We present an approach for training “agile” character control policies, able to produce a wide variety of motor skills from a single reference motion cycle. Our technique builds off of generative adversarial imitation learning (GAIL), with a key novelty of our approach being to provide modification to the observation map in order to improve agility and robustness. Namely, to support more agile behavior, we adjust the value measurements of the training discriminator through relative features - hence the name ReGAIL. Our state observations include both task relevant relative velocities and poses, as well as relative goal deviation information. In addition, to increase robustness of the resulting gaits, servo gains and damping values are included as part of the policy action to let the controller learn how to best combine tension and relaxation during motion. From a policy informed by a single reference motion, our resulting agent is able to maneuver as needed, at runtime, from walking forward to walking backward or sideways, turning and stepping nimbly. Moreover, thanks to the use of observations in relative frames, the trained controllers are robust to morphological changes of the simulated character, which makes adaptation to new morphologies straightforward. We demonstrate our approach for a humanoid and a quadruped, on both flat and sloped terrains, as well as provide ablation studies to validate the design choices of our framework. In addition, we present an application to prehistoric research, where being able to simulate hominids of specific morphologies on rough terrain is valuable with encouraging results.
AB - We present an approach for training “agile” character control policies, able to produce a wide variety of motor skills from a single reference motion cycle. Our technique builds off of generative adversarial imitation learning (GAIL), with a key novelty of our approach being to provide modification to the observation map in order to improve agility and robustness. Namely, to support more agile behavior, we adjust the value measurements of the training discriminator through relative features - hence the name ReGAIL. Our state observations include both task relevant relative velocities and poses, as well as relative goal deviation information. In addition, to increase robustness of the resulting gaits, servo gains and damping values are included as part of the policy action to let the controller learn how to best combine tension and relaxation during motion. From a policy informed by a single reference motion, our resulting agent is able to maneuver as needed, at runtime, from walking forward to walking backward or sideways, turning and stepping nimbly. Moreover, thanks to the use of observations in relative frames, the trained controllers are robust to morphological changes of the simulated character, which makes adaptation to new morphologies straightforward. We demonstrate our approach for a humanoid and a quadruped, on both flat and sloped terrains, as well as provide ablation studies to validate the design choices of our framework. In addition, we present an application to prehistoric research, where being able to simulate hominids of specific morphologies on rough terrain is valuable with encouraging results.
KW - Character animation
KW - Generative adversarial imitation learning
KW - Motion controllers
KW - Physically-based simulation
KW - Reinforcement learning
UR - https://www.scopus.com/pages/publications/105019943596
U2 - 10.1016/j.cag.2025.104457
DO - 10.1016/j.cag.2025.104457
M3 - Article
AN - SCOPUS:105019943596
SN - 0097-8493
VL - 133
JO - Computers and Graphics (Pergamon)
JF - Computers and Graphics (Pergamon)
M1 - 104457
ER -