TY - GEN
T1 - Strategic and interactive learning of a hierarchical set of tasks by the Poppy humanoid robot
AU - Duminy, Nicolas
AU - Nguyen, Sao Mai
AU - Duhaut, Dominique
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2017/2/7
Y1 - 2017/2/7
N2 - We present an active learning architecture that allows a robot to actively learn which data collection strategy is most efficient for acquiring motor skills to achieve multiple outcomes, and generalise over its experience to achieve new outcomes for cumulative learning. In the present work, we consider the learning of tasks that are hierarchically organised, interrelated and more and more difficult. This paper proposes an algorithmic architecture, called Socially Guided Intrinsic Motivation with Active Choice of Task and Strategy for Cumulative Learning (SGIM-ACTSCL). It relies on hierarchical active decisions of what and how to learn, driven by empirical evaluation of learning progress for each learning strategy. Our learning agent uses both interactive learning and autonomous goal-babbling. It actively decides at the same time, which tasks to focus on, when to explore autonomously, and when and what to request for social guidance. We present experimental results on the physical humanoid robot Poppy that learns different types of motor skills, encoded by Dynamic Movement Primitives, in order to use a tablet (Fig. 1). We show that SGIM-ACTSCL learns significantly more efficiently than other algorithms. Moreover, it automatically organises its learning process focusing on easy tasks first, and difficult tasks afterwards. It coherently selects the best strategy with respect to the chosen outcome, manages to learn to associate the teacher with his competence domain in order to actively request social guidance for the appropriate tasks.
AB - We present an active learning architecture that allows a robot to actively learn which data collection strategy is most efficient for acquiring motor skills to achieve multiple outcomes, and generalise over its experience to achieve new outcomes for cumulative learning. In the present work, we consider the learning of tasks that are hierarchically organised, interrelated and more and more difficult. This paper proposes an algorithmic architecture, called Socially Guided Intrinsic Motivation with Active Choice of Task and Strategy for Cumulative Learning (SGIM-ACTSCL). It relies on hierarchical active decisions of what and how to learn, driven by empirical evaluation of learning progress for each learning strategy. Our learning agent uses both interactive learning and autonomous goal-babbling. It actively decides at the same time, which tasks to focus on, when to explore autonomously, and when and what to request for social guidance. We present experimental results on the physical humanoid robot Poppy that learns different types of motor skills, encoded by Dynamic Movement Primitives, in order to use a tablet (Fig. 1). We show that SGIM-ACTSCL learns significantly more efficiently than other algorithms. Moreover, it automatically organises its learning process focusing on easy tasks first, and difficult tasks afterwards. It coherently selects the best strategy with respect to the chosen outcome, manages to learn to associate the teacher with his competence domain in order to actively request social guidance for the appropriate tasks.
UR - https://www.scopus.com/pages/publications/85015293157
U2 - 10.1109/DEVLRN.2016.7846820
DO - 10.1109/DEVLRN.2016.7846820
M3 - Conference contribution
AN - SCOPUS:85015293157
T3 - 2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2016
SP - 204
EP - 209
BT - 2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2016
Y2 - 19 September 2016 through 22 September 2016
ER -