Abstract
We aim for a robot capable to learn sequences of motor policies to achieve a field of complex tasks. In this paper, we consider a set of interrelated complex tasks hierarchically organized. To address this high-dimensional mapping between a continuous high-dimensional space of tasks and an infinite dimensional space of sequences of policies, we introduce a framework called 'procedure', which enables the creation of sequences of policies by combining previously learned skills. We propose an active learning algorithmic architecture, capable of organizing its learning process in order to achieve a field of complex tasks by learning sequences of primitive motor policies. Based on heuristics of goal-babbling, social guidance, strategic learning guided by intrinsic motivation, and the 'procedure' framework, our algorithm can actively decide on which outcome to focus and which exploration strategy to apply. We show that a simulation industrial robot can tackle the learning of complex motor policies and adapt this complexity to that of the task at hand. Owing to its exploration strategies, it can discover the levels of difficulty of the tasks, and learn the hierarchy between tasks so as to combine simple tasks to complete a complex task.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 3755-3760 |
| Number of pages | 6 |
| ISBN (Electronic) | 9781538666500 |
| DOIs | |
| Publication status | Published - 2 Jul 2018 |
| Event | 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 - Miyazaki, Japan Duration: 7 Oct 2018 → 10 Oct 2018 |
Publication series
| Name | Proceedings - 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 |
|---|
Conference
| Conference | 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 |
|---|---|
| Country/Territory | Japan |
| City | Miyazaki |
| Period | 7/10/18 → 10/10/18 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- active imitation learning
- continual learning
- curriculum learning
- hierarchical learning
- intrinsic motivation
- social guidance
Fingerprint
Dive into the research topics of 'Effects of Social Guidance on a Robot Learning Sequences of Policies in Hierarchical Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver