Résumé
We aim for a robot capable to learn sequences of motor policies to achieve a field of complex tasks. In this paper, we consider a set of interrelated complex tasks hierarchically organized. To address this high-dimensional mapping between a continuous high-dimensional space of tasks and an infinite dimensional space of sequences of policies, we introduce a framework called 'procedure', which enables the creation of sequences of policies by combining previously learned skills. We propose an active learning algorithmic architecture, capable of organizing its learning process in order to achieve a field of complex tasks by learning sequences of primitive motor policies. Based on heuristics of goal-babbling, social guidance, strategic learning guided by intrinsic motivation, and the 'procedure' framework, our algorithm can actively decide on which outcome to focus and which exploration strategy to apply. We show that a simulation industrial robot can tackle the learning of complex motor policies and adapt this complexity to that of the task at hand. Owing to its exploration strategies, it can discover the levels of difficulty of the tasks, and learn the hierarchy between tasks so as to combine simple tasks to complete a complex task.
| langue originale | Anglais |
|---|---|
| titre | Proceedings - 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 |
| Editeur | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 3755-3760 |
| Nombre de pages | 6 |
| ISBN (Electronique) | 9781538666500 |
| Les DOIs | |
| état | Publié - 2 juil. 2018 |
| Evénement | 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 - Miyazaki, Japon Durée: 7 oct. 2018 → 10 oct. 2018 |
Série de publications
| Nom | Proceedings - 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 |
|---|
Une conférence
| Une conférence | 2018 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2018 |
|---|---|
| Pays/Territoire | Japon |
| La ville | Miyazaki |
| période | 7/10/18 → 10/10/18 |
SDG des Nations Unies
Ce résultat contribue à ou aux Objectifs de développement durable suivants
-
SDG 3 Bonne santé et bien-être
Empreinte digitale
Examiner les sujets de recherche de « Effects of Social Guidance on a Robot Learning Sequences of Policies in Hierarchical Learning ». Ensemble, ils forment une empreinte digitale unique.Contient cette citation
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver