TY - JOUR
T1 - Open-ended learning
T2 - A conceptual framework based on representational redescription
AU - Doncieux, Stephane
AU - Filliat, David
AU - Díaz-Rodríguez, Natalia
AU - Hospedales, Timothy
AU - Duro, Richard
AU - Coninx, Alexandre
AU - Roijers, DIederik M.
AU - Girard, Benoît
AU - Perrin, Nicolas
AU - Sigaud, Olivier
N1 - Publisher Copyright:
© 2007 - 2018 Frontiers Media S.A. All Rights Reserved.
PY - 2018/9/25
Y1 - 2018/9/25
N2 - Reinforcement learning (RL) aims at building a policy that maximizes a task-related reward within a given domain. When the domain is known, i.e., when its states, actions and reward are defined, Markov Decision Processes (MDPs) provide a convenient theoretical framework to formalize RL. But in an open-ended learning process, an agent or robot must solve an unbounded sequence of tasks that are not known in advance and the corresponding MDPs cannot be built at design time. This defines the main challenges of open-ended learning: how can the agent learn how to behave appropriately when the adequate states, actions and rewards representations are not given? In this paper, we propose a conceptual framework to address this question. We assume an agent endowed with low-level perception and action capabilities. This agent receives an external reward when it faces a task. It must discover the state and action representations that will let it cast the tasks as MDPs in order to solve them by RL. The relevance of the action or state representation is critical for the agent to learn efficiently. Considering that the agent starts with a low level, task-agnostic state and action spaces based on its low-level perception and action capabilities, we describe open-ended learning as the challenge of building the adequate representation of states and actions, i.e., of redescribing available representations. We suggest an iterative approach to this problem based on several successive Representational Redescription processes, and highlight the corresponding challenges in which intrinsic motivations play a key role.
AB - Reinforcement learning (RL) aims at building a policy that maximizes a task-related reward within a given domain. When the domain is known, i.e., when its states, actions and reward are defined, Markov Decision Processes (MDPs) provide a convenient theoretical framework to formalize RL. But in an open-ended learning process, an agent or robot must solve an unbounded sequence of tasks that are not known in advance and the corresponding MDPs cannot be built at design time. This defines the main challenges of open-ended learning: how can the agent learn how to behave appropriately when the adequate states, actions and rewards representations are not given? In this paper, we propose a conceptual framework to address this question. We assume an agent endowed with low-level perception and action capabilities. This agent receives an external reward when it faces a task. It must discover the state and action representations that will let it cast the tasks as MDPs in order to solve them by RL. The relevance of the action or state representation is critical for the agent to learn efficiently. Considering that the agent starts with a low level, task-agnostic state and action spaces based on its low-level perception and action capabilities, we describe open-ended learning as the challenge of building the adequate representation of states and actions, i.e., of redescribing available representations. We suggest an iterative approach to this problem based on several successive Representational Redescription processes, and highlight the corresponding challenges in which intrinsic motivations play a key role.
KW - Actions and goals
KW - Developmental robotics
KW - Reinforcement learning
KW - Representational redescription
KW - Skills
KW - State representation learning
U2 - 10.3389/fnbot.2018.00059
DO - 10.3389/fnbot.2018.00059
M3 - Review article
AN - SCOPUS:85055086379
SN - 1662-5218
VL - 12
JO - Frontiers in Neurorobotics
JF - Frontiers in Neurorobotics
IS - SEP
M1 - 59
ER -