TY - GEN
T1 - Iterative Learning for Model Reactive Control
T2 - 2021 International Conference on Automation, Robotics and Applications, ICARA 2021
AU - Shrit, Omar
AU - Filliat, David
AU - Sebag, Michele
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/2/4
Y1 - 2021/2/4
N2 - In this paper, a decentralized autonomous controller aimed to control a fleet of quadrotors is designed, based on the iterative generation and exploitation of logged traces. The presented approach, inspired by model predictive control, aims to maintain the geometrical configuration for a set of quadrotors led by remotely controlled leaders. The novelty of this approach is to rely on inexpensive commercial off-the-shelf sensors (as opposed to positioning systems and/or cameras) that only measure the distance among quadrotors. In the first phase (trace generation) quadrotors are operated using randomized controllers based on domain knowledge, and their trajectories are registered. In the exploitation phase, a policy is learned from the traces generated in the previous phase, and the policy is iteratively refined, to achieve a robust reactive control of each quadrotor agent. Extensive experiments using RotorS, a Software In the Loop (SITL) framework in Gazebo simulator demonstrates the efficiency of the approach, and its ability to preserve the flocking structure of the quadrotors, following the (remotely and independently controlled) leaders.
AB - In this paper, a decentralized autonomous controller aimed to control a fleet of quadrotors is designed, based on the iterative generation and exploitation of logged traces. The presented approach, inspired by model predictive control, aims to maintain the geometrical configuration for a set of quadrotors led by remotely controlled leaders. The novelty of this approach is to rely on inexpensive commercial off-the-shelf sensors (as opposed to positioning systems and/or cameras) that only measure the distance among quadrotors. In the first phase (trace generation) quadrotors are operated using randomized controllers based on domain knowledge, and their trajectories are registered. In the exploitation phase, a policy is learned from the traces generated in the previous phase, and the policy is iteratively refined, to achieve a robust reactive control of each quadrotor agent. Extensive experiments using RotorS, a Software In the Loop (SITL) framework in Gazebo simulator demonstrates the efficiency of the approach, and its ability to preserve the flocking structure of the quadrotors, following the (remotely and independently controlled) leaders.
KW - Quadrotors
KW - iterative learning
KW - leader-follower
KW - machine learning
KW - model predictive control
KW - neural networks
U2 - 10.1109/ICARA51699.2021.9376454
DO - 10.1109/ICARA51699.2021.9376454
M3 - Conference contribution
AN - SCOPUS:85103738219
T3 - 2021 International Conference on Automation, Robotics and Applications, ICARA 2021
SP - 140
EP - 146
BT - 2021 International Conference on Automation, Robotics and Applications, ICARA 2021
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 4 February 2021 through 6 February 2021
ER -