TY - GEN
T1 - Infinite-Dimensional Sums-of-Squares for Optimal Control
AU - Berthier, Eloise
AU - Carpentier, Justin
AU - Rudi, Alessandro
AU - Bach, Francis
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022/1/1
Y1 - 2022/1/1
N2 - In this paper, we introduce an approximation method to solve an optimal control problem via the Lagrange dual of its weak formulation, which applies to problems with an unknown, non-necessarily polynomial, dynamics accessed through samples, akin to model-free reinforcement learning. It is based on a sum-of-squares representation of the Hamiltonian, and extends a previous method from polynomial optimization to the generic case of smooth problems. Such a representation is infinite-dimensional and relies on a particular space of functions - a reproducing kernel Hilbert space - chosen to fit the structure of the control problem. After subsampling, it leads to a practical method that amounts to solving a semi-definite program. We illustrate our approach numerically on a low-dimensional control problem.
AB - In this paper, we introduce an approximation method to solve an optimal control problem via the Lagrange dual of its weak formulation, which applies to problems with an unknown, non-necessarily polynomial, dynamics accessed through samples, akin to model-free reinforcement learning. It is based on a sum-of-squares representation of the Hamiltonian, and extends a previous method from polynomial optimization to the generic case of smooth problems. Such a representation is infinite-dimensional and relies on a particular space of functions - a reproducing kernel Hilbert space - chosen to fit the structure of the control problem. After subsampling, it leads to a practical method that amounts to solving a semi-definite program. We illustrate our approach numerically on a low-dimensional control problem.
UR - https://www.scopus.com/pages/publications/85147036576
U2 - 10.1109/CDC51059.2022.9992396
DO - 10.1109/CDC51059.2022.9992396
M3 - Conference contribution
AN - SCOPUS:85147036576
T3 - Proceedings of the IEEE Conference on Decision and Control
SP - 577
EP - 582
BT - 2022 IEEE 61st Conference on Decision and Control, CDC 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 61st IEEE Conference on Decision and Control, CDC 2022
Y2 - 6 December 2022 through 9 December 2022
ER -