Passer à la navigation principale Passer à la recherche Passer au contenu principal

Progressive State Space Disaggregation for Infinite Horizon Dynamic Programming

  • Telecom Sudparis
  • Sorbonne Université
  • Université Paris-Nanterre

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

High dimensionality of model-based Reinforcement Learning and Markov Decision Processes can be reduced using abstractions of the state and action spaces. Although hierarchical learning and state abstraction methods have been explored over the past decades, explicit methods to build useful abstractions of models are rarely provided. In this work, we provide a new state abstraction method for solving infinite horizon problems in the discounted and total settings. Our approach is to progressively disaggregate abstract regions by iteratively slicing aggregations of states relatively to a value function. The distinguishing feature of our method, in contrast to previous approximations of the Bellman operator, is the disaggregation of regions during value function iterations (or policy evaluation steps). The objective is to find a more efficient aggregation that reduces the error on each piece of the partition. We provide a proof of convergence for this algorithm without making any assumptions about the structure of the problem. We also show that this process decreases the computational complexity of the Bellman operator iteration and provides useful abstractions. We then plug this state space disaggregation process in classical Dynamic Programming algorithms, namely Approximate Value Iteration, Q-Value Iteration and Policy Iteration. Finally, we conduct a numerical comparison, which shows that our algorithm is faster than both traditional dynamic programming approach and recent aggregative methods that use a fixed number of partitions.

langue originaleAnglais
titreProceedings of the 34th International Conference on Automated Planning and Scheduling, ICAPS 2024
rédacteurs en chefSara Bernardini, Christian Muise
EditeurAssociation for the Advancement of Artificial Intelligence
Pages221-229
Nombre de pages9
ISBN (Electronique)9781577358893
Les DOIs
étatPublié - 30 mai 2024
Evénement34th International Conference on Automated Planning and Scheduling, ICAPS 2024 - Banaff, Canada
Durée: 1 juin 20246 juin 2024

Série de publications

NomProceedings International Conference on Automated Planning and Scheduling, ICAPS
Volume34
ISSN (imprimé)2334-0835
ISSN (Electronique)2334-0843

Une conférence

Une conférence34th International Conference on Automated Planning and Scheduling, ICAPS 2024
Pays/TerritoireCanada
La villeBanaff
période1/06/246/06/24

Empreinte digitale

Examiner les sujets de recherche de « Progressive State Space Disaggregation for Infinite Horizon Dynamic Programming ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation