Skip to main navigation Skip to search Skip to main content

Progressive State Space Disaggregation for Infinite Horizon Dynamic Programming

  • Telecom Sudparis
  • Sorbonne Université
  • Université Paris-Nanterre

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

High dimensionality of model-based Reinforcement Learning and Markov Decision Processes can be reduced using abstractions of the state and action spaces. Although hierarchical learning and state abstraction methods have been explored over the past decades, explicit methods to build useful abstractions of models are rarely provided. In this work, we provide a new state abstraction method for solving infinite horizon problems in the discounted and total settings. Our approach is to progressively disaggregate abstract regions by iteratively slicing aggregations of states relatively to a value function. The distinguishing feature of our method, in contrast to previous approximations of the Bellman operator, is the disaggregation of regions during value function iterations (or policy evaluation steps). The objective is to find a more efficient aggregation that reduces the error on each piece of the partition. We provide a proof of convergence for this algorithm without making any assumptions about the structure of the problem. We also show that this process decreases the computational complexity of the Bellman operator iteration and provides useful abstractions. We then plug this state space disaggregation process in classical Dynamic Programming algorithms, namely Approximate Value Iteration, Q-Value Iteration and Policy Iteration. Finally, we conduct a numerical comparison, which shows that our algorithm is faster than both traditional dynamic programming approach and recent aggregative methods that use a fixed number of partitions.

Original languageEnglish
Title of host publicationProceedings of the 34th International Conference on Automated Planning and Scheduling, ICAPS 2024
EditorsSara Bernardini, Christian Muise
PublisherAssociation for the Advancement of Artificial Intelligence
Pages221-229
Number of pages9
ISBN (Electronic)9781577358893
DOIs
Publication statusPublished - 30 May 2024
Event34th International Conference on Automated Planning and Scheduling, ICAPS 2024 - Banaff, Canada
Duration: 1 Jun 20246 Jun 2024

Publication series

NameProceedings International Conference on Automated Planning and Scheduling, ICAPS
Volume34
ISSN (Print)2334-0835
ISSN (Electronic)2334-0843

Conference

Conference34th International Conference on Automated Planning and Scheduling, ICAPS 2024
Country/TerritoryCanada
CityBanaff
Period1/06/246/06/24

Fingerprint

Dive into the research topics of 'Progressive State Space Disaggregation for Infinite Horizon Dynamic Programming'. Together they form a unique fingerprint.

Cite this