How to solve large scale deterministic games with mean payoff by policy iteration

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Min-max functions are dynamic programming operators of zero-sum deterministic games with finite state and action spaces. The problem of computing the linear growth rate of the orbits (cycle-time) of a min-max function, which is equivalent to computing the value of a deterministic game with mean payoff, arises in the performance analysis of discrete event systems. We present here an improved version of the policy iteration algorithm given by Gaubert and Gunawardena in 1998 to compute the cycle-time of a min-max functions. The improvement consists of a fast evaluation of the spectral projector which is adapted to the case of large sparse graphs. We present detailed numerical experiments, both on randomly generated instances, and on concrete examples, indicating that the algorithm is experimentally fast.

Original languageEnglish
Title of host publicationProceedings of VALUETOOLS
Subtitle of host publication1st International Conference on Performance Evaluation Methodologies and Tools
DOIs
Publication statusPublished - 1 Dec 2006
EventVALUETOOLS: 1st International Conference on Performance Evaluation Methodologies and Tools - Pisa, Italy
Duration: 11 Oct 200613 Oct 2006

Publication series

NameACM International Conference Proceeding Series
Volume180

Conference

ConferenceVALUETOOLS: 1st International Conference on Performance Evaluation Methodologies and Tools
Country/TerritoryItaly
CityPisa
Period11/10/0613/10/06

Keywords

  • Graph algorithms
  • Maxplus algebra
  • Nonlinear harmonic functions
  • Policy iteration
  • Repeated games

Fingerprint

Dive into the research topics of 'How to solve large scale deterministic games with mean payoff by policy iteration'. Together they form a unique fingerprint.

Cite this