Passer à la navigation principale Passer à la recherche Passer au contenu principal

Solving multichain stochastic games with mean payoff by policy iteration

  • CGA

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Zero-sum stochastic games with finite state and action spaces, perfect information, and mean payoff criteria arise in particular from the monotone discretization of mean-payoff pursuit-evasion deterministic differential games. In that case no irreducibility assumption on the Markov chains associated to strategies are satisfied (multichain games). The value of such a game can be characterized by a system of nonlinear equations, involving the mean payoff vector and an auxiliary vector (relative value or bias). Cochet-Terrasson and Gaubert proposed in (C. R. Math. Acad. Sci. Paris, 2006) a policy iteration algorithm relying on a notion of nonlinear spectral projection (Akian and Gaubert, Nonlinear Analysis TMA, 2003), which allows one to avoid cycling in degenerate iterations. We give here a complete presentation of the algorithm, with details of implementation in particular of the nonlinear projection. This has led to the software PIGAMES and allowed us to present numerical results on pursuit-evasion games.

langue originaleAnglais
titre2013 IEEE 52nd Annual Conference on Decision and Control, CDC 2013
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages1834-1841
Nombre de pages8
ISBN (imprimé)9781467357173
Les DOIs
étatPublié - 1 janv. 2013
Evénement52nd IEEE Conference on Decision and Control, CDC 2013 - Florence, Italie
Durée: 10 déc. 201313 déc. 2013

Série de publications

NomProceedings of the IEEE Conference on Decision and Control
ISSN (imprimé)0743-1546
ISSN (Electronique)2576-2370

Une conférence

Une conférence52nd IEEE Conference on Decision and Control, CDC 2013
Pays/TerritoireItalie
La villeFlorence
période10/12/1313/12/13

Empreinte digitale

Examiner les sujets de recherche de « Solving multichain stochastic games with mean payoff by policy iteration ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation