A policy iteration algorithm for zero-sum stochastic games with mean payoff

Research output: Contribution to journalArticlepeer-review

Abstract

We give a policy iteration algorithm to solve zero-sum stochastic games with finite state and action spaces and perfect information, when the value is defined in terms of the mean payoff per turn. This algorithm does not require any irreducibility assumption on the Markov chains determined by the strategies of the players. It is based on a discrete nonlinear analogue of the notion of reduction of a super-harmonic function. To cite this article: J. Cochet-Terrasson, S. Gaubert, C. R. Acad. Sci. Paris, Ser. I 343 (2006).

Original languageEnglish
Pages (from-to)377-382
Number of pages6
JournalComptes Rendus Mathematique
Volume343
Issue number5
DOIs
Publication statusPublished - 1 Sept 2006

Fingerprint

Dive into the research topics of 'A policy iteration algorithm for zero-sum stochastic games with mean payoff'. Together they form a unique fingerprint.

Cite this