Abstract
We give a policy iteration algorithm to solve zero-sum stochastic games with finite state and action spaces and perfect information, when the value is defined in terms of the mean payoff per turn. This algorithm does not require any irreducibility assumption on the Markov chains determined by the strategies of the players. It is based on a discrete nonlinear analogue of the notion of reduction of a super-harmonic function. To cite this article: J. Cochet-Terrasson, S. Gaubert, C. R. Acad. Sci. Paris, Ser. I 343 (2006).
| Original language | English |
|---|---|
| Pages (from-to) | 377-382 |
| Number of pages | 6 |
| Journal | Comptes Rendus Mathematique |
| Volume | 343 |
| Issue number | 5 |
| DOIs | |
| Publication status | Published - 1 Sept 2006 |
Fingerprint
Dive into the research topics of 'A policy iteration algorithm for zero-sum stochastic games with mean payoff'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver