Résumé
We give a policy iteration algorithm to solve zero-sum stochastic games with finite state and action spaces and perfect information, when the value is defined in terms of the mean payoff per turn. This algorithm does not require any irreducibility assumption on the Markov chains determined by the strategies of the players. It is based on a discrete nonlinear analogue of the notion of reduction of a super-harmonic function. To cite this article: J. Cochet-Terrasson, S. Gaubert, C. R. Acad. Sci. Paris, Ser. I 343 (2006).
| langue originale | Anglais |
|---|---|
| Pages (de - à) | 377-382 |
| Nombre de pages | 6 |
| journal | Comptes Rendus Mathematique |
| Volume | 343 |
| Numéro de publication | 5 |
| Les DOIs | |
| état | Publié - 1 sept. 2006 |
Empreinte digitale
Examiner les sujets de recherche de « A policy iteration algorithm for zero-sum stochastic games with mean payoff ». Ensemble, ils forment une empreinte digitale unique.Contient cette citation
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver