Passer à la navigation principale Passer à la recherche Passer au contenu principal

Adapting to game trees in zero-sum imperfect information games

  • Côme Fiegel
  • , Pierre Ménard
  • , Tadashi Kozuno
  • , Rémi Munos
  • , Vianney Perchet
  • , Michal Valko
  • Ecole Normale Supérieure de Lyon
  • Omron Sinic X
  • DeepMind Technologies Limited
  • ENSAE & Criteo AI Lab.

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

Imperfect information games (IIG) are games in which each player only partially observes the current game state. We study how to learn ε-optimal strategies in a zero-sum IIG through self-play with trajectory feedback. We give a problem-independent lower bound Õ(H(AX + BY)/ε2) on the required number of realizations to learn these strategies with high probability, where H is the length of the game, AX and BY are the total number of actions for the two players. We also propose two Follow the Regularized leader (FTRL) algorithms for this setting: Balanced FTRL which matches this lower bound, but requires the knowledge of the information set structure beforehand to define the regularization; and Adaptive FTRL which needs Õ(H2(AX + BY)/ε2) realizations without this requirement by progressively adapting the regularization to the observations.

langue originaleAnglais
Pages (de - à)10093-10135
Nombre de pages43
journalProceedings of Machine Learning Research
Volume202
étatPublié - 1 janv. 2023
Evénement40th International Conference on Machine Learning, ICML 2023 - Honolulu, États-Unis
Durée: 23 juil. 202329 juil. 2023

Empreinte digitale

Examiner les sujets de recherche de « Adapting to game trees in zero-sum imperfect information games ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation