Skip to main navigation Skip to search Skip to main content

Adapting to game trees in zero-sum imperfect information games

  • Côme Fiegel
  • , Pierre Ménard
  • , Tadashi Kozuno
  • , Rémi Munos
  • , Vianney Perchet
  • , Michal Valko
  • Ecole Normale Supérieure de Lyon
  • Omron Sinic X
  • DeepMind Technologies Limited
  • ENSAE & Criteo AI Lab.

Research output: Contribution to journalConference articlepeer-review

Abstract

Imperfect information games (IIG) are games in which each player only partially observes the current game state. We study how to learn ε-optimal strategies in a zero-sum IIG through self-play with trajectory feedback. We give a problem-independent lower bound Õ(H(AX + BY)/ε2) on the required number of realizations to learn these strategies with high probability, where H is the length of the game, AX and BY are the total number of actions for the two players. We also propose two Follow the Regularized leader (FTRL) algorithms for this setting: Balanced FTRL which matches this lower bound, but requires the knowledge of the information set structure beforehand to define the regularization; and Adaptive FTRL which needs Õ(H2(AX + BY)/ε2) realizations without this requirement by progressively adapting the regularization to the observations.

Original languageEnglish
Pages (from-to)10093-10135
Number of pages43
JournalProceedings of Machine Learning Research
Volume202
Publication statusPublished - 1 Jan 2023
Event40th International Conference on Machine Learning, ICML 2023 - Honolulu, United States
Duration: 23 Jul 202329 Jul 2023

Fingerprint

Dive into the research topics of 'Adapting to game trees in zero-sum imperfect information games'. Together they form a unique fingerprint.

Cite this