Passer à la navigation principale Passer à la recherche Passer au contenu principal

Approachability in unknown games: Online learning meets multi-objective optimization

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

In the standard setting of approachability there are two players and a target set. The players play a repeated vector-valued game where one of them wants to have the average vector-valued payoff converge to the target set which the other player tries to exclude. We revisit the classical setting and consider the setting where the player has a preference relation between target sets: she wishes to approach the smallest ("best") set possible given the observed average payoffs in hindsight. Moreover, as opposed to previous works on approachability, and in the spirit of online learning, we do not assume that there is a known game structure with actions for two players. Rather, the player receives an arbitrary vector-valued reward vector at every round. We show that it is impossible, in general, to approach the best target set in hindsight. We further propose a concrete strategy that approaches a non-trivial relaxation of the best-in-hindsight given the actual rewards. Our approach does not require projection onto a target set and amounts to switching between scalar regret minimization algorithms that are performed in episodes.

langue originaleAnglais
Pages (de - à)339-355
Nombre de pages17
journalJournal of Machine Learning Research
Volume35
étatPublié - 1 janv. 2014
Modification externeOui
Evénement27th Conference on Learning Theory, COLT 2014 - Barcelona, Espagne
Durée: 13 juin 201415 juin 2014

Empreinte digitale

Examiner les sujets de recherche de « Approachability in unknown games: Online learning meets multi-objective optimization ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation