Passer à la navigation principale Passer à la recherche Passer au contenu principal

Exploiting Structure of Uncertainty for Efficient Matroid Semi-Bandits

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

We improve the efficiency of algorithms for stochastic combinatorial semi-bandits. In most interesting problems, state-of-the-art algorithms take advantage of structural properties of rewards, such as independence. However, while being optimal in terms of asymptotic regret, these al-gorithms are inefficient. In our paper, we first reduce their implementation to a specific submod-ular maximization. Then, in case of matroid con-straints, we design adapted approximation rou-tines, thereby providing the first efficient algo-rithms that rely on reward structure to improve regret bound. In particular, we improve the state-of-the-art efficient gap-free regret bound by a fac-torm/ log m, where m is the maximum action size. Finally, we show how our improvement translates to more general budgeted combinato-rial semi-bandits.

langue originaleAnglais
Pages (de - à)5123-5132
Nombre de pages10
journalProceedings of Machine Learning Research
Volume97
étatPublié - 1 janv. 2019
Modification externeOui
Evénement36th International Conference on Machine Learning, ICML 2019 - Long Beach, États-Unis
Durée: 9 juin 201915 juin 2019

Empreinte digitale

Examiner les sujets de recherche de « Exploiting Structure of Uncertainty for Efficient Matroid Semi-Bandits ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation