Passer à la navigation principale Passer à la recherche Passer au contenu principal

ROI Maximization in Stochastic Online Decision-Making

  • University of Milano
  • Toulouse School of Economics
  • Tel Aviv University
  • ENSAE

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

We introduce a novel theoretical framework for Return On Investment (ROI) maximization in repeated decision-making. Our setting is motivated by the use case of companies that regularly receive proposals for technological innovations and want to quickly decide whether they are worth implementing. We design an algorithm for learning ROI-maximizing decision-making policies over a sequence of innovation proposals. Our algorithm provably converges to an optimal policy in class Π at a rate of order min {1/(N ∆2), N−1/3}, where N is the number of innovations and ∆ is the suboptimality gap in Π. A significant hurdle of our formulation, which sets it aside from other online learning problems such as bandits, is that running a policy does not provide an unbiased estimate of its performance.

langue originaleAnglais
titreAdvances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021
rédacteurs en chefMarc'Aurelio Ranzato, Alina Beygelzimer, Yann Dauphin, Percy S. Liang, Jenn Wortman Vaughan
EditeurNeural information processing systems foundation
Pages9152-9166
Nombre de pages15
ISBN (Electronique)9781713845393
étatPublié - 1 janv. 2021
Modification externeOui
Evénement35th Conference on Neural Information Processing Systems, NeurIPS 2021 - Virtual, Online
Durée: 6 déc. 202114 déc. 2021

Série de publications

NomAdvances in Neural Information Processing Systems
Volume11
ISSN (imprimé)1049-5258

Une conférence

Une conférence35th Conference on Neural Information Processing Systems, NeurIPS 2021
La villeVirtual, Online
période6/12/2114/12/21

Empreinte digitale

Examiner les sujets de recherche de « ROI Maximization in Stochastic Online Decision-Making ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation