Passer à la navigation principale Passer à la recherche Passer au contenu principal

Local Differential Privacy for Regret Minimization in Reinforcement Learning

  • ENSAE
  • Imperial College London
  • Meta

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Reinforcement learning algorithms are widely used in domains where it is desirable to provide a personalized service. In these domains it is common that user data contains sensitive information that needs to be protected from third parties. Motivated by this, we study privacy in the context of finite-horizon Markov Decision Processes (MDPs) by requiring information to be obfuscated on the user side. We formulate this notion of privacy for RL by leveraging the local differential privacy (LDP) framework. We establish a lower bound for regret minimization in finite-horizon MDPs with LDP guarantees which shows that guaranteeing privacy has a multiplicative effect on the regret. This result shows that while LDP is an appealing notion of privacy, it makes the learning problem significantly more complex. Finally, we present an optimistic algorithm that simultaneously satisfies ε-LDP requirements, and achieves K/ε regret in any finite-horizon MDP after K episodes, matching the lower bound dependency on the number of episodes K.

langue originaleAnglais
titreAdvances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021
rédacteurs en chefMarc'Aurelio Ranzato, Alina Beygelzimer, Yann Dauphin, Percy S. Liang, Jenn Wortman Vaughan
EditeurNeural information processing systems foundation
Pages10561-10573
Nombre de pages13
ISBN (Electronique)9781713845393
étatPublié - 1 janv. 2021
Modification externeOui
Evénement35th Conference on Neural Information Processing Systems, NeurIPS 2021 - Virtual, Online
Durée: 6 déc. 202114 déc. 2021

Série de publications

NomAdvances in Neural Information Processing Systems
Volume13
ISSN (imprimé)1049-5258

Une conférence

Une conférence35th Conference on Neural Information Processing Systems, NeurIPS 2021
La villeVirtual, Online
période6/12/2114/12/21

Empreinte digitale

Examiner les sujets de recherche de « Local Differential Privacy for Regret Minimization in Reinforcement Learning ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation