TY - GEN
T1 - On the Utility Gain of Iterative Bayesian Update for Locally Differentially Private Mechanisms
AU - Arcolezi, Héber H.
AU - Cerna, Selene
AU - Palamidessi, Catuscia
N1 - Publisher Copyright:
© 2023, IFIP International Federation for Information Processing.
PY - 2023/1/1
Y1 - 2023/1/1
N2 - This paper investigates the utility gain of using Iterative Bayesian Update (IBU) for private discrete distribution estimation using data obfuscated with Locally Differentially Private (LDP) mechanisms. We compare the performance of IBU to Matrix Inversion (MI), a standard estimation technique, for seven LDP mechanisms designed for one-time data collection and for other seven LDP mechanisms designed for multiple data collections (e.g., RAPPOR). To broaden the scope of our study, we also varied the utility metric, the number of users n, the domain size k, and the privacy parameter ϵ, using both synthetic and real-world data. Our results suggest that IBU can be a useful post-processing tool for improving the utility of LDP mechanisms in different scenarios without any additional privacy cost. For instance, our experiments show that IBU can provide better utility than MI, especially in high privacy regimes (i.e., when ϵ is small). Our paper provides insights for practitioners to use IBU in conjunction with existing LDP mechanisms for more accurate and privacy-preserving data analysis. Finally, we implemented IBU for all fourteen LDP mechanisms into the state-of-the-art multi-freq-ldpy Python package (https://pypi.org/project/multi-freq-ldpy/ ) and open-sourced all our code used for the experiments as tutorials.
AB - This paper investigates the utility gain of using Iterative Bayesian Update (IBU) for private discrete distribution estimation using data obfuscated with Locally Differentially Private (LDP) mechanisms. We compare the performance of IBU to Matrix Inversion (MI), a standard estimation technique, for seven LDP mechanisms designed for one-time data collection and for other seven LDP mechanisms designed for multiple data collections (e.g., RAPPOR). To broaden the scope of our study, we also varied the utility metric, the number of users n, the domain size k, and the privacy parameter ϵ, using both synthetic and real-world data. Our results suggest that IBU can be a useful post-processing tool for improving the utility of LDP mechanisms in different scenarios without any additional privacy cost. For instance, our experiments show that IBU can provide better utility than MI, especially in high privacy regimes (i.e., when ϵ is small). Our paper provides insights for practitioners to use IBU in conjunction with existing LDP mechanisms for more accurate and privacy-preserving data analysis. Finally, we implemented IBU for all fourteen LDP mechanisms into the state-of-the-art multi-freq-ldpy Python package (https://pypi.org/project/multi-freq-ldpy/ ) and open-sourced all our code used for the experiments as tutorials.
KW - Distribution Estimation
KW - Expectation-Maximization
KW - Iterative Bayesian Update
KW - Local Differential Privacy
U2 - 10.1007/978-3-031-37586-6_11
DO - 10.1007/978-3-031-37586-6_11
M3 - Conference contribution
AN - SCOPUS:85169023563
SN - 9783031375859
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 165
EP - 183
BT - Data and Applications Security and Privacy XXXVII - 37th Annual IFIP WG 11.3 Conference, DBSec 2023, Proceedings
A2 - Atluri, Vijayalakshmi
A2 - Ferrara, Anna Lisa
PB - Springer Science and Business Media Deutschland GmbH
T2 - 37th Annual IFIP WG 11.3 Conference on Data and Applications Security and Privacy, DBSec 2023
Y2 - 19 July 2023 through 21 July 2023
ER -