TY - GEN
T1 - Tight Differential Privacy Guarantees for the Shuffle Model with k-Randomized Response
AU - Biswas, Sayan
AU - Jung, Kangsoo
AU - Palamidessi, Catuscia
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024/1/1
Y1 - 2024/1/1
N2 - Most differentially private algorithms assume a central model in which a reliable third party inserts noise to queries made on datasets, or a local model where the data owners directly perturb their data. However, the central model is vulnerable via a single point of failure, and the local model has the disadvantage that the utility of the data deteriorates significantly. The recently proposed shuffle model is an intermediate framework between the central and local paradigms. In the shuffle model, data owners send their locally privatized data to a server where messages are shuffled randomly, making it impossible to trace the link between a privatized message and the corresponding sender. In this paper, we theoretically derive the tightest known differential privacy guarantee for the shuffle models with k-Randomized Response (k-RR) local randomizers, under histogram queries, and we denoise the histogram produced by the shuffle model using the matrix inversion method to evaluate the utility of the privacy mechanism. We perform experiments on both synthetic and real data to compare the privacy-utility trade-off of the shuffle model with that of the central one privatized by adding the state-of-the-art Gaussian noise to each bin. We see that the difference in statistical utilities between the central and the shuffle models shows that they are almost comparable under the same level of differential privacy protection.
AB - Most differentially private algorithms assume a central model in which a reliable third party inserts noise to queries made on datasets, or a local model where the data owners directly perturb their data. However, the central model is vulnerable via a single point of failure, and the local model has the disadvantage that the utility of the data deteriorates significantly. The recently proposed shuffle model is an intermediate framework between the central and local paradigms. In the shuffle model, data owners send their locally privatized data to a server where messages are shuffled randomly, making it impossible to trace the link between a privatized message and the corresponding sender. In this paper, we theoretically derive the tightest known differential privacy guarantee for the shuffle models with k-Randomized Response (k-RR) local randomizers, under histogram queries, and we denoise the histogram produced by the shuffle model using the matrix inversion method to evaluate the utility of the privacy mechanism. We perform experiments on both synthetic and real data to compare the privacy-utility trade-off of the shuffle model with that of the central one privatized by adding the state-of-the-art Gaussian noise to each bin. We see that the difference in statistical utilities between the central and the shuffle models shows that they are almost comparable under the same level of differential privacy protection.
KW - Differential privacy
KW - Privacy-utility optimization
KW - Shuffle model
U2 - 10.1007/978-3-031-57537-2_27
DO - 10.1007/978-3-031-57537-2_27
M3 - Conference contribution
AN - SCOPUS:85192518898
SN - 9783031575365
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 440
EP - 458
BT - Foundations and Practice of Security - 16th International Symposium, FPS 2023, Revised Selected Papers
A2 - Mosbah, Mohamed
A2 - Ahmed, Toufik
A2 - Sèdes, Florence
A2 - Tawbi, Nadia
A2 - Boulahia-Cuppens, Nora
A2 - Garcia-Alfaro, Joaquin
PB - Springer Science and Business Media Deutschland GmbH
T2 - 16th International Symposium on Foundations and Practice of Security, FPS 2023
Y2 - 11 December 2023 through 13 December 2023
ER -