TY - GEN
T1 - Locally differentially private estimation of nonlinear functionals of discrete distributions
AU - Butucea, Cristina
AU - Issartel, Yann
N1 - Publisher Copyright:
© 2021 Neural information processing systems foundation. All rights reserved.
PY - 2021/1/1
Y1 - 2021/1/1
N2 - We study the problem of estimating non-linear functionals of discrete distributions in the context of local differential privacy. The initial data x1, . . ., xn ∈ [K] are supposed i.i.d. and distributed according to an unknown discrete distribution p = (p1, . . ., pK). Only α-locally differentially private (LDP) samples z1, ..., zn are publicly available, where the term’local’ means that each zi is produced using one individual attribute xi. We exhibit privacy mechanisms (PM) that are sequentially interactive (i.e. they are allowed to use already published confidential data) or non-interactive. We describe the behavior of the quadratic risk for estimating the power sum functional Fγ =PKk=1 pγk, γ > 0 as a function of K, n and α. In the non-interactive case, we study two plug-in type estimators of Fγ, for all γ > 0, that are similar to the MLE analyzed by Jiao et al. [18] in the multinomial model. However, due to the privacy constraint the rates we attain are slower and similar to those obtained in the Gaussian model by Collier et al. [9]. In the sequentially interactive case, we introduce for all γ > 1 a two-step procedure which attains the parametric rate (nα2)−1/2 when γ ≥ 2. We give lower bounds results over all α-LDP mechanisms and all estimators using the private samples.
AB - We study the problem of estimating non-linear functionals of discrete distributions in the context of local differential privacy. The initial data x1, . . ., xn ∈ [K] are supposed i.i.d. and distributed according to an unknown discrete distribution p = (p1, . . ., pK). Only α-locally differentially private (LDP) samples z1, ..., zn are publicly available, where the term’local’ means that each zi is produced using one individual attribute xi. We exhibit privacy mechanisms (PM) that are sequentially interactive (i.e. they are allowed to use already published confidential data) or non-interactive. We describe the behavior of the quadratic risk for estimating the power sum functional Fγ =PKk=1 pγk, γ > 0 as a function of K, n and α. In the non-interactive case, we study two plug-in type estimators of Fγ, for all γ > 0, that are similar to the MLE analyzed by Jiao et al. [18] in the multinomial model. However, due to the privacy constraint the rates we attain are slower and similar to those obtained in the Gaussian model by Collier et al. [9]. In the sequentially interactive case, we introduce for all γ > 1 a two-step procedure which attains the parametric rate (nα2)−1/2 when γ ≥ 2. We give lower bounds results over all α-LDP mechanisms and all estimators using the private samples.
M3 - Conference contribution
AN - SCOPUS:85131358048
T3 - Advances in Neural Information Processing Systems
SP - 24753
EP - 24764
BT - Advances in Neural Information Processing Systems 34 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021
A2 - Ranzato, Marc'Aurelio
A2 - Beygelzimer, Alina
A2 - Dauphin, Yann
A2 - Liang, Percy S.
A2 - Wortman Vaughan, Jenn
PB - Neural information processing systems foundation
T2 - 35th Conference on Neural Information Processing Systems, NeurIPS 2021
Y2 - 6 December 2021 through 14 December 2021
ER -