Abstract
We study the problem of online conditional distribution estimation with unbounded label sets under local differential privacy. The problem may be succinctly stated as follows. Let F be a distribution-valued function class with an unbounded label set. Our aim is to estimate an unknown function f ∈ F in an online fashion. More precisely, at time t, given a sample xt, we generate an estimate of f(xt) using only a privatized version of the true labels sampled from f(xt). The objective is to minimize the cumulative KL-risk of a finite horizon T. We show that under (ϵ, 0)-local differential privacy for the labels, the KL-risk equals Θ̃(1ϵ√KT), up to poly-logarithmic factors, where K = |F|. This result significantly differs from the Θ̃(√T log K) bound derived in Wu et al. (2023a) for bounded label sets. As a side-result, our approach recovers a nearly tight upper bound for the hypothesis selection problem of Gopi et al. (2020), which has only been established for the batch setting.
Original language | English (US) |
---|---|
Pages (from-to) | 460-468 |
Number of pages | 9 |
Journal | Proceedings of Machine Learning Research |
Volume | 238 |
State | Published - 2024 |
Event | 27th International Conference on Artificial Intelligence and Statistics, AISTATS 2024 - Valencia, Spain Duration: May 2 2024 → May 4 2024 |
ASJC Scopus subject areas
- Artificial Intelligence
- Software
- Control and Systems Engineering
- Statistics and Probability