Abstract
Estimating the quantiles of a large dataset is a fundamental problem in both the streaming algorithms literature and the differential privacy literature. However, all existing private mechanisms for distribution-independent quantile computation require space at least linear in the input size n. In this work, we devise a differentially private algorithm for the quantile estimation problem, with strongly sublinear space complexity, in the one-shot and continual observation settings. Our basic mechanism estimates any α-approximate quantile (log(|X |/β) log n αɛ) of a length-n stream over a data universe X with probability 1 − β using O space while satisfying ɛ-differential privacy at a single time point. Our approach builds upon deterministic streaming algorithms for non-private quantile estimation instantiating the exponential mechanism using a utility function defined on sketch items, while (privately) sampling from intervals defined by the sketch. We also present another algorithm based on histograms that is especially well-suited to the multiple quantiles case. We implement our algorithms and experimentally evaluate them on synthetic and real-world datasets.
| Original language | English (US) |
|---|---|
| Journal | Transactions on Machine Learning Research |
| Volume | 2023 |
| State | Published - Jun 1 2023 |
| Externally published | Yes |
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Vision and Pattern Recognition
Fingerprint
Dive into the research topics of 'Bounded Space Differentially Private Quantiles'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS