TY - GEN
T1 - Application-driven privacy-preserving data publishing with correlated attributes
AU - Rezaei, Aria
AU - Xiao, Chaowei
AU - Gao, Jie
AU - Li, Bo
AU - Munir, Sirajum
N1 - Publisher Copyright:
© 2021 the authors.
PY - 2021
Y1 - 2021
N2 - Recent advances in computing have allowed for the possibility to collect large amounts of data on personal activities and private living spaces. To address the privacy concerns of users in this environment, we propose a novel framework called PR-GAN that offers privacy-preserving mechanism using generative adversarial networks. Given a target application, PR-GAN automatically modifies the data to hide sensitive attributes – which may be hidden and can be inferred by machine learning algorithms – while preserving the data utility in the target application. Unlike prior works, the pub-lic’s possible knowledge of the correlation between the target application and sensitive attributes is built into our modeling. We formulate our problem as an optimization problem, show that an optimal solution exists and use generative adversarial networks (GAN) to create perturbations. We further show that our method provides privacy guarantees under the Pufferfish framework, an elegant generalization of the differential privacy that allows for the modeling of prior knowledge on data and correlations. Through experiments, we show that our method outperforms conventional methods in effectively hiding the sensitive attributes while guaranteeing high performance in the target application, for both property inference and training purposes. Finally, we demonstrate through further experiments that once our model learns a privacy-preserving task, such as hiding subjects’ identity, on a group of individuals, it can perform the same task on a separate group with minimal performance drops.
AB - Recent advances in computing have allowed for the possibility to collect large amounts of data on personal activities and private living spaces. To address the privacy concerns of users in this environment, we propose a novel framework called PR-GAN that offers privacy-preserving mechanism using generative adversarial networks. Given a target application, PR-GAN automatically modifies the data to hide sensitive attributes – which may be hidden and can be inferred by machine learning algorithms – while preserving the data utility in the target application. Unlike prior works, the pub-lic’s possible knowledge of the correlation between the target application and sensitive attributes is built into our modeling. We formulate our problem as an optimization problem, show that an optimal solution exists and use generative adversarial networks (GAN) to create perturbations. We further show that our method provides privacy guarantees under the Pufferfish framework, an elegant generalization of the differential privacy that allows for the modeling of prior knowledge on data and correlations. Through experiments, we show that our method outperforms conventional methods in effectively hiding the sensitive attributes while guaranteeing high performance in the target application, for both property inference and training purposes. Finally, we demonstrate through further experiments that once our model learns a privacy-preserving task, such as hiding subjects’ identity, on a group of individuals, it can perform the same task on a separate group with minimal performance drops.
UR - http://www.scopus.com/inward/record.url?scp=85120678982&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85120678982&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85120678982
SN - 9780994988652
T3 - International Conference on Embedded Wireless Systems and Networks
BT - International Conference on Embedded Wireless Systems and Networks, EWSN 2021
A2 - Huang, Polly
A2 - Zuniga, Marco
PB - Junction Publishing
T2 - International Conference on Embedded Wireless Systems and Networks, EWSN 2021
Y2 - 17 February 2021 through 19 February 2021
ER -