TY - JOUR
T1 - Assessing the Impact of Differential Privacy on Population Uniques in Geographically Aggregated Data
T2 - The Case of the 2020 U.S. Census
AU - Lin, Yue
AU - Xiao, Ningchuan
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive licence to Springer Nature B.V.
PY - 2023/10
Y1 - 2023/10
N2 - Geographically aggregated demographic, social, and economic data are valuable for research and practical applications, but their use and sharing often compromise individual privacy. The U.S. Census Bureau has responded to this issue by introducing a new privacy protection method, the TopDown Algorithm (TDA), in the 2020 Census. The TDA is based on a privacy definition known as differential privacy and is primarily designed to reduce the risk of reconstruction-abetted disclosure, a type of privacy violation where individual identities can be revealed by reconstructing confidential census responses and linking them to publicly available data. However, there still lacks a systematic exploration of the impact of the TDA on direct disclosure, another common type of privacy violation where individuals can be directly distinguished from public census tables to reveal their identities. To address this gap, this paper examines the effectiveness of the TDA in protecting against direct disclosure by focusing on how information from public census tables can be used to distinguish population uniques, the individuals that can be uniquely distinguished from census tables. Our study reveals that while the TDA provides a reasonable level of differential privacy, it does not necessarily prevent the direct identification of population uniques using public census tables. Our finding is crucial for policymakers to consider when making informed decisions regarding parameter selection for the TDA during its implementation.
AB - Geographically aggregated demographic, social, and economic data are valuable for research and practical applications, but their use and sharing often compromise individual privacy. The U.S. Census Bureau has responded to this issue by introducing a new privacy protection method, the TopDown Algorithm (TDA), in the 2020 Census. The TDA is based on a privacy definition known as differential privacy and is primarily designed to reduce the risk of reconstruction-abetted disclosure, a type of privacy violation where individual identities can be revealed by reconstructing confidential census responses and linking them to publicly available data. However, there still lacks a systematic exploration of the impact of the TDA on direct disclosure, another common type of privacy violation where individuals can be directly distinguished from public census tables to reveal their identities. To address this gap, this paper examines the effectiveness of the TDA in protecting against direct disclosure by focusing on how information from public census tables can be used to distinguish population uniques, the individuals that can be uniquely distinguished from census tables. Our study reveals that while the TDA provides a reasonable level of differential privacy, it does not necessarily prevent the direct identification of population uniques using public census tables. Our finding is crucial for policymakers to consider when making informed decisions regarding parameter selection for the TDA during its implementation.
KW - Census
KW - Differential privacy
KW - Disclosure avoidance
KW - TopDown algorithm
UR - http://www.scopus.com/inward/record.url?scp=85173757370&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85173757370&partnerID=8YFLogxK
U2 - 10.1007/s11113-023-09829-4
DO - 10.1007/s11113-023-09829-4
M3 - Article
AN - SCOPUS:85173757370
SN - 0167-5923
VL - 42
JO - Population Research and Policy Review
JF - Population Research and Policy Review
IS - 5
M1 - 81
ER -