TY - GEN
T1 - PrivacyChat
T2 - 19th International Conference on Wisdom, Well-Being, Win-Win, iConference 2024
AU - Salvi, Rohan Charudatt
AU - Blake, Catherine
AU - Bahir, Masooda
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024
Y1 - 2024
N2 - Privacy policies play a crucial role in upholding the privacy rights of users and fostering trust between organizations and their users. By clearly understanding the terms and conditions of a privacy policy, individuals can make well-informed choices about disclosing their personal information and understand how the concerned entity will manage their data. Following the introduction of the General Data Protection Regulation, these policies have become more extensive and intricate. This creates a challenge for users in terms of understanding and finding specific information in the policy. Today, through prompt-based methods, we can extract specific data from extensive text documents using large language models (LLMs), thus eliminating the need for training or fine-tuning models. In this study, we explore a prompt-based approach to extract information concerning personal data from privacy policies using a large language model, GPT-3.5. In this preliminary study, we assess the performance of GPT-3.5 on such a fine-grained extraction task through varied metrics and its capability to address previous computational challenges. The prompt structure can be adapted for other LLMs, and a similar approach can be employed for various information extraction tasks over privacy policies. The data and code are available at our GitHub repository. .
AB - Privacy policies play a crucial role in upholding the privacy rights of users and fostering trust between organizations and their users. By clearly understanding the terms and conditions of a privacy policy, individuals can make well-informed choices about disclosing their personal information and understand how the concerned entity will manage their data. Following the introduction of the General Data Protection Regulation, these policies have become more extensive and intricate. This creates a challenge for users in terms of understanding and finding specific information in the policy. Today, through prompt-based methods, we can extract specific data from extensive text documents using large language models (LLMs), thus eliminating the need for training or fine-tuning models. In this study, we explore a prompt-based approach to extract information concerning personal data from privacy policies using a large language model, GPT-3.5. In this preliminary study, we assess the performance of GPT-3.5 on such a fine-grained extraction task through varied metrics and its capability to address previous computational challenges. The prompt structure can be adapted for other LLMs, and a similar approach can be employed for various information extraction tasks over privacy policies. The data and code are available at our GitHub repository. .
KW - Information Extraction
KW - Large Language Models
KW - Privacy Policy
UR - http://www.scopus.com/inward/record.url?scp=85192144213&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85192144213&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-57850-2_17
DO - 10.1007/978-3-031-57850-2_17
M3 - Conference contribution
AN - SCOPUS:85192144213
SN - 9783031578496
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 223
EP - 231
BT - Wisdom, Well-Being, Win-Win - 19th International Conference, iConference 2024, Proceedings
A2 - Sserwanga, Isaac
A2 - Joho, Hideo
A2 - Ma, Jie
A2 - Hansen, Preben
A2 - Wu, Dan
A2 - Koizumi, Masanori
A2 - Gilliland, Anne J.
PB - Springer
Y2 - 15 April 2024 through 26 April 2024
ER -