GPT Attack for Adversarial Privacy Policies

Isaac Joy, Jun Wu, Jingrui He

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

While Machine Learning (ML) is widely used in many industries, one sector that is just beginning to leverage this technology is the legal field. Efforts like the construction of the OPP-115 Privacy Policy corpus make it possible to train Natural Language Processing (NLP) and ML models for legal use, and tools such as Polisis, Claudette, etc., which utilize these models can now provide users with descriptive annotations of various types of contracts, including Privacy Policies. However, these tools remain vulnerable to adversarial attacks, specifically adversarial legal documents, which differ from typical text based adversarial attacks due to the necessity of preserving the continuity of a legal document. Therefore, focusing on the OPP-115 corpus and Polisis specifically, we propose a framework that consists of a Doc2Vec model and a variety of shallow and deep learning methods for classification, followed by adversarial attacks on Privacy Policy segments which utilize GPT 3.5, a state-of-the-art LLM, in order to create an adversarial Privacy Policy. We evaluate the effectiveness of our method by comparing attack results to those produced by current methods, including Bert Attack and Text Fooler. Our procedure reveals not just that an attack using GPT 3.5 is an effective means of producing an adversarial policy, but that it is surprisingly easy to prompt a widely used state-of-the-art LLM to perform such a task.

Original languageEnglish (US)
Title of host publicationProceedings - 2024 10th International Conference on Big Data Computing and Communications, BIGCOM 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages173-180
Number of pages8
Edition2024
ISBN (Electronic)9798331509538
DOIs
StatePublished - 2024
Event10th International Conference on Big Data Computing and Communications, BIGCOM 2024 - Dalian, China
Duration: Aug 9 2024Aug 11 2024

Conference

Conference10th International Conference on Big Data Computing and Communications, BIGCOM 2024
Country/TerritoryChina
CityDalian
Period8/9/248/11/24

Keywords

  • Adversarial Attacks
  • LLM
  • Privacy Policy

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Computer Vision and Pattern Recognition
  • Information Systems
  • Information Systems and Management
  • Statistics, Probability and Uncertainty
  • Health Informatics

Fingerprint

Dive into the research topics of 'GPT Attack for Adversarial Privacy Policies'. Together they form a unique fingerprint.

Cite this