“Slow Service” → “Great Food”: Enhancing Content Preservation in Unsupervised Text Style Transfer

Wanzheng Zhu, Suma Bhat

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Text style transfer aims to change the style (e.g., sentiment, politeness) of a sentence while preserving its content. A common solution is the prototype editing approach, where stylistic tokens are deleted in the “mask” stage and then the masked sentences are infilled with the target style tokens in the “infill” stage. Despite their success, these approaches still suffer from the content preservation problem. By closely inspecting the results of existing approaches, we identify two common types of errors: 1) many content-related tokens are masked and 2) irrelevant words associated with the target style are infilled. Our paper aims to enhance content preservation by tackling each of them. In the “mask” stage, we utilize a BERT-based keyword extraction model that incorporates syntactic information to prevent content-related tokens from being masked. In the “infill” stage, we create a pseudo-parallel dataset and train a T5 model to infill the masked sentences without introducing irrelevant content. Empirical results show that our method outperforms the state-of-the-art baselines in terms of content preservation, while maintaining comparable transfer effectiveness and language quality.

Original languageEnglish (US)
Title of host publication15th International Natural Language Generation Conference, INLG 2022
EditorsSamira Shaikh, Thiago Castro Ferreira, Amanda Stent
PublisherAssociation for Computational Linguistics (ACL)
Pages29-39
Number of pages11
ISBN (Electronic)9781955917575
StatePublished - 2022
Event15th International Natural Language Generation Conference, INLG 2022 - Hybrid, Waterville, United States
Duration: Jul 18 2022Jul 22 2022

Publication series

Name15th International Natural Language Generation Conference, INLG 2022

Conference

Conference15th International Natural Language Generation Conference, INLG 2022
Country/TerritoryUnited States
CityHybrid, Waterville
Period7/18/227/22/22

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Information Systems
  • Software

Fingerprint

Dive into the research topics of '“Slow Service” → “Great Food”: Enhancing Content Preservation in Unsupervised Text Style Transfer'. Together they form a unique fingerprint.

Cite this