FineSum: Target-Oriented, Fine-Grained Opinion Summarization

Suyu Ge, Jiaxin Huang, Yu Meng, Jiawei Han

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Target-oriented opinion summarization is to profile a target by extracting user opinions from multiple related documents. Instead of simply mining opinion ratings on a target (e.g., a restaurant) or on multiple aspects (e.g., food, service) of a target, it is desirable to go deeper, to mine opinion on fine-grained sub-aspects (e.g., fish). However, it is expensive to obtain high-quality annotations at such fine-grained scale. This leads to our proposal of a new framework, FineSum, which advances the frontier of opinion analysis in three aspects: (1) minimal supervision, where no document-summary pairs are provided, only aspect names and a few aspect/sentiment keywords are available; (2) fine-grained opinion analysis, where sentiment analysis drills down to a specific subject or characteristic within each general aspect; and (3) phrase-based summarization, where short phrases are taken as basic units for summarization, and semantically coherent phrases are gathered to improve the consistency and comprehensiveness of summary. Given a large corpus with no annotation, FineSum first automatically identifies potential spans of opinion phrases, and further reduces the noise in identification results using aspect and sentiment classifiers. It then constructs multiple fine-grained opinion clusters under each aspect and sentiment. Each cluster expresses uniform opinions towards certain sub-aspects (e.g., "fish"in "food"aspect) or characteristics (e.g., "Mexican"in "food"aspect). To accomplish this, we train a spherical word embedding space to explicitly represent different aspects and sentiments. We then distill the knowledge from embedding to a contextualized phrase classifier, and perform clustering using the contextualized opinion-aware phrase embedding. Both automatic evaluations on the benchmark and quantitative human evaluation validate the effectiveness of our approach.

Original languageEnglish (US)
Title of host publicationWSDM 2023 - Proceedings of the 16th ACM International Conference on Web Search and Data Mining
PublisherAssociation for Computing Machinery
Pages1093-1101
Number of pages9
ISBN (Electronic)9781450394079
DOIs
StatePublished - Feb 27 2023
Event16th ACM International Conference on Web Search and Data Mining, WSDM 2023 - Singapore, Singapore
Duration: Feb 27 2023Mar 3 2023

Publication series

NameWSDM 2023 - Proceedings of the 16th ACM International Conference on Web Search and Data Mining

Conference

Conference16th ACM International Conference on Web Search and Data Mining, WSDM 2023
Country/TerritorySingapore
CitySingapore
Period2/27/233/3/23

Keywords

  • aspect extraction
  • opinion summarization
  • sentiment analysis

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Software

Fingerprint

Dive into the research topics of 'FineSum: Target-Oriented, Fine-Grained Opinion Summarization'. Together they form a unique fingerprint.

Cite this