Topic sentiment mixture: Modeling facets and opinions in weblogs

Qiaozhu Mei, Xu Ling, Matthew Wondra, Hang Su, Chengxiang Zhai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we define the problem of topic-sentiment analysis on Weblogs and propose a novel probabilistic model to capture the mixture of topics and sentiments simultaneously. The proposed Topic-Sentiment Mixture (TSM) model can reveal the latent topical facets in a Weblog collection, the subtopics in the results of an ad hoc query, and their associated sentiments. It could also provide general sentiment models that are applicable to any ad hoc topics. With a specifically designed HMM structure, the sentiment models and topic models estimated with TSM can be utilized to extract topic life cycles and sentiment dynamics. Empirical experiments on different Weblog datasets show that this approach is effective for modeling the topic facets and sentiments and extracting their dynamics from Weblog collections. The TSM model is quite general; it can be applied to any text collections with a mixture of topics and sentiments, thus has many potential applications, such as search result summarization, opinion tracking, and user behavior prediction.

Original languageEnglish (US)
Title of host publication16th International World Wide Web Conference, WWW2007
Pages171-180
Number of pages10
DOIs
StatePublished - 2007
Event16th International World Wide Web Conference, WWW2007 - Banff, AB, Canada
Duration: May 8 2007May 12 2007

Publication series

Name16th International World Wide Web Conference, WWW2007

Other

Other16th International World Wide Web Conference, WWW2007
Country/TerritoryCanada
CityBanff, AB
Period5/8/075/12/07

Keywords

  • Mixture model
  • Sentiment analysis
  • Topic models
  • Topic-sentiment mixture
  • Weblogs

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'Topic sentiment mixture: Modeling facets and opinions in weblogs'. Together they form a unique fingerprint.

Cite this