PET: A statistical model for popular events tracking in social communities

Cindy Xide Lin, Bo Zhao, Qiaozhu Mei, Jiawei Han

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

User generated information in online communities has been characterized with the mixture of a text stream and a network structure both changing over time. A good example is a web-blogging community with the daily blog posts and a social network of bloggers. An important task of analyzing an online community is to observe and track the popular events, or topics that evolve over time in the community. Existing approaches usually focus on either the burstiness of topics or the evolution of networks, but ignoring the interplay between textual topics and network structures. In this paper, we formally define the problem of popular event tracking (PET) in online communities, focusing on the interplay between texts and networks. We propose a novel statistical method that models the popularity of events over time, taking into consideration the burstiness of user interest, information diffusion on the network structure, and the evolution of textual topics. Specifically, a Gibbs Random Field is defined to model the influence of historical status and the dependency relationships in the graph; thereafter a topic model generates the words in text content of the event, regularized by the Gibbs Random Field. We prove that two classic models in information diffusion and text burstiness are special cases of our model under certain situations. Empirical experiments with two different communities and datasets (i.e., Twitter and DBLP) show that our approach is effective and outperforms existing approaches.

Original languageEnglish (US)
Title of host publicationKDD'10 - Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data
Pages929-938
Number of pages10
DOIs
StatePublished - Sep 7 2010
Externally publishedYes
Event16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD-2010 - Washington, DC, United States
Duration: Jul 25 2010Jul 28 2010

Publication series

NameProceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Other

Other16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD-2010
CountryUnited States
CityWashington, DC
Period7/25/107/28/10

Keywords

  • PET
  • Popular events tracking
  • Social communities
  • Topic modeling

ASJC Scopus subject areas

  • Software
  • Information Systems

Fingerprint Dive into the research topics of 'PET: A statistical model for popular events tracking in social communities'. Together they form a unique fingerprint.

Cite this