TY - GEN
T1 - PET
T2 - 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD-2010
AU - Lin, Cindy Xide
AU - Zhao, Bo
AU - Mei, Qiaozhu
AU - Han, Jiawei
PY - 2010
Y1 - 2010
N2 - User generated information in online communities has been characterized with the mixture of a text stream and a network structure both changing over time. A good example is a web-blogging community with the daily blog posts and a social network of bloggers. An important task of analyzing an online community is to observe and track the popular events, or topics that evolve over time in the community. Existing approaches usually focus on either the burstiness of topics or the evolution of networks, but ignoring the interplay between textual topics and network structures. In this paper, we formally define the problem of popular event tracking (PET) in online communities, focusing on the interplay between texts and networks. We propose a novel statistical method that models the popularity of events over time, taking into consideration the burstiness of user interest, information diffusion on the network structure, and the evolution of textual topics. Specifically, a Gibbs Random Field is defined to model the influence of historical status and the dependency relationships in the graph; thereafter a topic model generates the words in text content of the event, regularized by the Gibbs Random Field. We prove that two classic models in information diffusion and text burstiness are special cases of our model under certain situations. Empirical experiments with two different communities and datasets (i.e., Twitter and DBLP) show that our approach is effective and outperforms existing approaches.
AB - User generated information in online communities has been characterized with the mixture of a text stream and a network structure both changing over time. A good example is a web-blogging community with the daily blog posts and a social network of bloggers. An important task of analyzing an online community is to observe and track the popular events, or topics that evolve over time in the community. Existing approaches usually focus on either the burstiness of topics or the evolution of networks, but ignoring the interplay between textual topics and network structures. In this paper, we formally define the problem of popular event tracking (PET) in online communities, focusing on the interplay between texts and networks. We propose a novel statistical method that models the popularity of events over time, taking into consideration the burstiness of user interest, information diffusion on the network structure, and the evolution of textual topics. Specifically, a Gibbs Random Field is defined to model the influence of historical status and the dependency relationships in the graph; thereafter a topic model generates the words in text content of the event, regularized by the Gibbs Random Field. We prove that two classic models in information diffusion and text burstiness are special cases of our model under certain situations. Empirical experiments with two different communities and datasets (i.e., Twitter and DBLP) show that our approach is effective and outperforms existing approaches.
KW - PET
KW - Popular events tracking
KW - Social communities
KW - Topic modeling
UR - http://www.scopus.com/inward/record.url?scp=77956200065&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77956200065&partnerID=8YFLogxK
U2 - 10.1145/1835804.1835922
DO - 10.1145/1835804.1835922
M3 - Conference contribution
AN - SCOPUS:77956200065
SN - 9781450300551
T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
SP - 929
EP - 938
BT - KDD'10 - Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data
Y2 - 25 July 2010 through 28 July 2010
ER -