Torpedo: Topic periodicity discovery from text data

Jingjing Wang, Hongbo Deng, Jiawei Han

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Although history may not repeat itself, many human activities are inherently periodic, recurring daily, weekly, monthly, yearly or following some other periods. Such recurring activities may not repeat the same set of keywords, but they do share similar topics. Thus it is interesting to mine topic periodicity from text data instead of just looking at the temporal behavior of a single keyword/phrase. Some previous preliminary studies in this direction prespecify a periodic temporal template for each topic. In this paper, we remove this restriction and propose a simple yet effective framework Torpedo to mine periodic/recurrent patterns from text, such as news articles, search query logs, research papers, and web blogs. We first transform text data into topic-specific time series by a time dependent topic modeling module, where each of the time series characterizes the temporal behavior of a topic. Then we use time series techniques to detect periodicity. Hence we both obtain a clear view of how topics distribute over time and enable the automatic discovery of periods that are inherent in each topic. Theoretical and experimental analyses demonstrate the advantage of Torpedo over existing work.

Original languageEnglish (US)
Title of host publicationNext-Generation Analyst III
EditorsTimothy P. Hanratty, James Llinas, Barbara D. Broome, David L. Hall
PublisherSPIE
ISBN (Electronic)9781628416152
DOIs
StatePublished - 2015
EventNext-Generation Analyst III - Baltimore, United States
Duration: Apr 20 2015Apr 21 2015

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume9499
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Other

OtherNext-Generation Analyst III
Country/TerritoryUnited States
CityBaltimore
Period4/20/154/21/15

Keywords

  • Text Data
  • Time dependent topic modeling
  • Topic periodicity

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Torpedo: Topic periodicity discovery from text data'. Together they form a unique fingerprint.

Cite this