Probabilistic models for text mining

Yizhou Sun, Hongbo Deng, Jiawei Han

Research output: Chapter in Book/Report/Conference proceedingChapter


A number of probabilistic methods such as LDA, hidden Markov models, Markov random fields have arisen in recent years for probabilistic analysis of text data. This chapter provides an overview of a variety of probabilistic models for text mining. The chapter focuses more on the fundamental probabilistic techniques, and also covers their various applications to different text mining problems. Some examples of such applications include topic modeling, language modeling, document classification, document clustering, and information extraction.

Original languageEnglish (US)
Title of host publicationMining Text Data
Number of pages37
ISBN (Electronic)9781461432234
ISBN (Print)1461432227, 9781461432227
StatePublished - Aug 1 2012


  • Graphical model
  • Mixture model
  • Probabilistic models
  • Stochastic process

ASJC Scopus subject areas

  • General Computer Science


Dive into the research topics of 'Probabilistic models for text mining'. Together they form a unique fingerprint.

Cite this