Integrating prosodic and lexical cues for automatic topic segmentation

Gökhan Tür, Andreas Stolcke, Dilek Hakkani-Tür, Elizabeth Shriberg

Research output: Contribution to journalArticlepeer-review

Abstract

We present a probabilistic model that uses both prosodic and lexical cues for the automatic segmentation of speech into topically coherent units. We propose two methods for combining lexical and prosodic information using hidden Markov models and decision trees. Lexical information is obtained from a speech recognizer, and prosodic features are extracted automatically from speech waveforms. We evaluate our approach on the Broadcast News corpus, using the DARPA-TDT evaluation metrics. Results show that the prosodic model alone is competitive with word-based segmentation methods. Furthermore, we achieve a significant reduction in error by combining the prosodic and word-based knowledge sources.

Original languageEnglish (US)
Pages (from-to)31-57
Number of pages27
JournalComputational Linguistics
Volume27
Issue number1
DOIs
StatePublished - Mar 2001
Externally publishedYes

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language
  • Computer Science Applications
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Integrating prosodic and lexical cues for automatic topic segmentation'. Together they form a unique fingerprint.

Cite this