An active approach to spoken language processing

Dilek Hakkani-Tür, Giuseppe Riccardi, Gokhan Tur

Research output: Contribution to journalArticlepeer-review

Abstract

State of the art data-driven speech and language processing systems require a large amount of human intervention ranging from data annotation to system prototyping. In the traditional supervised passive approach, the system is trained on a given number of annotated data samples and evaluated using a separate test set. Then more data is collected arbitrarily, annotated, and the whole cycle is repeated. In this article, we propose the active approach where the system itself selects its own training data, evaluates itself and re-trains when necessary. We first employ active learning which aims to automatically select the examples that are likely to be the most informative for a given task. We use active learning for both selecting the examples to label and the examples to re-label in order to correct labeling errors. Furthermore, the system automatically evaluates itself using active evaluation to keep track of the unexpected events and decides on-demand to label more examples. The active approach enables dynamic adaptation of spoken language processing systems to unseen or unexpected events for nonstationary input while reducing the manual annotation effort significantly. We have evaluated the active approach with the AT&T spoken dialog system used for customer care applications. In this article, we present our results for both automatic speech recognition and spoken language understanding.

Original languageEnglish (US)
Pages (from-to)1-31
Number of pages31
JournalACM Transactions on Speech and Language Processing
Volume3
Issue number3
DOIs
StatePublished - 2006
Externally publishedYes

Keywords

  • Active evaluation
  • Active learning
  • Adaptive learning
  • Automatic speech recognition
  • Passive learning
  • Speech and language processing
  • Spoken dialog systems
  • Spoken language understanding
  • Unsupervised learning

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computational Mathematics

Fingerprint

Dive into the research topics of 'An active approach to spoken language processing'. Together they form a unique fingerprint.

Cite this