Extending Boosting for call classification using word confusion networks

Gokhan Tur, Dilek Hakkani-Tür, Giuseppe Riccardi

Research output: Contribution to journalConference articlepeer-review

Abstract

We are interested in the problem of robust understanding from noisy spontaneous speech input. In goal driven humanmachine dialog, utterance classification is a key component of the understanding process to determine the intent of the speaker. In this paper we propose a novel algorithm for exploiting ASR word confidence scores for better utterance classification of spoken utterances. Word confidence scores for automatic speech recognition (ASR) provide estimates for word error rates. While previous work has focused on straightforward combination of word confidence scores into Bayesian classifiers, in this paper we extend the mathematical formulation for Boosting classifiers. This extension of die algorithm allows to exploit confidence scores from a 1-best ASR output or from word confusion networks (WCNs). We present methods for on-line and off-line score combinations. The results we show are for a large database of utterances collected using the AT&T VoiceTone SM spoken dialog system. Our experiments show between 5%-10% reduction in error (1-precision) for a given recall using WCNs compared to ASR output.

Original languageEnglish (US)
Pages (from-to)I437-I440
JournalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
StatePublished - 2004
Externally publishedYes
EventProceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing - Montreal, Que, Canada
Duration: May 17 2004May 21 2004

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Extending Boosting for call classification using word confusion networks'. Together they form a unique fingerprint.

Cite this