Abstract
State of the art data-driven speech and language processing systems require a large amount of human intervention ranging from data annotation to system prototyping. In the traditional supervised passive approach, the system is trained on a given number of annotated data samples and evaluated using a separate test set. Then more data is collected arbitrarily, annotated, and the whole cycle is repeated. In this article, we propose the active approach where the system itself selects its own training data, evaluates itself and re-trains when necessary. We first employ active learning which aims to automatically select the examples that are likely to be the most informative for a given task. We use active learning for both selecting the examples to label and the examples to re-label in order to correct labeling errors. Furthermore, the system automatically evaluates itself using active evaluation to keep track of the unexpected events and decides on-demand to label more examples. The active approach enables dynamic adaptation of spoken language processing systems to unseen or unexpected events for nonstationary input while reducing the manual annotation effort significantly. We have evaluated the active approach with the AT&T spoken dialog system used for customer care applications. In this article, we present our results for both automatic speech recognition and spoken language understanding.
Original language | English (US) |
---|---|
Pages (from-to) | 1-31 |
Number of pages | 31 |
Journal | ACM Transactions on Speech and Language Processing |
Volume | 3 |
Issue number | 3 |
DOIs | |
State | Published - 2006 |
Externally published | Yes |
Keywords
- Active evaluation
- Active learning
- Adaptive learning
- Automatic speech recognition
- Passive learning
- Speech and language processing
- Spoken dialog systems
- Spoken language understanding
- Unsupervised learning
ASJC Scopus subject areas
- Computer Science (miscellaneous)
- Computational Mathematics