A weakly-supervised approach for discovering new user intents from search query logs

Dilek Hakkani-Tür, Asli Celikyilmaz, Larry Heck, Gokhan Tur

Research output: Contribution to journalConference articlepeer-review

Abstract

State-of-the art spoken language understanding models that automatically capture user intents in human to machine dialogs are trained with manually annotated data, which is cumbersome and time-consuming to prepare. For bootstrapping the learning algorithm that detects relations in natural language queries to a conversational system, one can rely on publicly available knowledge graphs, such as Freebase, and mine corresponding data from the web. In this paper, we present an unsupervised approach to discover new user intents using a novel Bayesian hierarchical graphical model. Our model employs search query click logs to enrich the information extracted from bootstrapped models. We use the clicked URLs as implicit supervision and extend the knowledge graph based on the relational information discovered from this model. The posteriors from the graphical model relate the newly discovered intents with the search queries. These queries are then used as additional training examples to complement the bootstrapped relation detection models. The experimental results demonstrate the effectiveness of this approach, showing extended coverage to new intents without impacting the known intents.

Original languageEnglish (US)
Pages (from-to)3780-3784
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
StatePublished - 2013
Externally publishedYes
Event14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 - Lyon, France
Duration: Aug 25 2013Aug 29 2013

Keywords

  • Graphical models
  • Intent discovery
  • Search query click logs
  • Spoken language understanding

ASJC Scopus subject areas

  • Signal Processing
  • Software
  • Modeling and Simulation
  • Language and Linguistics
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'A weakly-supervised approach for discovering new user intents from search query logs'. Together they form a unique fingerprint.

Cite this