Towards unsupervised spoken language understanding: Exploiting query click logs for slot filling

Gokhan Tur, Dilek Hakkani-Tür, Dustin Hillard, Asli Celikyilmaz

Research output: Contribution to journalConference articlepeer-review

Abstract

In this paper, we present a novel approach to exploit user queries mined from search engine query click logs to bootstrap or improve slot filling models for spoken language understanding. We propose extending the earlier gazetteer population techniques to mine unannotated training data for semantic parsing. The automatically annotated mined data can then be used to train slot specific parsing models. We show that this method can be used to bootstrap slot filling models and can be combined with any available annotated data to improve performance. Furthermore, this approach may eliminate the need for populating and maintaining in-domain gazetteers, in addition to providing complementary information if they are already available.

Original languageEnglish (US)
Pages (from-to)1293-1296
Number of pages4
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
DOIs
StatePublished - 2011
Externally publishedYes
Event12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011 - Florence, Italy
Duration: Aug 27 2011Aug 31 2011

Keywords

  • Data mining
  • Named entity extraction
  • Slot filling
  • Spoken language understanding
  • Unsupervised learning

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Modeling and Simulation

Fingerprint

Dive into the research topics of 'Towards unsupervised spoken language understanding: Exploiting query click logs for slot filling'. Together they form a unique fingerprint.

Cite this