Unsupervised learning of edit parameters for matching name variants

Dan Gillick, Dilek Hakkani-Tür, Michael Levit

Research output: Contribution to journalConference articlepeer-review

Abstract

Since named entities are often written in different ways, question answering (QA) and other language processing tasks stand to benefit from entity matching. We address the problem of finding equivalent person names in unstructured text. Our approach is a generalization of spelling correction: We compare to candidate matches by applying a set of edits to an input name. We introduce a novel unsupervised method for learning spelling edit probabilities which improves overall F-Measure on our own name-matching task by 12%. Relevance is demonstrated by application to the GALE Distillation task.

Original languageEnglish (US)
Pages (from-to)467-470
Number of pages4
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
StatePublished - 2008
Externally publishedYes
EventINTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association - Brisbane, QLD, Australia
Duration: Sep 22 2008Sep 26 2008

Keywords

  • Entity matching
  • Equivalent names
  • Unsupervised learning

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Signal Processing
  • Software
  • Sensory Systems

Fingerprint

Dive into the research topics of 'Unsupervised learning of edit parameters for matching name variants'. Together they form a unique fingerprint.

Cite this