Design challenges and misconceptions in named entity recognition

Lev Ratinov, Dan Roth

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We analyze some of the fundamental design challenges and misconceptions that underlie the development of an efficient and robust NER system. In particular, we address issues such as the representation of text chunks, the inference approach needed to combine local NER decisions, the sources of prior knowledge and how to use them within an NER system. In the process of comparing several solutions to these challenges we reach some surprising conclusions, as well as develop an NER system that achieves 90.8 F1 score on the CoNLL-2003 NER shared task, the best reported result for this dataset.

Original languageEnglish (US)
Title of host publicationCoNLL 2009 - Proceedings of the Thirteenth Conference on Computational Natural Language Learning
PublisherAssociation for Computational Linguistics (ACL)
Pages147-155
Number of pages9
ISBN (Print)1932432299, 9781932432299
DOIs
StatePublished - 2009
Externally publishedYes
Event13th Conference on Computational Natural Language Learning, CoNLL 2009 - Boulder, CO, United States
Duration: Jun 4 2009Jun 5 2009

Publication series

NameCoNLL 2009 - Proceedings of the Thirteenth Conference on Computational Natural Language Learning

Other

Other13th Conference on Computational Natural Language Learning, CoNLL 2009
Country/TerritoryUnited States
CityBoulder, CO
Period6/4/096/5/09

ASJC Scopus subject areas

  • Artificial Intelligence
  • Human-Computer Interaction
  • Language and Linguistics
  • Computational Theory and Mathematics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Design challenges and misconceptions in named entity recognition'. Together they form a unique fingerprint.

Cite this