TY - GEN
T1 - Design challenges and misconceptions in named entity recognition
AU - Ratinov, Lev
AU - Roth, Dan
PY - 2009
Y1 - 2009
N2 - We analyze some of the fundamental design challenges and misconceptions that underlie the development of an efficient and robust NER system. In particular, we address issues such as the representation of text chunks, the inference approach needed to combine local NER decisions, the sources of prior knowledge and how to use them within an NER system. In the process of comparing several solutions to these challenges we reach some surprising conclusions, as well as develop an NER system that achieves 90.8 F1 score on the CoNLL-2003 NER shared task, the best reported result for this dataset.
AB - We analyze some of the fundamental design challenges and misconceptions that underlie the development of an efficient and robust NER system. In particular, we address issues such as the representation of text chunks, the inference approach needed to combine local NER decisions, the sources of prior knowledge and how to use them within an NER system. In the process of comparing several solutions to these challenges we reach some surprising conclusions, as well as develop an NER system that achieves 90.8 F1 score on the CoNLL-2003 NER shared task, the best reported result for this dataset.
UR - http://www.scopus.com/inward/record.url?scp=84862300668&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84862300668&partnerID=8YFLogxK
U2 - 10.3115/1596374.1596399
DO - 10.3115/1596374.1596399
M3 - Conference contribution
AN - SCOPUS:84862300668
SN - 1932432299
SN - 9781932432299
T3 - CoNLL 2009 - Proceedings of the Thirteenth Conference on Computational Natural Language Learning
SP - 147
EP - 155
BT - CoNLL 2009 - Proceedings of the Thirteenth Conference on Computational Natural Language Learning
PB - Association for Computational Linguistics (ACL)
T2 - 13th Conference on Computational Natural Language Learning, CoNLL 2009
Y2 - 4 June 2009 through 5 June 2009
ER -