Abstract
In recent years there has been substantial work on the important problem of coreference resolution, most of which has concentrated on the development of new models and algorithmic techniques. These works often show that complex models improve over a weak pairwise baseline. However, less attention has been given to the importance of selecting strong features to support learning a coreference model. This paper describes a rather simple pairwise classification model for coreference resolution, developed with a well-designed set of features. We show that this produces a state-of-the-art system that outperforms systems built with complex models. We suggest that our system can be used as a baseline for the development of more complex models - which may have less impact when a more robust set of features is used. The paper also presents an ablation study and discusses the relative contributions of various features.
Original language | English (US) |
---|---|
Pages | 294-303 |
Number of pages | 10 |
DOIs | |
State | Published - 2008 |
Externally published | Yes |
Event | 2008 Conference on Empirical Methods in Natural Language Processing, EMNLP 2008, Co-located with AMTA 2008 and the International Workshop on Spoken Language Translation - Honolulu, HI, United States Duration: Oct 25 2008 → Oct 27 2008 |
Other
Other | 2008 Conference on Empirical Methods in Natural Language Processing, EMNLP 2008, Co-located with AMTA 2008 and the International Workshop on Spoken Language Translation |
---|---|
Country/Territory | United States |
City | Honolulu, HI |
Period | 10/25/08 → 10/27/08 |
ASJC Scopus subject areas
- Computational Theory and Mathematics
- Computer Science Applications
- Information Systems