TY - GEN
T1 - Unsupervised ranking of knowledge bases for named entity recognition
AU - Mrabet, Yassine
AU - Kilicoglu, Halil
AU - Demner-Fushman, Dina
N1 - Publisher Copyright:
© 2016 The Authors and IOS Press.
PY - 2016
Y1 - 2016
N2 - With the continuous growth of freely accessible knowledge bases and the heterogeneity of textual corpora, selecting the most adequate knowledge base for named entity recognition is becoming a challenge in itself. In this paper, we propose an unsupervised method to rank knowledge bases according to their adequacy for the recognition of named entities in a given corpus. Building on a state-of-the-art, unsupervised entity linking approach, we propose several evaluation metrics to measure the lexical and structural adequacy of a knowledge base for a given corpus. We study the correlation between these metrics and three standard performance measures: precision, recall and F1 score. Our multi-domain experiments on 9 different corpora with 6 knowledge bases show that three of the proposed metrics are strong performance predictors having 0.62 to 0.76 Pearson correlation with precision and 0.96 correlation with both recall and F1 score.
AB - With the continuous growth of freely accessible knowledge bases and the heterogeneity of textual corpora, selecting the most adequate knowledge base for named entity recognition is becoming a challenge in itself. In this paper, we propose an unsupervised method to rank knowledge bases according to their adequacy for the recognition of named entities in a given corpus. Building on a state-of-the-art, unsupervised entity linking approach, we propose several evaluation metrics to measure the lexical and structural adequacy of a knowledge base for a given corpus. We study the correlation between these metrics and three standard performance measures: precision, recall and F1 score. Our multi-domain experiments on 9 different corpora with 6 knowledge bases show that three of the proposed metrics are strong performance predictors having 0.62 to 0.76 Pearson correlation with precision and 0.96 correlation with both recall and F1 score.
UR - http://www.scopus.com/inward/record.url?scp=85013040562&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85013040562&partnerID=8YFLogxK
U2 - 10.3233/978-1-61499-672-9-1248
DO - 10.3233/978-1-61499-672-9-1248
M3 - Conference contribution
AN - SCOPUS:85013040562
T3 - Frontiers in Artificial Intelligence and Applications
SP - 1248
EP - 1255
BT - Frontiers in Artificial Intelligence and Applications
A2 - Kaminka, Gal A.
A2 - Dignum, Frank
A2 - Hullermeier, Eyke
A2 - Bouquet, Paolo
A2 - Dignum, Virginia
A2 - Fox, Maria
A2 - van Harmelen, Frank
PB - IOS Press
T2 - 22nd European Conference on Artificial Intelligence, ECAI 2016
Y2 - 29 August 2016 through 2 September 2016
ER -