TY - JOUR
T1 - A probabilistic similarity metric for Medline records
T2 - a model for author name disambiguation.
AU - Torvik, Vetle I.
AU - Weeber, Marc
AU - Swanson, Don R.
AU - Smalheiser, Neil R.
PY - 2003
Y1 - 2003
N2 - We present a model for automatically generating training sets and estimating the probability that a pair of Medline records sharing a last and first name initial are authored by the same individual, based on shared title words, journal name, co-authors, medical subject headings, language, and affiliation, as well as distinctive features of the name itself (i.e., presence of middle initial, suffix, and prevalence in Medline).
AB - We present a model for automatically generating training sets and estimating the probability that a pair of Medline records sharing a last and first name initial are authored by the same individual, based on shared title words, journal name, co-authors, medical subject headings, language, and affiliation, as well as distinctive features of the name itself (i.e., presence of middle initial, suffix, and prevalence in Medline).
UR - http://www.scopus.com/inward/record.url?scp=16544383397&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=16544383397&partnerID=8YFLogxK
M3 - Article
C2 - 14728536
AN - SCOPUS:16544383397
SN - 1559-4076
SP - 1033
JO - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
JF - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
ER -