A probabilistic similarity metric for Medline records: a model for author name disambiguation.

Vetle I. Torvik, Marc Weeber, Don R. Swanson, Neil R. Smalheiser

Research output: Contribution to journalArticlepeer-review

Abstract

We present a model for automatically generating training sets and estimating the probability that a pair of Medline records sharing a last and first name initial are authored by the same individual, based on shared title words, journal name, co-authors, medical subject headings, language, and affiliation, as well as distinctive features of the name itself (i.e., presence of middle initial, suffix, and prevalence in Medline).

Original languageEnglish (US)
Number of pages1
JournalAMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
StatePublished - 2003
Externally publishedYes

ASJC Scopus subject areas

  • Medicine(all)

Fingerprint

Dive into the research topics of 'A probabilistic similarity metric for Medline records: a model for author name disambiguation.'. Together they form a unique fingerprint.

Cite this