A probabilistic similarity metric for Medline records: a model for author name disambiguation.

Vetle I. Torvik, Marc Weeber, Don R. Swanson, Neil R. Smalheiser

Research output: Contribution to journalArticlepeer-review

Abstract

We present a model for automatically generating training sets and estimating the probability that a pair of Medline records sharing a last and first name initial are authored by the same individual, based on shared title words, journal name, co-authors, medical subject headings, language, and affiliation, as well as distinctive features of the name itself (i.e., presence of middle initial, suffix, and prevalence in Medline).

Original languageEnglish (US)
Pages (from-to)1033
Number of pages1
JournalAMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
StatePublished - 2003
Externally publishedYes

ASJC Scopus subject areas

  • General Medicine

Fingerprint

Dive into the research topics of 'A probabilistic similarity metric for Medline records: a model for author name disambiguation.'. Together they form a unique fingerprint.

Cite this