Ranking indirect connections in literature-based discovery: The role of medical subject headings

Don R. Swanson, Neil R. Smalheiser, Vetle I. Torvik

Research output: Contribution to journalArticle

Abstract

Arrowsmith, a computer-assisted process for literature-based discovery, takes as input two disjoint sets of records (A, C) from the Medline database. It produces a list of title words and phrases, B, that are common to A and C, and displays the title context In which each B-term occurs within A and within C. Subject experts then can try to find A-B and B-C title-pairs that together may suggest novel and plausible indirect A-C relationships (via B-terms) that are of particular interest in the absence of any known direct A-C relationship. The list of B-terms typically is so large that it is difficult to find the relatively few that contribute to scientifically interesting connections. The purpose of the present article is to propose and test several techniques for improving the quality of the B-list. These techniques exploit the Medical Subject Headings (MeSH) that are assigned to each input record. A MesH-based concept of literature cohesiveness is defined and plays a key role. The proposed techniques are tested on a published example of indirect connections between migraine and magnesium deficiency. The tests demonstrate how the earlier results can be replicated with a more efficient and more system-atic computer-aided process.

Original languageEnglish (US)
Pages (from-to)1427-1439
Number of pages13
JournalJournal of the American Society for Information Science and Technology
Volume57
Issue number11
DOIs
StatePublished - Sep 1 2006
Externally publishedYes

Fingerprint

Magnesium
ranking
Computer systems
expert
literature
Ranking
Literature-based discovery
Data base

ASJC Scopus subject areas

  • Software
  • Information Systems
  • Human-Computer Interaction
  • Computer Networks and Communications
  • Artificial Intelligence

Cite this

Ranking indirect connections in literature-based discovery : The role of medical subject headings. / Swanson, Don R.; Smalheiser, Neil R.; Torvik, Vetle I.

In: Journal of the American Society for Information Science and Technology, Vol. 57, No. 11, 01.09.2006, p. 1427-1439.

Research output: Contribution to journalArticle

@article{013b65f1b6864e738469ff72f824d167,
title = "Ranking indirect connections in literature-based discovery: The role of medical subject headings",
abstract = "Arrowsmith, a computer-assisted process for literature-based discovery, takes as input two disjoint sets of records (A, C) from the Medline database. It produces a list of title words and phrases, B, that are common to A and C, and displays the title context In which each B-term occurs within A and within C. Subject experts then can try to find A-B and B-C title-pairs that together may suggest novel and plausible indirect A-C relationships (via B-terms) that are of particular interest in the absence of any known direct A-C relationship. The list of B-terms typically is so large that it is difficult to find the relatively few that contribute to scientifically interesting connections. The purpose of the present article is to propose and test several techniques for improving the quality of the B-list. These techniques exploit the Medical Subject Headings (MeSH) that are assigned to each input record. A MesH-based concept of literature cohesiveness is defined and plays a key role. The proposed techniques are tested on a published example of indirect connections between migraine and magnesium deficiency. The tests demonstrate how the earlier results can be replicated with a more efficient and more system-atic computer-aided process.",
author = "Swanson, {Don R.} and Smalheiser, {Neil R.} and Torvik, {Vetle I.}",
year = "2006",
month = "9",
day = "1",
doi = "10.1002/asi.20438",
language = "English (US)",
volume = "57",
pages = "1427--1439",
journal = "Journal of the Association for Information Science and Technology",
issn = "2330-1635",
publisher = "John Wiley and Sons Ltd",
number = "11",

}

TY - JOUR

T1 - Ranking indirect connections in literature-based discovery

T2 - The role of medical subject headings

AU - Swanson, Don R.

AU - Smalheiser, Neil R.

AU - Torvik, Vetle I.

PY - 2006/9/1

Y1 - 2006/9/1

N2 - Arrowsmith, a computer-assisted process for literature-based discovery, takes as input two disjoint sets of records (A, C) from the Medline database. It produces a list of title words and phrases, B, that are common to A and C, and displays the title context In which each B-term occurs within A and within C. Subject experts then can try to find A-B and B-C title-pairs that together may suggest novel and plausible indirect A-C relationships (via B-terms) that are of particular interest in the absence of any known direct A-C relationship. The list of B-terms typically is so large that it is difficult to find the relatively few that contribute to scientifically interesting connections. The purpose of the present article is to propose and test several techniques for improving the quality of the B-list. These techniques exploit the Medical Subject Headings (MeSH) that are assigned to each input record. A MesH-based concept of literature cohesiveness is defined and plays a key role. The proposed techniques are tested on a published example of indirect connections between migraine and magnesium deficiency. The tests demonstrate how the earlier results can be replicated with a more efficient and more system-atic computer-aided process.

AB - Arrowsmith, a computer-assisted process for literature-based discovery, takes as input two disjoint sets of records (A, C) from the Medline database. It produces a list of title words and phrases, B, that are common to A and C, and displays the title context In which each B-term occurs within A and within C. Subject experts then can try to find A-B and B-C title-pairs that together may suggest novel and plausible indirect A-C relationships (via B-terms) that are of particular interest in the absence of any known direct A-C relationship. The list of B-terms typically is so large that it is difficult to find the relatively few that contribute to scientifically interesting connections. The purpose of the present article is to propose and test several techniques for improving the quality of the B-list. These techniques exploit the Medical Subject Headings (MeSH) that are assigned to each input record. A MesH-based concept of literature cohesiveness is defined and plays a key role. The proposed techniques are tested on a published example of indirect connections between migraine and magnesium deficiency. The tests demonstrate how the earlier results can be replicated with a more efficient and more system-atic computer-aided process.

UR - http://www.scopus.com/inward/record.url?scp=33748461581&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33748461581&partnerID=8YFLogxK

U2 - 10.1002/asi.20438

DO - 10.1002/asi.20438

M3 - Article

AN - SCOPUS:33748461581

VL - 57

SP - 1427

EP - 1439

JO - Journal of the Association for Information Science and Technology

JF - Journal of the Association for Information Science and Technology

SN - 2330-1635

IS - 11

ER -