Predicting MeSH beyond MEDLINE

Adam K. Kehoe, Vetle I. Torvik, Neil R. Smalheiser, Matthew B. Ross

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Medical subject headings (MeSH) are a flexible and useful tool for describing biomedical concepts. Here, we present MeSHier, a tool for assigning MeSH terms to biomedical documents based on abstract similarity and references to MEDLINE records. When applied to PubMedCentral papers, NIH grants, and USPTO patents we find that these two sources of information produce largely disjoint sets of related MEDLINE records, albeit with some overlap in MeSH. When combined they provide an enriched topical annotation that would not have been possible with either alone. MeSHier is available as a demo tool that can take as input IDs of PubMed papers, USPTO patents, and NIH grants: http://abel.lis.illinois.edu/cgi-bin/meshier/search.py.

Original languageEnglish (US)
Title of host publicationProceedings of the 1st Workshop on Scholarly Web Mining, SWM 2017
PublisherAssociation for Computing Machinery
Pages49-56
Number of pages8
ISBN (Electronic)9781450352406
DOIs
StatePublished - Feb 10 2017
Event1st Workshop on Scholarly Web Mining, SWM 2017 - Cambridge, United Kingdom
Duration: Feb 10 2017 → …

Publication series

NameACM International Conference Proceeding Series
VolumePart F127853

Other

Other1st Workshop on Scholarly Web Mining, SWM 2017
CountryUnited Kingdom
CityCambridge
Period2/10/17 → …

Keywords

  • Controlled vocabularies
  • Medical subject headings

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Cite this

Kehoe, A. K., Torvik, V. I., Smalheiser, N. R., & Ross, M. B. (2017). Predicting MeSH beyond MEDLINE. In Proceedings of the 1st Workshop on Scholarly Web Mining, SWM 2017 (pp. 49-56). (ACM International Conference Proceeding Series; Vol. Part F127853). Association for Computing Machinery. https://doi.org/10.1145/3057148.3057155