PaReCat: Patient record subcategorization for precision Traditional Chinese medicine

Edward W. Huang, Baoyan Liu, Sheng Wang, Xuezhong Zhou, Runshun Zhang, Chengxiang Zhai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Traditional Chinese medicine (TCM), a style of medicine widely used in China for thousands of years, can complement modern western medicine by taking personalization as the core principle of clinical practice. A fundamental task in TCM, particularly important for achieving efective precision medicine, is to subcategorize patients with a general disease into groups corresponding to variations of that disease. In this paper, we conduct the frst study of the problem of subcategorizing electronic patient records in TCM. While the general problem of subcategorization can be solved using basic clustering algorithms, accommodating variations in symptoms and herb prescriptions of TCM patient records when computing patient similarity is a major technical challenge that has yet to be addressed. To tackle this problem, we propose to learn inexact matchings of both symptoms and herbs from a TCM dictionary of herb functions by using an embedding algorithm. Our hypothesis is that the prior knowledge of herb-symptom associations in the TCM dictionary can be used to discover latent relationships among comorbid symptoms and functionally similar herbs, thereby improving the quality of subcategorization. We performed extensive experiments on large-scale real-world datasets. As expected, our approach leads to more accurate matchings between patient records than baseline approaches, and thus better subcategorization results. We also show that the proposed algorithm can be used immediately in multiple clinical applications, such as retrieving similar patients as well as discovering two special TCM cases: similar symptoms treated by diferent herbs and diferent symptoms treated by similar herbs.

Original languageEnglish (US)
Title of host publicationACM-BCB 2016 - 7th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
PublisherAssociation for Computing Machinery
Pages443-452
Number of pages10
ISBN (Electronic)9781450342254
DOIs
StatePublished - Oct 2 2016
Event7th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM-BCB 2016 - Seattle, United States
Duration: Oct 2 2016Oct 5 2016

Publication series

NameACM-BCB 2016 - 7th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics

Other

Other7th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM-BCB 2016
Country/TerritoryUnited States
CitySeattle
Period10/2/1610/5/16

Keywords

  • Network embedding
  • Patient record subcategorization
  • Traditional Chinese medicine

ASJC Scopus subject areas

  • Software
  • Health Informatics
  • Biomedical Engineering
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'PaReCat: Patient record subcategorization for precision Traditional Chinese medicine'. Together they form a unique fingerprint.

Cite this