TY - GEN
T1 - PaReCat
T2 - 7th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM-BCB 2016
AU - Huang, Edward W.
AU - Liu, Baoyan
AU - Wang, Sheng
AU - Zhou, Xuezhong
AU - Zhang, Runshun
AU - Zhai, Chengxiang
N1 - Publisher Copyright:
© 2016 ACM.
PY - 2016/10/2
Y1 - 2016/10/2
N2 - Traditional Chinese medicine (TCM), a style of medicine widely used in China for thousands of years, can complement modern western medicine by taking personalization as the core principle of clinical practice. A fundamental task in TCM, particularly important for achieving efective precision medicine, is to subcategorize patients with a general disease into groups corresponding to variations of that disease. In this paper, we conduct the frst study of the problem of subcategorizing electronic patient records in TCM. While the general problem of subcategorization can be solved using basic clustering algorithms, accommodating variations in symptoms and herb prescriptions of TCM patient records when computing patient similarity is a major technical challenge that has yet to be addressed. To tackle this problem, we propose to learn inexact matchings of both symptoms and herbs from a TCM dictionary of herb functions by using an embedding algorithm. Our hypothesis is that the prior knowledge of herb-symptom associations in the TCM dictionary can be used to discover latent relationships among comorbid symptoms and functionally similar herbs, thereby improving the quality of subcategorization. We performed extensive experiments on large-scale real-world datasets. As expected, our approach leads to more accurate matchings between patient records than baseline approaches, and thus better subcategorization results. We also show that the proposed algorithm can be used immediately in multiple clinical applications, such as retrieving similar patients as well as discovering two special TCM cases: similar symptoms treated by diferent herbs and diferent symptoms treated by similar herbs.
AB - Traditional Chinese medicine (TCM), a style of medicine widely used in China for thousands of years, can complement modern western medicine by taking personalization as the core principle of clinical practice. A fundamental task in TCM, particularly important for achieving efective precision medicine, is to subcategorize patients with a general disease into groups corresponding to variations of that disease. In this paper, we conduct the frst study of the problem of subcategorizing electronic patient records in TCM. While the general problem of subcategorization can be solved using basic clustering algorithms, accommodating variations in symptoms and herb prescriptions of TCM patient records when computing patient similarity is a major technical challenge that has yet to be addressed. To tackle this problem, we propose to learn inexact matchings of both symptoms and herbs from a TCM dictionary of herb functions by using an embedding algorithm. Our hypothesis is that the prior knowledge of herb-symptom associations in the TCM dictionary can be used to discover latent relationships among comorbid symptoms and functionally similar herbs, thereby improving the quality of subcategorization. We performed extensive experiments on large-scale real-world datasets. As expected, our approach leads to more accurate matchings between patient records than baseline approaches, and thus better subcategorization results. We also show that the proposed algorithm can be used immediately in multiple clinical applications, such as retrieving similar patients as well as discovering two special TCM cases: similar symptoms treated by diferent herbs and diferent symptoms treated by similar herbs.
KW - Network embedding
KW - Patient record subcategorization
KW - Traditional Chinese medicine
UR - http://www.scopus.com/inward/record.url?scp=85009792744&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85009792744&partnerID=8YFLogxK
U2 - 10.1145/2975167.2975213
DO - 10.1145/2975167.2975213
M3 - Conference contribution
AN - SCOPUS:85009792744
T3 - ACM-BCB 2016 - 7th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
SP - 443
EP - 452
BT - ACM-BCB 2016 - 7th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
PB - Association for Computing Machinery
Y2 - 2 October 2016 through 5 October 2016
ER -