TY - GEN
T1 - Side effect PTM
T2 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, ACM BCB 2014
AU - Wang, Sheng
AU - Li, Yanen
AU - Ferguson, Duncan
AU - Zhai, Chengxiang
N1 - Publisher Copyright:
Copyright © 2014 ACM.
PY - 2014/9/20
Y1 - 2014/9/20
N2 - Automatic discovery of medical knowledge using data mining has great potential benefit in improving population health and reducing healthcare cost. Discovering adverse drug reaction (ADR) is especially important because of the significant morbidity of ADRs to patients. Recently, more and more patients describe the ADRs they experienced and seek for help through online health forums, creating great opportunities for these forums to discover previously unknown ADRs. In this paper, we propose a novel unsupervised approach to tap into the increasingly available health forums to mine the side effect symptoms of drugs mentioned by forum users. Our approach is based on a novel probabilistic mixture model of symptoms, where the side effect symptoms and disease symptoms are explicitly modeled with two separate component models, and discovery of side effect symptoms can be achieved in an unsupervised way through fit- Ting the mixture model to the forum data. Extensive experiments on online health forums demonstrate that our proposed model is effective for discovering the reported ADRs on forums in a completely unsupervised way. The mined knowledge using our model is directly useful for increasing our understanding of more challenging ADRs, such as long-term side effects, drug-drug interactions, and rare side effects. Since our approach is unsupervised, it can be applied to mining large amounts of growing forum data to discover new knowledge about ADRs, helping many patients become aware of possible ADRs.
AB - Automatic discovery of medical knowledge using data mining has great potential benefit in improving population health and reducing healthcare cost. Discovering adverse drug reaction (ADR) is especially important because of the significant morbidity of ADRs to patients. Recently, more and more patients describe the ADRs they experienced and seek for help through online health forums, creating great opportunities for these forums to discover previously unknown ADRs. In this paper, we propose a novel unsupervised approach to tap into the increasingly available health forums to mine the side effect symptoms of drugs mentioned by forum users. Our approach is based on a novel probabilistic mixture model of symptoms, where the side effect symptoms and disease symptoms are explicitly modeled with two separate component models, and discovery of side effect symptoms can be achieved in an unsupervised way through fit- Ting the mixture model to the forum data. Extensive experiments on online health forums demonstrate that our proposed model is effective for discovering the reported ADRs on forums in a completely unsupervised way. The mined knowledge using our model is directly useful for increasing our understanding of more challenging ADRs, such as long-term side effects, drug-drug interactions, and rare side effects. Since our approach is unsupervised, it can be applied to mining large amounts of growing forum data to discover new knowledge about ADRs, helping many patients become aware of possible ADRs.
KW - Adverse drug reaction
KW - Health forum
KW - Probabilistic topic model
UR - http://www.scopus.com/inward/record.url?scp=84920748198&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84920748198&partnerID=8YFLogxK
U2 - 10.1145/2649387.2649398
DO - 10.1145/2649387.2649398
M3 - Conference contribution
AN - SCOPUS:84920748198
T3 - ACM BCB 2014 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
SP - 321
EP - 330
BT - ACM BCB 2014 - 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
PB - Association for Computing Machinery
Y2 - 20 September 2014 through 23 September 2014
ER -