TY - JOUR
T1 - Non-Negative Matrix Factorization for Drug Repositioning
T2 - Experiments with the repoDB Dataset
AU - Bakal, Gokhan
AU - Kilicoglu, Halil
AU - Kavuluru, Ramakanth
N1 - Publisher Copyright:
©2019 AMIA - All rights reserved.
PY - 2019
Y1 - 2019
N2 - Computational methods for drug repositioning are gaining mainstream attention with the availability of experimental gene expression datasets and manually curated relational information in knowledge bases. When building repurpos-ing tools, a fundamental limitation is the lack of gold standard datasets that contain realistic true negative examples of drug-disease pairs that were shown to be non-indications. To address this gap, the repoDB dataset was created in 2017 as a first of its kind realistic resource to benchmark drug repositioning methods - its positive examples are drawn from FDA approved indications and negatives examples are derivedfrom failed clinical trials. In this paper, we present the first effort for repositioning that directly tests against repoDB instances. By using hand-curated drug-disease indications from the UMLS Metathesaurus and automatically extracted relations from the SemMedDB database, we employ non-negative matrix factorization (NMF) methods to recover repoDB positive indications. Among recoverable approved indications, our NMF methods achieve 96% recall with 80% precision providing further evidence that hand-curated knowledge and matrix completion methods can be exploited for hypothesis generation.
AB - Computational methods for drug repositioning are gaining mainstream attention with the availability of experimental gene expression datasets and manually curated relational information in knowledge bases. When building repurpos-ing tools, a fundamental limitation is the lack of gold standard datasets that contain realistic true negative examples of drug-disease pairs that were shown to be non-indications. To address this gap, the repoDB dataset was created in 2017 as a first of its kind realistic resource to benchmark drug repositioning methods - its positive examples are drawn from FDA approved indications and negatives examples are derivedfrom failed clinical trials. In this paper, we present the first effort for repositioning that directly tests against repoDB instances. By using hand-curated drug-disease indications from the UMLS Metathesaurus and automatically extracted relations from the SemMedDB database, we employ non-negative matrix factorization (NMF) methods to recover repoDB positive indications. Among recoverable approved indications, our NMF methods achieve 96% recall with 80% precision providing further evidence that hand-curated knowledge and matrix completion methods can be exploited for hypothesis generation.
UR - http://www.scopus.com/inward/record.url?scp=85083755469&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85083755469&partnerID=8YFLogxK
M3 - Article
C2 - 32308816
AN - SCOPUS:85083755469
SN - 1559-4076
VL - 2019
SP - 238
EP - 247
JO - AMIA Annual Symposium Proceedings
JF - AMIA Annual Symposium Proceedings
ER -