TY - GEN
T1 - Learning Structural Genetic Information via Graph Neural Embedding
AU - Xie, Yuan
AU - Pei, Yulong
AU - Lu, Yun
AU - Tang, Haixu
AU - Zhou, Yuan
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2020
Y1 - 2020
N2 - Learning continuous vector representations of genes has been proved to be conducive for many bioinformatics tasks as it can incorporate information of various sources including gene interactions and gene-disease interactions. However, most of the existing approaches, following a paradigm stemmed from the natural language processing community, treat the embedding context in a flat fashion such as a sequence, and tend to overlook the fact that proteins are more likely to function together. In this study, we propose an unsupervised gene embedding algorithm which utilizes graph convolutional network to learn structural information of genes from their neighborhoods in genetic interaction networks. We also propose a neighborhood sampling strategy to generate training samples. Our approach does not assume conditional independence of the node neighborhood and focuses on learning structural information. We compare our method against state-of-the-art baselines and experimental results demonstrate the effectiveness of our approach.
AB - Learning continuous vector representations of genes has been proved to be conducive for many bioinformatics tasks as it can incorporate information of various sources including gene interactions and gene-disease interactions. However, most of the existing approaches, following a paradigm stemmed from the natural language processing community, treat the embedding context in a flat fashion such as a sequence, and tend to overlook the fact that proteins are more likely to function together. In this study, we propose an unsupervised gene embedding algorithm which utilizes graph convolutional network to learn structural information of genes from their neighborhoods in genetic interaction networks. We also propose a neighborhood sampling strategy to generate training samples. Our approach does not assume conditional independence of the node neighborhood and focuses on learning structural information. We compare our method against state-of-the-art baselines and experimental results demonstrate the effectiveness of our approach.
KW - Essential gene identification
KW - Gene embedding
KW - Graph convolutional network
KW - Protein-protein interaction
UR - http://www.scopus.com/inward/record.url?scp=85090098741&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85090098741&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-57821-3_22
DO - 10.1007/978-3-030-57821-3_22
M3 - Conference contribution
AN - SCOPUS:85090098741
SN - 9783030578206
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 250
EP - 261
BT - Bioinformatics Research and Applications - 16th International Symposium, ISBRA 2020, Proceedings
A2 - Cai, Zhipeng
A2 - Mandoiu, Ion
A2 - Narasimhan, Giri
A2 - Skums, Pavel
A2 - Guo, Xuan
PB - Springer
T2 - 16th International Symposium on Bioinformatics Research and Applications, ISBRA 2020
Y2 - 1 December 2020 through 4 December 2020
ER -