TY - JOUR
T1 - Identification of essential proteins using induced stars in protein–protein interaction networks
AU - Vogiatzis, Chrysafis
AU - Camur, Mustafa Can
N1 - C. Vogiatzis acknowledges support from the National Science Foundation Office of Experimental Program to Stimulate Competitive Research [Grant ND EPSCoR NSF 1355466].
History: Accepted by Allen Holder, Area Editor for Applications in Biology, Medicine, and Healthcare. Funding: C. Vogiatzis acknowledges support from the National Science Foundation Office of Exper-imental Program to Stimulate Competitive Research [Grant ND EPSCoR NSF 1355466]. Supplemental Material: The online supplement is available at https://doi.org/10.1287/ijoc.2018.0872.
PY - 2019/9
Y1 - 2019/9
N2 - In this work, we propose a novel centrality metric, referred to as star centrality, which incorporates information from the closed neighborhood of a node, rather than solely from the node itself, when calculating its topological importance. More specifically, we focus on degree centrality and show that in the complex protein–protein interaction networks, it is a naive metric that can lead to misclassifying protein importance. For our extension of degree centrality when considering stars, we derive its computational complexity, provide a mathematical formulation, and propose two approximation algorithms that are shown to be efficient in practice. We portray the success of this new metric in protein–protein interaction networks when predicting protein essentiality in several organisms, including the well-studied Saccharomyces cerevisiae, Helicobacter pylori, and Caenorhabditis elegans, where star centrality is shown to significantly outperform other nodal centrality metrics at detecting essential proteins. We also analyze the average and worst-case performance of the two approximation algorithms in practice and show that they are viable options for computing star centrality in very large-scale protein–protein interaction networks, such as the human proteome, where exact methodologies are bound to be time and memory intensive.
AB - In this work, we propose a novel centrality metric, referred to as star centrality, which incorporates information from the closed neighborhood of a node, rather than solely from the node itself, when calculating its topological importance. More specifically, we focus on degree centrality and show that in the complex protein–protein interaction networks, it is a naive metric that can lead to misclassifying protein importance. For our extension of degree centrality when considering stars, we derive its computational complexity, provide a mathematical formulation, and propose two approximation algorithms that are shown to be efficient in practice. We portray the success of this new metric in protein–protein interaction networks when predicting protein essentiality in several organisms, including the well-studied Saccharomyces cerevisiae, Helicobacter pylori, and Caenorhabditis elegans, where star centrality is shown to significantly outperform other nodal centrality metrics at detecting essential proteins. We also analyze the average and worst-case performance of the two approximation algorithms in practice and show that they are viable options for computing star centrality in very large-scale protein–protein interaction networks, such as the human proteome, where exact methodologies are bound to be time and memory intensive.
KW - Centrality
KW - Complex network analysis
KW - Protein-protein interaction networks
UR - http://www.scopus.com/inward/record.url?scp=85079071583&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85079071583&partnerID=8YFLogxK
U2 - 10.1287/IJOC.2018.0872
DO - 10.1287/IJOC.2018.0872
M3 - Article
AN - SCOPUS:85079071583
SN - 1091-9856
VL - 31
SP - 703
EP - 718
JO - INFORMS Journal on Computing
JF - INFORMS Journal on Computing
IS - 4
ER -