TY - JOUR
T1 - FingerprintContacts
T2 - Predicting Alternative Conformations of Proteins from Coevolution
AU - Feng, Jiangyan
AU - Shukla, Diwakar
N1 - We thank Jiming Chen and Matthew Chan from Shukla research group for the critical reading of this manuscript and for creating the cover image for this article, respectively. We also thank the Blue Waters sustained-petascale computing project, which is supported by the National Science Foundation (Awards OCI-0725070 and ACI-1238993) and the state of Illinois. D.S. acknowledges support from the New Innovator Award from the Foundation for Food and Agriculture Research and NSF Early Career Award (MCB 1845606). J.F. was partially supported by Chia-chen Chu Fellowship and Harry G. Drickamer Graduate Research Fellowship from University of Illinois at Urbana–Champaign, IL.
PY - 2020/5/7
Y1 - 2020/5/7
N2 - Proteins are dynamic molecules which perform diverse molecular functions by adopting different three-dimensional structures. Recent progress in residue-residue contacts prediction opens up new avenues for the de novo protein structure prediction from sequence information. However, it is still difficult to predict more than one conformation from residue-residue contacts alone. This is due to the inability to deconvolve the complex signals of residue-residue contacts, i.e., spatial contacts relevant for protein folding, conformational diversity, and ligand binding. Here, we introduce a machine learning based method, called FingerprintContacts, for extending the capabilities of residue-residue contacts. This algorithm leverages the features of residue-residue contacts, that is, (1) a single conformation outperforms the others in the structural prediction using all the top ranking residue-residue contacts as structural constraints and (2) conformation specific contacts rank lower and constitute a small fraction of residue-residue contacts. We demonstrate the capabilities of FingerprintContacts on eight ligand binding proteins with varying conformational motions. Furthermore, FingerprintContacts identifies small clusters of residue-residue contacts which are preferentially located in the dynamically fluctuating regions. With the rapid growth in protein sequence information, we expect FingerprintContacts to be a powerful first step in structural understanding of protein functional mechanisms.
AB - Proteins are dynamic molecules which perform diverse molecular functions by adopting different three-dimensional structures. Recent progress in residue-residue contacts prediction opens up new avenues for the de novo protein structure prediction from sequence information. However, it is still difficult to predict more than one conformation from residue-residue contacts alone. This is due to the inability to deconvolve the complex signals of residue-residue contacts, i.e., spatial contacts relevant for protein folding, conformational diversity, and ligand binding. Here, we introduce a machine learning based method, called FingerprintContacts, for extending the capabilities of residue-residue contacts. This algorithm leverages the features of residue-residue contacts, that is, (1) a single conformation outperforms the others in the structural prediction using all the top ranking residue-residue contacts as structural constraints and (2) conformation specific contacts rank lower and constitute a small fraction of residue-residue contacts. We demonstrate the capabilities of FingerprintContacts on eight ligand binding proteins with varying conformational motions. Furthermore, FingerprintContacts identifies small clusters of residue-residue contacts which are preferentially located in the dynamically fluctuating regions. With the rapid growth in protein sequence information, we expect FingerprintContacts to be a powerful first step in structural understanding of protein functional mechanisms.
UR - https://www.scopus.com/pages/publications/85084379792
UR - https://www.scopus.com/pages/publications/85084379792#tab=citedBy
U2 - 10.1021/acs.jpcb.9b11869
DO - 10.1021/acs.jpcb.9b11869
M3 - Article
C2 - 32283936
AN - SCOPUS:85084379792
SN - 1520-6106
VL - 124
SP - 3605
EP - 3615
JO - Journal of Physical Chemistry B
JF - Journal of Physical Chemistry B
IS - 18
ER -