TY - JOUR
T1 - Understanding the "horizontal dimension" of Molecular evolution to annotate, classify, and discover proteins with functional domains
AU - Rendon, Gloria
AU - Ger, Mao Feng
AU - Kantorovitz, Ruth
AU - Natarajan, Shreedhar
AU - Tilson, Jeffrey
AU - Jakobsson, Eric
N1 - Funding Information:
Survey This work is supported by NSF of USA under Grant Nos. 0835718 and 0235792, NIH under Grant Nos. 5PN2EY016570-06 and 5R01NS063405-02, the Beckman Institute for Advanced Science and Technology, the National Center for Supercomputing Applications, and the Renaissance Computing Institute. 2010 Springer Science + Business Media, LLC & Science Press, China
PY - 2010/1
Y1 - 2010/1
N2 - Protein evolution proceeds by two distinct processes: 1) individual mutation and selection for adaptive mutations and 2) rearrangement of entire domains within proteins into novel combinations, producing new protein families that combine functional properties in ways that previously did not exist. Domain rearrangement poses a challenge to sequence alignment-based search methods, such as BLAST, in predicting homology since the methodology implicitly assumes that related proteins primarily differ from each other by individual mutations. Moreover, there is ample evidence that the evolutionary process has used (and continues to use) domains as building blocks, therefore, it seems fit to utilize computational, domain-based methods to reconstruct that process. A challenge and opportunity for computational biology is how to use knowledge of evolutionary domain recombination to characterize families of proteins whose evolutionary history includes such recombination, to discover novel proteins, and to infer protein-protein interactions. In this paper we review techniques and databases that exploit our growing knowledge of "horizontal" protein evolution, and suggest possible areas of future development. We illustrate the power of the domain-based methods and the possible directions of future development by a case history in progress aiming at facilitating a particular approach to understanding microbial pathogenicity.
AB - Protein evolution proceeds by two distinct processes: 1) individual mutation and selection for adaptive mutations and 2) rearrangement of entire domains within proteins into novel combinations, producing new protein families that combine functional properties in ways that previously did not exist. Domain rearrangement poses a challenge to sequence alignment-based search methods, such as BLAST, in predicting homology since the methodology implicitly assumes that related proteins primarily differ from each other by individual mutations. Moreover, there is ample evidence that the evolutionary process has used (and continues to use) domains as building blocks, therefore, it seems fit to utilize computational, domain-based methods to reconstruct that process. A challenge and opportunity for computational biology is how to use knowledge of evolutionary domain recombination to characterize families of proteins whose evolutionary history includes such recombination, to discover novel proteins, and to infer protein-protein interactions. In this paper we review techniques and databases that exploit our growing knowledge of "horizontal" protein evolution, and suggest possible areas of future development. We illustrate the power of the domain-based methods and the possible directions of future development by a case history in progress aiming at facilitating a particular approach to understanding microbial pathogenicity.
KW - Domain interactions
KW - Domain-based discovery
KW - Domain-based orthology
KW - Interpro
KW - Meme/mast
UR - http://www.scopus.com/inward/record.url?scp=75749109341&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=75749109341&partnerID=8YFLogxK
U2 - 10.1007/s11390-010-9307-3
DO - 10.1007/s11390-010-9307-3
M3 - Article
AN - SCOPUS:75749109341
SN - 1000-9000
VL - 25
SP - 82
EP - 94
JO - Journal of Computer Science and Technology
JF - Journal of Computer Science and Technology
IS - 1
ER -