TY - JOUR
T1 - An approach of orthology detection from homologous sequences under minimum evolution.
AU - Kim, Kyung Mo
AU - Sung, Samsun
AU - Caetano-Anollés, Gustavo
AU - Han, Jae Yong
AU - Kim, Heebal
N1 - Funding Information:
We are sincerely grateful to Mr Sunjin Moon for providing valuable comments. This work is supported by a grant from BioGreen 21 Program of the Korean Rural Development Administration and by the Brain Korea 21 Project of the Ministry of Education. Funding to pay the Open Access publication charges for this article was provided by the Brain Korea 21 Project.
PY - 2008/10
Y1 - 2008/10
N2 - In the field of phylogenetics and comparative genomics, it is important to establish orthologous relationships when comparing homologous sequences. Due to the slight sequence dissimilarity between orthologs and paralogs, it is prone to regarding paralogs as orthologs. For this reason, several methods based on evolutionary distance, phylogeny and BLAST have tried to detect orthologs with more precision. Depending on their algorithmic implementations, each of these methods sometimes has increased false negative or false positive rates. Here, we developed a novel algorithm for orthology detection that uses a distance method based on the phylogenetic criterion of minimum evolution. Our algorithm assumes that sets of sequences exhibiting orthologous relationships are evolutionarily less costly than sets that include one or more paralogous relationships. Calculation of evolutionary cost requires the reconstruction of a neighbor-joining (NJ) tree, but calculations are unaffected by the topology of any given NJ tree. Unlike tree reconciliation, our algorithm appears free from the problem of incorrect topologies of species and gene trees. The reliability of the algorithm was tested in a comparative analysis with two other orthology detection methods using 95 manually curated KOG datasets and 21 experimentally verified EXProt datasets. Sensitivity and specificity estimates indicate that the concept of minimum evolution could be valuable for the detection of orthologs.
AB - In the field of phylogenetics and comparative genomics, it is important to establish orthologous relationships when comparing homologous sequences. Due to the slight sequence dissimilarity between orthologs and paralogs, it is prone to regarding paralogs as orthologs. For this reason, several methods based on evolutionary distance, phylogeny and BLAST have tried to detect orthologs with more precision. Depending on their algorithmic implementations, each of these methods sometimes has increased false negative or false positive rates. Here, we developed a novel algorithm for orthology detection that uses a distance method based on the phylogenetic criterion of minimum evolution. Our algorithm assumes that sets of sequences exhibiting orthologous relationships are evolutionarily less costly than sets that include one or more paralogous relationships. Calculation of evolutionary cost requires the reconstruction of a neighbor-joining (NJ) tree, but calculations are unaffected by the topology of any given NJ tree. Unlike tree reconciliation, our algorithm appears free from the problem of incorrect topologies of species and gene trees. The reliability of the algorithm was tested in a comparative analysis with two other orthology detection methods using 95 manually curated KOG datasets and 21 experimentally verified EXProt datasets. Sensitivity and specificity estimates indicate that the concept of minimum evolution could be valuable for the detection of orthologs.
UR - http://www.scopus.com/inward/record.url?scp=55249117540&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=55249117540&partnerID=8YFLogxK
U2 - 10.1093/nar/gkn485
DO - 10.1093/nar/gkn485
M3 - Article
C2 - 18676448
AN - SCOPUS:55249117540
SN - 0305-1048
VL - 36
SP - e110
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 17
ER -