An evaluation of information content as a metric for the inference of putative conserved noncoding regions in DNA sequences using a genetic algorithms approach

Clare Bates Congdon, Joseph C. Aman, Gerardo M. Nava, H. Rex Gaskins, Carolyn J. Mattingly

Research output: Contribution to journalArticlepeer-review

Abstract

In previous work, we presented GAMI [1], an approach to motif inference that uses a genetic algorithms search. GAMI is designed specifically to find putative conserved regulatory motifs in noncoding regions of divergent species and is designed to allow for analysis of long nucleotide sequences. In this work, we compare GAMI's performance when run with its original fitness function (a simple count of the number of matches) and when run with information content (IC), as well as several variations on these metrics. Results indicate that IC does not identify highly conserved regions and, thus, is not the appropriate metric for this task, whereas variations on IC, as well as the original metric, succeed in identifying putative conserved regions.

Original languageEnglish (US)
Pages (from-to)1-14
Number of pages14
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume5
Issue number1
DOIs
StatePublished - Jan 2008

Keywords

  • Biology
  • Evolutionary computing
  • Genetic algorithms
  • Genetics

ASJC Scopus subject areas

  • Biotechnology
  • Genetics
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'An evaluation of information content as a metric for the inference of putative conserved noncoding regions in DNA sequences using a genetic algorithms approach'. Together they form a unique fingerprint.

Cite this