Categorical spectral analysis of periodicity in human and viral genomes

Elizabeth D. Howe, Jun S. Song

Research output: Contribution to journalArticle

Abstract

Periodicity in nucleotide sequences arises from regular repeating patterns which may reflect important structure and function. Although a three-base periodicity in coding regions has been known for some time and has provided the basis for powerful gene prediction algorithms, its origins are still not fully understood. Here, we show that, contrary to common belief, amino acid (AA) bias and codon usage bias are insufficient to create base-3 periodicity. This article applies the rigorous method of spectral envelope to systematically characterize the contributions of codon bias, AA bias and protein structural motifs to the three-base periodicity of coding sequences. The method is also used to classify CpG islands in the human genome. In addition, we show how spectral envelope can be used to trace the evolution of viral genomes and monitor global sequence changes without having to align to previously known genomes. This approach also detects reassortment events, such as those that led to the 2009 pandemic H1N1 virus.

Original languageEnglish (US)
Pages (from-to)1395-1405
Number of pages11
JournalNucleic acids research
Volume41
Issue number3
DOIs
StatePublished - Feb 2013

ASJC Scopus subject areas

  • Genetics

Fingerprint Dive into the research topics of 'Categorical spectral analysis of periodicity in human and viral genomes'. Together they form a unique fingerprint.

  • Cite this