Modal codon usage: Assessing the typical codon usage of a genome

James J. Davis, Gary J. Olsen

Research output: Contribution to journalReview article

Abstract

Most genomes are heterogeneous in codon usage, so a codon usage study should start by defining the codon usage that is typical to the genome. Although this is commonly taken to be the genomewide average, we propose that the mode-The codon usage that matches the most genes-provides a more useful approximation of the typical codon usage of a genome. We provide a method for estimating the modal codon usage, which utilizes a continuous approximation to the number of matching genes and a simplex optimization. In a survey of bacterial and archaeal genomes, as many as 20% more of the genes in a given genome match the modal codon usage than the average codon usage. We use the mode to examine the evolution of the multireplicon genomes of Agrobacterium tumefaciens C58 and Borrelia burgdorferi B31. In A. tumefaciens, the circular and linear chromosomes are characterized by a common "chromosome-like" codon usage, whereas both plasmids share a distinct "plasmid-like" codon usage. In B. burgdorferi, in addition to different codon-usage biases on the leading and lagging strands of DNA replication found by McInerney (McInerney JO. 1998. Replicational and transcriptional selection on codon usage in Borrelia burgdorferi. Proc Natl Acad Sci USA. 95:10698-10703), we also detect a codon-usage similarity between linear plasmid lp38 and the leading strand of the chromosome and a high similarity among the cp32 family of plasmids.

Original languageEnglish (US)
Pages (from-to)800-810
Number of pages11
JournalMolecular biology and evolution
Volume27
Issue number4
DOIs
StatePublished - Apr 1 2010

Fingerprint

codons
Codon
genome
Genome
plasmid
chromosome
Borrelia burgdorferi
gene
Plasmids
plasmids
Agrobacterium tumefaciens
Chromosomes
Agrobacterium radiobacter
chromosomes
Archaeal Genome
Genes
Bacterial Genomes
DNA
Initiator Codon
genes

Keywords

  • Codon adaptation index
  • Correspondence analysis
  • Horizontal gene transfer

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Molecular Biology
  • Genetics

Cite this

Modal codon usage : Assessing the typical codon usage of a genome. / Davis, James J.; Olsen, Gary J.

In: Molecular biology and evolution, Vol. 27, No. 4, 01.04.2010, p. 800-810.

Research output: Contribution to journalReview article

@article{f164f3eb1c38480dae8c803697b5218b,
title = "Modal codon usage: Assessing the typical codon usage of a genome",
abstract = "Most genomes are heterogeneous in codon usage, so a codon usage study should start by defining the codon usage that is typical to the genome. Although this is commonly taken to be the genomewide average, we propose that the mode-The codon usage that matches the most genes-provides a more useful approximation of the typical codon usage of a genome. We provide a method for estimating the modal codon usage, which utilizes a continuous approximation to the number of matching genes and a simplex optimization. In a survey of bacterial and archaeal genomes, as many as 20{\%} more of the genes in a given genome match the modal codon usage than the average codon usage. We use the mode to examine the evolution of the multireplicon genomes of Agrobacterium tumefaciens C58 and Borrelia burgdorferi B31. In A. tumefaciens, the circular and linear chromosomes are characterized by a common {"}chromosome-like{"} codon usage, whereas both plasmids share a distinct {"}plasmid-like{"} codon usage. In B. burgdorferi, in addition to different codon-usage biases on the leading and lagging strands of DNA replication found by McInerney (McInerney JO. 1998. Replicational and transcriptional selection on codon usage in Borrelia burgdorferi. Proc Natl Acad Sci USA. 95:10698-10703), we also detect a codon-usage similarity between linear plasmid lp38 and the leading strand of the chromosome and a high similarity among the cp32 family of plasmids.",
keywords = "Codon adaptation index, Correspondence analysis, Horizontal gene transfer",
author = "Davis, {James J.} and Olsen, {Gary J.}",
year = "2010",
month = "4",
day = "1",
doi = "10.1093/molbev/msp281",
language = "English (US)",
volume = "27",
pages = "800--810",
journal = "Molecular Biology and Evolution",
issn = "0737-4038",
publisher = "Oxford University Press",
number = "4",

}

TY - JOUR

T1 - Modal codon usage

T2 - Assessing the typical codon usage of a genome

AU - Davis, James J.

AU - Olsen, Gary J.

PY - 2010/4/1

Y1 - 2010/4/1

N2 - Most genomes are heterogeneous in codon usage, so a codon usage study should start by defining the codon usage that is typical to the genome. Although this is commonly taken to be the genomewide average, we propose that the mode-The codon usage that matches the most genes-provides a more useful approximation of the typical codon usage of a genome. We provide a method for estimating the modal codon usage, which utilizes a continuous approximation to the number of matching genes and a simplex optimization. In a survey of bacterial and archaeal genomes, as many as 20% more of the genes in a given genome match the modal codon usage than the average codon usage. We use the mode to examine the evolution of the multireplicon genomes of Agrobacterium tumefaciens C58 and Borrelia burgdorferi B31. In A. tumefaciens, the circular and linear chromosomes are characterized by a common "chromosome-like" codon usage, whereas both plasmids share a distinct "plasmid-like" codon usage. In B. burgdorferi, in addition to different codon-usage biases on the leading and lagging strands of DNA replication found by McInerney (McInerney JO. 1998. Replicational and transcriptional selection on codon usage in Borrelia burgdorferi. Proc Natl Acad Sci USA. 95:10698-10703), we also detect a codon-usage similarity between linear plasmid lp38 and the leading strand of the chromosome and a high similarity among the cp32 family of plasmids.

AB - Most genomes are heterogeneous in codon usage, so a codon usage study should start by defining the codon usage that is typical to the genome. Although this is commonly taken to be the genomewide average, we propose that the mode-The codon usage that matches the most genes-provides a more useful approximation of the typical codon usage of a genome. We provide a method for estimating the modal codon usage, which utilizes a continuous approximation to the number of matching genes and a simplex optimization. In a survey of bacterial and archaeal genomes, as many as 20% more of the genes in a given genome match the modal codon usage than the average codon usage. We use the mode to examine the evolution of the multireplicon genomes of Agrobacterium tumefaciens C58 and Borrelia burgdorferi B31. In A. tumefaciens, the circular and linear chromosomes are characterized by a common "chromosome-like" codon usage, whereas both plasmids share a distinct "plasmid-like" codon usage. In B. burgdorferi, in addition to different codon-usage biases on the leading and lagging strands of DNA replication found by McInerney (McInerney JO. 1998. Replicational and transcriptional selection on codon usage in Borrelia burgdorferi. Proc Natl Acad Sci USA. 95:10698-10703), we also detect a codon-usage similarity between linear plasmid lp38 and the leading strand of the chromosome and a high similarity among the cp32 family of plasmids.

KW - Codon adaptation index

KW - Correspondence analysis

KW - Horizontal gene transfer

UR - http://www.scopus.com/inward/record.url?scp=77952314202&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77952314202&partnerID=8YFLogxK

U2 - 10.1093/molbev/msp281

DO - 10.1093/molbev/msp281

M3 - Review article

C2 - 20018979

AN - SCOPUS:77952314202

VL - 27

SP - 800

EP - 810

JO - Molecular Biology and Evolution

JF - Molecular Biology and Evolution

SN - 0737-4038

IS - 4

ER -