Characterization of an EST database for the perennial weed leafy spurge: An important resource for weed biology research

James V. Anderson, David P. Horvath, Wun S. Chao, Michael E. Foley, Alvaro G. Hernandez, Jyothi Thimmapuram, Lie Liu, George L. Gong, Mark Band, Ryan Kim, Mark A. Mikel

Research output: Contribution to journalArticlepeer-review


Genomics programs in the weed science community have not developed as rapidly as that of other crop, horticultural, forestry, and model plant systems. Development of genomic resources for selected model weeds are expected to enhance our understanding of weed biology, just as they have in other plant systems. In this report, we describe the development, characteristics, and information gained from an expressed sequence tag (EST) database for the perennial weed leafy spurge. ESTs were obtained using a normalized cDNA library prepared from a comprehensive collection of tissues. During the EST characterization process, redundancy was minimized by periodic subtractions of the normalized cDNA library. A sequencing success rate of 88% yielded 45,314 ESTs with an average read length of 671 nucleotides. Using bioinformatic analysis, the leafy spurge EST database was assembled into 23,472 unique sequences representing 19,015 unigenes (10,293 clusters and 8,722 singletons). Blast similarity searches to the GenBank nonredundant protein database identified 18,186 total matches, of which 14,205 were nonredundant. These data indicate that 77.4% of the 23,472 unique sequences and 74.7% of the 19,015 unigenes are similar to other known proteins. Further bioinformatics analysis indicated that 2,950, or 15.5%, of the unigenes have previously not been identified suggesting that some may be novel to leafy spurge. Functional classifications assigned to leafy spurge unique sequences using Munich Information Center for Protein or Gene Ontology were proportional to functional classifications for genes of arabidopsis, with the exception of unclassified or unknowns and transposable elements which were significantly reduced in leafy spurge. Although these EST resources have been developed for the purpose of constructing high-density leafy spurge microarrays, they are already providing valuable information related to sugar metabolism, cell cycle regulation, dormancy, terpenoid secondary metabolism, and flowering.

Original languageEnglish (US)
Pages (from-to)193-203
Number of pages11
JournalWeed Science
Issue number3
StatePublished - May 1 2007


  • Expressed sequence tag
  • Genomics
  • Leafy spurge
  • Perennial weeds

ASJC Scopus subject areas

  • Agronomy and Crop Science
  • Plant Science

Fingerprint Dive into the research topics of 'Characterization of an EST database for the perennial weed leafy spurge: An important resource for weed biology research'. Together they form a unique fingerprint.

Cite this