Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data

Xingtan Zhang, Shengcheng Zhang, Qian Zhao, Ray Ming, Haibao Tang

Research output: Contribution to journalArticle

Abstract

Construction of chromosome-level assembly is a vital step in achieving the goal of a ‘Platinum’ genome, but it remains a major challenge to assemble and anchor sequences to chromosomes in autopolyploid or highly heterozygous genomes. High-throughput chromosome conformation capture (Hi-C) technology serves as a robust tool to dramatically advance chromosome scaffolding; however, existing approaches are mostly designed for diploid genomes and often with the aim of reconstructing a haploid representation, thereby having limited power to reconstruct chromosomes for autopolyploid genomes. We developed a novel algorithm (ALLHiC) that is capable of building allele-aware, chromosomal-scale assembly for autopolyploid genomes using Hi-C paired-end reads with innovative ‘prune’ and ‘optimize’ steps. Application on simulated data showed that ALLHiC can phase allelic contigs and substantially improve ordering and orientation when compared to other mainstream Hi-C assemblers. We applied ALLHiC on an autotetraploid and an autooctoploid sugar-cane genome and successfully constructed the phased chromosomal-level assemblies, revealing allelic variations present in these two genomes. The ALLHiC pipeline enables de novo chromosome-level assembly of autopolyploid genomes, separating each allele. Haplotype chromosome-level assembly of allopolyploid and heterozygous diploid genomes can be achieved using ALLHiC, overcoming obstacles in assembling complex genomes.

Original languageEnglish (US)
Pages (from-to)833-845
Number of pages13
JournalNature plants
Volume5
Issue number8
DOIs
StatePublished - Aug 1 2019

ASJC Scopus subject areas

  • Plant Science

Fingerprint Dive into the research topics of 'Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data'. Together they form a unique fingerprint.

  • Cite this