Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data

Xingtan Zhang, Shengcheng Zhang, Qian Zhao, Ray R Ming, Haibao Tang

Research output: Contribution to journalArticle

Abstract

Construction of chromosome-level assembly is a vital step in achieving the goal of a ‘Platinum’ genome, but it remains a major challenge to assemble and anchor sequences to chromosomes in autopolyploid or highly heterozygous genomes. High-throughput chromosome conformation capture (Hi-C) technology serves as a robust tool to dramatically advance chromosome scaffolding; however, existing approaches are mostly designed for diploid genomes and often with the aim of reconstructing a haploid representation, thereby having limited power to reconstruct chromosomes for autopolyploid genomes. We developed a novel algorithm (ALLHiC) that is capable of building allele-aware, chromosomal-scale assembly for autopolyploid genomes using Hi-C paired-end reads with innovative ‘prune’ and ‘optimize’ steps. Application on simulated data showed that ALLHiC can phase allelic contigs and substantially improve ordering and orientation when compared to other mainstream Hi-C assemblers. We applied ALLHiC on an autotetraploid and an autooctoploid sugar-cane genome and successfully constructed the phased chromosomal-level assemblies, revealing allelic variations present in these two genomes. The ALLHiC pipeline enables de novo chromosome-level assembly of autopolyploid genomes, separating each allele. Haplotype chromosome-level assembly of allopolyploid and heterozygous diploid genomes can be achieved using ALLHiC, overcoming obstacles in assembling complex genomes.

Original languageEnglish (US)
Pages (from-to)833-845
Number of pages13
JournalNature Plants
Volume5
Issue number8
DOIs
StatePublished - Aug 1 2019

Fingerprint

autopolyploidy
alleles
genome
chromosomes
diploidy
prunes
platinum
allopolyploidy
haploidy
sugarcane
haplotypes

ASJC Scopus subject areas

  • Plant Science

Cite this

Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data. / Zhang, Xingtan; Zhang, Shengcheng; Zhao, Qian; Ming, Ray R; Tang, Haibao.

In: Nature Plants, Vol. 5, No. 8, 01.08.2019, p. 833-845.

Research output: Contribution to journalArticle

Zhang, Xingtan ; Zhang, Shengcheng ; Zhao, Qian ; Ming, Ray R ; Tang, Haibao. / Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data. In: Nature Plants. 2019 ; Vol. 5, No. 8. pp. 833-845.
@article{914fc67bf2db4b1da586c45ef96afdd1,
title = "Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data",
abstract = "Construction of chromosome-level assembly is a vital step in achieving the goal of a ‘Platinum’ genome, but it remains a major challenge to assemble and anchor sequences to chromosomes in autopolyploid or highly heterozygous genomes. High-throughput chromosome conformation capture (Hi-C) technology serves as a robust tool to dramatically advance chromosome scaffolding; however, existing approaches are mostly designed for diploid genomes and often with the aim of reconstructing a haploid representation, thereby having limited power to reconstruct chromosomes for autopolyploid genomes. We developed a novel algorithm (ALLHiC) that is capable of building allele-aware, chromosomal-scale assembly for autopolyploid genomes using Hi-C paired-end reads with innovative ‘prune’ and ‘optimize’ steps. Application on simulated data showed that ALLHiC can phase allelic contigs and substantially improve ordering and orientation when compared to other mainstream Hi-C assemblers. We applied ALLHiC on an autotetraploid and an autooctoploid sugar-cane genome and successfully constructed the phased chromosomal-level assemblies, revealing allelic variations present in these two genomes. The ALLHiC pipeline enables de novo chromosome-level assembly of autopolyploid genomes, separating each allele. Haplotype chromosome-level assembly of allopolyploid and heterozygous diploid genomes can be achieved using ALLHiC, overcoming obstacles in assembling complex genomes.",
author = "Xingtan Zhang and Shengcheng Zhang and Qian Zhao and Ming, {Ray R} and Haibao Tang",
year = "2019",
month = "8",
day = "1",
doi = "10.1038/s41477-019-0487-8",
language = "English (US)",
volume = "5",
pages = "833--845",
journal = "Nature Plants",
issn = "2055-026X",
publisher = "Palgrave Macmillan Ltd.",
number = "8",

}

TY - JOUR

T1 - Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data

AU - Zhang, Xingtan

AU - Zhang, Shengcheng

AU - Zhao, Qian

AU - Ming, Ray R

AU - Tang, Haibao

PY - 2019/8/1

Y1 - 2019/8/1

N2 - Construction of chromosome-level assembly is a vital step in achieving the goal of a ‘Platinum’ genome, but it remains a major challenge to assemble and anchor sequences to chromosomes in autopolyploid or highly heterozygous genomes. High-throughput chromosome conformation capture (Hi-C) technology serves as a robust tool to dramatically advance chromosome scaffolding; however, existing approaches are mostly designed for diploid genomes and often with the aim of reconstructing a haploid representation, thereby having limited power to reconstruct chromosomes for autopolyploid genomes. We developed a novel algorithm (ALLHiC) that is capable of building allele-aware, chromosomal-scale assembly for autopolyploid genomes using Hi-C paired-end reads with innovative ‘prune’ and ‘optimize’ steps. Application on simulated data showed that ALLHiC can phase allelic contigs and substantially improve ordering and orientation when compared to other mainstream Hi-C assemblers. We applied ALLHiC on an autotetraploid and an autooctoploid sugar-cane genome and successfully constructed the phased chromosomal-level assemblies, revealing allelic variations present in these two genomes. The ALLHiC pipeline enables de novo chromosome-level assembly of autopolyploid genomes, separating each allele. Haplotype chromosome-level assembly of allopolyploid and heterozygous diploid genomes can be achieved using ALLHiC, overcoming obstacles in assembling complex genomes.

AB - Construction of chromosome-level assembly is a vital step in achieving the goal of a ‘Platinum’ genome, but it remains a major challenge to assemble and anchor sequences to chromosomes in autopolyploid or highly heterozygous genomes. High-throughput chromosome conformation capture (Hi-C) technology serves as a robust tool to dramatically advance chromosome scaffolding; however, existing approaches are mostly designed for diploid genomes and often with the aim of reconstructing a haploid representation, thereby having limited power to reconstruct chromosomes for autopolyploid genomes. We developed a novel algorithm (ALLHiC) that is capable of building allele-aware, chromosomal-scale assembly for autopolyploid genomes using Hi-C paired-end reads with innovative ‘prune’ and ‘optimize’ steps. Application on simulated data showed that ALLHiC can phase allelic contigs and substantially improve ordering and orientation when compared to other mainstream Hi-C assemblers. We applied ALLHiC on an autotetraploid and an autooctoploid sugar-cane genome and successfully constructed the phased chromosomal-level assemblies, revealing allelic variations present in these two genomes. The ALLHiC pipeline enables de novo chromosome-level assembly of autopolyploid genomes, separating each allele. Haplotype chromosome-level assembly of allopolyploid and heterozygous diploid genomes can be achieved using ALLHiC, overcoming obstacles in assembling complex genomes.

UR - http://www.scopus.com/inward/record.url?scp=85070278390&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85070278390&partnerID=8YFLogxK

U2 - 10.1038/s41477-019-0487-8

DO - 10.1038/s41477-019-0487-8

M3 - Article

C2 - 31383970

AN - SCOPUS:85070278390

VL - 5

SP - 833

EP - 845

JO - Nature Plants

JF - Nature Plants

SN - 2055-026X

IS - 8

ER -