Chromosome-level scaffolding of haplotype-resolved assemblies using Hi-C data without reference genomes.
Scaffolding is crucial for constructing most chromosome-level genomes. The high-throughput chromatin conformation capture (Hi-C) technology has become the primary scaffolding strategy due to its convenience and cost-effectiveness. As sequencing technologies and assembly algorithms advance, constructing haplotype-resolved genomes is increasingly preferred because haplotypes can provide additional genetic information on allelic and non-allelic variations. ALLHiC is a widely used allele-aware scaffolding tool designed for this purpose. However, its dependence on chromosome-level reference genomes and a higher chromosome misassignment rate still impede the unravelling of haplotype-resolved genomes. Here we present HapHiC, a reference-independent allele-aware scaffolding tool with superior performance on chromosome assignment as well as contig ordering and orientation. In addition, we provide new insights into the challenges in allele-aware scaffolding by conducting comprehensive analyses on various adverse factors. Finally, with the help of HapHiC, we constructed the haplotype-resolved allotriploid genome for Miscanthus × giganteus, an important lignocellulosic bioenergy crop.
Duke Scholars
Published In
DOI
EISSN
Publication Date
Volume
Issue
Start / End Page
Location
Related Subject Headings
- Poaceae
- High-Throughput Nucleotide Sequencing
- Haplotypes
- Genome, Plant
- Chromosomes, Plant
- Chromatin
- Alleles
- 3108 Plant biology
- 3103 Ecology
Citation
Published In
DOI
EISSN
Publication Date
Volume
Issue
Start / End Page
Location
Related Subject Headings
- Poaceae
- High-Throughput Nucleotide Sequencing
- Haplotypes
- Genome, Plant
- Chromosomes, Plant
- Chromatin
- Alleles
- 3108 Plant biology
- 3103 Ecology