Skip to main content
construction release_alert
The Scholars Team is working with OIT to resolve some issues with the Scholars search index
cancel

Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C.

Publication ,  Journal Article
Kronenberg, ZN; Rhie, A; Koren, S; Concepcion, GT; Peluso, P; Munson, KM; Porubsky, D; Kuhn, K; Mueller, KA; Low, WY; Hiendleder, S; Hall, RJ ...
Published in: Nat Commun
April 28, 2021

Haplotype-resolved genome assemblies are important for understanding how combinations of variants impact phenotypes. To date, these assemblies have been best created with complex protocols, such as cultured cells that contain a single-haplotype (haploid) genome, single cells where haplotypes are separated, or co-sequencing of parental genomes in a trio-based approach. These approaches are impractical in most situations. To address this issue, we present FALCON-Phase, a phasing tool that uses ultra-long-range Hi-C chromatin interaction data to extend phase blocks of partially-phased diploid assembles to chromosome or scaffold scale. FALCON-Phase uses the inherent phasing information in Hi-C reads, skipping variant calling, and reduces the computational complexity of phasing. Our method is validated on three benchmark datasets generated as part of the Vertebrate Genomes Project (VGP), including human, cow, and zebra finch, for which high-quality, fully haplotype-resolved assemblies are available using the trio-based approach. FALCON-Phase is accurate without having parental data and performance is better in samples with higher heterozygosity. For cow and zebra finch the accuracy is 97% compared to 80-91% for human. FALCON-Phase is applicable to any draft assembly that contains long primary contigs and phased associate contigs.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Nat Commun

DOI

EISSN

2041-1723

Publication Date

April 28, 2021

Volume

12

Issue

1

Start / End Page

1935

Location

England

Related Subject Headings

  • Zebrafish
  • Sequence Analysis, DNA
  • Polymorphism, Single Nucleotide
  • Humans
  • High-Throughput Nucleotide Sequencing
  • Haplotypes
  • Genome, Human
  • Contig Mapping
  • Cattle
  • Animals
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Kronenberg, Z. N., Rhie, A., Koren, S., Concepcion, G. T., Peluso, P., Munson, K. M., … Kingan, S. B. (2021). Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C. Nat Commun, 12(1), 1935. https://doi.org/10.1038/s41467-020-20536-y
Kronenberg, Zev N., Arang Rhie, Sergey Koren, Gregory T. Concepcion, Paul Peluso, Katherine M. Munson, David Porubsky, et al. “Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C.Nat Commun 12, no. 1 (April 28, 2021): 1935. https://doi.org/10.1038/s41467-020-20536-y.
Kronenberg ZN, Rhie A, Koren S, Concepcion GT, Peluso P, Munson KM, et al. Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C. Nat Commun. 2021 Apr 28;12(1):1935.
Kronenberg, Zev N., et al. “Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C.Nat Commun, vol. 12, no. 1, Apr. 2021, p. 1935. Pubmed, doi:10.1038/s41467-020-20536-y.
Kronenberg ZN, Rhie A, Koren S, Concepcion GT, Peluso P, Munson KM, Porubsky D, Kuhn K, Mueller KA, Low WY, Hiendleder S, Fedrigo O, Liachko I, Hall RJ, Phillippy AM, Eichler EE, Williams JL, Smith TPL, Jarvis ED, Sullivan ST, Kingan SB. Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C. Nat Commun. 2021 Apr 28;12(1):1935.

Published In

Nat Commun

DOI

EISSN

2041-1723

Publication Date

April 28, 2021

Volume

12

Issue

1

Start / End Page

1935

Location

England

Related Subject Headings

  • Zebrafish
  • Sequence Analysis, DNA
  • Polymorphism, Single Nucleotide
  • Humans
  • High-Throughput Nucleotide Sequencing
  • Haplotypes
  • Genome, Human
  • Contig Mapping
  • Cattle
  • Animals