Devider: Long-Read Reconstruction of Many Diverse Haplotypes

Jim Shaw, Christina Boucher, Yun William Yu, Noelle Noyes, Heng Li

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Reconstructing haplotypes through sequencing of a mixture of similar sequences is a fundamental problem. Long-read sequencing technologies can connect distant alleles to disentangle similar haplotypes, but handling elevated sequencing error rates requires specialized techniques. We present devider, an algorithm for haplotyping small sequences—such as viruses or genes—from long-read sequencing. devider uses a positional de Bruijn graph with sequence-to-graph alignment on an alphabet of informative alleles to provide a fast assembly-inspired approach compatible with various long-read sequencing technologies. Benchmarking on synthetic mixtures of antimicrobial resistance (AMR) genes showed that devider recovered 83% of haplotypes, 23% points higher than the next best method. On real PacBio and Nanopore datasets, devider recapitulates previously known results in seconds, disentangling a bacterial community with >10 strains and an HIV-1 co-infection dataset. We used devider to investigate the within-host diversity of a long-read bovine gut metagenome enriched for AMR genes, discovering a history of recombination for diverse AMR gene haplotypes and showcasing devider ’s ability to unveil ecological signals for heterogeneous mixtures.

Original languageEnglish (US)
Title of host publicationResearch in Computational Molecular Biology - 29th International Conference, RECOMB 2025, Proceedings
EditorsSriram Sankararaman
PublisherSpringer Science and Business Media Deutschland GmbH
Pages290-293
Number of pages4
ISBN (Print)9783031902512
DOIs
StatePublished - 2025
Event29th International Conference on Research in Computational Molecular Biology, RECOMB 2025 - Seoul, Korea, Republic of
Duration: Apr 26 2025Apr 29 2025

Publication series

NameLecture Notes in Computer Science
Volume15647 LNBI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference29th International Conference on Research in Computational Molecular Biology, RECOMB 2025
Country/TerritoryKorea, Republic of
CitySeoul
Period4/26/254/29/25

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

Keywords

  • de Bruijn graph
  • genes
  • haplotyping
  • long-reads
  • metagenome
  • viruses

Fingerprint

Dive into the research topics of 'Devider: Long-Read Reconstruction of Many Diverse Haplotypes'. Together they form a unique fingerprint.

Cite this