Abstract
Reconstructing haplotypes through sequencing of a mixture of similar sequences is a fundamental problem. Long-read sequencing technologies can connect distant alleles to disentangle similar haplotypes, but handling elevated sequencing error rates requires specialized techniques. We present devider, an algorithm for haplotyping small sequences—such as viruses or genes—from long-read sequencing. devider uses a positional de Bruijn graph with sequence-to-graph alignment on an alphabet of informative alleles to provide a fast assembly-inspired approach compatible with various long-read sequencing technologies. Benchmarking on synthetic mixtures of antimicrobial resistance (AMR) genes showed that devider recovered 83% of haplotypes, 23% points higher than the next best method. On real PacBio and Nanopore datasets, devider recapitulates previously known results in seconds, disentangling a bacterial community with >10 strains and an HIV-1 co-infection dataset. We used devider to investigate the within-host diversity of a long-read bovine gut metagenome enriched for AMR genes, discovering a history of recombination for diverse AMR gene haplotypes and showcasing devider ’s ability to unveil ecological signals for heterogeneous mixtures.
Original language | English (US) |
---|---|
Title of host publication | Research in Computational Molecular Biology - 29th International Conference, RECOMB 2025, Proceedings |
Editors | Sriram Sankararaman |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 290-293 |
Number of pages | 4 |
ISBN (Print) | 9783031902512 |
DOIs | |
State | Published - 2025 |
Event | 29th International Conference on Research in Computational Molecular Biology, RECOMB 2025 - Seoul, Korea, Republic of Duration: Apr 26 2025 → Apr 29 2025 |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Volume | 15647 LNBI |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 29th International Conference on Research in Computational Molecular Biology, RECOMB 2025 |
---|---|
Country/Territory | Korea, Republic of |
City | Seoul |
Period | 4/26/25 → 4/29/25 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
Keywords
- de Bruijn graph
- genes
- haplotyping
- long-reads
- metagenome
- viruses