TY - JOUR
T1 - Efficient Sequencing, Assembly, and Annotation of Human KIR Haplotypes
AU - Roe, David
AU - Williams, Jonathan
AU - Ivery, Keyton
AU - Brouckaert, Jenny
AU - Downey, Nick
AU - Locklear, Chad
AU - Kuang, Rui
AU - Maiers, Martin
N1 - Publisher Copyright:
© Copyright © 2020 Roe, Williams, Ivery, Brouckaert, Downey, Locklear, Kuang and Maiers.
PY - 2020/10/9
Y1 - 2020/10/9
N2 - The homology, recombination, variation, and repetitive elements in the natural killer-cell immunoglobulin-like receptor (KIR) region has made full haplotype DNA interpretation impossible in a high-throughput workflow. Here, we present a new approach using long-read sequencing to efficiently capture, sequence, and assemble diploid human KIR haplotypes. Probes were designed to capture KIR fragments efficiently by leveraging the repeating homology of the region. IDT xGen® Lockdown probes were used to capture 2–8 kb of sheared DNA fragments followed by sequencing on a PacBio Sequel. The sequences were error corrected, binned, and then assembled using the Canu assembler. The location of genes and their exon/intron boundaries are included in the workflow. The assembly and annotation was evaluated on 16 individuals (8 African American and 8 Europeans) from whom ground truth was known via long-range sequencing with fosmid library preparation. Using only 18 capture probes, the results show that the assemblies cover 97% of the GenBank reference, are 99.97% concordant, and it takes only 1.8 haplotigs to cover 75% of the reference. We also report the first assembly of diploid KIR haplotypes from long-read WGS. Our targeted hybridization probe capture and sequencing approach is the first of its kind to fully sequence and phase all diploid human KIR haplotypes, and it is efficient enough for population-scale studies and clinical use. The open and free software is available at https://github.com/droeatumn/kass and supported by a environment at https://hub.docker.com/repository/docker/droeatumn/kass.
AB - The homology, recombination, variation, and repetitive elements in the natural killer-cell immunoglobulin-like receptor (KIR) region has made full haplotype DNA interpretation impossible in a high-throughput workflow. Here, we present a new approach using long-read sequencing to efficiently capture, sequence, and assemble diploid human KIR haplotypes. Probes were designed to capture KIR fragments efficiently by leveraging the repeating homology of the region. IDT xGen® Lockdown probes were used to capture 2–8 kb of sheared DNA fragments followed by sequencing on a PacBio Sequel. The sequences were error corrected, binned, and then assembled using the Canu assembler. The location of genes and their exon/intron boundaries are included in the workflow. The assembly and annotation was evaluated on 16 individuals (8 African American and 8 Europeans) from whom ground truth was known via long-range sequencing with fosmid library preparation. Using only 18 capture probes, the results show that the assemblies cover 97% of the GenBank reference, are 99.97% concordant, and it takes only 1.8 haplotigs to cover 75% of the reference. We also report the first assembly of diploid KIR haplotypes from long-read WGS. Our targeted hybridization probe capture and sequencing approach is the first of its kind to fully sequence and phase all diploid human KIR haplotypes, and it is efficient enough for population-scale studies and clinical use. The open and free software is available at https://github.com/droeatumn/kass and supported by a environment at https://hub.docker.com/repository/docker/droeatumn/kass.
KW - DNA
KW - annotation
KW - assembly
KW - haplotype
KW - killer-cell immunoglobulin-like receptor
KW - natural killer
UR - http://www.scopus.com/inward/record.url?scp=85094110671&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85094110671&partnerID=8YFLogxK
U2 - 10.3389/fimmu.2020.582927
DO - 10.3389/fimmu.2020.582927
M3 - Article
C2 - 33162997
AN - SCOPUS:85094110671
SN - 1664-3224
VL - 11
JO - Frontiers in immunology
JF - Frontiers in immunology
M1 - 582927
ER -