TY - JOUR
T1 - Validation of a comprehensive long-read sequencing platform for broad clinical genetic diagnosis
AU - Sen, Siddhartha
AU - Handler, Hillary P.
AU - Victorsen, Alec
AU - Flaten, Zach
AU - Ellison, Aidan
AU - Knutson, Todd P.
AU - Munro, Sarah A.
AU - Martinez, Ryan J.
AU - Billington, Charles John
AU - Laffin, Jennifer J.
AU - Bray, Sarah
AU - Mroz, Pawel
AU - Yohe, Sophia
AU - Nelson, Andrew C.
AU - Bower, Matthew
AU - Thyagarajan, Bharat
N1 - Publisher Copyright:
Copyright © 2025 Sen, Handler, Victorsen, Flaten, Ellison, Knutson, Munro, Martinez, Billington, Laffin, Bray, Mroz, Yohe, Nelson, Bower and Thyagarajan.
PY - 2025
Y1 - 2025
N2 - Though short read high-throughput sequencing, commonly known as Next-Generation Sequencing (NGS), has revolutionized genomics and genetic testing, there is no single genetic test that can accurately detect single nucleotide variants (SNVs), small insertions/deletions (indels), complex structural variants (SVs), repetitive genomic alterations, and variants in genes with highly homologous pseudogenes. The implementation of a unified comprehensive technique that can simultaneously detect a broad spectrum of genetic variation would substantially increase efficiency of the diagnostic process. The current study evaluated the clinical utility of long-read sequencing as a comprehensive genetic test for diagnosis of inherited conditions. Using Oxford Nanopore Technologies long read nanopore sequencing, we successfully developed and validated a clinically deployable integrated bioinformatics pipeline that utilizes a combination of eight publicly available variant callers. A concordance assessment comparing the known variant calls from a well-characterized, benchmarked sample called NA12878 from the National Institute of Standards and Technology (NIST) with the variants detected by our pipeline for this sample, determined that the analytical sensitivity of our pipeline was 98.87% and the analytical specificity exceeded 99.99%. We then evaluated our pipeline’s ability to detect 167 clinically relevant variants from 72 clinical samples. This set of variants consisted of 80 SNVs, 26 indels, 32 SVs, and 29 repeat expansions, including 14 variants in genes with highly homologous pseudogenes. The overall detection concordance for these clinically relevant variants was 99.4% (95% CI: 99.7%–99.9%). Importantly, in addition to detecting known clinically relevant variants, in four cases, our pipeline yielded valuable additional information in support of clinical diagnoses that could not have been established using short-read NGS alone. Our findings suggest that long-read sequencing is successful in identifying diverse genomic alterations and that our pipeline functions well as the basis for a single diagnostic test for patients with suspected genetic disease.
AB - Though short read high-throughput sequencing, commonly known as Next-Generation Sequencing (NGS), has revolutionized genomics and genetic testing, there is no single genetic test that can accurately detect single nucleotide variants (SNVs), small insertions/deletions (indels), complex structural variants (SVs), repetitive genomic alterations, and variants in genes with highly homologous pseudogenes. The implementation of a unified comprehensive technique that can simultaneously detect a broad spectrum of genetic variation would substantially increase efficiency of the diagnostic process. The current study evaluated the clinical utility of long-read sequencing as a comprehensive genetic test for diagnosis of inherited conditions. Using Oxford Nanopore Technologies long read nanopore sequencing, we successfully developed and validated a clinically deployable integrated bioinformatics pipeline that utilizes a combination of eight publicly available variant callers. A concordance assessment comparing the known variant calls from a well-characterized, benchmarked sample called NA12878 from the National Institute of Standards and Technology (NIST) with the variants detected by our pipeline for this sample, determined that the analytical sensitivity of our pipeline was 98.87% and the analytical specificity exceeded 99.99%. We then evaluated our pipeline’s ability to detect 167 clinically relevant variants from 72 clinical samples. This set of variants consisted of 80 SNVs, 26 indels, 32 SVs, and 29 repeat expansions, including 14 variants in genes with highly homologous pseudogenes. The overall detection concordance for these clinically relevant variants was 99.4% (95% CI: 99.7%–99.9%). Importantly, in addition to detecting known clinically relevant variants, in four cases, our pipeline yielded valuable additional information in support of clinical diagnoses that could not have been established using short-read NGS alone. Our findings suggest that long-read sequencing is successful in identifying diverse genomic alterations and that our pipeline functions well as the basis for a single diagnostic test for patients with suspected genetic disease.
KW - Oxford Nanopore Technologies
KW - Tandem repeat expansions
KW - clinical genomics
KW - complex structural variants
KW - long-read sequencing
KW - whole genome sequencing
UR - http://www.scopus.com/inward/record.url?scp=105005407611&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=105005407611&partnerID=8YFLogxK
U2 - 10.3389/fgene.2025.1499456
DO - 10.3389/fgene.2025.1499456
M3 - Article
C2 - 40385982
AN - SCOPUS:105005407611
SN - 1664-8021
VL - 16
JO - Frontiers in Genetics
JF - Frontiers in Genetics
M1 - 1499456
ER -