Accurate detection of complex structural variations using single molecule sequencing
Citations Over Time
Abstract
Abstract Structural variations (SVs) are the largest source of genetic variation, but remain poorly understood because of limited genomics technology. Single molecule long read sequencing from Pacific Biosciences and Oxford Nanopore has the potential to dramatically advance the field, although their high error rates challenge existing methods. Addressing this need, we introduce open-source methods for long read alignment (NGMLR, https://github.com/philres/ngmlr ) and SV identification (Sniffles, https://github.com/fritzsedlazeck/Sniffles ) that enable unprecedented SV sensitivity and precision, including within repeat-rich regions and of complex nested events that can have significant impact on human disorders. Examining several datasets, including healthy and cancerous human genomes, we discover thousands of novel variants using long reads and categorize systematic errors in short-read approaches. NGMLR and Sniffles are further able to automatically filter false events and operate on low amounts of coverage to address the cost factor that has hindered the application of long reads in clinical and research settings.
Related Papers
- → Mapping and phasing of structural variation in patient genomes using nanopore sequencing(2017)436 cited
- → NanoVar: accurate characterization of patients’ genomic structural variants using low-depth nanopore sequencing(2020)130 cited
- → Picky comprehensively detects high-resolution structural variants in nanopore long reads(2018)126 cited
- → Evaluation of Germline Structural Variant Calling Methods for Nanopore Sequencing Data(2021)30 cited
- → Initial Analysis of Structural Variation Detections in Cattle Using Long-Read Sequencing Methods(2022)8 cited