5 research outputs found

    Benchmarking long-read genome sequence alignment tools for human genomics applications

    Get PDF
    Background The utility of long-read genome sequencing platforms has been shown in many fields including whole genome assembly, metagenomics, and amplicon sequencing. Less clear is the applicability of long reads to reference-guided human genomics, which is the foundation of genomic medicine. Here, we benchmark available platform-agnostic alignment tools on datasets from nanopore and single-molecule real-time platforms to understand their suitability in producing a genome representation. Results For this study, we leveraged publicly-available data from sample NA12878 generated on Oxford Nanopore and sample NA24385 on Pacific Biosciences platforms. We employed state of the art sequence alignment tools including GraphMap2, long-read aligner (LRA), Minimap2, CoNvex Gap-cost alignMents for Long Reads (NGMLR), and Winnowmap2. Minimap2 and Winnowmap2 were computationally lightweight enough for use at scale, while GraphMap2 was not. NGMLR took a long time and required many resources, but produced alignments each time. LRA was fast, but only worked on Pacific Biosciences data. Each tool widely disagreed on which reads to leave unaligned, affecting the end genome coverage and the number of discoverable breakpoints. No alignment tool independently resolved all large structural variants (1,001–100,000 base pairs) present in the Database of Genome Variants (DGV) for sample NA12878 or the truthset for NA24385. Conclusions These results suggest a combined approach is needed for LRS alignments for human genomics. Specifically, leveraging alignments from three tools will be more effective in generating a complete picture of genomic variability. It should be best practice to use an analysis pipeline that generates alignments with both Minimap2 and Winnowmap2 as they are lightweight and yield different views of the genome. Depending on the question at hand, the data available, and the time constraints, NGMLR and LRA are good options for a third tool. If computational resources and time are not a factor for a given case or experiment, NGMLR will provide another view, and another chance to resolve a case. LRA, while fast, did not work on the nanopore data for our cluster, but PacBio results were promising in that those computations completed faster than Minimap2. Due to its significant burden on computational resources and slow run time, Graphmap2 is not an ideal tool for exploration of a whole human genome generated on a long-read sequencing platform

    New technologies to uncover the molecular basis of disorders of sex development

    No full text
    The elegant developmental biology experiments conducted in the 1940s by French physiologist Alfred Jost demonstrated that the sexual phenotype of a mammalian embryo depended whether the embryonic gonad develops into a testis or not. In humans, anomalies in the processes that regulate development of chromosomal, gonadal or anatomic sex result in a spectrum of conditions termed Disorders/Differences of Sex Development (DSD). Each of these conditions is rare, and understanding of their genetic etiology is still incomplete. Historically, DSD diagnoses have been difficult to establish due to the lack of standardization of anatomical and endocrine phenotyping procedures as well as genetic testing. Yet, a definitive diagnosis is critical for optimal management of the medical and psychosocial challenges associated with DSD conditions. The advent in the clinical realm of next-generation sequencing methods, with constantly decreasing price and turnaround time, has revolutionized the diagnostic process. Here we review the successes and limitations of the genetic methods currently available for DSD diagnosis, including Sanger sequencing, karyotyping, exome sequencing and chromosomal microarrays. While exome sequencing provides higher diagnostic rates, many patients still remain undiagnosed. Newer approaches, such as whole-genome sequencing and whole-genome mapping, along with gene expression studies, have the potential to identify novel DSD-causing genes and significantly increase total diagnostic yield, hopefully shortening the patients journey to an accurate diagnosis and enhancing health-related quality-of-life outcomes for patients and families

    Loss-of-Function Variants in PPP1R12A: From Isolated Sex Reversal to Holoprosencephaly Spectrum and Urogenital Malformations

    No full text
    In two independent ongoing next-generation sequencing projects for individuals with holoprosencephaly and individuals with disorders of sex development, and through international research collaboration, we identified twelve individuals with de novo loss-of-function (LoF) variants in protein phosphatase 1, regulatory subunit 12a (PPP1R12A), an important developmental gene involved in cell migration, adhesion, and morphogenesis. This gene has not been previously reported in association with human disease, and it has intolerance to LoF as illustrated by a very low observed-to-expected ratio of LoF variants in gnomAD. Of the twelve individuals, midline brain malformations were found in five, urogenital anomalies in nine, and a combination of both phenotypes in two. Other congenital anomalies identified included omphalocele, jejunal, and ileal atresia with aberrant mesenteric blood supply, and syndactyly. Six individuals had stop gain variants, five had a deletion or duplication resulting in a frameshift, and one had a canonical splice acceptor site loss. Murine and human in situ hybridization and immunostaining revealed PPP1R12A expression in the prosencephalic neural folds and protein localization in the lower urinary tract at critical periods for forebrain division and urogenital development. Based on these clinical and molecular findings, we propose the association of PPP1R12A pathogenic variants with a congenital malformations syndrome affecting the embryogenesis of the brain and genitourinary systems and including disorders of sex development

    Loss-of-Function Variants in PPP1R12A: From Isolated Sex Reversal to Holoprosencephaly Spectrum and Urogenital Malformations.

    No full text
    In two independent ongoing next-generation sequencing projects for individuals with holoprosencephaly and individuals with disorders of sex development, and through international research collaboration, we identified twelve individuals with de novo loss-of-function (LoF) variants in protein phosphatase 1, regulatory subunit 12a (PPP1R12A), an important developmental gene involved in cell migration, adhesion, and morphogenesis. This gene has not been previously reported in association with human disease, and it has intolerance to LoF as illustrated by a very low observed-to-expected ratio of LoF variants in gnomAD. Of the twelve individuals, midline brain malformations were found in five, urogenital anomalies in nine, and a combination of both phenotypes in two. Other congenital anomalies identified included omphalocele, jejunal, and ileal atresia with aberrant mesenteric blood supply, and syndactyly. Six individuals had stop gain variants, five had a deletion or duplication resulting in a frameshift, and one had a canonical splice acceptor site loss. Murine and human in situ hybridization and immunostaining revealed PPP1R12A expression in the prosencephalic neural folds and protein localization in the lower urinary tract at critical periods for forebrain division and urogenital development. Based on these clinical and molecular findings, we propose the association of PPP1R12A pathogenic variants with a congenital malformations syndrome affecting the embryogenesis of the brain and genitourinary systems and including disorders of sex development
    corecore