42 research outputs found

    Addressing NGS Data Challenges: Efficient High Throughput Processing and Sequencing Error Detection

    Get PDF
    Next generation sequencing (NGS) technologies have facilitated the identification of disease causing mutations, which has significantly improved patient’s diagnosis and treatment. Since its emergence, NGS has been used in many applications like genome sequencing, DNA resequencing, transcriptome sequencing and epigenomics, to unfold the various layers of genome biology. Because of this broad spectrum of applications and recent decrement in cost, usage of NGS has become a routine approach to address many research as well as medical questions. It is producing huge amounts of data, which necessitate highly efficient and accurate computational analysis as well as data management. This thesis addresses some of the challenges of NGS data analysis, mainly for targeted DNA sequencing data. It describes the various steps required for data analysis including their significance and potential negative effects on consecutive downstream analysis and so on the final variant lists. In order to make the analysis more accurate and efficient, an extensive testing of different bioinformatics tools and algorithms was preformed and a fully automated data analysis workflow was developed. This workflow is implemented and optimized on high performance computing (HPC) systems. I describe different design principles and parallelization strategies that enable proper exploitation of HPC resources to achieve high throughput of data analysis. Besides correcting for known sequencing errors by using existing tools, this work is also aimed at the detection of a new class of systematic sequencing errors called recurrent systematic sequencing errors. I present an approach for the exploration of this class of errors and describe the probable causes and patterns behind them. This includes some known and novel patterns observed during this work. Furthermore, I provide a tool to filter the false variants due to these errors from any variant list. Overall, the work performed during this thesis has been already used (and will be used in future as well), to provide accurate and efficient data analysis, which enables exploration of the genetic background of various diseases

    Correction to: A nonsynonymous mutation in PLCG2 reduces the risk of Alzheimer's disease, dementia with Lewy bodies and frontotemporal dementia, and increases the likelihood of longevity.

    Get PDF
    The IPDGC (The International Parkinson Disease Genomics Consortium) and EADB (Alzheimer Disease European DNA biobank) are listed correctly as an author to the article, however, they were incorrectly listed more than once

    Exome sequencing identifies rare damaging variants in ATP8B4 and ABCA1 as novel risk factors for Alzheimers Disease

    Get PDF
    The genetic component of Alzheimer’s disease (AD) has been mainly assessed using Genome Wide Association Studies (GWAS), which do not capture the risk contributed by rare variants. Here, we compared the gene-based burden of rare damaging variants in exome sequencing data from 32,558 individuals —16,036 AD cases and 16,522 controls— in a two-stage analysis. Next to known genes TREM2, SORL1 and ABCA7, we observed a significant association of rare, predicted damaging variants in ATP8B4 and ABCA1 with AD risk, and a suggestive signal in ADAM10. Next to these genes, the rare variant burden in RIN3, CLU, ZCWPW1 and ACE highlighted these genes as potential driver genes in AD-GWAS loci. Rare damaging variants in these genes, and in particular loss-of-function variants, have a large effect on AD-risk, and they are enriched in early onset AD cases. The newly identified AD-associated genes provide additional evidence for a major role for APP-processing, Aβ-aggregation, lipid metabolism and microglial function in AD

    Exome sequencing identifies rare damaging variants in ATP8B4 and ABCA1 as risk factors for Alzheimer’s disease

    Get PDF
    Alzheimer’s disease (AD), the leading cause of dementia, has an estimated heritability of approximately 70%1. The genetic component of AD has been mainly assessed using genome-wide association studies, which do not capture the risk contributed by rare variants2. Here, we compared the gene-based burden of rare damaging variants in exome sequencing data from 32,558 individuals—16,036 AD cases and 16,522 controls. Next to variants in TREM2, SORL1 and ABCA7, we observed a significant association of rare, predicted damaging variants in ATP8B4 and ABCA1 with AD risk, and a suggestive signal in ADAM10. Additionally, the rare-variant burden in RIN3, CLU, ZCWPW1 and ACE highlighted these genes as potential drivers of respective AD-genome-wide association study loci. Variants associated with the strongest effect on AD risk, in particular loss-of-function variants, are enriched in early-onset AD cases. Our results provide additional evidence for a major role for amyloid-β precursor protein processing, amyloid-β aggregation, lipid metabolism and microglial function in AD

    Late diagnosis of a truncating W1SP3 mutation entails a severe phenotype of progressive pseudorheumatoid dysplasia

    Get PDF
    Rare diseases are often misdiagnosed or receive a delayed diagnosis; thus, unfortunately, affected individuals may not receive optimal medical management. Here, we report a case of two siblings with a severe phenotype of progressive pseudorheumatoid dysplasia (PPD). Their onset of symptoms began at the age of 3 yr. Both were neglected in the past, and the patients presented with a very severe phenotype and unmitigated natural history. PPD is a rare autosomal recessive skeletal dysplasia characterized by progressive joint stiffness, swelling, and pain. Because of observed muscle wasting, weakness, and the lack of laboratory testing, the case had been initially misdiagnosed by the local physicians. We aimed to provide diagnostic support by a targeted next-generation sequencing gene panel (Illumina TruSight One) for Mendelian diseases (Mendeliome), and we identified a homozygous frameshift mutation in the gene WISP3 (c.868_869delAG, p.Ser290Leufs*12). Thus, early diagnosis and intervention may have decreased the severity and complication of the disease

    Mutational profiling in the peripheral blood leukocytes of patients with systemic mast cell activation syndrome using next-generation sequencing

    No full text
    Mast cell activation syndrome (MCAS) and systemic mastocytosis (SM) are two clinical systemic mast cell activation disease variants. Few studies to date have investigated the genetic basis of MCAS. The present study had two aims. First, to investigate whether peripheral blood leukocytes from MCAS patients also harbor somatic mutations in genes implicated in SM using next-generation sequencing (NGS) technology and a relatively large MCAS cohort. We also addressed the question, whether some of the previously as somatic reported mutations are indeed germline mutations. Second, to identify germline mutations of relevance to MCAS pathogenesis. Here, mutation frequency in the present MCAS cohort was compared to that in public- and in-house databases in the case of frequent variants, and co-segregation was investigated in multiply affected families in the case of rare variants (allele frequency < 1%). MCAS diagnoses were assigned according to current criteria. Twenty five candidate genes were selected on the basis of published findings for SM. NGS was performed using a 76kbp custom designed Agilent SureSelect Target Enrichment and an Illumina Hiseq2000 2x100bp sequencing run. NGS revealed 67 germline mutations. No somatic mutations were detected. None of the germline mutations showed unequivocal association with MCAS. Failure to detect somatic mutations was probably attributable to the dilution of mutated mast cell DNA in normal leukocyte DNA. The present exploratory association findings suggest that some of the detected germline mutations may be functionally relevant and explain familial aggregation. Independent replication studies are therefore warranted

    Recurrent and Prolonged Infections in a Child with a Homozygous IFIH1 Nonsense Mutation

    No full text
    In an Egyptian girl born to consanguineous parents, whole-exome sequencing (WES) identified a homozygous mutation in PHGDH, c. 1273G>A (p.Val425Met), indicating 3-phosphoglycerate dehydrogenase deficiency. This diagnosis was compatible with the patient's microcephaly, severe psychomotor retardation, seizures and cataracts. However, she additionally suffered from recurrent (at least monthly) episodes of prolonged and severe chest infections requiring hospitalization, suggesting a secondary, predisposing and potentially Mendelian, condition. A local reactivation of an EBV infection in the respiratory tract was detected after a recent chest infection, likely representing an opportunistic infection based on a compromised immune system. Further inspection of WES data revealed a homozygous nonsense mutation, c.2665A>T (p.Lys889*), in IFIH1, encoding MDA5. MDA5 detects long viral double-stranded RNA that is generated during replication of picorna viruses, and thereby activates the type I interferon signaling pathway. The results of Western blot analysis of protein from cultured fibroblasts of the patient indicates absence of wild type MDA5/ IFIH1, compatible with NMD. We propose that, analogous to the severe course of primary influenza infection due to biallelic deficiency of a downstream effector, IRF7, homozygous loss of IFIH1 defines a novel Mendelian immunodeficiency disorder that increases susceptibility to severe viral infections. This is contrasted to heterozygous gain-offunction IFIH1 mutations in autoimmune diseases. Our findings highlight the potential of comprehensive genomic investigations in patients from consanguineous families to identify monogenic predispositions to severe infections

    Novel mutations in SLC6A5 with benign course in hyperekplexia

    No full text
    Infants suffering from life-threatening apnea, stridor, cyanosis, and increased muscle tone may often be misdiagnosed with infantile seizures and inappropriately treated because of lack and delay in genetic diagnosis. Here, we report a patient with increased muscle tone after birth and hypertonic attacks with life-threatening apnea but no epileptiform patterns in EEG recordings. We identified novel compound heterozygous variants in SLC6A5 (NM_004211.4:c.[1429T> C];[1430delC]) by trio whole-exome sequencing, containing a base deletion inherited by the asymptomatic mother leading to a frameshift (c.1430delC, p.Ser477PhefsTer9) and a de novo base exchange leading to an amino acid change (c.1429T > C, p.Ser477Pro). To date, there are four known disease-associated genes for primary hyperekplexia, all of which are involved in the functioning of glycinergic synapses. SLC6A5 encodes the sodium- and chloride-dependent glycine transporter 2 (GlyT2), which recaptures glycine, a major inhibitory transmitter in the brainstem and spinal cord. The diagnosis altered the patient's medical care to his benefit because SLC6A5 mutations with rather benign courses of hyperekplexia may be spared of needless pharmacotherapy. Symptoms eventually decreased in frequency until about once in 2 mo at 2 yr age. We present the first report of halting hyperekplexia episodes by maternal soothing in multiple instances. We highlight the importance of clarifying the genetic diagnosis by rapid next-generation sequencing techniques in this group of infantile apneic attacks with hyperekplexia due to the broad differential diagnoses
    corecore