52 research outputs found

    Using the structure of genome data in the design of deep neural networks for predicting amyotrophic lateral sclerosis from genotype

    Get PDF
    Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease caused by aberrations in the genome. While several disea

    Using the structure of genome data in the design of deep neural networks for predicting amyotrophic lateral sclerosis from genotype

    Get PDF
    Motivation: Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease caused by aberrations in the genome. While several disease-causing variants have been identified, a major part of heritability remains unexplained. ALS is believed to have a complex genetic basis where non-additive combinations of variants constitute disease, which cannot be picked up using the linear models employed in classical genotype-phenotype association studies. Deep learning on the other hand is highly promising for identifying such complex relations. We therefore developed a deep-learning based approach for the classification of ALS patients versus healthy individuals from the Dutch cohort of the Project MinE dataset. Based on recent insight that regulatory regions harbor the majority of disease-associated variants, we employ a two-step approach: first promoter regions that are likely associated to ALS are identified, and second individuals are classified based on their genotype in the selected genomic regions. Both steps employ a deep convolutional neural network. The network architecture accounts for the structure of genome data by applying convolution only to parts of the data where this makes sense from a genomics perspective. Results: Our approach identifies potentially ALS-associated promoter regions, and generally outperforms other classification methods. Test results support the hypothesis that non-additive combinations of variants contribute to ALS. Architectures and protocols developed are tailored toward processing population-scale, whole-genome data. We consider this a relevant first step toward deep learning assisted genotype-phenotype association in whole genome-sized data

    Pollitt syndrome patients carry mutation in TTDN1

    Get PDF
    Complete human genome sequencing was used to identify the causative mutation in a family with Pollitt syndrome (MIM #. 275550), comprising two non-consanguineous parents and their two affected children. The patient's symptoms were reminiscent of the non-photosensitive form of recessively inherited trichothiodystrophy (TTD). A mutation in the TTDN1/. C7orf11 gene, a gene that is known to be involved in non-photosensitive TTD, had been excluded by others by Sanger sequencing. Unexpectedly, we did find a homozygous single-base pair deletion in the coding region of this gene, a mutation that is known to cause non-photosensitive TTD. The deleterious variant causing a frame shift at amino acid 93 (C326delA) followed the right mode of inheritance in the family and was independently validated using conventional DNA sequencing. We expect this novel DNA sequencing technology to help redefine phenotypic and genomic variation in patients with (mono) genetic disorders in an unprecedented manner

    Telomere length analysis in amyotrophic lateral sclerosis using large-scale whole genome sequence data

    Full text link
    BackgroundAmyotrophic lateral sclerosis (ALS) is a neurodegenerative disease characterized by the loss of upper and lower motor neurons, leading to progressive weakness of voluntary muscles, with death following from neuromuscular respiratory failure, typically within 3 to 5 years. There is a strong genetic contribution to ALS risk. In 10% or more, a family history of ALS or frontotemporal dementia is obtained, and the Mendelian genes responsible for ALS in such families have now been identified in about 50% of cases. Only about 14% of apparently sporadic ALS is explained by known genetic variation, suggesting that other forms of genetic variation are important. Telomeres maintain DNA integrity during cellular replication, differ between sexes, and shorten naturally with age. Sex and age are risk factors for ALS and we therefore investigated telomere length in ALS. MethodsSamples were from Project MinE, an international ALS whole genome sequencing consortium that includes phenotype data. For validation we used donated brain samples from motor cortex from people with ALS and controls. Ancestry and relatedness were evaluated by principal components analysis and relationship matrices of DNA microarray data. Whole genome sequence data were from Illumina HiSeq platforms and aligned using the Isaac pipeline. TelSeq was used to quantify telomere length using whole genome sequence data. We tested the association of telomere length with ALS and ALS survival using Cox regression. ResultsThere were 6,580 whole genome sequences, reducing to 6,195 samples (4,315 from people with ALS and 1,880 controls) after quality control, and 159 brain samples (106 ALS, 53 controls). Accounting for age and sex, there was a 20% (95% CI 14%, 25%) increase of telomere length in people with ALS compared to controls (p = 1.1 x 10(-12)), validated in the brain samples (p = 0.03). Those with shorter telomeres had a 10% increase in median survival (p = 5.0x10(-7)). Although there was no difference in telomere length between sporadic ALS and familial ALS (p=0.64), telomere length in 334 people with ALS due to expanded C9orf72 repeats was shorter than in those without expanded C9orf72 repeats (p = 5.0x10(-4)). DiscussionAlthough telomeres shorten with age, longer telomeres are a risk factor for ALS and worsen prognosis. Longer telomeres are associated with ALS

    Common and rare variant association analyses in amyotrophic lateral sclerosis identify 15 risk loci with distinct genetic architectures and neuron-specific biology

    Get PDF
    Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease with a lifetime risk of one in 350 people and an unmet need for disease-modifying therapies. We conducted a cross-ancestry genome-wide association study (GWAS) including 29,612 patients with ALS and 122,656 controls, which identified 15 risk loci. When combined with 8,953 individuals with whole-genome sequencing (6,538 patients, 2,415 controls) and a large cortex-derived expression quantitative trait locus (eQTL) dataset (MetaBrain), analyses revealed locus-specific genetic architectures in which we prioritized genes either through rare variants, short tandem repeats or regulatory effects. ALS-associated risk loci were shared with multiple traits within the neurodegenerative spectrum but with distinct enrichment patterns across brain regions and cell types. Of the environmental and lifestyle risk factors obtained from the literature, Mendelian randomization analyses indicated a causal role for high cholesterol levels. The combination of all ALS-associated signals reveals a role for perturbations in vesicle-mediated transport and autophagy and provides evidence for cell-autonomous disease initiation in glutamatergic neurons

    Common and rare variant association analyses in amyotrophic lateral sclerosis identify 15 risk loci with distinct genetic architectures and neuron-specific biology

    Get PDF
    Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease with a lifetime risk of one in 350 people and an unmet need for disease-modifying therapies. We conducted a cross-ancestry genome-wide association study (GWAS) including 29,612 patients with ALS and 122,656 controls, which identified 15 risk loci. When combined with 8,953 individuals with whole-genome sequencing (6,538 patients, 2,415 controls) and a large cortex-derived expression quantitative trait locus (eQTL) dataset (MetaBrain), analyses revealed locus-specific genetic architectures in which we prioritized genes either through rare variants, short tandem repeats or regulatory effects. ALS-associated risk loci were shared with multiple traits within the neurodegenerative spectrum but with distinct enrichment patterns across brain regions and cell types. Of the environmental and lifestyle risk factors obtained from the literature, Mendelian randomization analyses indicated a causal role for high cholesterol levels. The combination of all ALS-associated signals reveals a role for perturbations in vesicle-mediated transport and autophagy and provides evidence for cell-autonomous disease initiation in glutamatergic neurons. A cross-ancestry genome-wide association meta-analysis of amyotrophic lateral sclerosis (ALS) including 29,612 patients with ALS and 122,656 controls identifies 15 risk loci with distinct genetic architectures and neuron-specific biology

    Common and rare variant association analyses in amyotrophic lateral sclerosis identify 15 risk loci with distinct genetic architectures and neuron-specific biology

    Get PDF
    Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease with a lifetime risk of one in 350 people and an unmet need for disease-modifying therapies. We conducted a cross-ancestry genome-wide association study (GWAS) including 29,612 patients with ALS and 122,656 controls, which identified 15 risk loci. When combined with 8,953 individuals with whole-genome sequencing (6,538 patients, 2,415 controls) and a large cortex-derived expression quantitative trait locus (eQTL) dataset (MetaBrain), analyses revealed locus-specific genetic architectures in which we prioritized genes either through rare variants, short tandem repeats or regulatory effects. ALS-associated risk loci were shared with multiple traits within the neurodegenerative spectrum but with distinct enrichment patterns across brain regions and cell types. Of the environmental and lifestyle risk factors obtained from the literature, Mendelian randomization analyses indicated a causal role for high cholesterol levels. The combination of all ALS-associated signals reveals a role for perturbations in vesicle-mediated transport and autophagy and provides evidence for cell-autonomous disease initiation in glutamatergic neurons. A cross-ancestry genome-wide association meta-analysis of amyotrophic lateral sclerosis (ALS) including 29,612 patients with ALS and 122,656 controls identifies 15 risk loci with distinct genetic architectures and neuron-specific biology

    Common and rare variant association analyses in amyotrophic lateral sclerosis identify 15 risk loci with distinct genetic architectures and neuron-specific biology

    Get PDF
    A cross-ancestry genome-wide association meta-analysis of amyotrophic lateral sclerosis (ALS) including 29,612 patients with ALS and 122,656 controls identifies 15 risk loci with distinct genetic architectures and neuron-specific biology. Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease with a lifetime risk of one in 350 people and an unmet need for disease-modifying therapies. We conducted a cross-ancestry genome-wide association study (GWAS) including 29,612 patients with ALS and 122,656 controls, which identified 15 risk loci. When combined with 8,953 individuals with whole-genome sequencing (6,538 patients, 2,415 controls) and a large cortex-derived expression quantitative trait locus (eQTL) dataset (MetaBrain), analyses revealed locus-specific genetic architectures in which we prioritized genes either through rare variants, short tandem repeats or regulatory effects. ALS-associated risk loci were shared with multiple traits within the neurodegenerative spectrum but with distinct enrichment patterns across brain regions and cell types. Of the environmental and lifestyle risk factors obtained from the literature, Mendelian randomization analyses indicated a causal role for high cholesterol levels. The combination of all ALS-associated signals reveals a role for perturbations in vesicle-mediated transport and autophagy and provides evidence for cell-autonomous disease initiation in glutamatergic neurons

    Genome-wide Analyses Identify KIF5A as a Novel ALS Gene

    Get PDF
    To identify novel genes associated with ALS, we undertook two lines of investigation. We carried out a genome-wide association study comparing 20,806 ALS cases and 59,804 controls. Independently, we performed a rare variant burden analysis comparing 1,138 index familial ALS cases and 19,494 controls. Through both approaches, we identified kinesin family member 5A (KIF5A) as a novel gene associated with ALS. Interestingly, mutations predominantly in the N-terminal motor domain of KIF5A are causative for two neurodegenerative diseases: hereditary spastic paraplegia (SPG10) and Charcot-Marie-Tooth type 2 (CMT2). In contrast, ALS-associated mutations are primarily located at the C-terminal cargo-binding tail domain and patients harboring loss-of-function mutations displayed an extended survival relative to typical ALS cases. Taken together, these results broaden the phenotype spectrum resulting from mutations in KIF5A and strengthen the role of cytoskeletal defects in the pathogenesis of ALS.Peer reviewe

    Association of NIPA1 repeat expansions with amyotrophic lateral sclerosis in a large international cohort

    Get PDF
    NIPA1 (nonimprinted in Prader-Willi/Angelman syndrome 1) mutations are known to cause hereditary spastic paraplegia type 6, a neurodegenerative disease that phenotypically overlaps to some extent with amyotrophic lateral sclerosis (ALS). Previously, a genomewide screen for copy number variants found an association with rare deletions in NIPA1 and ALS, and subsequent genetic analyses revealed that long (or expanded) polyalanine repeats in NIPA1 convey increased ALS susceptibility. We set out to perform a large-scale replication study to further investigate the role of NIPA1 polyalanine expansions with ALS, in which we characterized NIPA1 repeat size in an independent international cohort of 3955 patients with ALS and 2276 unaffected controls and combined our results with previous reports. Meta-analysis on a total of 6245 patients with ALS and 5051 controls showed an overall increased risk of ALS in those with expanded (>8) GCG repeat length (odds ratio = 1.50, p = 3.8×10-5). Together with previous reports, these findings provide evidence for an association of an expanded polyalanine repeat in NIPA1 and ALS
    • …
    corecore