8 research outputs found

    Using the structure of genome data in the design of deep neural networks for predicting amyotrophic lateral sclerosis from genotype

    Get PDF
    Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease caused by aberrations in the genome. While several disea

    Using the structure of genome data in the design of deep neural networks for predicting amyotrophic lateral sclerosis from genotype

    Get PDF
    Motivation: Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease caused by aberrations in the genome. While several disease-causing variants have been identified, a major part of heritability remains unexplained. ALS is believed to have a complex genetic basis where non-additive combinations of variants constitute disease, which cannot be picked up using the linear models employed in classical genotype-phenotype association studies. Deep learning on the other hand is highly promising for identifying such complex relations. We therefore developed a deep-learning based approach for the classification of ALS patients versus healthy individuals from the Dutch cohort of the Project MinE dataset. Based on recent insight that regulatory regions harbor the majority of disease-associated variants, we employ a two-step approach: first promoter regions that are likely associated to ALS are identified, and second individuals are classified based on their genotype in the selected genomic regions. Both steps employ a deep convolutional neural network. The network architecture accounts for the structure of genome data by applying convolution only to parts of the data where this makes sense from a genomics perspective. Results: Our approach identifies potentially ALS-associated promoter regions, and generally outperforms other classification methods. Test results support the hypothesis that non-additive combinations of variants contribute to ALS. Architectures and protocols developed are tailored toward processing population-scale, whole-genome data. We consider this a relevant first step toward deep learning assisted genotype-phenotype association in whole genome-sized data

    Project MinE: study design and pilot analyses of a large-scale whole-genome sequencing study in amyotrophic lateral sclerosis

    Get PDF
    The most recent genome-wide association study in amyotrophic lateral sclerosis (ALS) demonstrates a disproportionate contribution from low-frequency variants to genetic susceptibility to disease. We have therefore begun Project MinE, an international collaboration that seeks to analyze whole-genome sequence data of at least 15 000 ALS patients and 7500 controls. Here, we report on the design of Project MinE and pilot analyses of successfully sequenced 1169 ALS patients and 608 controls drawn from the Netherlands. As has become characteristic of sequencing studies, we find an abundance of rare genetic variation (minor allele frequency < 0.1%), the vast majority of which is absent in public datasets. Principal component analysis reveals local geographical clustering of these variants within The Netherlands. We use the whole-genome sequence data to explore the implications of poor geographical matching of cases and controls in a sequence-based disease study and to investigate how ancestry-matched, externally sequenced controls can induce false positive associations. Also, we have publicly released genome-wide minor allele counts in cases and controls, as well as results from genic burden tests

    Structural variation analysis of 6,500 whole genome sequences in amyotrophic lateral sclerosis

    Get PDF
    There is a strong genetic contribution to Amyotrophic lateral sclerosis (ALS) risk, with heritability estimates of up to 60%. Both Mendelian and small effect variants have been identified, but in common with other conditions, such variants only explain a little of the heritability. Genomic structural variation might account for some of this otherwise unexplained heritability. We therefore investigated association between structural variation in a set of 25 ALS genes, and ALS risk and phenotype. As expected, the repeat expansion in the C9orf72 gene was identified as associated with ALS. Two other ALS-associated structural variants were identified: inversion in the VCP gene and insertion in the ERBB4 gene. All three variants were associated both with increased risk of ALS and specific phenotypic patterns of disease expression. More than 70% of people with respiratory onset ALS harboured ERBB4 insertion compared with 25% of the general population, suggesting respiratory onset ALS may be a distinct genetic subtype

    Telomere length analysis in amyotrophic lateral sclerosis using large-scale whole genome sequence data

    Get PDF
    Background: Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease characterized by the loss of upper and lower motor neurons, leading to progressive weakness of voluntary muscles, with death following from neuromuscular respiratory failure, typically within 3 to 5 years. There is a strong genetic contribution to ALS risk. In 10% or more, a family history of ALS or frontotemporal dementia is obtained, and the Mendelian genes responsible for ALS in such families have now been identified in about 50% of cases. Only about 14% of apparently sporadic ALS is explained by known genetic variation, suggesting that other forms of genetic variation are important. Telomeres maintain DNA integrity during cellular replication, differ between sexes, and shorten naturally with age. Sex and age are risk factors for ALS and we therefore investigated telomere length in ALS. Methods: Samples were from Project MinE, an international ALS whole genome sequencing consortium that includes phenotype data. For validation we used donated brain samples from motor cortex from people with ALS and controls. Ancestry and relatedness were evaluated by principal components analysis and relationship matrices of DNA microarray data. Whole genome sequence data were from Illumina HiSeq platforms and aligned using the Isaac pipeline. TelSeq was used to quantify telomere length using whole genome sequence data. We tested the association of telomere length with ALS and ALS survival using Cox regression. Results: There were 6,580 whole genome sequences, reducing to 6,195 samples (4,315 from people with ALS and 1,880 controls) after quality control, and 159 brain samples (106 ALS, 53 controls). Accounting for age and sex, there was a 20% (95% CI 14%, 25%) increase of telomere length in people with ALS compared to controls (p = 1.1 × 10−12), validated in the brain samples (p = 0.03). Those with shorter telomeres had a 10% increase in median survival (p = 5.0×10−7). Although there was no difference in telomere length between sporadic ALS and familial ALS (p=0.64), telomere length in 334 people with ALS due to expanded C9orf72 repeats was shorter than in those without expanded C9orf72 repeats (p = 5.0×10−4). Discussion: Although telomeres shorten with age, longer telomeres are a risk factor for ALS and worsen prognosis. Longer telomeres are associated with ALS

    Pharmacogenetic interactions in amyotrophic lateral sclerosis: a step closer to a cure?

    No full text
    Genetic mutations related to amyotrophic lateral sclerosis (ALS) act through distinct pathophysiological pathways, which may lead to varying treatment responses. Here we assess the genetic interaction between C9orf72, UNC13A, and MOBP with creatine and valproic acid treatment in two clinical trials. Genotypic data was available for 309 of the 338 participants (91.4%). The UNC13A genotype affected mortality (p = 0.012), whereas C9orf72 repeat-expansion carriers exhibited a faster rate of decline in overall (p = 0.051) and bulbar functioning (p = 0.005). A dose-response pharmacogenetic interaction was identified between creatine and the A allele of the MOBP genotype (p = 0.027), suggesting a qualitative interaction in a recessive model (HR 3.96, p = 0.015). Not taking genetic information into account may mask evidence of response to treatment or be an unrecognized source of bias. Incorporating genetic data could help investigators to identify critical treatment clues in patients with ALS.Perioperative Medicine: Efficacy, Safety and Outcome (Anesthesiology/Intensive Care

    Whole-genome sequencing reveals that variants in the Interleukin 18 Receptor Accessory Protein 3'UTR protect against ALS

    No full text
    The noncoding genome is substantially larger than the protein-coding genome but has been largely unexplored by genetic association studies. Here, we performed region-based rare variant association analysis of >25,000 variants in untranslated regions of 6,139 amyotrophic lateral sclerosis (ALS) whole genomes and the whole genomes of 70,403 non-ALS controls. We identified interleukin-18 receptor accessory protein (IL18RAP) 3′ untranslated region (3′UTR) variants as significantly enriched in non-ALS genomes and associated with a fivefold reduced risk of developing ALS, and this was replicated in an independent cohort. These variants in the IL18RAP 3′UTR reduce mRNA stability and the binding of double-stranded RNA (dsRNA)-binding proteins. Finally, the variants of the IL18RAP 3′UTR confer a survival advantage for motor neurons because they dampen neurotoxicity of human induced pluripotent stem cell (iPSC)-derived microglia bearing an ALS-associated expansion in C9orf72, and this depends on NF-κB signaling. This study reveals genetic variants that protect against ALS by reducing neuroinflammation and emphasizes the importance of noncoding genetic association studies

    Author Correction: Common and rare variant association analyses in amyotrophic lateral sclerosis identify 15 risk loci with distinct genetic architectures and neuron-specific biology

    Get PDF
    Correction to: Nature Genetics https://doi.org/10.1038/s41588-021-00973-1, published online 6 December 2021. In the version of this article initially published, the affiliation for Nazli Başak appeared incorrectly. Nazli Başak is at Koç University, School of Medicine, KUTTAM-NDAL, Istanbul, Turkey, and not Bogazici University. The error has been corrected in the HTML and PDF versions of the article
    corecore