146 research outputs found
Dutch population structure across space, time and GWAS design
Previous genetic studies have identified local population structure within the Netherlands; however their resolution is limited by use of unlinked markers and absence of external reference data. Here we apply advanced haplotype sharing methods (ChromoPainter/fineSTRUCTURE) to study fine-grained population genetic structure and demographic change across the Netherlands using genome-wide single nucleotide polymorphism data (1,626 individuals) with associated geography (1,422 individuals). We identify 40 haplotypic clusters exhibiting strong north/south variation and fine-scale differentiation within provinces. Clustering is tied to country-wide ancestry gradients from neighbouring lands and to locally restricted gene flow across major Dutch rivers. North-south structure is temporally stable, with west-east differentiation more transient, potentially influenced by migrations during the middle ages. Despite superexponential population growth, regional demographic estimates reveal population crashes contemporaneous with the Black Death. Within Dutch and international data, GWAS incorporating fine-grained haplotypic covariates are less confounded than standard methods
The project MinE databrowser: bringing large-scale whole-genome sequencing in ALS to researchers and the public
Amyotrophic lateral sclerosis (ALS) is a rapidly progressive fatal neurodegenerative disease affecting one in 350 people. The aim of Project MinE is to elucidate the pathophysiology of ALS through whole-genome sequencing at least 15,000 ALS patients and 7500 controls at 30Ă— coverage. Here, we present the Project MinE data browser ( databrowser.projectmine.com ), a unique and intuitive one-stop, open-access server that provides detailed information on genetic variation analyzed in a new and still growing set of 4366 ALS cases and 1832 matched controls. Through its visual components and interactive design, the browser specifically aims to be a resource to those without a biostatistics background and allow clinicians and preclinical researchers to integrate Project MinE data into their own research. The browser allows users to query a transcript and immediately access a unique combination of detailed (meta)data, annotations and association statistics that would otherwise require analytic expertise and visits to scattered resources
Systematic rare variant analyses identify RAB32 as a susceptibility gene for familial Parkinson’s disease
Despite substantial progress, causal variants are identified only for a minority of familial Parkinson’s disease (PD) cases, leaving high-risk pathogenic variants unidentified1,2. To identify such variants, we uniformly processed exome sequencing data of 2,184 index familial PD cases and 69,775 controls. Exome-wide analyses converged on RAB32 as a novel PD gene identifying c.213C > G/p.S71R as a high-risk variant presenting in ~0.7% of familial PD cases while observed in only 0.004% of controls (odds ratio of 65.5). This variant was confirmed in all cases via Sanger sequencing and segregated with PD in three families. RAB32 encodes a small GTPase known to interact with LRRK2 (refs. 3,4). Functional analyses showed that RAB32 S71R increases LRRK2 kinase activity, as indicated by increased autophosphorylation of LRRK2 S1292. Here our results implicate mutant RAB32 in a key pathological mechanism in PD—LRRK2 kinase activity5–7—and thus provide novel insights into the mechanistic connections between RAB family biology, LRRK2 and PD risk
ATXN1 repeat expansions confer risk for amyotrophic lateral sclerosis and contribute to TDP-43 mislocalization
Increasingly, repeat expansions are being identified as part of the complex genetic architecture of amyotrophic lateral sclerosis. To date, several repeat expansions have been genetically associated with the disease: intronic repeat expansions in C9orf72, polyglutamine expansions in ATXN2 and polyalanine expansions in NIPA1. Together with previously published data, the identification of an amyotrophic lateral sclerosis patient with a family history of spinocerebellar ataxia type 1, caused by polyglutamine expansions in ATXN1, suggested a similar disease association for the repeat expansion in ATXN1. We, therefore, performed a large-scale international study in 11 700 individuals, in which we showed a significant association between intermediate ATXN1 repeat expansions and amyotrophic lateral sclerosis (P = 3.33 × 10-7). Subsequent functional experiments have shown that ATXN1 reduces the nucleocytoplasmic ratio of TDP-43 and enhances amyotrophic lateral sclerosis phenotypes in Drosophila, further emphasizing the role of polyglutamine repeat expansions in the pathophysiology of amyotrophic lateral sclerosis
Pathogenic Huntingtin Repeat Expansions in Patients with Frontotemporal Dementia and Amyotrophic Lateral Sclerosis
Hannu Laaksovirta konsortion jäsenenä.The Genomics England Research Consortium, The International ALS/FTD Genomics Consortium (iAFGC), The International FTD Genetics Consortium (IFGC), The International LBD Genomics Consortium (iLBDGC), The NYGC ALS Consortium, The PROSPECT Consortium,17 James B. Rowe,17 Luisa Benussi,18 Giuliano Binetti,18,19 Roberta Ghidoni,18 Edwin Jabbari,20,21 Coralie Viollet,22 Jonathan D. Glass,23 Andrew B. Singleton,24 Vincenzo Silani,25,26 Owen A. Ross,27 Mina Ryten,8,28,29 Ali Torkamani,30 Toshiko Tanaka,31 Luigi Ferrucci,31 Susan M. Resnick,32 We examined the role of repeat expansions in the pathogenesis of frontotemporal dementia (FTD) and amyotrophic lateral sclerosis (ALS) by analyzing whole-genome sequence data from 2,442 FTD/ALS patients, 2,599 Lewy body dementia (LBD) patients, and 3,158 neurologically healthy subjects. Pathogenic expansions (range, 40?64 CAG repeats) in the huntingtin (HTT) gene were found in three (0.12%) patients diagnosed with pure FTD/ALS syndromes but were not present in the LBD or healthy cohorts. We replicated our findings in an independent collection of 3,674 FTD/ALS patients. Postmortem evaluations of two patients revealed the classical TDP-43 pathology of FTD/ALS, as well as huntingtin-positive, ubiquitin-positive aggregates in the frontal cortex. The neostriatal atrophy that pathologically defines Huntington?s disease was absent in both cases. Our findings reveal an etiological relationship between HTT repeat expansions and FTD/ALS syndromes and indicate that genetic screening of FTD/ALS patients for HTT repeat expansions should be considered. We examined the role of repeat expansions in the pathogenesis of frontotemporal dementia (FTD) and amyotrophic lateral sclerosis (ALS) by analyzing whole-genome sequence data from 2,442 FTD/ALS patients, 2,599 Lewy body dementia (LBD) patients, and 3,158 neurologically healthy subjects. Pathogenic expansions (range, 40?64 CAG repeats) in the huntingtin (HTT) gene were found in three (0.12%) patients diagnosed with pure FTD/ALS syndromes but were not present in the LBD or healthy cohorts. We replicated our findings in an independent collection of 3,674 FTD/ALS patients. Postmortem evaluations of two patients revealed the classical TDP-43 pathology of FTD/ALS, as well as huntingtin-positive, ubiquitin-positive aggregates in the frontal cortex. The neostriatal atrophy that pathologically defines Huntington?s disease was absent in both cases. Our findings reveal an etiological relationship between HTT repeat expansions and FTD/ALS syndromes and indicate that genetic screening of FTD/ALS patients for HTT repeat expansions should be considered.Peer reviewe
Analysis of shared common genetic risk between amyotrophic lateral sclerosis and epilepsy
Because hyper-excitability has been shown to be a shared pathophysiological mechanism, we used the latest and largest genome-wide studies in amyotrophic lateral sclerosis (n = 36,052) and epilepsy (n = 38,349) to determine genetic overlap between these conditions. First, we showed no significant genetic correlation, also when binned on minor allele frequency. Second, we confirmed the absence of polygenic overlap using genomic risk score analysis. Finally, we did not identify pleiotropic variants in meta-analyses of the 2 diseases. Our findings indicate that amyotrophic lateral sclerosis and epilepsy do not share common genetic risk, showing that hyper-excitability in both disorders has distinct origins
Association of Variants in the SPTLC1 Gene With Juvenile Amyotrophic Lateral Sclerosis
Hannu Laaksovirta konsortion jäsenenä.IMPORTANCE Juvenile amyotrophic lateral sclerosis (ALS) is a rare form of ALS characterized by age of symptom onset less than 25 years and a variable presentation. OBJECTIVE To identify the genetic variants associated with juvenile ALS. DESIGN, SETTING, AND PARTICIPANTS In this multicenter family-based genetic study, trio whole-exome sequencing was performed to identify the disease-associated gene in a case series of unrelated patients diagnosed with juvenile ALS and severe growth retardation. The patients and their family members were enrolled at academic hospitals and a government research facility between March 1, 2016, and March 13, 2020, and were observed until October 1, 2020. Whole-exome sequencing was also performed in a series of patients with juvenile ALS. A total of 66 patients with juvenile ALS and 6258 adult patients with ALS participated in the study. Patients were selected for the study based on their diagnosis, and all eligible participants were enrolled in the study. None of the participants had a family history of neurological disorders, suggesting de novo variants as the underlying genetic mechanism. MAIN OUTCOMES AND MEASURES De novo variants present only in the index case and not in unaffected family members. RESULTS Trio whole-exome sequencing was performed in 3 patients diagnosed with juvenile ALS and their parents. An additional 63 patients with juvenile ALS and 6258 adult patients with ALS were subsequently screened for variants in the SPTLC1 gene. De novo variants in SPTLC1 (p. Ala20Ser in 2 patients and p.Ser331Tyr in 1 patient) were identified in 3 unrelated patients diagnosed with juvenile ALS and failure to thrive. A fourth variant (p.Leu39del) was identified in a patient with juvenile ALS where parental DNA was unavailable. Variants in this gene have been previously shown to be associated with autosomal-dominant hereditary sensory autonomic neuropathy, type 1A, by disrupting an essential enzyme complex in the sphingolipid synthesis pathway. CONCLUSIONS AND RELEVANCE These data broaden the phenotype associated with SPTLC1 and suggest that patients presenting with juvenile ALS should be screened for variants in this gene.Peer reviewe
Unexpected frequency of the pathogenic AR CAG repeat expansion in the general population
CAG repeat expansions in exon 1 of the AR gene on the X chromosome cause spinal and bulbar muscular atrophy, a male-specific progressive neuromuscular disorder associated with a variety of extra-neurological symptoms. The disease has a reported male prevalence of 1:30,303 or less, but the AR repeat expansion frequency is unknown. We established a pipeline, which combines the use of the ExpansionHunter tool and visual validation, to detect AR CAG expansion on whole-genome sequencing data, benchmarked it to fragment PCR sizing, and applied it to 74,277 unrelated individuals from four large cohorts. Our pipeline showed sensitivity of 100% (95% C.I. 90.8-100%), specificity of 99% (95% C.I. 94.2-99.7%), and positive predictive value of 97.4% (95% C.I. 84.4-99.6%). We found the mutation frequency to be 1:3,182 (95% C.I. 1:2,309-1:4,386, n=117,734) X chromosomes - ten times more frequent than the reported disease prevalence. Modelling using the novel mutation frequency led to estimate disease prevalence of 1:6,887 males, more than four times more frequent than the reported disease prevalence. This discrepancy is possibly due to underdiagnosis of this neuromuscular condition, reduced prevalence, and/or pleomorphic clinical manifestations
Investigating the Causal Relationship of C-Reactive Protein with 32 Complex Somatic and Psychiatric Outcomes: A Large-Scale Cross-Consortium Mendelian Randomization Study
BACKGROUND: C-reactive protein (CRP) is associated with immune, cardiometabolic, and psychiatric traits and diseases. Yet it is inconclusive whether these associations are causal. METHODS AND FINDINGS: We performed Mendelian randomization (MR) analyses using two genetic risk scores (GRSs) as instrumental variables (IVs). The first GRS consisted of four single nucleotide polymorphisms (SNPs) in the CRP gene (GRSCRP), and the second consisted of 18 SNPs that were significantly associated with CRP levels in the largest genome-wide association study (GWAS) to date (GRSGWAS). To optimize power, we used summary statistics from GWAS consortia and tested the association of these two GRSs with 32 complex somatic and psychiatric outcomes, with up to 123,865 participants per outcome from populations of European ancestry. We performed heterogeneity tests to disentangle the pleiotropic effect of IVs. A Bonferroni-corrected significance level of less than 0.0016 was considered statistically significant. An observed p-value equal to or less than 0.05 was considered nominally significant evidence for a potential causal association, yet to be confirmed. The strengths (F-statistics) of the IVs were 31.92-3,761.29 and 82.32-9,403.21 for GRSCRP and GRSGWAS, respectively. CRP GRSGWAS showed a statistically significant protective relationship of a 10% genetically elevated CRP level with the risk of schizophrenia (odds ratio [OR] 0.86 [95% CI 0.79-0.94]; p < 0.001). We validated this finding with individual-level genotype data from the schizophrenia GWAS (OR 0.96 [95% CI 0.94-0.98]; p < 1.72 Ă— 10-6). Further, we found that a standardized CRP polygenic risk score (CRPPRS) at p-value thresholds of 1 Ă— 10-4, 0.001, 0.01, 0.05, and 0.1 using individual-level data also showed a protective effect (OR < 1.00) against schizophrenia; the first CRPPRS (built of SNPs with p < 1 Ă— 10-4) showed a statistically significant (p < 2.45 Ă— 10-4) protective effect with an OR of 0.97 (95% CI 0.95-0.99). The CRP GRSGWAS showed that a 10% increase in genetically determined CRP level was significantly associated with coronary artery disease (OR 0.88 [95% CI 0.84-0.94]; p < 2.4 Ă— 10-5) and was nominally associated with the risk of inflammatory bowel disease (OR 0.85 [95% CI 0.74-0.98]; p < 0.03), Crohn disease (OR 0.81 [95% CI 0.70-0.94]; p < 0.005), psoriatic arthritis (OR 1.36 [95% CI 1.00-1.84]; p < 0.049), knee osteoarthritis (OR 1.17 [95% CI 1.01-1.36]; p < 0.04), and bipolar disorder (OR 1.21 [95% CI 1.05-1.40]; p < 0.007) and with an increase of 0.72 (95% CI 0.11-1.34; p < 0.02) mm Hg in systolic blood pressure, 0.45 (95% CI 0.06-0.84; p < 0.02) mm Hg in diastolic blood pressure, 0.01 ml/min/1.73 m2 (95% CI 0.003-0.02; p < 0.005) in estimated glomerular filtration rate from serum creatinine, 0.01 g/dl (95% CI 0.0004-0.02; p < 0.04) in serum albumin level, and 0.03 g/dl (95% CI 0.008-0.05; p < 0.009) in serum protein level. However, after adjustment for heterogeneity, neither GRS showed a significant effect of CRP level (at p < 0.0016) on any of these outcomes, including coronary artery disease, nor on the other 20 complex outcomes studied. Our study has two potential limitations: the limited variance explained by our genetic instruments modeling CRP levels in blood and the unobserved bias introduced by the use of summary statistics in our MR analyses. CONCLUSIONS: Genetically elevated CRP levels showed a significant potentially protective causal relationship with risk of schizophrenia. We observed nominal evidence at an observed p < 0.05 using either GRSCRP or GRSGWAS-with persistence after correction for heterogeneity-for a causal relationship of elevated CRP levels with psoriatic osteoarthritis, rheumatoid arthritis, knee osteoarthritis, systolic blood pressure, diastolic blood pressure, serum albumin, and bipolar disorder. These associations remain yet to be confirmed. We cannot verify any causal effect of CRP level on any of the other common somatic and neuropsychiatric outcomes investigated in the present study. This implies that interventions that lower CRP level are unlikely to result in decreased risk for the majority of common complex outcomes
Computing linkage disequilibrium aware genome embeddings using autoencoders
Motivation: The completion of the genome has paved the way for genome-wide association studies (GWAS), which explained certain proportions of heritability. GWAS are not optimally suited to detect non-linear effects in disease risk, possibly hidden in non-additive interactions (epistasis). Alternative methods for epistasis detection using, e.g. deep neural networks (DNNs) are currently under active development. However, DNNs are constrained by finite computational resources, which can be rapidly depleted due to increasing complexity with the sheer size of the genome. Besides, the curse of dimensionality complicates the task of capturing meaningful genetic patterns for DNNs; therefore necessitates dimensionality reduction. Results: We propose a method to compress single nucleotide polymorphism (SNP) data, while leveraging the linkage disequilibrium (LD) structure and preserving potential epistasis. This method involves clustering correlated SNPs into haplotype blocks and training per-block autoencoders to learn a compressed representation of the block’s genetic content. We provide an adjustable autoencoder design to accommodate diverse blocks and bypass extensive hyperparameter tuning. We applied this method to genotyping data from Project MinE, and achieved 99% average test reconstruction accuracy—i.e. minimal information loss—while compressing the input to nearly 10% of the original size. We demonstrate that haplotype-block based autoencoders outperform linear Principal Component Analysis (PCA) by approximately 3% chromosome-wide accuracy of reconstructed variants. To the extent of our knowledge, our approach is the first to simultaneously leverage haplotype structure and DNNs for dimensionality reduction of genetic data. Availability and implementation: Data are available for academic use through Project MinE at https://www.projectmine.com/research/data-sharing/, contingent upon terms and requirements specified by the source studies. Code is available at https://github.com/gizem-tas/haploblockautoencoders
- …