191 research outputs found

    Identification of gene-gene interactions for Alzheimer's disease using co-operative game theory

    Full text link
    Thesis (Ph.D.)--Boston UniversityThe multifactorial nature of Alzheimer's Disease suggests that complex gene-gene interactions are present in AD pathways. Contemporary approaches to detect such interactions in genome-wide data are mathematically and computationally challenging. We investigated gene-gene interactions for AD using a novel algorithm based on cooperative game theory in 15 genome-wide association study (GWAS) datasets comprising of a total of 11,840 AD cases and 10,931 cognitively normal elderly controls from the Alzheimer Disease Genetics Consortium (ADGC). We adapted this approach, which was developed originally for solving multi-dimensional problems in economics and social sciences, to compute a Shapely value statistic to identify genetic markers that contribute most to coalitions of SNPs in predicting AD risk. Treating each GWAS dataset as independent discovery, markers were ranked according to their contribution to coalitions formed with other markers. Using a backward elimination strategy, markers with low Shapley values were eliminated and the statistic was recalculated iteratively. We tested all two-way interactions between top Shapley markers in regression models which included the two SNPs (main effects) and a term for their interaction. Models yielding a p-value<0.05 for the interaction term were evaluated in each of the other datasets and the results from all datasets were combined by meta-analysis. Statistically significant interactions were observed with multiple marker combinations in the APOE regions. My analyses also revealed statistically strong interactions between markers in 6 regions; CTNNA3-ATP11A (p=4.1E-07), CSMD1-PRKCQ (p=3.5E-08), DCC-UNC5CL (p=5.9e-8), CNTNAP2-RFC3 (p=1.16e-07), AACS-TSHZ3 (p=2.64e-07) and CAMK4-MMD (p=3.3e-07). The Shapley value algorithm outperformed Chi-Square and ReliefF in detecting known interactions between APOE and GAB2 in a previously published GWAS dataset. It was also more accurate than competing filtering methods in identifying simulated epistastic SNPs that are additive in nature, but its accuracy was low in identifying non-linear interactions. The game theory algorithm revealed strong interactions between markers in novel genes with weak main effects, which would have been overlooked if only markers with strong marginal association with AD were tested. This method will be a valuable tool for identifying gene-gene interactions for complex diseases and other traits

    Polygenic risk associated with Alzheimerā€™s disease and other traits influences genes involved in T cell signaling and activation

    Get PDF
    IntroductionT cells, known for their ability to respond to an enormous variety of pathogens and other insults, are increasingly recognized as important mediators of pathology in neurodegeneration and other diseases. T cell gene expression phenotypes can be regulated by disease-associated genetic variants. Many complex diseases are better represented by polygenic risk than by individual variants.MethodsWe first compute a polygenic risk score (PRS) for Alzheimerā€™s disease (AD) using genomic sequencing data from a cohort of Alzheimerā€™s disease (AD) patients and age-matched controls, and validate the AD PRS against clinical metrics in our cohort. We then calculate the PRS for several autoimmune disease, neurological disorder, and immune function traits, and correlate these PRSs with T cell gene expression data from our cohort. We compare PRS-associated genes across traits and four T cell subtypes.ResultsSeveral genes and biological pathways associated with the PRS for these traits relate to key T cell functions. The PRS-associated gene signature generally correlates positively for traits within a particular category (autoimmune disease, neurological disease, immune function) with the exception of stroke. The trait-associated gene expression signature for autoimmune disease traits was polarized towards CD4+ T cell subtypes.DiscussionOur findings show that polygenic risk for complex disease and immune function traits can have varying effects on T cell gene expression trends. Several PRS-associated genes are potential candidates for therapeutic modulation in T cells, and could be tested in in vitro applications using cells from patients bearing high or low polygenic risk for AD or other conditions

    Behavioral Variant Frontotemporal Lobar Degeneration with Amyotrophic Lateral Sclerosis with a Chromosome 9p21 Hexanucleotide Repeat

    Get PDF
    To determine the genetic basis of familial frontotemporal lobar degeneration (FTLD) with amyotrophic lateral sclerosis (ALS) we performed a clinical and genetic analysis of an affected family. A 51-year-old man with behavioral variant FTLD with ALS had a family history of the disease suggestive of autosomal dominant inheritance with incomplete penetrance. Genetic studies in this patient demonstrated the presence of an amplified hexanucleotide repeat (>30) polymorphism in the chromosome 9 open reading frame 72 (C9ORF72) gene which was previously identified as a cause of FTLD. Five others unaffected from the family were negative (all had less than 11 repeats). Because of the clinical and pathological overlap between FTLD and AD we performed a larger genome-wide association study and did not find association of single nucleotide polymorphisms (SNPs) in the C9ORF72 gene with Alzheimerā€™s disease (AD) risk. Bioinformatic analysis of C9ORF72 using the Gene Expression Omnibus database showed expression differences in patients with muscular dystrophy, neural tube defects, and schizophrenia. We also report analysis of gene expression in brain regions using the Allen Human Brain Atlas. Defects in this recently reported gene are now believed to be the most common cause of inherited ALS and an important cause of inherited FTLD. Our work suggests that the gene may also be important in other neurological and psychiatric conditions

    LUSTR: a new customizable tool for calling genome-wide germline and somatic short tandem repeat variants

    Get PDF
    Background Short tandem repeats (STRs) are widely distributed across the human genome and are associated with numerous neurological disorders. However, the extent that STRs contribute to disease is likely under-estimated because of the challenges calling these variants in short read next generation sequencing data. Several computational tools have been developed for STR variant calling, but none fully address all of the complexities associated with this variant class.ResultsHere we introduce LUSTR which is designed to address some of the challenges associated with STR variant calling by enabling more flexibility in defining STR loci, allowing for customizable modules to tailor analyses, and expanding the capability to call somatic and multiallelic STR variants. LUSTR is a user-friendly and easily customizable tool for targeted or unbiased genome-wide STR variant screening that can use either predefined or novel genome builds. Using both simulated and real data sets, we demonstrated that LUSTR accurately infers germline and somatic STR expansions in individuals with and without diseases.ConclusionsLUSTR offers a powerful and user-friendly approach that allows for the identification of STR variants and can facilitate more comprehensive studies evaluating the role of pathogenic STR variants across human diseases

    Association of Rare Coding Mutations With Alzheimer Disease and Other Dementias Among Adults of European Ancestry

    Get PDF
    IMPORTANCE Some of the unexplained heritability of Alzheimer disease (AD) may be due to rare variants whose effects are not captured in genome-wide association studies because very large samples are needed to observe statistically significant associations. OBJECTIVE To identify genetic variants associated with AD risk using a nonstatistical approach. DESIGN, SETTING, AND PARTICIPANTS Genetic association study in which rare variants were identified by whole-exome sequencing in unrelated individuals of European ancestry from the Alzheimerā€™s Disease Sequencing Project (ADSP). Data were analyzed between March 2017 and September 2018. MAIN OUTCOMES AND MEASURES Minor alleles genome-wide and in 95 genes previously associated with AD, AD-related traits, or other dementias were tabulated and filtered for predicted functional impact and occurrence in participants with AD but not controls. Support for several findings was sought in a whole-exome sequencing data set comprising 19 affected relative pairs from Utah high-risk pedigrees and whole-genome sequencing data sets from the ADSP and Alzheimerā€™s Disease Neuroimaging Initiative. RESULTS Among 5617 participants with AD (3202 [57.0%] women; mean [SD] age, 76.4 [9.3] years) and 4594 controls (2719 [59.0%] women; mean [SD] age, 86.5 [4.5] years), a total of 24 variants with moderate or high functional impact from 19 genes were observed in 10 or more participants with AD but not in controls. These variants included a missense mutation (rs149307620 [p.A284T], n = 10) in NOTCH3, a gene in which coding mutations are associated with cerebral autosomal-dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL), that was also identified in 1 participant with AD and 1 participant with mild cognitive impairment in the whole genome sequencing data sets. Four participants with AD carried the TREM2 rs104894002 (p.Q33X) high-impact mutation that, in homozygous form, causes Nasu-Hakola disease, a rare disorder characterized by early-onset dementia and multifocal bone cysts, suggesting an intermediate inheritance model for the mutation. Compared with controls, participants with AD had a significantly higher burden of deleterious rare coding variants in dementia-associated genes (2314 vs 3354 cumulative variants, respectively; P = .006). CONCLUSIONS AND RELEVANCE Different mutations in the same gene or variable dose of a mutation may be associated with result in distinct dementias. These findings suggest that minor differences in the structure or amount of protein may be associated with in different clinical outcomes. Understanding these genotype-phenotype associations may provide further insight into the pathogenic nature of the mutations, as well as offer clues for developing new therapeutic targets

    Linkage analyses in Caribbean Hispanic families identify novel loci associated with familial late-onset Alzheimer's disease

    Get PDF
    INTRODUCTION: We performed linkage analyses in Caribbean Hispanic families with multiple late-onset Alzheimer's disease (LOAD) cases to identify regions that may contain disease causative variants. METHODS: We selected 67 LOAD families to perform genome-wide linkage scan. Analysis of the linked regions was repeated using the entire sample of 282 families. Validated chromosomal regions were analyzed using joint linkage and association. RESULTS: We identified 26 regions linked to LOAD (HLOD ā‰„3.6). We validated 13 of the regions (HLOD ā‰„2.5) using the entire family sample. The strongest signal was at 11q12.3 (rs2232932: HLODmax = 4.7, Pjoint = 6.6 Ɨ 10(-6)), a locus located āˆ¼2 Mb upstream of the membrane-spanning 4A gene cluster. We additionally identified a locus at 7p14.3 (rs10255835: HLODmax = 4.9, Pjoint = 1.2 Ɨ 10(-5)), a region harboring genes associated with the nervous system (GARS, GHRHR, and NEUROD6). DISCUSSION: Future sequencing efforts should focus on these regions because they may harbor familial LOAD causative mutations

    Rare Variants Imputation in Admixed Populations: Comparison Across Reference Panels and Bioinformatics Tools

    Get PDF
    BackgroundImputation has become a standard approach in genome-wide association studies (GWAS) to infer in silico untyped markers. Although feasibility for common variants imputation is well established, we aimed to assess rare and ultra-rare variantsā€™ imputation in an admixed Caribbean Hispanic population (CH).MethodsWe evaluated imputation accuracy in CH (N = 1,000), focusing on rare (0.1% ā‰¤ minor allele frequency (MAF) ā‰¤ 1%) and ultra-rare (MAF &lt; 0.1%) variants. We used two reference panels, the Haplotype Reference Consortium (HRC; N = 27,165) and 1000 Genome Project (1000G phase 3; N = 2,504) and multiple phasing (SHAPEIT, Eagle2) and imputation algorithms (IMPUTE2, MACH-Admix). To assess imputation quality, we reported: (a) high-quality variant counts according to imputation toolsā€™ internal indexes (e.g., IMPUTE2 ā€œInfoā€ ā‰„ 80%). (b) Wilcoxon Signed-Rank Test comparing imputation quality for genotyped variants that were masked and imputed; (c) Cohenā€™s kappa coefficient to test agreement between imputed and whole-exome sequencing (WES) variants; (d) imputation of G206A mutation in the PSEN1 (ultra-rare in the general population an more frequent in CH) followed by confirmation genotyping. We also tested ancestry proportion (European, African and Native American) against WES-imputation mismatches in a Poisson regression fashion.ResultsSHAPEIT2 retrieved higher percentage of imputed high-quality variants than Eagle2 (rare: 51.02% vs. 48.60%; ultra-rare 0.66% vs. 0.65%, Wilcoxon p-value &lt; 0.001). SHAPEIT-IMPUTE2 employing HRC outperformed 1000G (64.50% vs. 59.17%; 1.69% vs. 0.75% for high-quality rare and ultra-rare variants, respectively, Wilcoxon p-value &lt; 0.001). SHAPEIT-IMPUTE2 outperformed MaCH-Admix. Compared to 1000G, HRC-imputation retrieved a higher number of high-quality rare and ultra-rare variants, despite showing lower agreement between imputed and WES variants (e.g., rare: 98.86% for HRC vs. 99.02% for 1000G). High Kappa (K = 0.99) was observed for both reference panels. Twelve G206A mutation carriers were imputed and all validated by confirmation genotyping. African ancestry was associated with higher imputation errors for uncommon and rare variants (p-value &lt; 1e-05).ConclusionReference panels with larger numbers of haplotypes can improve imputation quality for rare and ultra-rare variants in admixed populations such as CH. Ethnic composition is an important predictor of imputation accuracy, with higher African ancestry associated with poorer imputation accuracy
    • ā€¦
    corecore