3 research outputs found

    Modeling islet enhancers using deep learning identifies candidate causal variants at loci associated with T2D and glycemic traits.

    Get PDF
    Genetic association studies have identified hundreds of independent signals associated with type 2 diabetes (T2D) and related traits. Despite these successes, the identification of specific causal variants underlying a genetic association signal remains challenging. In this study, we describe a deep learning (DL) method to analyze the impact of sequence variants on enhancers. Focusing on pancreatic islets, a T2D relevant tissue, we show that our model learns islet-specific transcription factor (TF) regulatory patterns and can be used to prioritize candidate causal variants. At 101 genetic signals associated with T2D and related glycemic traits where multiple variants occur in linkage disequilibrium, our method nominates a single causal variant for each association signal, including three variants previously shown to alter reporter activity in islet-relevant cell types. For another signal associated with blood glucose levels, we biochemically test all candidate causal variants from statistical fine-mapping using a pancreatic islet beta cell line and show biochemical evidence of allelic effects on TF binding for the model-prioritized variant. To aid in future research, we publicly distribute our model and islet enhancer perturbation scores across ~67 million genetic variants. We anticipate that DL methods like the one presented in this study will enhance the prioritization of candidate causal variants for functional studies

    Genetic discovery and risk characterization in type 2 diabetes across diverse populations

    Get PDF
    Genomic discovery and characterization of risk loci for type 2 diabetes (T2D) have been conducted primarily in individuals of European ancestry. We conducted a multiethnic genome-wide association study of T2D among 53,102 cases and 193,679 control subjects from African, Hispanic, Asian, Native Hawaiian, and European population groups in the Population Architecture Genomics and Epidemiology (PAGE) and Diabetes Genetics Replication and Meta-analysis (DIAGRAM) Consortia. In individuals of African ancestry, we discovered a risk variant in the TGFB1 gene (rs11466334, risk allele frequency (RAF) = 6.8%, odds ratio [OR] = 1.27, p = 2.06 × 10−8), which replicated in independent studies of African ancestry (p = 6.26 × 10−23). We identified a multiethnic risk variant in the BACE2 gene (rs13052926, RAF = 14.1%, OR = 1.08, p = 5.75 × 10−9), which also replicated in independent studies (p = 3.45 × 10−4). We also observed a significant difference in the performance of a multiethnic genetic risk score (GRS) across population groups (pheterogeneity = 3.85 × 10−20). Comparing individuals in the top GRS risk category (40%–60%), the OR was highest in Asians (OR = 3.08) and European (OR = 2.94) ancestry populations, followed by Hispanic (OR = 2.39), Native Hawaiian (OR = 2.02), and African ancestry (OR = 1.57) populations. These findings underscore the importance of genetic discovery and risk characterization in diverse populations and the urgent need to further increase representation of non-European ancestry individuals in genetics research to improve genetic-based risk prediction across populations
    corecore