378 research outputs found

    Finding genes and variants for lipid levels after genome-wide association analysis

    Get PDF
    We review the main findings from genome-wide association studies (GWAS) for levels of HDL-cholesterol, LDL-cholesterol and triglycerides, including approaches to identify the functional variant(s) or gene(s). We discuss study design and challenges related to whole genome or exome sequencing to identify novel genes and variants

    Genotype imputation

    Get PDF
    Genotype imputation is now an essential tool in the analysis of genome-wide association scans. This technique allows geneticists to accurately evaluate the evidence for association at genetic markers that are not directly genotyped. Genotype imputation is particularly useful for combining results across studies that rely on different genotyping platforms but also increases the power of individual scans. Here, we review the history and theoretical underpinnings of the technique. To illustrate performance of the approach, we summarize results from several gene mapping studies. Finally, we preview the role of genotype imputation in an era when whole genome resequencing is becoming increasingly common

    MaCH: Using sequence and genotype data to estimate haplotypes and unobserved genotypes

    Get PDF
    Genome‐wide association studies (GWAS) can identify common alleles that contribute to complex disease susceptibility. Despite the large number of SNPs assessed in each study, the effects of most common SNPs must be evaluated indirectly using either genotyped markers or haplotypes thereof as proxies. We have previously implemented a computationally efficient Markov Chain framework for genotype imputation and haplotyping in the freely available MaCH software package. The approach describes sampled chromosomes as mosaics of each other and uses available genotype and shotgun sequence data to estimate unobserved genotypes and haplotypes, together with useful measures of the quality of these estimates. Our approach is already widely used to facilitate comparison of results across studies as well as meta‐analyses of GWAS. Here, we use simulations and experimental genotypes to evaluate its accuracy and utility, considering choices of genotyping panels, reference panel configurations, and designs where genotyping is replaced with shotgun sequencing. Importantly, we show that genotype imputation not only facilitates cross study analyses but also increases power of genetic association studies. We show that genotype imputation of common variants using HapMap haplotypes as a reference is very accurate using either genome‐wide SNP data or smaller amounts of data typical in fine‐mapping studies. Furthermore, we show the approach is applicable in a variety of populations. Finally, we illustrate how association analyses of unobserved variants will benefit from ongoing advances such as larger HapMap reference panels and whole genome shotgun sequencing technologies

    WikiGWA: an open platform for collecting and using genome-wide association results

    Get PDF
    The number of discovered genetic variants from genome-wide association (GWA) studies (GWAS) has been growing rapidly. Centralized efforts such as the National Human Genome Research Institute's GWAS catalog provide regular updates and a convenient interface for quick lookup. However, the catalog entries are manually curated and rely on data from published articles. Other tools such as SNPedia (http://www.snpedia.com) collect published results regarding functional consequences of genetic variations. Here, we propose an approach that allows individual investigators to share their GWA results through an open platform. Unlike GWAS catalog or SNPedia, wikiGWA collects first-hand GWAS results and in a much larger scale. Investigators are not only able to post a much larger amount of results, but also post results from unpublished studies, which could alleviate publication bias and facilitate identification of weak signals. Our interface allows for flexible and fast queries, and the query results are formatted to work seamlessly with the LocusZoom program for visualization and annotation. We here describe wikiGWA, made publically available at http://www.wikiGWA.org

    GREGOR: evaluating global enrichment of trait-associated variants in epigenomic features using a systematic, data-driven approach

    Get PDF
    Motivation: The majority of variation identified by genome wide association studies falls in non-coding genomic regions and is hypothesized to impact regulatory elements that modulate gene expression. Here we present a statistically rigorous software tool GREGOR (Genomic Regulatory Elements and Gwas Overlap algoRithm) for evaluating enrichment of any set of genetic variants with any set of regulatory features. Using variants from five phenotypes, we describe a data-driven approach to determine the tissue and cell types most relevant to a trait of interest and to identify the subset of regulatory features likely impacted by these variants. Last, we experimentally evaluate six predicted functional variants at six lipid-associated loci and demonstrate significant evidence for allele-specific impact on expression levels. GREGOR systematically evaluates enrichment of genetic variation with the vast collection of regulatory data available to explore novel biological mechanisms of disease and guide us toward the functional variant at trait-associated loci

    Genetic associations with temporal shifts in obesity and severe obesity during the obesity epidemic in Norway:A longitudinal population-based cohort (the HUNT Study)

    Get PDF
    Background Obesity has tripled worldwide since 1975 as environments are becoming more obesogenic. Our study investigates how changes in population weight and obesity over time are associated with genetic predisposition in the context of an obesogenic environment over 6 decades and examines the robustness of the findings using sibling design. Methods and findings A total of 67,110 individuals aged 13–80 years in the Nord-Trøndelag region of Norway participated with repeated standardized body mass index (BMI) measurements from 1966 to 2019 and were genotyped in a longitudinal population-based health study, the Trøndelag Health Study (the HUNT Study). Genotyping required survival to and participation in the HUNT Study in the 1990s or 2000s. Linear mixed models with observations nested within individuals were used to model the association between a genome-wide polygenic score (GPS) for BMI and BMI, while generalized estimating equations were used for obesity (BMI ≥ 30 kg/m2) and severe obesity (BMI ≥ 35 kg/m2). The increase in the average BMI and prevalence of obesity was steeper among the genetically predisposed. Among 35-year-old men, the prevalence of obesity for the least predisposed tenth increased from 0.9% (95% confidence interval [CI] 0.6% to 1.2%) to 6.5% (95% CI 5.0% to 8.0%), while the most predisposed tenth increased from 14.2% (95% CI 12.6% to 15.7%) to 39.6% (95% CI 36.1% to 43.0%). Equivalently for women of the same age, the prevalence of obesity for the least predisposed tenth increased from 1.1% (95% CI 0.7% to1.5%) to 7.6% (95% CI 6.0% to 9.2%), while the most predisposed tenth increased from 15.4% (95% CI 13.7% to 17.2%) to 42.0% (95% CI 38.7% to 45.4%). Thus, for 35-year-old men and women, respectively, the absolute change in the prevalence of obesity from 1966 to 2019 was 19.8 percentage points (95% CI 16.2 to 23.5, p < 0.0001) and 20.0 percentage points (95% CI 16.4 to 23.7, p < 0.0001) greater for the most predisposed tenth compared with the least predisposed tenth, defined using the GPS for BMI. The corresponding absolute changes in the prevalence of severe obesity for men and women, respectively, were 8.5 percentage points (95% CI 6.3 to 10.7, p < 0.0001) and 12.6 percentage points (95% CI 9.6 to 15.6, p < 0.0001) greater for the most predisposed tenth. The greater increase in BMI in genetically predisposed individuals over time was apparent after adjustment for family-level confounding using a sibling design. Key limitations include a slightly lower survival to date of genetic testing for the older cohorts and that we apply a contemporary genetic score to past time periods. Future research should validate our findings using a polygenic risk score constructed from historical data. Conclusions In the context of increasingly obesogenic changes in our environment over 6 decades, our findings reveal a growing inequality in the risk for obesity and severe obesity across GPS tenths. Our results suggest that while obesity is a partially heritable trait, it is still modifiable by environmental factors. While it may be possible to identify those most susceptible to environmental change, who thus have the most to gain from preventive measures, efforts to reverse the obesogenic environment will benefit the whole population and help resolve the obesity epidemic

    Improving power of association tests using multiple sets of imputed genotypes from distributed reference panels

    Full text link
    The accuracy of genotype imputation depends upon two factors: the sample size of the reference panel and the genetic similarity between the reference panel and the target samples. When multiple reference panels are not consented to combine together, it is unclear how to combine the imputation results to optimize the power of genetic association studies. We compared the accuracy of 9,265 Norwegian genomes imputed from three reference panels—1000 Genomes phase 3 (1000G), Haplotype Reference Consortium (HRC), and a reference panel containing 2,201 Norwegian participants from the population‐based Nord Trøndelag Health Study (HUNT) from low‐pass genome sequencing. We observed that the population‐matched reference panel allowed for imputation of more population‐specific variants with lower frequency (minor allele frequency (MAF) between 0.05% and 0.5%). The overall imputation accuracy from the population‐specific panel was substantially higher than 1000G and was comparable with HRC, despite HRC being 15‐fold larger. These results recapitulate the value of population‐specific reference panels for genotype imputation. We also evaluated different strategies to utilize multiple sets of imputed genotypes to increase the power of association studies. We observed that testing association for all variants imputed from any panel results in higher power to detect association than the alternative strategy of including only one version of each genetic variant, selected for having the highest imputation quality metric. This was particularly true for lower frequency variants (MAF < 1%), even after adjusting for the additional multiple testing burden.Peer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/139954/1/gepi22067_am.pdfhttps://deepblue.lib.umich.edu/bitstream/2027.42/139954/2/gepi22067.pd

    Causal relationships between NAFLD, T2D and obesity have implications for disease subphenotyping

    Get PDF
    Background & aims: Non-alcoholic fatty liver disease (NAFLD), type 2 diabetes (T2D) and obesity are epidemiologically correlated with each other but the causal inter-relationships between them remain incompletely understood. We aimed to explore the causal relationships between the 3 diseases. Methods: Using both UK Biobank and publicly available genome-wide association study data, we performed a 2-sample bidirectional Mendelian randomization analysis to test the causal inter-relationships between NAFLD, T2D, and obesity. Transgenic mice expressing the human PNPLA3-I148M isoforms (TghPNPLA3-I148M) were used as an example to validate causal effects and explore underlying mechanisms. Results: Genetically driven NAFLD significantly increased the risk of T2D and central obesity but not insulin resistance or generalized obesity, while genetically driven T2D, body mass index and WHRadjBMI causally increased NAFLD risk. The animal study focusing on PNPLA3 corroborated these causal effects: compared to the TghPNPLA3-I148I controls, the TghPNPLA3-I148M mice developed glucose intolerance and increased visceral fat, but maintained normal insulin sensitivity, reduced body weight, and decreased circulating total cholesterol. Mechanistically, the TghPNPLA3-I148M mice demonstrated decreased pancreatic insulin but increased glucagon secretion, which was associated with increased pancreatic inflammation. In addition, transcription of hepatic cholesterol biosynthesis pathway genes was significantly suppressed, while transcription of thermogenic pathway genes was activated in subcutaneous and brown adipose tissues but not in visceral fat in TghPNPLA3-I148M mice. Conclusions: Our study suggests that lifelong, genetically driven NAFLD causally promotes T2D with a late-onset type 1-like diabetic subphenotype and central obesity; while genetically driven T2D, obesity, and central obesity all causally increase the risk of NAFLD. This causal relationship revealed new insights into how nature and nurture drive these diseases, providing novel hypotheses for disease subphenotyping. Lay summary: Non-alcoholic fatty liver disease, type 2 diabetes and obesity are epidemiologically correlated with each other, but their causal relationships were incompletely understood. Herein, we identified causal relationships between these conditions, which suggest that each of these closely related diseases should be further stratified into subtypes. This is important for accurate diagnosis, prevention and treatment of these diseases

    Deep-coverage whole genome sequences and blood lipids among 16,324 individuals.

    Get PDF
    Large-scale deep-coverage whole-genome sequencing (WGS) is now feasible and offers potential advantages for locus discovery. We perform WGS in 16,324 participants from four ancestries at mean depth &gt;29X and analyze genotypes with four quantitative traits-plasma total cholesterol, low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol, and triglycerides. Common variant association yields known loci except for few variants previously poorly imputed. Rare coding variant association yields known Mendelian dyslipidemia genes but rare non-coding variant association detects no signals. A high 2M-SNP LDL-C polygenic score (top 5th percentile) confers similar effect size to a monogenic mutation (~30 mg/dl higher for each); however, among those with severe hypercholesterolemia, 23% have a high polygenic score and only 2% carry a monogenic mutation. At these sample sizes and for these phenotypes, the incremental value of WGS for discovery is limited but WGS permits simultaneous assessment of monogenic and polygenic models to severe hypercholesterolemia
    corecore