30 research outputs found

    Statistical Methods and Models for Modern Genetic Analysis.

    Full text link
    The Genome-Wide Association Study (GWAS) is the predominant tool to search for genetic risk variants that contribute to complex human disease. Despite the large number of GWAS findings, variants implicated by GWAS are themselves unlikely to fully explain the heritability of many diseases. In this dissertation, we propose statistical methods to augment GWAS and further our understanding of the genetic causes of complex disease. In the first project, we consider the challenges of a gene-environment analysis performed as a follow-up to a significant initial GWAS result. It is known that effect estimates based on the same data that showed the significant GWAS result suffer from an upward bias called the “Winner's Curse." We show that the initial GWAS testing strategy can induce bias in both follow-up hypothesis testing and estimation for gene-environment interaction. We propose a novel bias-correction method based on a partial likelihood Markov Chain Monte Carlo algorithm. In the second project, we shift attention to rare genetic variants that have low power of being detected by GWAS. We propose the Cumulative Minor Allele Test (CMAT) to pool together multiple rare variants from the same gene and test for an excessive burden of rare variants in either cases or controls. We show the CMAT performs favorably across a range of study designs. Notably, the CMAT accommodates probabilistic genotypes, extending applicability to low-coverage and imputed sequence data. We use a simulation analysis to validate study designs that combine sequenced and imputed samples as a means to improve power to detect rare risk variants. Determining conditions that optimize imputation accuracy is important for successful application. In the final project, we propose a coalescent model of genotype imputation that allows fast, analytical estimates of imputation accuracy across complex population genetic models. We use our model to compare the performance of custom-made reference panels drawn from the same source population as imputation targets to publicly available reference panels (i.e. 1000 Genomes Project) that may differ in ancestry from the targets.Ph.D.BiostatisticsUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/89761/1/mattz_1.pd

    Meta-Analysis of Gene Level Tests for Rare Variant Association

    Get PDF
    The vast majority of connections between complex disease and common genetic variants were identified through meta-analysis, a powerful approach that enables large sample sizes while protecting against common artifacts due to population structure, repeated small sample analyses, and/or limitations with sharing individual level data. As the focus of genetic association studies shifts to rare variants, genes and other functional units are becoming the unit of analysis. Here, we propose and evaluate new approaches for performing meta-analysis of rare variant association tests, including burden tests, weighted burden tests, variable threshold tests and tests that allow variants with opposite effects to be grouped together. We show that our approach retains useful features of single variant meta-analytic approaches and demonstrate its utility in a study of blood lipid levels in ∼18,500 individuals genotyped with exome arrays

    GWAS of thyroid stimulating hormone highlights pleiotropic effects and inverse association with thyroid cancer

    Get PDF
    Correction: Volume12, Issue1 Article Number7354 DOI10.1038/s41467-021-27675-w PublishedDEC 16 2021Thyroid stimulating hormone (TSH) is critical for normal development and metabolism. To better understand the genetic contribution to TSH levels, we conduct a GWAS meta-analysis at 22.4 million genetic markers in up to 119,715 individuals and identify 74 genome-wide significant loci for TSH, of which 28 are previously unreported. Functional experiments show that the thyroglobulin protein-altering variants P118L and G67S impact thyroglobulin secretion. Phenome-wide association analysis in the UK Biobank demonstrates the pleiotropic effects of TSH-associated variants and a polygenic score for higher TSH levels is associated with a reduced risk of thyroid cancer in the UK Biobank and three other independent studies. Two-sample Mendelian randomization using TSH index variants as instrumental variables suggests a protective effect of higher TSH levels (indicating lower thyroid function) on risk of thyroid cancer and goiter. Our findings highlight the pleiotropic effects of TSH-associated variants on thyroid function and growth of malignant and benign thyroid tumors. Thyroid stimulating hormone (TSH) is critical for normal development and metabolism. Here, the authors conduct a GWAS and suggest protective effect of higher TSH on risk of thyroid cancer and goitre.Peer reviewe

    Author Correction:GWAS of thyroid stimulating hormone highlights the pleiotropic effects and inverse association with thyroid cancer

    Get PDF
    The original version of this article contained an error in the results, in the second paragraph of the subsection entitled “Fine-mapping for potentially causal variants among TSH loci”, in which effect sizes for two variants were incorrectly reported

    Evaluating the contribution of rare variants to type 2 diabetes and related traits using pedigrees

    Get PDF
    Significance Contributions of rare variants to common and complex traits such as type 2 diabetes (T2D) are difficult to measure. This paper describes our results from deep whole-genome analysis of large Mexican-American pedigrees to understand the role of rare-sequence variations in T2D and related traits through enriched allele counts in pedigrees. Our study design was well-powered to detect association of rare variants if rare variants with large effects collectively accounted for large portions of risk variability, but our results did not identify such variants in this sample. We further quantified the contributions of common and rare variants in gene expression profiles and concluded that rare expression quantitative trait loci explain a substantive, but minor, portion of expression heritability.</jats:p

    A new strategy for enhancing imputation quality of rare variants from next-generation sequencing data via combining SNP and exome chip data

    Get PDF
    Background: Rare variants have gathered increasing attention as a possible alternative source of missing heritability. Since next generation sequencing technology is not yet cost-effective for large-scale genomic studies, a widely used alternative approach is imputation. However, the imputation approach may be limited by the low accuracy of the imputed rare variants. To improve imputation accuracy of rare variants, various approaches have been suggested, including increasing the sample size of the reference panel, using sequencing data from study-specific samples (i.e., specific populations), and using local reference panels by genotyping or sequencing a subset of study samples. While these approaches mainly utilize reference panels, imputation accuracy of rare variants can also be increased by using exome chips containing rare variants. The exome chip contains 250 K rare variants selected from the discovered variants of about 12,000 sequenced samples. If exome chip data are available for previously genotyped samples, the combined approach using a genotype panel of merged data, including exome chips and SNP chips, should increase the imputation accuracy of rare variants. Results: In this study, we describe a combined imputation which uses both exome chip and SNP chip data simultaneously as a genotype panel. The effectiveness and performance of the combined approach was demonstrated using a reference panel of 848 samples constructed using exome sequencing data from the T2D-GENES consortium and 5,349 sample genotype panels consisting of an exome chip and SNP chip. As a result, the combined approach increased imputation quality up to 11 %, and genomic coverage for rare variants up to 117.7 % (MAF < 1 %), compared to imputation using the SNP chip alone. Also, we investigated the systematic effect of reference panels on imputation quality using five reference panels and three genotype panels. The best performing approach was the combination of the study specific reference panel and the genotype panel of combined data. Conclusions: Our study demonstrates that combined datasets, including SNP chips and exome chips, enhances both the imputation quality and genomic coverage of rare variants

    FOXA1 and adaptive response determinants to HER2 targeted therapy in TBCRC 036

    Get PDF
    Inhibition of the HER2/ERBB2 receptor is a keystone to treating HER2-positive malignancies, particularly breast cancer, but a significant fraction of HER2-positive (HER2+) breast cancers recur or fail to respond. Anti-HER2 monoclonal antibodies, like trastuzumab or pertuzumab, and ATP active site inhibitors like lapatinib, commonly lack durability because of adaptive changes in the tumor leading to resistance. HER2+ cell line responses to inhibition with lapatinib were analyzed by RNAseq and ChIPseq to characterize transcriptional and epigenetic changes. Motif analysis of lapatinib-responsive genomic regions implicated the pioneer transcription factor FOXA1 as a mediator of adaptive responses. Lapatinib in combination with FOXA1 depletion led to dysregulation of enhancers, impaired adaptive upregulation of HER3, and decreased proliferation. HER2-directed therapy using clinically relevant drugs (trastuzumab with or without lapatinib or pertuzumab) in a 7-day clinical trial designed to examine early pharmacodynamic response to antibody-based anti-HER2 therapy showed reduced FOXA1 expression was coincident with decreased HER2 and HER3 levels, decreased proliferation gene signatures, and increased immune gene signatures. This highlights the importance of the immune response to anti-HER2 antibodies and suggests that inhibiting FOXA1-mediated adaptive responses in combination with HER2 targeting is a potential therapeutic strategy

    Genetic Drivers of Heterogeneity in Type 2 Diabetes Pathophysiology

    Get PDF
    Type 2 diabetes (T2D) is a heterogeneous disease that develops through diverse pathophysiological processes1,2 and molecular mechanisms that are often specific to cell type3,4. Here, to characterize the genetic contribution to these processes across ancestry groups, we aggregate genome-wide association study data from 2,535,601 individuals (39.7% not of European ancestry), including 428,452 cases of T2D. We identify 1,289 independent association signals at genome-wide significance (P \u3c 5 × 10-8) that map to 611 loci, of which 145 loci are, to our knowledge, previously unreported. We define eight non-overlapping clusters of T2D signals that are characterized by distinct profiles of cardiometabolic trait associations. These clusters are differentially enriched for cell-type-specific regions of open chromatin, including pancreatic islets, adipocytes, endothelial cells and enteroendocrine cells. We build cluster-specific partitioned polygenic scores5 in a further 279,552 individuals of diverse ancestry, including 30,288 cases of T2D, and test their association with T2D-related vascular outcomes. Cluster-specific partitioned polygenic scores are associated with coronary artery disease, peripheral artery disease and end-stage diabetic nephropathy across ancestry groups, highlighting the importance of obesity-related processes in the development of vascular outcomes. Our findings show the value of integrating multi-ancestry genome-wide association study data with single-cell epigenomics to disentangle the aetiological heterogeneity that drives the development and progression of T2D. This might offer a route to optimize global access to genetically informed diabetes care

    Genetic drivers of heterogeneity in type 2 diabetes pathophysiology

    Get PDF
    Type 2 diabetes (T2D) is a heterogeneous disease that develops through diverse pathophysiological processes1,2 and molecular mechanisms that are often specific to cell type3,4. Here, to characterize the genetic contribution to these processes across ancestry groups, we aggregate genome-wide association study data from 2,535,601 individuals (39.7% not of European ancestry), including 428,452 cases of T2D. We identify 1,289 independent association signals at genome-wide significance (P &lt; 5 × 10-8) that map to 611 loci, of which 145 loci are, to our knowledge, previously unreported. We define eight non-overlapping clusters of T2D signals that are characterized by distinct profiles of cardiometabolic trait associations. These clusters are differentially enriched for cell-type-specific regions of open chromatin, including pancreatic islets, adipocytes, endothelial cells and enteroendocrine cells. We build cluster-specific partitioned polygenic scores5 in a further 279,552 individuals of diverse ancestry, including 30,288 cases of T2D, and test their association with T2D-related vascular outcomes. Cluster-specific partitioned polygenic scores are associated with coronary artery disease, peripheral artery disease and end-stage diabetic nephropathy across ancestry groups, highlighting the importance of obesity-related processes in the development of vascular outcomes. Our findings show the value of integrating multi-ancestry genome-wide association study data with single-cell epigenomics to disentangle the aetiological heterogeneity that drives the development and progression of T2D. This might offer a route to optimize global access to genetically informed diabetes care.</p

    GWAS of thyroid stimulating hormone highlights pleiotropic effects and inverse association with thyroid cancer

    Get PDF
    Thyroid stimulating hormone (TSH) is critical for normal development and metabolism. To better understand the genetic contribution to TSH levels, we conduct a GWAS meta-analysis at 22.4 million genetic markers in up to 119,715 individuals and identify 74 genome-wide significant loci for TSH, of which 28 are previously unreported. Functional experiments show that the thyroglobulin protein-altering variants P118L and G67S impact thyroglobulin secretion. Phenome-wide association analysis in the UK Biobank demonstrates the pleiotropic effects of TSH-associated variants and a polygenic score for higher TSH levels is associated with a reduced risk of thyroid cancer in the UK Biobank and three other independent studies. Two-sample Mendelian randomization using TSH index variants as instrumental variables suggests a protective effect of higher TSH levels (indicating lower thyroid function) on risk of thyroid cancer and goiter. Our findings highlight the pleiotropic effects of TSH-associated variants on thyroid function and growth of malignant and benign thyroid tumors
    corecore