359 research outputs found

    Comprehensive Survey of SNPs in the Affymetrix Exon Array Using the 1000 Genomes Dataset

    Get PDF
    Microarray gene expression data has been used in genome-wide association studies to allow researchers to study gene regulation as well as other complex phenotypes including disease risks and drug response. To reach scientifically sound conclusions from these studies, however, it is necessary to get reliable summarization of gene expression intensities. Among various factors that could affect expression profiling using a microarray platform, single nucleotide polymorphisms (SNPs) in target mRNA may lead to reduced signal intensity measurements and result in spurious results. The recently released 1000 Genomes Project dataset provides an opportunity to evaluate the distribution of both known and novel SNPs in the International HapMap Project lymphoblastoid cell lines (LCLs). We mapped the 1000 Genomes Project genotypic data to the Affymetrix GeneChip Human Exon 1.0ST array (exon array), which had been used in our previous studies and for which gene expression data had been made publicly available. We also evaluated the potential impact of these SNPs on the differentially spliced probesets we had identified previously. Though the 1000 Genomes Project data allowed a comprehensive survey of the SNPs in this particular array, the same approach can certainly be applied to other microarray platforms. Furthermore, we present a detailed catalogue of SNP-containing probesets (exon-level) and transcript clusters (gene-level), which can be considered in evaluating findings using the exon array as well as benefit the design of follow-up experiments and data re-analysis

    Campylobacter jejuni Type VI Secretion System: Roles in Adaptation to Deoxycholic Acid, Host Cell Adherence, Invasion, and In Vivo Colonization

    Get PDF
    The recently identified type VI secretion system (T6SS) of proteobacteria has been shown to promote pathogenicity, competitive advantage over competing microorganisms, and adaptation to environmental perturbation. By detailed phenotypic characterization of loss-of-function mutants, in silico, in vitro and in vivo analyses, we provide evidence that the enteric pathogen, Campylobacter jejuni, possesses a functional T6SS and that the secretion system exerts pleiotropic effects on two crucial processes – survival in a bile salt, deoxycholic acid (DCA), and host cell adherence and invasion. The expression of T6SS during initial exposure to the upper range of physiological levels of DCA (0.075%–0.2%) was detrimental to C. jejuni proliferation, whereas down-regulation or inactivation of T6SS enabled C. jejuni to resist this effect. The C. jejuni multidrug efflux transporter gene, cmeA, was significantly up-regulated during the initial exposure to DCA in the wild type C. jejuni relative to the T6SS-deficient strains, suggesting that inhibition of proliferation is the consequence of T6SS-mediated DCA influx. A sequential modulation of the efflux transporter activity and the T6SS represents, in part, an adaptive mechanism for C. jejuni to overcome this inhibitory effect, thereby ensuring its survival. C. jejuni T6SS plays important roles in host cell adhesion and invasion as T6SS inactivation resulted in a reduction of adherence to and invasion of in vitro cell lines, while over-expression of a hemolysin co-regulated protein, which encodes a secreted T6SS component, greatly enhanced these processes. When inoculated into B6.129P2-IL-10[superscript tm1Cgn] mice, the T6SS-deficient C. jejuni strains did not effectively establish persistent colonization, indicating that T6SS contributes to colonization in vivo. Taken together, our data demonstrate the importance of bacterial T6SS in host cell adhesion, invasion, colonization and, for the first time to our knowledge, adaptation to DCA, providing new insights into the role of T6SS in C. jejuni pathogenesis

    Imputing Gene Expression in Uncollected Tissues Within and Beyond GTEx

    Get PDF
    Gene expression and its regulation can vary substantially across tissue types. In order to generate knowledge about gene expression in human tissues, the Genotype-Tissue Expression (GTEx) program has collected transcriptome data in a wide variety of tissue types from post-mortem donors. However, many tissue types are difficult to access and are not collected in every GTEx individual. Furthermore, in non-GTEx studies, the accessibility of certain tissue types greatly limits the feasibility and scale of studies of multi-tissue expression. In this work, we developed multi-tissue imputation methods to impute gene expression in uncollected or inaccessible tissues. Via simulation studies, we showed that the proposed methods outperform existing imputation methods in multi-tissue expression imputation and that incorporating imputed expression data can improve power to detect phenotype-expression correlations. By analyzing data from nine selected tissue types in the GTEx pilot project, we demonstrated that harnessing expression quantitative trait loci (eQTLs) and tissue-tissue expression-level correlations can aid imputation of transcriptome data from uncollected GTEx tissues. More importantly, we showed that by using GTEx data as a reference, one can impute expression levels in inaccessible tissues in non-GTEx expression studies

    A Gene-Based Association Method for Mapping Traits Using Reference Transcriptome Data

    Get PDF
    Genome-wide association studies (GWAS) have identified thousands of variants robustly associated with complex traits. However, the biological mechanisms underlying these associations are, in general, not well understood. We propose a gene-based association method called PrediXcan that directly tests the molecular mechanisms through which genetic variation affects phenotype. The approach estimates the component of gene expression determined by an individual’s genetic profile and correlates ‘imputed’ gene expression with the phenotype under investigation to identify genes involved in the etiology of the phenotype. Genetically regulated gene expression is estimated using whole-genome tissue-dependent prediction models trained with reference transcriptome data sets. PrediXcan enjoys the benefits of gene-based approaches such as reduced multiple-testing burden and a principled approach to the design of follow-up experiments. Our results demonstrate that PrediXcan can detect known and new genes associated with disease traits and provide insights into the mechanism of these associations

    Genetic architecture of host proteins involved in SARS-CoV-2 infection

    Get PDF
    Understanding the genetic architecture of host proteins interacting with SARS-CoV-2 or mediating the maladaptive host response to COVID-19 can help to identify new or repurpose existing drugs targeting those proteins. We present a genetic discovery study of 179 such host proteins among 10,708 individuals using an aptamer-based technique. We identify 220 host DNA sequence variants acting in cis (MAF 0.01-49.9%) and explaining 0.3-70.9% of the variance of 97 of these proteins, including 45 with no previously known protein quantitative trait loci (pQTL) and 38 encoding current drug targets. Systematic characterization of pQTLs across the phenome identified protein-drug-disease links and evidence that putative viral interaction partners such as MARK3 affect immune response. Our results accelerate the evaluation and prioritization of new drug development programmes and repurposing of trials to prevent, treat or reduce adverse outcomes. Rapid sharing and detailed interrogation of results is facilitated through an interactive webserver (https://omicscience.org/apps/covidpgwas/).We further acknowledge support for genomics from the Medical Research Council (MC_PC_13046). Proteomic measurements were supported and governed by a collaboration agreement between the University of Cambridge and Somalogic. JCZ and VPWA are supported by a 4-year Wellcome Trust PhD Studentship and the Cambridge Trust, CL, EW, and NJW are funded by the Medical Research Council (MC_UU_12015/1). NJW and ADH are an NIHR Senior Investigator. GK is supported by grants from the National Institute on Aging (NIA): R01 AG057452, RF1 AG058942, RF1 AG059093, U01 AG061359, and U19 AG063744. MR acknowledges funding from the Francis Crick Institute, which receives its core funding from Cancer Research UK (FC001134), the UK Medical Research Council (FC001134), and the Wellcome Trust (FC001134). ERG is supported by the National Human Genome Research Institute of the National Institutes of Health under Award Numbers R35HG010718 and R01HG011138. JR is supported by the German Federal Ministry of Education and Research (BMBF) within the framework of the e:Med research and funding concept (grant no. 01ZX1912D). This work was supported by the UCL British Heart Foundation Research Accelerator Award (AA/18/6/34223), the National Institute for Health Research University College London Hospitals Biomedical Research Centre, and arises from one of the national "Covid-19 Cardiovascular Disease Flagship Projects" designated by the NIHR-BHF Cardiovascular Partnership

    Mapping the proteo-genomic convergence of human diseases

    Get PDF
    Characterization of the genetic regulation of proteins is essential for understanding disease etiology and developing therapies. We identified 10,674 genetic associations for 3892 plasma proteins to create a cis-anchored gene-protein-disease map of 1859 connections that highlights strong cross-disease biological convergence. This proteo-genomic map provides a framework to connect etiologically related diseases, to provide biological context for new or emerging disorders, and to integrate different biological domains to establish mechanisms for known gene-disease links. Our results identify proteo-genomic connections within and between diseases and establish the value of cis-protein variants for annotation of likely causal disease genes at loci identified in genome-wide association studies, thereby addressing a major barrier to experimental validation and clinical translation of genetic discoveries
    • …
    corecore