6 research outputs found

    Comparison between instrumental variable and mediation-based methods for reconstructing causal gene networks in yeast

    Get PDF
    Under embargo until: 2021-12-17Causal gene networks model the flow of information within a cell. Reconstructing causal networks from omics data is challenging because correlation does not imply causation. When genomics and transcriptomics data from a segregating population are combined, genomic variants can be used to orient the direction of causality between gene expression traits. Instrumental variable methods use a local expression quantitative trait locus (eQTL) as a randomized instrument for a gene's expression level, and assign target genes based on distal eQTL associations. Mediation-based methods additionally require that distal eQTL associations are mediated by the source gene. A detailed comparison between these methods has not yet been conducted, due to the lack of a standardized implementation of different methods, the limited sample size of most multi-omics datasets, and the absence of ground-truth networks for most organisms. Here we used Findr, a software package providing uniform implementations of instrumental variable, mediation, and coexpression-based methods, a recent dataset of 1012 segregants from a cross between two budding yeast strains, and the YEASTRACT database of known transcriptional interactions to compare causal gene network inference methods. We found that causal inference methods result in a significant overlap with the ground-truth, whereas coexpression did not perform better than random. A subsampling analysis revealed that the performance of mediation saturates at large sample sizes, due to a loss of sensitivity when residual correlations become significant. Instrumental variable methods on the other hand contain false positive predictions, due to genomic linkage between eQTL instruments. Instrumental variable and mediation-based methods also have complementary roles for identifying causal genes underlying transcriptional hotspots. Instrumental variable methods correctly predicted STB5 targets for a hotspot centred on the transcription factor STB5, whereas mediation failed due to Stb5p auto-regulating its own expression. Mediation suggests a new candidate gene, DNM1, for a hotspot on Chr XII, whereas instrumental variable methods could not distinguish between multiple genes located within the hotspot. In conclusion, causal inference from genomics and transcriptomics data is a powerful approach for reconstructing causal gene networks, which could be further improved by the development of methods to control for residual correlations in mediation analyses, and for genomic linkage and pleiotropic effects from transcriptional hotspots in instrumental variable analyses.acceptedVersio

    Changes in the gene expression profile during spontaneous migraine attacks

    Get PDF
    Migraine attacks are delimited, allowing investigation of changes during and outside attack. Gene expression fluctuates according to environmental and endogenous events and therefore, we hypothesized that changes in RNA expression during and outside a spontaneous migraine attack exist which are specific to migraine. Twenty-seven migraine patients were assessed during a spontaneous migraine attack, including headache characteristics and treatment effect. Blood samples were taken during attack, two hours after treatment, on a headache-free day and after a cold pressor test. RNA-Sequencing, genotyping, and steroid profiling were performed. RNA-Sequences were analyzed at gene level (differential expression analysis) and at network level, and genomic and transcriptomic data were integrated. We found 29 differentially expressed genes between ‘attack’ and ‘after treatment’, after subtracting non-migraine specific genes, that were functioning in fatty acid oxidation, signaling pathways and immune-related pathways. Network analysis revealed mechanisms affected by changes in gene interactions, e.g. ‘ion transmembrane transport’. Integration of genomic and transcriptomic data revealed pathways related to sumatriptan treatment, i.e. ‘5HT1 type receptor mediated signaling pathway’. In conclusion, we uniquely investigated intra-individual changes in gene expression during a migraine attack. We revealed both genes and pathways potentially involved in the pathophysiology of migraine and/or migraine treatment.publishedVersio

    High-Dimensional Bayesian Network Inference From Systems Genetics Data Using Genetic Node Ordering

    No full text
    Studying the impact of genetic variation on gene regulatory networks is essential to understand the biological mechanisms by which genetic variation causes variation in phenotypes. Bayesian networks provide an elegant statistical approach for multi-trait genetic mapping and modelling causal trait relationships. However, inferring Bayesian gene networks from high-dimensional genetics and genomics data is challenging, because the number of possible networks scales super-exponentially with the number of nodes, and the computational cost of conventional Bayesian network inference methods quickly becomes prohibitive. We propose an alternative method to infer high-quality Bayesian gene networks that easily scales to thousands of genes. Our method first reconstructs a node ordering by conducting pairwise causal inference tests between genes, which then allows to infer a Bayesian network via a series of independent variable selection problems, one for each gene. We demonstrate using simulated and real systems genetics data that this results in a Bayesian network with equal, and sometimes better, likelihood than the conventional methods, while having a significantly higher overlap with groundtruth networks and being orders of magnitude faster. Moreover our method allows for a unified false discovery rate control across genes and individual edges, and thus a rigorous and easily interpretable way for tuning the sparsity level of the inferred network. Bayesian network inference using pairwise node ordering is a highly efficient approach for reconstructing gene regulatory networks when prior information for the inclusion of edges exists or can be inferred from the available data

    Model-based clustering of multi-tissue gene expression data

    Get PDF
    Motivation Recently, it has become feasible to generate large-scale, multi-tissue gene expression data, where expression profiles are obtained from multiple tissues or organs sampled from dozens to hundreds of individuals. When traditional clustering methods are applied to this type of data, important information is lost, because they either require all tissues to be analyzed independently, ignoring dependencies and similarities between tissues, or to merge tissues in a single, monolithic dataset, ignoring individual characteristics of tissues. Results We developed a Bayesian model-based multi-tissue clustering algorithm, revamp, which can incorporate prior information on physiological tissue similarity, and which results in a set of clusters, each consisting of a core set of genes conserved across tissues as well as differential sets of genes specific to one or more subsets of tissues. Using data from seven vascular and metabolic tissues from over 100 individuals in the STockholm Atherosclerosis Gene Expression (STAGE) study, we demonstrate that multi-tissue clusters inferred by revamp are more enriched for tissue-dependent protein-protein interactions compared to alternative approaches. We further demonstrate that revamp results in easily interpretable multi-tissue gene expression associations to key coronary artery disease processes and clinical phenotypes in the STAGE individuals

    Changes in the gene expression profile during spontaneous migraine attacks

    No full text
    Migraine attacks are delimited, allowing investigation of changes during and outside attack. Gene expression fluctuates according to environmental and endogenous events and therefore, we hypothesized that changes in RNA expression during and outside a spontaneous migraine attack exist which are specific to migraine. Twenty-seven migraine patients were assessed during a spontaneous migraine attack, including headache characteristics and treatment effect. Blood samples were taken during attack, two hours after treatment, on a headache-free day and after a cold pressor test. RNA-Sequencing, genotyping, and steroid profiling were performed. RNA-Sequences were analyzed at gene level (differential expression analysis) and at network level, and genomic and transcriptomic data were integrated. We found 29 differentially expressed genes between ‘attack’ and ‘after treatment’, after subtracting non-migraine specific genes, that were functioning in fatty acid oxidation, signaling pathways and immune-related pathways. Network analysis revealed mechanisms affected by changes in gene interactions, e.g. ‘ion transmembrane transport’. Integration of genomic and transcriptomic data revealed pathways related to sumatriptan treatment, i.e. ‘5HT1 type receptor mediated signaling pathway’. In conclusion, we uniquely investigated intra-individual changes in gene expression during a migraine attack. We revealed both genes and pathways potentially involved in the pathophysiology of migraine and/or migraine treatment

    Variation in the SERPINA6/SERPINA1 locus alters morning plasma cortisol, hepatic corticosteroid binding globulin expression, gene expression in peripheral tissues, and risk of cardiovascular disease

    No full text
    The stress hormone cortisol modulates fuel metabolism, cardiovascular homoeostasis, mood, inflammation and cognition. The CORtisol NETwork (CORNET) consortium previously identified a single locus associated with morning plasma cortisol. Identifying additional genetic variants that explain more of the variance in cortisol could provide new insights into cortisol biology and provide statistical power to test the causative role of cortisol in common diseases. The CORNET consortium extended its genome-wide association meta-analysis for morning plasma cortisol from 12,597 to 25,314 subjects and from ~2.2 M to ~7 M SNPs, in 17 population-based cohorts of European ancestries. We confirmed the genetic association with SERPINA6/SERPINA1. This locus contains genes encoding corticosteroid binding globulin (CBG) and α1-antitrypsin. Expression quantitative trait loci (eQTL) analyses undertaken in the STARNET cohort of 600 individuals showed that specific genetic variants within the SERPINA6/SERPINA1 locus influence expression of SERPINA6 rather than SERPINA1 in the liver. Moreover, trans-eQTL analysis demonstrated effects on adipose tissue gene expression, suggesting that variations in CBG levels have an effect on delivery of cortisol to peripheral tissues. Two-sample Mendelian randomisation analyses provided evidence that each genetically-determined standard deviation (SD) increase in morning plasma cortisol was associated with increased odds of chronic ischaemic heart disease (0.32, 95% CI 0.06–0.59) and myocardial infarction (0.21, 95% CI 0.00–0.43) in UK Biobank and similarly in CARDIoGRAMplusC4D. These findings reveal a causative pathway for CBG in determining cortisol action in peripheral tissues and thereby contributing to the aetiology of cardiovascular disease
    corecore