28 research outputs found
Systems level analysis of non-model organisms: a tool for understanding environmental stress
Omics techniques are changing the focus of ecotoxicology. In addition to challenges resulting from large amounts of data, there are further difficulties for non-model species: from lack of annotation to limited number of additional databases for molecular interactions and functions. In this thesis, I demonstrate the use of systems biology to relate molecular measurements to physiological parameters in non-model species in the context of environmental stress. Firstly, I make dynamical data-driven model of how gene expression changes in relation to the nerve conductance in earthworm Eisenia fetida exposed to single chemicals in the laboratory. The model reveals that gene expression changes might reflect the recovery from nerve damage. Using a similar approach, I use blue mussel Mytilus edulis sampled from their natural environment to model their annual cycle, integrating 1H-NMR metabolite levels with physiological and environmental parameters. I challenge this model created from data from a reference site to see site-effects for mussels sampled from an industrial harbour. Finally, I use systems biology to relate changing chemical concentrations and traditional toxicity assays in an effluent remediation system to stickleback gene expression and morphology. I demonstrate that data-driven systems biology can help the interpretation of complex problems.
Supplementary data for this thesis can be found on the University of Birmingham eData repository at: https://doi.org/10.25500/edata.bham.0000065
Lacking mechanistic disease definitions and corresponding association data hamper progress in network medicine and beyond
AbstractA long-term objective of network medicine is to replace our current, mainly phenotype-based disease definitions by subtypes of health conditions corresponding to distinct pathomechanisms. For this, molecular and health data are modeled as networks and are mined for pathomechanisms. However, many such studies rely on large-scale disease association data where diseases are annotated using the very phenotype-based disease definitions the network medicine field aims to overcome. This raises the question to which extent the biases mechanistically inadequate disease annotations introduce in disease association data distort the results of studies which use such data for pathomechanism mining. We address this question using global- and local-scale analyses of networks constructed from disease association data of various types. Our results indicate that large-scale disease association data should be used with care for pathomechanism mining and that analyses of such data should be accompanied by close-up analyses of molecular data for well-characterized patient cohorts.</jats:p
Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression
Trait-associated genetic variants affect complex phenotypes primarily via regulatory mechanisms on the transcriptome. To investigate the genetics of gene expression, we performed cis- and trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium. We detected cis-eQTL for 88% of genes, and these were replicable in numerous tissues. Distal trans-eQTL (detected for 37% of 10,317 trait-associated variants tested) showed lower replication rates, partially due to low replication power and confounding by cell type composition. However, replication analyses in single-cell RNA-seq data prioritized intracellular trans-eQTL. Trans-eQTL exerted their effects via several mechanisms, primarily through regulation by transcription factors. Expression of 13% of the genes correlated with polygenic scores for 1,263 phenotypes, pinpointing potential drivers for those traits. In summary, this work represents a large eQTL resource, and its results serve as a starting point for in-depth interpretation of complex phenotypes
Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes
Type 2 diabetes (T2D) is a very common disease in humans. Here we conduct a meta-analysis of genome-wide association studies (GWAS) with ~16 million genetic variants in 62,892 T2D cases and 596,424 controls of European ancestry. We identify 139 common and 4 rare variants associated with T2D, 42 of which (39 common and 3 rare variants) are independent of the known variants. Integration of the gene expression data from blood (n = 14,115 and 2765) with the GWAS results identifies 33 putative functional genes for T2D, 3 of which were targeted by approved drugs. A further integration of DNA methylation (n = 1980) and epigenomic annotation data highlight 3 genes (CAMK1D, TP53INP1, and ATP5G1) with plausible regulatory mechanisms, whereby a genetic variant exerts an effect on T2D through epigenetic regulation of gene expression. Our study uncovers additional loci, proposes putative genetic regulatory mechanisms for T2D, and provides evidence of purifying selection for T2D-associated variants
Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits
Genome-wide association studies (GWAS) have identified thousands of variants associated with complex traits, but their biological interpretation often remains unclear. Most of these variants overlap with expression QTLs, indicating their potential involvement in regulation of gene expression. Here, we propose a transcriptome-wide summary statistics-based Mendelian Randomization approach (TWMR) that uses multiple SNPs as instruments and multiple gene expression traits as exposures, simultaneously. Applied to 43 human phenotypes, it uncovers 3,913 putatively causal gene-trait associations, 36% of which have no genome-wide significant SNP nearby in previous GWAS. Using independent association summary statistics, we find that the majority of these loci were missed by GWAS due to power issues. Noteworthy among these links is educational attainment-associated BSCL2, known to carry mutations leading to a Mendelian form of encephalopathy. We also find pleiotropic causal effects suggestive of mechanistic connections. TWMR better accounts for pleiotropy and has the potential to identify biological mechanisms underlying complex traits
Refining Attention-Deficit/Hyperactivity Disorder and Autism Spectrum Disorder Genetic Loci by Integrating Summary Data From Genome-wide Association, Gene Expression, and DNA Methylation Studies
Background: Recent genome-wide association studies (GWASs) identified the first genetic loci associated with attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorder (ASD). The next step is to use these results to increase our understanding of the biological mechanisms involved. Most of the identified variants likely influence gene regulation. The aim of the current study is to shed light on the mechanisms underlying the genetic signals and prioritize genes by integrating GWAS results with gene expression and DNA methylation (DNAm) levels. Methods: We applied summary-dataâbased Mendelian randomization to integrate ADHD and ASD GWAS data with fetal brain expression and methylation quantitative trait loci, given the early onset of these disorders. We also analyzed expression and methylation quantitative trait loci datasets of adult brain and blood, as these provide increased statistical power. We subsequently used summary-dataâbased Mendelian randomization to investigate if the same variant influences both DNAm and gene expression levels. Results: We identified multiple gene expression and DNAm levels in fetal brain at chromosomes 1 and 17 that were associated with ADHD and ASD, respectively, through pleiotropy at shared genetic variants. The analyses in brain and blood showed additional associated gene expression and DNAm levels at the same and additional loci, likely because of increased statistical power. Several of the associated genes have not been identified in ADHD and ASD GWASs before. Conclusions: Our findings identified the genetic variants associated with ADHD and ASD that likely act through gene regulation. This facilitates prioritization of candidate genes for functional follow-up studies
Research data supporting the thesis "Systems level analysis of non-model organisms: a tool for understanding environmental stress"
Data created as part of PhD research project and contain tests results describing site-effects for mussels sampled from an industrial harbour. This process is based on challenging the model created from reference site data
Research data supporting the thesis "Systems level analysis of non-model organisms: a tool for understanding environmental stress"
Data created as part of PhD research project and contain Tests results describing site-effects for mussels sampled from an industrial harbour. This process is based on challenging the model created from reference site data