35 research outputs found
EPISPOT: An epigenome-driven approach for detecting and interpreting hotspots in molecular QTL studies.
We present EPISPOT, a fully joint framework which exploits large panels of epigenetic annotations as variant-level information to enhance molecular quantitative trait locus (QTL) mapping. Thanks to a purpose-built Bayesian inferential algorithm, EPISPOT accommodates functional information for both cis and trans actions, including QTL hotspot effects. It effectively couples simultaneous QTL analysis of thousands of genetic variants and molecular traits with hypothesis-free selection of biologically interpretable annotations which directly contribute to the QTL effects. This unified, epigenome-aided learning boosts statistical power and sheds light on the regulatory basis of the uncovered hits; EPISPOT therefore marks an essential step toward improving the challenging detection and functional interpretation of trans-acting genetic variants and hotspots. We illustrate the advantages of EPISPOT in simulations emulating real-data conditions and in a monocyte expression QTL study, which confirms known hotspots and finds other signals, as well as plausible mechanisms of action. In particular, by highlighting the role of monocyte DNase-I sensitivity sites from >150 epigenetic annotations, we clarify the mediation effects and cell-type specificity of major hotspots close to the lysozyme gene. Our approach forgoes the daunting and underpowered task of one-annotation-at-a-time enrichment analyses for prioritizing cis and trans QTL hits and is tailored to any transcriptomic, proteomic, or metabolomic QTL problem. By enabling principled epigenome-driven QTL mapping transcriptome-wide, EPISPOT helps progress toward a better functional understanding of genetic regulation
Systems view of adipogenesis via novel omics-driven and tissue-specific activity scoring of network functional modules
The investigation of the complex processes involved in cellular differentiation must be based on unbiased, high throughput data processing methods to identify relevant biological pathways. A number of bioinformatics tools are available that can generate lists of pathways ranked by statistical significance (i.e. by p-value), while ideally it would be desirable to functionally score the pathways relative to each other or to other interacting parts of the system or process. We describe a new computational method (Network Activity Score Finder - NASFinder) to identify tissue-specific, omicsdetermined sub-networks and the connections with their upstream regulator receptors to obtain a systems view of the differentiation of human adipocytes. Adipogenesis of human SBGS pre-adipocyte cells in vitro was monitored with a transcriptomic data set comprising six time points (0, 6, 48, 96, 192, 384 hours). To elucidate the mechanisms of adipogenesis, NASFinder was used to perform time-point analysis by comparing each time point against the control (0 h) and time-lapse analysis by comparing each time point with the previous one. NASFinder identified the coordinated activity of seemingly unrelated processes between each comparison, providing the first systems view of adipogenesis in culture. NASFinder has been implemented into a web-based, freely available resource associated with novel, easy to read visualization of omics data sets and network modules
Characterization of the genetic determinants of context-specific DNA methylation in primary monocytes
To better understand inter-individual variation in sensitivity of DNA methylation (DNAm) to immune activity, we characterized effects of inflammatory stimuli on primary monocyte DNAm (n = 190). We find that monocyte DNAm is site-dependently sensitive to lipopolysaccharide (LPS), with LPS-induced demethylation occurring following hydroxymethylation. We identify 7,359 high-confidence immune-modulated CpGs (imCpGs) that differ in genomic localization and transcription factor usage according to whether they represent a gain or loss in DNAm. Demethylated imCpGs are profoundly enriched for enhancers and colocalize to genes enriched for disease associations, especially cancer. DNAm is age associated, and we find that 24-h LPS exposure triggers approximately 6 months of gain in epigenetic age, directly linking epigenetic aging with innate immune activity. By integrating LPS-induced changes in DNAm with genetic variation, we identify 234 imCpGs under local genetic control. Exploring shared causal loci between LPS-induced DNAm responses and human disease traits highlights examples of disease-associated loci that modulate imCpG formation
Demultiplexing of single-cell RNA-sequencing data using interindividual variation in gene expression
Motivation: Pooled designs for single-cell RNA sequencing, where many cells from distinct samples are processed jointly, offer increased throughput and reduced batch variation. This study describes expression-aware demultiplexing (EAD), a computational method that employs differential co-expression patterns between individuals to demultiplex pooled samples without any extra experimental steps. Results: We use synthetic sample pools and show that the top interindividual differentially co-expressed genes provide a distinct cluster of cells per individual, significantly enriching the regulation of metabolism. Our application of EAD to samples of six isogenic inbred mice demonstrated that controlling genetic and environmental effects can solve interindividual variations related to metabolic pathways. We utilized 30 samples from both sepsis and healthy individuals in six batches to assess the performance of classification approaches. The results indicate that combining genetic and EAD results can enhance the accuracy of assignments (Min. 0.94, Mean 0.98, Max. 1). The results were enhanced by an average of 1.4% when EAD and barcoding techniques were combined (Min. 1.25%, Median 1.33%, Max. 1.74%). Furthermore, we demonstrate that interindividual differential co-expression analysis within the same cell type can be used to identify cells from the same donor in different activation states. By analysing single-nuclei transcriptome profiles from the brain, we demonstrate that our method can be applied to nonimmune cells. Availability and implementation: EAD workflow is available at https://isarnassiri.github.io/scDIV/ as an R package called scDIV (acronym for single-cell RNA-sequencing data demultiplexing using interindividual variations)
IL7 genetic variation and toxicity to immune checkpoint blockade in patients with melanoma
Treatment with immune checkpoint blockade (ICB) frequently triggers immune-related adverse events (irAEs), causing considerable morbidity. In 214 patients receiving ICB for melanoma, we observed increased severe irAE risk in minor allele carriers of rs16906115, intronic to IL7. We found that rs16906115 forms a B cell-specific expression quantitative trait locus (eQTL) to IL7 in patients. Patients carrying the risk allele demonstrate increased pre-treatment B cell IL7 expression, which independently associates with irAE risk, divergent immunoglobulin expression and more B cell receptor mutations. Consistent with the role of IL-7 in T cell development, risk allele carriers have distinct ICB-induced CD8+ T cell subset responses, skewing of T cell clonality and greater proportional repertoire occupancy by large clones. Finally, analysis of TCGA data suggests that risk allele carriers independently have improved melanoma survival. These observations highlight key roles for B cells and IL-7 in both ICB response and toxicity and clinical outcomes in melanoma
GWAS and meta-analysis identifies 49 genetic variants underlying critical COVID-19
Critical illness in COVID-19 is an extreme and clinically homogeneous disease phenotype that we have previously shown1 to be highly efficient for discovery of genetic associations2. Despite the advanced stage of illness at presentation, we have shown that host genetics in patients who are critically ill with COVID-19 can identify immunomodulatory therapies with strong beneficial effects in this group3. Here we analyse 24,202 cases of COVID-19 with critical illness comprising a combination of microarray genotype and whole-genome sequencing data from cases of critical illness in the international GenOMICC (11,440 cases) study, combined with other studies recruiting hospitalized patients with a strong focus on severe and critical disease: ISARIC4C (676 cases) and the SCOURGE consortium (5,934 cases). To put these results in the context of existing work, we conduct a meta-analysis of the new GenOMICC genome-wide association study (GWAS) results with previously published data. We find 49 genome-wide significant associations, of which 16 have not been reported previously. To investigate the therapeutic implications of these findings, we infer the structural consequences of protein-coding variants, and combine our GWAS results with gene expression data using a monocyte transcriptome-wide association study (TWAS) model, as well as gene and protein expression using Mendelian randomization. We identify potentially druggable targets in multiple systems, including inflammatory signalling (JAK1), monocyte-macrophage activation and endothelial permeability (PDE4A), immunometabolism (SLC2A5 and AK5), and host factors required for viral entry and replication (TMPRSS2 and RAB2A)
GWAS and Meta-Analysis Identifies 49 Genetic Variants Underlying Critical COVID-19
Critical illness in COVID-19 is an extreme and clinically homogeneous disease phenotype that we have previously shown1 to be highly efficient for discovery of genetic associations2. Despite the advanced stage of illness at presentation, we have shown that host genetics in patients who are critically ill with COVID-19 can identify immunomodulatory therapies with strong beneficial effects in this group3. Here we analyse 24,202 cases of COVID-19 with critical illness comprising a combination of microarray genotype and whole-genome sequencing data from cases of critical illness in the international GenOMICC (11,440 cases) study, combined with other studies recruiting hospitalized patients with a strong focus on severe and critical disease: ISARIC4C (676 cases) and the SCOURGE consortium (5,934 cases). To put these results in the context of existing work, we conduct a meta-analysis of the new GenOMICC genome-wide association study (GWAS) results with previously published data. We find 49 genome-wide significant associations, of which 16 have not been reported previously. To investigate the therapeutic implications of these findings, we infer the structural consequences of protein-coding variants, and combine our GWAS results with gene expression data using a monocyte transcriptome-wide association study (TWAS) model, as well as gene and protein expression using Mendelian randomization. We identify potentially druggable targets in multiple systems, including inflammatory signalling (JAK1), monocyte-macrophage activation and endothelial permeability (PDE4A), immunometabolism (SLC2A5 and AK5), and host factors required for viral entry and replication (TMPRSS2 and RAB2A)
Nonparametric Simulation of Signal Transduction Networks with Semi-Synchronized Update
Simulating signal transduction in cellular signaling networks provides predictions of network dynamics by quantifying the changes in concentration and activity-level of the individual proteins. Since numerical values of kinetic parameters might be difficult to obtain, it is imperative to develop non-parametric approaches that combine the connectivity of a network with the response of individual proteins to signals which travel through the network. The activity levels of signaling proteins computed through existing non-parametric modeling tools do not show significant correlations with the observed values in experimental results. In this work we developed a non-parametric computational framework to describe the profile of the evolving process and the time course of the proportion of active form of molecules in the signal transduction networks. The model is also capable of incorporating perturbations. The model was validated on four signaling networks showing that it can effectively uncover the activity levels and trends of response during signal transduction process
An immunodominant NP105-113-B*07:02 cytotoxic T cell response controls viral replication and is associated with less severe COVID-19 disease.
Funder: RCUK | Medical Research Council (MRC); doi: https://doi.org/10.13039/501100000265Funder: Chinese Academy of Medical Sciences (CAMS); doi: https://doi.org/10.13039/501100005150Funder: Wellcome Trust (Wellcome); doi: https://doi.org/10.13039/100004440NP105-113-B*07:02-specific CD8+ T cell responses are considered among the most dominant in SARS-CoV-2-infected individuals. We found strong association of this response with mild disease. Analysis of NP105-113-B*07:02-specific T cell clones and single-cell sequencing were performed concurrently, with functional avidity and antiviral efficacy assessed using an in vitro SARS-CoV-2 infection system, and were correlated with T cell receptor usage, transcriptome signature and disease severity (acute n = 77, convalescent n = 52). We demonstrated a beneficial association of NP105-113-B*07:02-specific T cells in COVID-19 disease progression, linked with expansion of T cell precursors, high functional avidity and antiviral effector function. Broad immune memory pools were narrowed postinfection but NP105-113-B*07:02-specific T cells were maintained 6 months after infection with preserved antiviral efficacy to the SARS-CoV-2 Victoria strain, as well as Alpha, Beta, Gamma and Delta variants. Our data show that NP105-113-B*07:02-specific T cell responses associate with mild disease and high antiviral efficacy, pointing to inclusion for future vaccine design