12 research outputs found
Single-Cell Epigenomics and Functional Fine-Mapping of Atherosclerosis GWAS Loci
Rationale: Genome-wide association studies have identified hundreds of loci associated with coronary artery disease (CAD). Many of these loci are enriched in cisregulatory elements but not linked to cardiometabolic risk factors nor to candidate causal genes, complicating their functional interpretation. Objective: Single-nucleus chromatin accessibility profiling of the human atherosclerotic lesions was used to investigate cell type-specific patterns of cisregulatory elements, to understand transcription factors establishing cell identity, and to interpret CAD-relevant, noncoding genetic variation. Methods and Results: We used single-nucleus ATAC-seq (assay for transposase-accessible chromatin with sequencing) to generate DNA accessibility maps in >7000 cells derived from human atherosclerotic lesions. We identified 5 major lesional cell types including endothelial cells, smooth muscle cells, monocyte/macrophages, natural killer/T cells, and B cells and further investigated subtype characteristics of macrophages and smooth muscle cells transitioning into fibromyocytes. We demonstrated that CAD-associated genetic variants are particularly enriched in endothelial and smooth muscle cell-specific open chromatin. Using single-cell coaccessibility and cis-expression quantitative trait loci information, we prioritized putative target genes and candidate regulatory elements for approximate to 30% of all known CAD loci. Finally, we performed genome-wide experimental fine-mapping of the CAD variants identified in genome-wide association studies using epigenetic quantitative trait loci analysis in primary human aortic endothelial cells and self-transcribing active regulatory region sequencing (STARR-Seq) massively parallel reporter assay in smooth muscle cells. This analysis identified potential causal single-nucleotide polymorphisms (SNPs) and the associated target gene for over 30 CAD loci. We present several examples where the chromatin accessibility and gene expression could be assigned to one cell type predicting the cell type of action for CAD loci. Conclusions: These findings highlight the potential of applying single-nucleus ATAC-seq to human tissues in revealing relative contributions of distinct cell types to diseases and in identifying genes likely to be influenced by noncoding genome-wide association study variants.</p
MGEx-Udb: A Mammalian Uterus Database for Expression-Based Cataloguing of Genes across Conditions, Including Endometriosis and Cervical Cancer
Gene expression profiling of uterus tissue has been performed in various contexts, but a significant amount of the data remains underutilized as it is not covered by the existing general resources.). The database can be queried with gene names/IDs, sub-tissue locations, as well as various conditions such as the cervical cancer, endometrial cycles and disorders, and experimental treatments. Accordingly, the output would be a) transcribed and dormant genes listed for the queried condition/location, or b) expression profile of the gene of interest in various uterine conditions. The results also include the reliability score for the expression status of each gene. MGEx-Udb also provides information related to Gene Ontology annotations, protein-protein interactions, transcripts, promoters, and expression status by other sequencing techniques, and facilitates various other types of analysis of the individual genes or co-expressed gene clusters.In brief, MGEx-Udb enables easy cataloguing of co-expressed genes and also facilitates bio-marker discovery for various uterine conditions
Human Ezrin: Prediction of the 3D structure and interactions with mTOR
The protein Ezrin, is a member of the ERM family (Ezrin, Radixin and Moesin) that links the F-actin to the plasma membrane. The protein is made of three domains namely the FERM domain, a central α-helical domain and the CERMAD domain. The residues in Ezrin such as Ser66, Tyr145, Tyr353 and Tyr477 regulate the function of the protein through phosphorylation. The protein is found in two distinct conformations of active and dormant (inactive) state. The initial step during the conformation change is the breakage of intramolecular interaction in dormant Ezrin by phosphorylation of residue Thr567. The dormant structure of human Ezrin was predicted computationally since only partial active form structure was available. The validation analysis showed that 99.7% residues were positioned in favored, allowed and generously allowed regions of the Ramachandran plot. The Z-score of Ezrin was â7.36, G-factor was 0.1, and the QMEAN score of the model was 0.61 indicating a good model for human Ezrin. The comparison of the conformations of the activated and dormant Ezrin showed a major shift in the F2 lobe (residues 142-149 and 161-177) while changes in the conformation induced mobility shifts in lobe F3 (residues 261 to 267). The 3D positions of the phosphorylation sites Tyr145, Tyr353, Tyr477, Tyr482 and Thr567 were also located. Using targeted molecular dynamic simulation, the molecular movements during conformational change from active to dormant were visualized. The dormant Ezrin auto-inhibits itself by a head-to-tail interaction of the N-terminal and C-terminal residues. The trajectory shows the breakage of the interactions and mobility of the CERMAD domain away from the FERM domain. Protein docking and clustering analysis were used to predict the residues involved in the interaction between dormant Ezrin and mTOR. Residues Tyr477 and Tyr482 were found to be involved in interaction with mTOR.Siirretty Doriast
Schematic representation of MGEx-Udb.
<p>The figure represents the data collection (top portion), architecture (central portion) and operation (bottom portion) of the database.</p
Datasets
<p>(<b>with gene count</b>) <b>collected from various sources.</b> In case of âPubMed & GEOâ and âPubMed & ArrayExpressâ, smaller gene lists came from validation experiments and were collected from PubMed, while raw/processed data were always collected from the repositories (GEO/ArrayExpress).</p
Example hierarchy of the conditions and sub-conditions.
<p>An example <i>(âstage IIA non-keratinizing squamous cell cervical carcinomaâ)</i> hierarchy of the conditions and sub-conditions, for which data have been collected, and drop-down options provided in the query and upload pages of MGEx-Udb. Currently the database allows up to four levels of the hierarchy to query.</p
Number of datasets
<p>(<b>and studies</b>) <b>in MGEx-Udb corresponding to various physiological and pathological uterine conditions.</b> âOthersâ represent post-parturition, genetic-ablation, artificial insemination and embryo implantation. Studies considering tissues that are used as controls but may not be absolutely ânormalâ have been grouped in <i>âmay be normalâ</i> category (examples: ânormal tissue adjacent to tumor/cancer tissueâ, âvehicle-treatedâ).</p
Source of data across various mammalian species in MGEx-Udb.
<p>Other species include cow and pig. Among the data collected from GEO or âPubMed & GEOâ, 85% of the studies were also present in ArrayExpress, even though this is not indicated in the figure.</p
A pan-cancer analysis shows immunoevasive characteristics in NRF2 hyperactive squamous malignancies
The NRF2 pathway is frequently activated in various cancer types, yet a comprehensive analysis of its effects across different malignancies is currently lacking. We developed a NRF2 activity metric and utilized it to conduct a pan-cancer analysis of oncogenic NRF2 signaling. We identified an immunoevasive phenotype where high NRF2 activity is associated with low interferon-gamma (IFNÎł), HLA-I expression and T cell and macrophage infiltration in squamous malignancies of the lung, head and neck area, cervix and esophagus. Squamous NRF2 overactive tumors comprise a molecular phenotype with SOX2/TP63 amplification, TP53 mutation and CDKN2A loss. These immune cold NRF2 hyperactive diseases are associated with upregulation of immunomodulatory NAMPT, WNT5A, SPP1, SLC7A11, SLC2A1 and PD-L1. Based on our functional genomics analyses, these genes represent candidate NRF2 targets, suggesting direct modulation of the tumor immune milieu. Single-cell mRNA data shows that cancer cells of this subtype exhibit decreased expression of IFNÎł responsive ligands, and increased expression of immunosuppressive ligands NAMPT, SPP1 and WNT5A that mediate signaling in intercellular crosstalk. In addition, we discovered that the negative relationship of NRF2 and immune cells are explained by stromal populations of lung squamous cell carcinoma, and this effect spans multiple squamous malignancies based on our molecular subtyping and deconvolution data
Recommended from our members
Cross-tissue omics analysis discovers ten adipose genes encoding secreted proteins in obesity-related non-alcoholic fatty liver disease.
BackgroundNon-alcoholic fatty liver disease (NAFLD) is a fast-growing, underdiagnosed, epidemic. We hypothesise that obesity-related inflammation compromises adipose tissue functions, preventing efficient fat storage, and thus driving ectopic fat accumulation into the liver.MethodsTo identify adipose-based mechanisms and potential serum biomarker candidates (SBCs) for NAFLD, we utilise dual-tissue RNA-sequencing (RNA-seq) data in adipose tissue and liver, paired with histology-based NAFLD diagnosis, from the same individuals in a cohort of obese individuals. We first scan for genes that are differentially expressed (DE) for NAFLD in obese individuals' subcutaneous adipose tissue but not in their liver; encode proteins secreted to serum; and show preferential adipose expression. Then the identified genes are filtered to key adipose-origin NAFLD genes by best subset analysis, knockdown experiments during human preadipocyte differentiation, recombinant protein treatment experiments in human liver HepG2 cells, and genetic analysis.FindingsWe discover a set of genes, including 10 SBCs, that may modulate NAFLD pathogenesis by impacting adipose tissue function. Based on best subset analysis, we further follow-up on two SBCs CCDC80 and SOD3 by knockdown in human preadipocytes and subsequent differentiation experiments, which show that they modulate crucial adipogenesis genes, LPL, SREBPF1, and LEP. We also show that treatment of the liver HepG2 cells with the CCDC80 and SOD3 recombinant proteins impacts genes related to steatosis and lipid processing, including PPARA, NFE2L2, and RNF128. Finally, utilizing the adipose NAFLD DE gene cis-regulatory variants associated with serum triglycerides (TGs) in extensive genome-wide association studies (GWASs), we demonstrate a unidirectional effect of serum TGs on NAFLD with Mendelian Randomization (MR) analysis. We also demonstrate that a single SNP regulating one of the SBC genes, rs2845885, produces a significant MR result by itself. This supports the conclusion that genetically regulated adipose expression of the NAFLD DE genes may contribute to NAFLD through changes in serum TG levels.InterpretationOur results from the dual-tissue transcriptomics screening improve the understanding of obesity-related NAFLD by providing a targeted set of 10 adipose tissue-active genes as new serum biomarker candidates for the currently grossly underdiagnosed fatty liver disease.FundingThe work was supported by NIH grants R01HG010505 and R01DK132775. The Genotype-Tissue Expression (GTEx) Project was supported by the Common Fund of the Office of the Director of the National Institutes of Health, and by NCI, NHGRI, NHLBI, NIDA, NIMH, and NINDS. The KOBS study (J. P.) was supported by the Finnish Diabetes Research Foundation, Kuopio University Hospital Project grant (EVO/VTR grants 2005-2019), and the Academy of Finland grant (Contract no. 138006). This study was funded by the European Research Council under the European Union's Horizon 2020 research and innovation program (Grant No. 802825 to M. U. K.). K. H. P. was funded by the Academy of Finland (grant numbers 272376, 266286, 314383, and 335443), the Finnish Medical Foundation, Gyllenberg Foundation, Novo Nordisk Foundation (grant numbers NNF10OC1013354, NNF17OC0027232, and NNF20OC0060547), Finnish Diabetes Research Foundation, Finnish Foundation for Cardiovascular Research, University of Helsinki, and Helsinki University Hospital and Government Research Funds. I. S. was funded by the Instrumentarium Science Foundation. Personal grants to U. T. A. were received from the Matti and Vappu Maukonen Foundation, Ella och Georg Ehrnrooths Stiftelse and the Finnish Foundation for Cardiovascular Research