1,381 research outputs found
Zipper plot : visualizing transcriptional activity of genomic regions
Background: Reconstructing transcript models from RNA-sequencing (RNA-seq) data and establishing these as independent transcriptional units can be a challenging task. Current state-of-the-art tools for long non-coding RNA (lncRNA) annotation are mainly based on evolutionary constraints, which may result in false negatives due to the overall limited conservation of lncRNAs.
Results: To tackle this problem we have developed the Zipper plot, a novel visualization and analysis method that enables users to simultaneously interrogate thousands of human putative transcription start sites (TSSs) in relation to various features that are indicative for transcriptional activity. These include publicly available CAGE-sequencing, ChIP-sequencing and DNase-sequencing datasets. Our method only requires three tab-separated fields (chromosome, genomic coordinate of the TSS and strand) as input and generates a report that includes a detailed summary table, a Zipper plot and several statistics derived from this plot.
Conclusion: Using the Zipper plot, we found evidence of transcription for a set of well-characterized lncRNAs and observed that fewer mono-exonic lncRNAs have CAGE peaks overlapping with their TSSs compared to multi-exonic lncRNAs. Using publicly available RNA-seq data, we found more than one hundred cases where junction reads connected protein-coding gene exons with a downstream mono-exonic lncRNA, revealing the need for a careful evaluation of lncRNA 5′-boundaries. Our method is implemented using the statistical programming language R and is freely available as a webtool
Recommended from our members
A human lung tumor microenvironment interactome identifies clinically relevant cell-type cross-talk.
BackgroundTumors comprise a complex microenvironment of interacting malignant and stromal cell types. Much of our understanding of the tumor microenvironment comes from in vitro studies isolating the interactions between malignant cells and a single stromal cell type, often along a single pathway.ResultTo develop a deeper understanding of the interactions between cells within human lung tumors, we perform RNA-seq profiling of flow-sorted malignant cells, endothelial cells, immune cells, fibroblasts, and bulk cells from freshly resected human primary non-small-cell lung tumors. We map the cell-specific differential expression of prognostically associated secreted factors and cell surface genes, and computationally reconstruct cross-talk between these cell types to generate a novel resource called the Lung Tumor Microenvironment Interactome (LTMI). Using this resource, we identify and validate a prognostically unfavorable influence of Gremlin-1 production by fibroblasts on proliferation of malignant lung adenocarcinoma cells. We also find a prognostically favorable association between infiltration of mast cells and less aggressive tumor cell behavior.ConclusionThese results illustrate the utility of the LTMI as a resource for generating hypotheses concerning tumor-microenvironment interactions that may have prognostic and therapeutic relevance
Selective Activation of Alternative MYC Core Promoters by Wnt-Responsive Enhancers.
In Metazoans, transcription of most genes is driven by the use of multiple alternative promoters. Although the precise regulation of alternative promoters is important for proper gene expression, the mechanisms that mediates their differential utilization remains unclear. Here, we investigate how the two alternative promoters (P1, P2) that drive MYC expression are regulated. We find that P1 and P2 can be differentially regulated across cell-types and that their selective usage is largely mediated by distal regulatory sequences. Moreover, we show that in colon carcinoma cells, Wnt-responsive enhancers preferentially upregulate transcription from the P1 promoter using reporter assays and in the context of the endogenous Wnt induction. In addition, multiple enhancer deletions using CRISPR/Cas9 corroborate the regulatory specificity of P1. Finally, we show that preferential activation between Wnt-responsive enhancers and the P1 promoter is influenced by the distinct core promoter elements that are present in the MYC promoters. Taken together, our results provide new insight into how enhancers can specifically target alternative promoters and suggest that formation of these selective interactions could allow more precise combinatorial regulation of transcription initiation
FANTOM5 CAGE profiles of human and mouse samples
In the FANTOM5 project, transcription initiation events across the human and
mouse genomes were mapped at a single base-pair resolution and their
frequencies were monitored by CAGE (Cap Analysis of Gene Expression) coupled
with single-molecule sequencing. Approximately three thousands of samples,
consisting of a variety of primary cells, tissues, cell lines, and time series
samples during cell activation and development, were subjected to a uniform
pipeline of CAGE data production. The analysis pipeline started by measuring
RNA extracts to assess their quality, and continued to CAGE library production
by using a robotic or a manual workflow, single molecule sequencing, and
computational processing to generate frequencies of transcription initiation.
Resulting data represents the consequence of transcriptional regulation in
each analyzed state of mammalian cells. Non-overlapping peaks over the CAGE
profiles, approximately 200,000 and 150,000 peaks for the human and mouse
genomes, were identified and annotated to provide precise location of known
promoters as well as novel ones, and to quantify their activities
Identification and annotation of conserved promoters and macrophage-expressed genes in the pig genome.
BACKGROUND: The FANTOM5 consortium used Cap Analysis of Gene Expression (CAGE) tag sequencing to produce a comprehensive atlas of promoters and enhancers within the human and mouse genomes. We reasoned that the mapping of these regulatory elements to the pig genome could provide useful annotation and evidence to support assignment of orthology. RESULTS: For human transcription start sites (TSS) associated with annotated human-mouse orthologs, 17% mapped to the pig genome but not to the mouse, 10% mapped only to the mouse, and 55% mapped to both pig and mouse. Around 17% did not map to either species. The mapping percentages were lower where there was not clear orthology relationship, but in every case, mapping to pig was greater than to mouse, and the degree of homology was also greater. Combined mapping of mouse and human CAGE-defined promoters identified at least one putative conserved TSS for >16,000 protein-coding genes. About 54% of the predicted locations of regulatory elements in the pig genome were supported by CAGE and/or RNA-Seq analysis from pig macrophages. CONCLUSIONS: Comparative mapping of promoters and enhancers from humans and mice can provide useful preliminary annotation of other animal genomes. The data also confirm extensive gain and loss of regulatory elements between species, and the likelihood that pigs provide a better model than mice for human gene regulation and function
Global Analysis of Transcription Start Sites and Enhancers in Endometrial Stromal Cells and Differences Associated with Endometriosis.
Identifying tissue-specific molecular signatures of active regulatory elements is critical to understanding gene regulatory mechanisms. In this study, transcription start sites (TSS) and enhancers were identified using Cap analysis of gene expression (CAGE) across endometrial stromal cell (ESC) samples obtained from women with (n = 4) and without endometriosis (n = 4). ESC TSSs and enhancers were compared to those reported in other tissue and cell types in FANTOM5 and were integrated with RNA-seq and ATAC-seq data from the same samples for regulatory activity and network analyses. CAGE tag count differences between women with and without endometriosis were statistically tested and tags within close proximity to genetic variants associated with endometriosis risk were identified. Over 90% of tag clusters mapping to promoters were observed in cells and tissues in FANTOM5. However, some potential cell-type-specific promoters and enhancers were also observed. Regions of open chromatin identified using ATAC-seq provided further evidence of the active transcriptional regions identified by CAGE. Despite the small sample number, there was evidence of differences associated with endometriosis at 210 consensus clusters, including IGFBP5, CALD1 and OXTR. ESC TSSs were also located within loci associated with endometriosis risk from genome-wide association studies. This study provides novel evidence of transcriptional differences in endometrial stromal cells associated with endometriosis and provides a valuable cell-type specific resource of active TSSs and enhancers in endometrial stromal cells
Ceruloplasmin is a novel adipokine which is overexpressed in adipose tissue of obese subjects and in obesity-associated cancer cells
Obesity confers an increased risk of developing specific cancer forms. Although the mechanisms are unclear, increased fat cell secretion of specific proteins (adipokines) may promote/facilitate development of malignant tumors in obesity via cross-talk between adipose tissue(s) and the tissues prone to develop cancer among obese. We searched for novel adipokines that were overexpressed in adipose tissue of obese subjects as well as in tumor cells derived from cancers commonly associated with obesity. For this purpose expression data from human adipose tissue of obese and non-obese as well as from a large panel of human cancer cell lines and corresponding primary cells and tissues were explored. We found expression of ceruloplasmin to be the most enriched in obesity-associated cancer cells. This gene was also significantly up-regulated in adipose tissue of obese subjects. Ceruloplasmin is the body's main copper carrier and is involved in angiogenesis. We demonstrate that ceruloplasmin is a novel adipokine, which is produced and secreted at increased rates in obesity. In the obese state, adipose tissue contributed markedly (up to 22%) to the total circulating protein level. In summary, we have through bioinformatic screening identified ceruloplasmin as a novel adipokine with increased expression in adipose tissue of obese subjects as well as in cells from obesity-associated cancers. Whether there is a causal relationship between adipose overexpression of ceruloplasmin and cancer development in obesity cannot be answered by these cross-sectional comparisons
- …