214 research outputs found
Implementing a transcription factor interaction prediction system using the genometric query language
Novel technologies and growing interest have resulted in a large increase in the amount of data available for genomics and transcriptomics studies, both in terms of volume and contents. Biology is relying more and more on computational methods to process, investigate, and extract knowledge from this huge amount of data. In this work, we present the TICA web server (available at http://www.gmql.eu/tica/), a fast and compact tool developed to support data-driven knowledge discovery in the realm of transcription factor interaction prediction. TICA leverages both the GenoMetric Query Language, a novel query tool (based on the Apache Hadoop and Spark technologies) specialized in the integration and management of heterogeneous, large genomic datasets, and a statistical method for robust detection of co-locations across interval-based data, in order to infer physically interacting transcription factors. Notably, TICA allows investigators to upload and analyze their own ChIP-seq experiments datasets, comparing them both against ENCODE data or between themselves, achieving computation time which increases linearly with respect to dataset size and density. Using ENCODE data from three well-studied cell lines as reference, we show that TICA predictions are supported by existing biological knowledge, making the web server a reliable and efficient tool for interaction screening and data-driven hypothesis generation
Arterivirus Nsp1 Modulates the Accumulation of Minus-Strand Templates to Control the Relative Abundance of Viral mRNAs
The gene expression of plus-strand RNA viruses with a polycistronic genome depends on translation and replication of the genomic mRNA, as well as synthesis of subgenomic (sg) mRNAs. Arteriviruses and coronaviruses, distantly related members of the nidovirus order, employ a unique mechanism of discontinuous minus-strand RNA synthesis to generate subgenome-length templates for the synthesis of a nested set of sg mRNAs. Non-structural protein 1 (nsp1) of the arterivirus equine arteritis virus (EAV), a multifunctional regulator of viral RNA synthesis and virion biogenesis, was previously implicated in controlling the balance between genome replication and sg mRNA synthesis. Here, we employed reverse and forward genetics to gain insight into the multiple regulatory roles of nsp1. Our analysis revealed that the relative abundance of viral mRNAs is tightly controlled by an intricate network of interactions involving all nsp1 subdomains. Distinct nsp1 mutations affected the quantitative balance among viral mRNA species, and our data implicate nsp1 in controlling the accumulation of full-length and subgenome-length minus-strand templates for viral mRNA synthesis. The moderate differential changes in viral mRNA abundance of nsp1 mutants resulted in similarly altered viral protein levels, but progeny virus yields were greatly reduced. Pseudorevertant analysis provided compelling genetic evidence that balanced EAV mRNA accumulation is critical for efficient virus production. This first report on protein-mediated, mRNA-specific control of nidovirus RNA synthesis reveals the existence of an integral control mechanism to fine-tune replication, sg mRNA synthesis, and virus production, and establishes a major role for nsp1 in coordinating the arterivirus replicative cycle
An Integrated Model of Multiple-Condition ChIP-Seq Data Reveals Predeterminants of Cdx2 Binding
Regulatory proteins can bind to different sets of genomic targets in various cell types or conditions. To reliably characterize such condition-specific regulatory binding we introduce MultiGPS, an integrated machine learning approach for the analysis of multiple related ChIP-seq experiments. MultiGPS is based on a generalized Expectation Maximization framework that shares information across multiple experiments for binding event discovery. We demonstrate that our framework enables the simultaneous modeling of sparse condition-specific binding changes, sequence dependence, and replicate-specific noise sources. MultiGPS encourages consistency in reported binding event locations across multiple-condition ChIP-seq datasets and provides accurate estimation of ChIP enrichment levels at each event. MultiGPS's multi-experiment modeling approach thus provides a reliable platform for detecting differential binding enrichment across experimental conditions. We demonstrate the advantages of MultiGPS with an analysis of Cdx2 binding in three distinct developmental contexts. By accurately characterizing condition-specific Cdx2 binding, MultiGPS enables novel insight into the mechanistic basis of Cdx2 site selectivity. Specifically, the condition-specific Cdx2 sites characterized by MultiGPS are highly associated with pre-existing genomic context, suggesting that such sites are pre-determined by cell-specific regulatory architecture. However, MultiGPS-defined condition-independent sites are not predicted by pre-existing regulatory signals, suggesting that Cdx2 can bind to a subset of locations regardless of genomic environment. A summary of this paper appears in the proceedings of the RECOMB 2014 conference, April 2–5.National Science Foundation (U.S.) (Graduate Research Fellowship under Grant 0645960)National Institutes of Health (U.S.) (grant P01 NS055923)Pennsylvania State University. Center for Eukaryotic Gene Regulatio
Genome-Wide Identification of Small RNAs in the Opportunistic Pathogen Enterococcus faecalis V583
Small RNA molecules (sRNAs) are key mediators of virulence and stress inducible gene expressions in some pathogens. In this work we identify sRNAs in the Gram positive opportunistic pathogen Enterococcus faecalis. We characterized 11 sRNAs by tiling microarray analysis, 5′ and 3′ RACE-PCR, and Northern blot analysis. Six sRNAs were specifically expressed at exponential phase, two sRNAs were observed at stationary phase, and three were detected during both phases. Searches of putative functions revealed that three of them (EFA0080_EFA0081 and EFB0062_EFB0063 on pTF1 and pTF2 plasmids, respectively, and EF0408_EF04092 located on the chromosome) are similar to antisense RNA involved in plasmid addiction modules. Moreover, EF1097_EF1098 shares strong homologies with tmRNA (bi-functional RNA acting as both a tRNA and an mRNA) and EF2205_EF2206 appears homologous to 4.5S RNA member of the Signal Recognition Particle (SRP) ribonucleoprotein complex. In addition, proteomic analysis of the ΔEF3314_EF3315 sRNA mutant suggests that it may be involved in the turnover of some abundant proteins. The expression patterns of these transcripts were evaluated by tiling array hybridizations performed with samples from cells grown under eleven different conditions some of which may be encountered during infection. Finally, distribution of these sRNAs among genome sequences of 54 E. faecalis strains was assessed. This is the first experimental genome-wide identification of sRNAs in E. faecalis and provides impetus to the understanding of gene regulation in this important human pathogen
Lineage-specific dynamic and pre-established enhancer–promoter contacts cooperate in terminal differentiation
Chromosome conformation is an important feature of metazoan gene regulation; however, enhancer–promoter contact remodeling during cellular differentiation remains poorly understood. To address this, genome-wide promoter capture Hi-C (CHi-C) was performed during epidermal differentiation. Two classes of enhancer–promoter contacts associated with differentiation-induced genes were identified. The first class ('gained') increased in contact strength during differentiation in concert with enhancer acquisition of the H3K27ac activation mark. The second class ('stable') were pre-established in undifferentiated cells, with enhancers constitutively marked by H3K27ac. The stable class was associated with the canonical conformation regulator cohesin, whereas the gained class was not, implying distinct mechanisms of contact formation and regulation. Analysis of stable enhancers identified a new, essential role for a constitutively expressed, lineage-restricted ETS-family transcription factor, EHF, in epidermal differentiation. Furthermore, neither class of contacts was observed in pluripotent cells, suggesting that lineage-specific chromatin structure is established in tissue progenitor cells and is further remodeled in terminal differentiation
Identification and functional characterization of small non-coding RNAs in Xanthomonas oryzae pathovar oryzae
<p>Abstract</p> <p>Background</p> <p>Small non-coding RNAs (sRNAs) are regarded as important regulators in prokaryotes and play essential roles in diverse cellular processes. <it>Xanthomonas oryzae </it>pathovar <it>oryzae </it>(<it>Xoo</it>) is an important plant pathogenic bacterium which causes serious bacterial blight of rice. However, little is known about the number, genomic distribution and biological functions of sRNAs in <it>Xoo</it>.</p> <p>Results</p> <p>Here, we performed a systematic screen to identify sRNAs in the <it>Xoo </it>strain PXO99. A total of 850 putative non-coding RNA sequences originated from intergenic and gene antisense regions were identified by cloning, of which 63 were also identified as sRNA candidates by computational prediction, thus were considered as <it>Xoo </it>sRNA candidates. Northern blot hybridization confirmed the size and expression of 6 sRNA candidates and other 2 cloned small RNA sequences, which were then added to the sRNA candidate list. We further examined the expression profiles of the eight sRNAs in an <it>hfq </it>deletion mutant and found that two of them showed drastically decreased expression levels, and another exhibited an Hfq-dependent transcript processing pattern. Deletion mutants were obtained for seven of the Northern confirmed sRNAs, but none of them exhibited obvious phenotypes. Comparison of the proteomic differences between three of the ΔsRNA mutants and the wild-type strain by two-dimensional gel electrophoresis (2-DE) analysis showed that these sRNAs are involved in multiple physiological and biochemical processes.</p> <p>Conclusions</p> <p>We experimentally verified eight sRNAs in a genome-wide screen and uncovered three Hfq-dependent sRNAs in <it>Xoo</it>. Proteomics analysis revealed <it>Xoo </it>sRNAs may take part in various metabolic processes. Taken together, this work represents the first comprehensive screen and functional analysis of sRNAs in rice pathogenic bacteria and facilitates future studies on sRNA-mediated regulatory networks in this important phytopathogen.</p
A study of alterations in DNA epigenetic modifications (5mC and 5hmC) and gene expression influenced by simulated microgravity in human lymphoblastoid cells
Cells alter their gene expression in response to exposure to various environmental changes. Epigenetic mechanisms such as DNA methylation are believed to regulate the alterations in gene expression patterns. In vitro and in vivo studies have documented changes in cellular proliferation, cytoskeletal remodeling, signal transduction, bone mineralization and immune deficiency under the influence of microgravity conditions experienced in space. However microgravity induced changes in the epigenome have not been well characterized. In this study we have used Next-generation Sequencing (NGS) to profile ground-based “simulated” microgravity induced changes on DNA methylation (5-methylcytosine or 5mC), hydroxymethylation (5-hydroxymethylcytosine or 5hmC), and simultaneous gene expression in cultured human lymphoblastoid cells. Our results indicate that simulated microgravity induced alterations in the methylome (~60% of the differentially methylated regions or DMRs are hypomethylated and ~92% of the differentially hydroxymethylated regions or DHMRs are hyperhydroxymethylated). Simulated microgravity also induced differential expression in 370 transcripts that were associated with crucial biological processes such as oxidative stress response, carbohydrate metabolism and regulation of transcription. While we were not able to obtain any global trend correlating the changes of methylation/ hydroxylation with gene expression, we have been able to profile the simulated microgravity induced changes of 5mC over some of the differentially expressed genes that includes five genes undergoing differential methylation over their promoters and twenty five genes undergoing differential methylation over their gene-bodies. To the best of our knowledge, this is the first NGS-based study to profile epigenomic patterns induced by short time exposure of simulated microgravity and we believe that our findings can be a valuable resource for future explorations
- …