27 research outputs found
R2R - software to speed the depiction of aesthetic consensus RNA secondary structures
<p>Abstract</p> <p>Background</p> <p>With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams.</p> <p>Results</p> <p>We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes.</p> <p>Conclusions</p> <p>R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at <url>http://breaker.research.yale.edu/R2R</url> and as an Additional file.</p
RNIE: genome-wide prediction of bacterial intrinsic terminators
Bacterial Rho-independent terminators (RITs) are important genomic landmarks involved in gene regulation and terminating gene expression. In this investigation we present RNIE, a probabilistic approach for predicting RITs. The method is based upon covariance models which have been known for many years to be the most accurate computational tools for predicting homology in structural non-coding RNAs. We show that RNIE has superior performance in model species from a spectrum of bacterial phyla. Further analysis of species where a low number of RITs were predicted revealed a highly conserved structural sequence motif enriched near the genic termini of the pathogenic Actinobacteria, Mycobacterium tuberculosis. This motif, together with classical RITs, account for up to 90% of all the significantly structured regions from the termini of M. tuberculosis genic elements. The software, predictions and alignments described below are available from http://github.com/ppgardne/RNIE
Multi-objective dynamic population shuffled frog-leaping biclustering of microarray data
<p>Abstract</p> <p>The structure of RiboNucleic Acid (RNA) has the potential to be altered by a Single Nucleotide Polymorphism (SNP). Disease-associated SNPs mapping to non-coding regions of the genome that are transcribed into RiboNucleic Acid (RNA) can potentially affect cellular regulation (and cause disease) by altering the structure of the transcript. We performed a large-scale meta-analysis of Selective 2'-Hydroxyl Acylation analyzed by Primer Extension (SHAPE) data, which probes the structure of RNA. We found that several single point mutations exist that significantly disrupt RNA secondary structure in the five transcripts we analyzed. Thus, every RNA that is transcribed has the potential to be a “RiboSNitch;” where a SNP causes a large conformational change that alters regulatory function. Predicting the SNPs that will have the largest effect on RNA structure remains a contemporary computational challenge. We therefore benchmarked the most popular RNA structure prediction algorithms for their ability to identify mutations that maximally affect structure. We also evaluated metrics for rank ordering the extent of the structural change. Although no single algorithm/metric combination dramatically outperformed the others, small differences in AUC (Area Under the Curve) values reveal that certain approaches do provide better agreement with experiment. The experimental data we analyzed nonetheless show that multiple single point mutations exist in all RNA transcripts that significantly disrupt structure in agreement with the predictions.</p
Singlet Glycine Riboswitches Bind Ligand as Well as Tandem Riboswitches
The glycine riboswitch often occurs in a tandem architecture, with two ligand-binding domains (aptamers) followed by a single expression platform. Based on previous observations, we hypothesized that singlet versions of the glycine riboswitch, which contain only one aptamer domain, are able to bind glycine if appropriate structural contacts are maintained. An initial alignment of 17 putative singlet riboswitches indicated that the single consensus aptamer domain is flanked by a conserved peripheral stem-loop structure. These singlets were sorted into two subtypes based on whether the active aptamer domain precedes or follows the peripheral stem-loop, and an example of each subtype of singlet riboswitch was characterized biochemically. The singlets possess glycine-binding affinities comparable to those of previously published tandem examples, and the conserved peripheral domains form A-minor interactions with the single aptamer domain that are necessary for ligand-binding activity. Analysis of sequenced genomes identified a significant number of singlet glycine riboswitches. Based on these observations, we propose an expanded model for glycine riboswitch gene control that includes singlet and tandem architectures
jViz.RNA 4.0—Visualizing Pseudoknots and RNA Editing Employing Compressed Tree Graphs
Previously, we have introduced an improved version of jViz.RNA which enabled faster and more stable RNA visualization by employing compressed tree graphs. However, the new RNA representation and visualization method required a sophisticated mechanism of pseudoknot visualization. In this work, we present our novel pseudoknot classification and implementation of pseudoknot visualization in the context of the new RNA graph model. We then compare our approach with other RNA visualization software, and demonstrate jViz.RNA 4.0’s benefits compared to other software. Additionally, we introduce interactive editing functionality into jViz.RNA and demonstrate its benefits in exploring and building RNA structures. The results presented highlight the new high degree of utility jViz.RNA 4.0 now offers. Users are now able to visualize pseudoknotted RNA, manipulate the resulting automatic layouts to suit their individual needs, and change both positioning and connectivity of the RNA molecules examined. Care was taken to limit overlap between structural elements, particularly in the case of pseudoknots to ensure an intuitive and informative layout of the final RNA structure
Streptococcus suis TrpX is part of a tryptophan uptake system, and its expression is regulated by a T-box regulatory element
Streptococcus suis, a common member of the porcine respiratory microbiota, can cause life-threatening diseases in pigs as well as humans. A previous study identified the gene trpX as conditionally essential for in vivo survival by intrathecal infection of pigs with a transposon library of S. suis strain 10. Here, we characterized trpX, encoding a putative tryptophan/tyrosine transport system substrate-binding protein, in more detail. We compared growth capacities of the isogenic trpX-deficient mutant derivative strain 10∆trpX with its parent. Growth experiments in chemically defined media (CDM) revealed that growth of 10∆trpX depended on tryptophan concentration, suggesting TrpX involvement in tryptophan uptake. We demonstrated that trpX is part of an operon structure and co-transcribed with two additional genes encoding a putative permease and ATPase, respectively. Bioinformatics analysis identified a putative tryptophan T-box riboswitch in the 5′ untranslated region of this operon. Finally, qRT-PCR and a reporter activation assay revealed trpX mRNA induction under tryptophan-limited conditions. In conclusion, our study showed that TrpX is part of a putative tryptophan ABC transporter system regulated by a T-box riboswitch probably functioning as a substrate-binding protein. Due to the tryptophan auxotrophy of S. suis, TrpX plays a crucial role for metabolic adaptation and growth during infection
Experimental demonstration and pan-structurome prediction of climate-associated riboSNitches in Arabidopsis
BACKGROUND: Genome-wide association studies (GWAS) aim to correlate phenotypic changes with genotypic variation. Upon transcription, single nucleotide variants (SNVs) may alter mRNA structure, with potential impacts on transcript stability, macromolecular interactions, and translation. However, plant genomes have not been assessed for the presence of these structure-altering polymorphisms or "riboSNitches." RESULTS: We experimentally demonstrate the presence of riboSNitches in transcripts of two Arabidopsis genes, ZINC RIBBON 3 (ZR3) and COTTON GOLGI-RELATED 3 (CGR3), which are associated with continentality and temperature variation in the natural environment. These riboSNitches are also associated with differences in the abundance of their respective transcripts, implying a role in regulating the gene's expression in adaptation to local climate conditions. We then computationally predict riboSNitches transcriptome-wide in mRNAs of 879 naturally inbred Arabidopsis accessions. We characterize correlations between SNPs/riboSNitches in these accessions and 434 climate descriptors of their local environments, suggesting a role of these variants in local adaptation. We integrate this information in CLIMtools V2.0 and provide a new web resource, T-CLIM, that reveals associations between transcript abundance variation and local environmental variation. CONCLUSION: We functionally validate two plant riboSNitches and, for the first time, demonstrate riboSNitch conditionality dependent on temperature, coining the term "conditional riboSNitch." We provide the first pan-genome-wide prediction of riboSNitches in plants. We expand our previous CLIMtools web resource with riboSNitch information and with 1868 additional Arabidopsis genomes and 269 additional climate conditions, which will greatly facilitate in silico studies of natural genetic variation, its phenotypic consequences, and its role in local adaptation
Computational Discovery of Structured Non-coding RNA Motifs in Bacteria
This dissertation describes a range of computational efforts to discover novel structured non-coding RNA (ncRNA) motifs in bacteria and generate hypotheses regarding their potential functions. This includes an introductory description of key advances in comparative genomics and RNA structure prediction as well as some of the most commonly found ncRNA candidates. Beyond that, I describe efforts for the comprehensive discovery of ncRNA candidates in 25 bacterial genomes and a catalog of the various functions hypothesized for these new motifs. Finally, I describe the Discovery of Intergenic Motifs PipeLine (DIMPL) which is a new computational toolset that harnesses the power of support vector machine (SVM) classifiers to identify bacterial intergenic regions most likely to contain novel structured ncRNA and automates the bulk of the subsequent analysis steps required to predict function. In totality, the body of work will enable the large scale discovery of novel structured ncRNA motifs at a far greater pace than possible before