Search CORE

27 research outputs found

CorreLogo: an online server for 3D sequence logos of RNA and DNA alignments

Author: Bindewald Eckart
Schneider Thomas D.
Shapiro Bruce A.
Publication venue: Oxford University Press
Publication date: 01/01/2006
Field of study

We present an online server that generates a 3D representation of properties of user-submitted RNA or DNA alignments. The visualized properties are information of single alignment columns, mutual information of two alignment positions as well as the position-specific fraction of gaps. The nucleotide composition of both single columns and column pairs is visualized with the help of color-coded 3D bars labeled with letters. The server generates both VRML and JVX output that can be viewed with a VRML viewer or the JavaView applet, respectively. We show that combining these different features of an alignment into one 3D representation is helpful in identifying correlations between bases and potential RNA and DNA base pairs. Significant known correlations between the tRNA 3′ anticodon cardinal nucleotide and the extended anticodon were observed, as were correlations within the amino acid acceptor stem and between the cardinal nucleotide and the acceptor stem. The online server can be accessed using the URL

CiteSeerX

Crossref

PubMed Central

A benchmark of multiple sequence alignment programs upon structural RNAs

Author: Gardner Paul P.
Washietl Stefan
Wilm Andreas
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

To date, few attempts have been made to benchmark the alignment algorithms upon nucleic acid sequences. Frequently, sophisticated PAM or BLOSUM like models are used to align proteins, yet equivalents are not considered for nucleic acids; instead, rather ad hoc models are generally favoured. Here, we systematically test the performance of existing alignment algorithms on structural RNAs. This work was aimed at achieving the following goals: (i) to determine conditions where it is appropriate to apply common sequence alignment methods to the structural RNA alignment problem. This indicates where and when researchers should consider augmenting the alignment process with auxiliary information, such as secondary structure and (ii) to determine which sequence alignment algorithms perform well under the broadest range of conditions. We find that sequence alignment alone, using the current algorithms, is generally inappropriate <50–60% sequence identity. Second, we note that the probabilistic method ProAlign and the aging Clustal algorithms generally outperform other sequence-based algorithms, under the broadest range of applications

CiteSeerX

Crossref

PubMed Central

Copenhagen University Research Information System

Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences

Author: A Lescoute
AM Rosenblad
C Papanicolaou
C Woese
C Zwieb
C Zwieb
D Chiu
D Gautheret
D Mathews
D Matthews
D Sankoff
E Bindewald
F Rousset
F Tahi
F Tahi
Fariza Tahi
I Hofacker
J Brown
K Han
K Horimoto
L Vawter
M Szymanski
M Zuker
N Savill
O Perriquet
P Baldi
P Doty
P Higgs
PP Gardner
R Nussinov
RJ Klein
RR Gutell
S Freier
S Lindgreen
Stéfan Engelen
WC Curtis
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The secondary structure of an RNA must be known before the relationship between its structure and function can be determined. One way to predict the secondary structure of an RNA is to identify covarying residues that maintain the pairings (Watson-Crick, Wobble and non-canonical pairings). This "comparative approach" consists of identifying mutations from homologous sequence alignments. The sequences must covary enough for compensatory mutations to be revealed, but comparison is difficult if they are too different. Thus the choice of homologous sequences is critical. While many possible combinations of homologous sequences may be used for prediction, only a few will give good structure predictions. This can be due to poor quality alignment in stems or to the variability of certain sequences. This problem of sequence selection is currently unsolved. Results This paper describes an algorithm, <it>SSCA</it>, which measures the suitability of sequences for the comparative approach. It is based on evolutionary models with structure constraints, particularly those on sequence variations and stem alignment. We propose three models, based on different constraints on sequence alignments. We show the results of the <it>SSCA </it>algorithm for predicting the secondary structure of several RNAs. <it>SSCA </it>enabled us to choose sets of homologous sequences that gave better predictions than arbitrarily chosen sets of homologous sequences. Conclusion <it>SSCA </it>is an algorithm for selecting combinations of RNA homologous sequences suitable for secondary structure predictions with the comparative approach.</p

HAL Evry

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ConStruct: Improved construction of RNA consensus structures

Author: Linnenbrink Kornelia
Steger Gerhard
Wilm Andreas
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Aligning homologous non-coding RNAs (ncRNAs) correctly in terms of sequence and structure is an unresolved problem, due to both mathematical complexity and imperfect scoring functions. High quality alignments, however, are a prerequisite for most consensus structure prediction approaches, homology searches, and tools for phylogeny inference. Automatically created ncRNA alignments often need manual corrections, yet this manual refinement is tedious and error-prone. Results We present an extended version of CONSTRUCT, a semi-automatic, graphical tool suitable for creating RNA alignments correct in terms of both consensus sequence and consensus structure. To this purpose CONSTRUCT combines sequence alignment, thermodynamic data and various measures of covariation. One important feature is that the user is guided during the alignment correction step by a consensus dotplot, which displays all thermodynamically optimal base pairs and the corresponding covariation. Once the initial alignment is corrected, optimal and suboptimal secondary structures as well as tertiary interaction can be predicted. We demonstrate CONSTRUCT's ability to guide the user in correcting an initial alignment, and show an example for optimal secondary consensus structure prediction on very hard to align SECIS elements. Moreover we use CONSTRUCT to predict tertiary interactions from sequences of the internal ribosome entry site of CrP-like viruses. In addition we show that alignments specifically designed for benchmarking can be easily be optimized using CONSTRUCT, although they share very little sequence identity. Conclusion CONSTRUCT's graphical interface allows for an easy alignment correction based on and guided by predicted and known structural constraints. It combines several algorithms for prediction of secondary consensus structure and even tertiary interactions. The CONSTRUCT package can be downloaded from the URL listed in the Availability and requirements section of this article.</p

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Direct-Coupling Analysis of nucleotide coevolution facilitates RNA secondary and tertiary structure prediction

Author: Cocco Simona
De Leonardis Eleonora
Lutz Benjamin
Monasson Remi
Ratz Sebastian
Schug Alexander
Weigt Martin
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

Despite the biological importance of non-coding RNA, their structural characterization remains challenging. Making use of the rapidly growing sequence databases, we analyze nucleotide coevolution across homologous sequences via Direct-Coupling Analysis to detect nucleotide-nucleotide contacts. For a representative set of riboswitches, we show that the results of Direct-Coupling Analysis in combination with a generalized Nussinov algorithm systematically improve the results of RNA secondary structure prediction beyond traditional covariance approaches based on mutual information. Even more importantly, we show that the results of Direct-Coupling Analysis are enriched in tertiary structure contacts. By integrating these predictions into molecular modeling tools, systematically improved tertiary structure predictions can be obtained, as compared to using secondary structure information alone.Comment: 22 pages, 8 figures, supplemental information available on the publisher's webpage (http://nar.oxfordjournals.org/content/early/2015/09/29/nar.gkv932.abstract

arXiv.org e-Print Archive

Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction

Author: Dowell Robin D
Eddy Sean R
Publication venue: BioMed Central
Publication date: 01/01/2004
Field of study

BACKGROUND: RNA secondary structure prediction methods based on probabilistic modeling can be developed using stochastic context-free grammars (SCFGs). Such methods can readily combine different sources of information that can be expressed probabilistically, such as an evolutionary model of comparative RNA sequence analysis and a biophysical model of structure plausibility. However, the number of free parameters in an integrated model for consensus RNA structure prediction can become untenable if the underlying SCFG design is too complex. Thus a key question is, what small, simple SCFG designs perform best for RNA secondary structure prediction? RESULTS: Nine different small SCFGs were implemented to explore the tradeoffs between model complexity and prediction accuracy. Each model was tested for single sequence structure prediction accuracy on a benchmark set of RNA secondary structures. CONCLUSIONS: Four SCFG designs had prediction accuracies near the performance of current energy minimization programs. One of these designs, introduced by Knudsen and Hein in their PFOLD algorithm, has only 21 free parameters and is significantly simpler than the others

Directory of Open Access Journals

PubMed Central

Digital Commons@Becker

Fragmentation of the large subunit ribosomal RNA gene in oyster mitochondrial genomes

Author: Cannone Jamie J
Gaffney Patrick M
Gutell Robin R
Lee Jung C
Milbury Coren A
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Discontinuous genes have been observed in bacteria, archaea, and eukaryotic nuclei, mitochondria and chloroplasts. Gene discontinuity occurs in multiple forms: the two most frequent forms result from introns that are spliced out of the RNA and the resulting exons are spliced together to form a single transcript, and fragmented gene transcripts that are not covalently attached post-transcriptionally. Within the past few years, fragmented ribosomal RNA (rRNA) genes have been discovered in bilateral metazoan mitochondria, all within a group of related oysters. Results In this study, we have characterized this fragmentation with comparative analysis and experimentation. We present secondary structures, modeled using comparative sequence analysis of the discontinuous mitochondrial large subunit rRNA genes of the cupped oysters <it>C. virginica, C. gigas</it>, and <it>C. hongkongensis</it>. Comparative structure models for the large subunit rRNA in each of the three oyster species are generally similar to those for other bilateral metazoans. We also used RT-PCR and analyzed ESTs to determine if the two fragmented LSU rRNAs are spliced together. The two segments are transcribed separately, and not spliced together although they still form functional rRNAs and ribosomes. Conclusions Although many examples of discontinuous ribosomal genes have been documented in bacteria and archaea, as well as the nuclei, chloroplasts, and mitochondria of eukaryotes, oysters are some of the first characterized examples of fragmented bilateral animal mitochondrial rRNA genes. The secondary structures of the oyster LSU rRNA fragments have been predicted on the basis of previous comparative metazoan mitochondrial LSU rRNA structure models.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Texas ScholarWorks

From Structure Prediction to Genomic Screens for Novel Non-Coding RNAs

Author: A Ben-Hur
AF Bompfünewerer
AM Khalil
AO Harmanci
AR Gruber
AV Uzilov
AX Wang
B Knudsen
B Lewis
BW Matthews
C Warden
C Workman
D Guarnieri
D Mathews
D Sankoff
DH Mathews
DH Turner
DK Chiu
E Bonnet
E Nudler
E Rivas
E Rivas
E Rivas
E Torarinsson
E Torarinsson
EP Nawrocki
EP Nawrocki
ES Andersen
ES Andersen
F Sleutels
GardnerJPP Daub
H Jia
I Holmes
I Holmes
IL Hofacker
Ivo L. Hofacker
J Felsenstein
J Gorodkin
J Gorodkin
J Gorodkin
J Gorodkin
J Gorodkin
J Gorodkin
J Gorodkin
Jan Gorodkin
JC Ellis
JG Underwood
JH Havgaard
JM Watts
JP McCutcheon
JS Mattick
JS Pedersen
JW Brown
K Doshi
K Okamura
K Reiche
KC Wang
KE Deigan
KM Weeks
L Redrup
M Georges
M Guttman
M Kertesz
M Kertesz
M Lindow
M Xie
MB Gerstein
MC Tsai
Michael Levitt
MW Hentze
N Lau
P Anandam
P Clote
P Gardner
P Larsson
P Menzel
P Schattner
PG Hawkins
PN Seibel
PP Gardner
R Nussinov
RA Gupta
RD Dowell
RD Dowell
RJ Klein
RJ Klein
RM Kuhn
RR Gutell
RR Gutell
S Eddy
S Griffiths-Jones
S Siebert
S Washietl
S Washietl
S Washietl
S Will
SE Seemann
SF Altschul
SR Eddy
T Gesell
T Hung
T Lowe
T Nagano
TF Consortium
TJ Macke
UA Ørom
V Kim
V Tripathi
W Deng
W Filipowicz
W Fontana
Y Park
Y Sakakibara
Z Weinberg
Z Weinberg
Z Yao
Z Yao
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Non-coding RNAs (ncRNAs) are receiving more and more attention not only as an abundant class of genes, but also as regulatory structural elements (some located in mRNAs). A key feature of RNA function is its structure. Computational methods were developed early for folding and prediction of RNA structure with the aim of assisting in functional analysis. With the discovery of more and more ncRNAs, it has become clear that a large fraction of these are highly structured. Interestingly, a large part of the structure is comprised of regular Watson-Crick and GU wobble base pairs. This and the increased amount of available genomes have made it possible to employ structure-based methods for genomic screens. The field has moved from folding prediction of single sequences to computational screens for ncRNAs in genomic sequence using the RNA structure as the main characteristic feature. Whereas early methods focused on energy-directed folding of single sequences, comparative analysis based on structure preserving changes of base pairs has been efficient in improving accuracy, and today this constitutes a key component in genomic screens. Here, we cover the basic principles of RNA folding and touch upon some of the concepts in current methods that have been applied in genomic screens for de novo RNA structures in searches for novel ncRNA genes and regulatory RNA structure on mRNAs. We discuss the strengths and weaknesses of the different strategies and how they can complement each other

Crossref

Directory of Open Access Journals

PubMed Central

Permanent Hosting, Archiving and Indexing of Digital Resources and Assets

Copenhagen University Research Information System

Structural Constraints Identified with Covariation Analysis in Ribosomal RNA

Covariation analysis is used to identify those positions with similar patterns of sequence variation in an alignment of RNA sequences. These constraints on the evolution of two positions are usually associated with a base pair in a helix. While mutual information (MI) has been used to accurately predict an RNA secondary structure and a few of its tertiary interactions, early studies revealed that phylogenetic event counting methods are more sensitive and provide extra confidence in the prediction of base pairs. We developed a novel and powerful phylogenetic events counting method (PEC) for quantifying positional covariation with the Gutell lab’s new RNA Comparative Analysis Database (rCAD). The PEC and MI-based methods each identify unique base pairs, and jointly identify many other base pairs. In total, both methods in combination with an N-best and helix-extension strategy identify the maximal number of base pairs. While covariation methods have effectively and accurately predicted RNAs secondary structure, only a few tertiary structure base pairs have been identified. Analysis presented herein and at the Gutell lab’s Comparative RNA Web (CRW) Site reveal that the majority of these latter base pairs do not covary with one another. However, covariation analysis does reveal a weaker although significant covariation between sets of nucleotides that are in proximity in the three-dimensional RNA structure. This reveals that covariation analysis identifies other types of structural constraints beyond the two nucleotides that form a base pair

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Texas ScholarWorks

FigShare

Incorporating phylogenetic-based covarying mutations into RNAalifold for RNA consensus structure prediction

Author: A Esquela-Kerscher
AO Harmanci
B Gulko
B Knudsen
B Knudsen
C Workman
CB Do
CM Croce
Consortium The ENCODE Project
CR Woese
D Sankoff
DKY Chiu
DL Swofford
E Rivas
F Xia
IL Hofacker
IL Hofacker
J Felsenstein
JA Jaeger
JH Havgaard
JP Huelsenbeck
JS Mattick
JS Pedersen
L He
M Mandal
M Zuker
M Zuker
MA Larkin
MS Nicoloso
MS Waterman
Ping Ge
PP Gardner
PP Gardner
R Lorenz
R Nussinov
RD Dowell
RJ Klein
RR Gutell
RR Gutell
RR Sokal
S Washietl
S Will
SE Seemann
SE Seemann
SH Bernhart
Shaojie Zhang
SR Eddy
SR Eddy
The FANTOM Consortium
TR Mercer
WM Fitch
Y Sakakibara
Z Yao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

BACKGROUND: RNAalifold, a popular computational method for RNA consensus structure prediction, incorporates covarying mutations into a thermodynamic model to fold the aligned RNA sequences. When quantifying covariance, it evaluates conserved signals of two aligned columns with base-pairing rules. This scoring scheme performs better than some other approaches, such as mutual information. However it ignores the phylogenetic history of the aligned sequences, which is an important criterion to evaluate the level of sequence covariance. RESULTS: In this article, in order to improve the accuracy of consensus structure folding, we propose a novel approach named PhyloRNAalifold. It incorporates the number of covarying mutations on the phylogenetic tree of the aligned sequences into the covariance scoring of RNAalifold. The benchmarking results show that the new scoring scheme of PhyloRNAalifold can improve the consensus structure detection of RNAalifold. CONCLUSION: Incorporating additional phylogenetic information of aligned sequences into the covariance scoring of RNAalifold can improve its performance of consensus structures folding. This improvement is correlated with alignment characteristics, such as pair-wise identity and the number of sequences in the alignment

Crossref

PubMed Central

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)