Search CORE

159 research outputs found

Efficient known ncRNA search including pseudoknots

Author: Cheng Yuan
Yanni Sun
Publication venue: Springer Nature
Publication date: 21/01/2013
Field of study

BACKGROUND: Searching for members of characterized ncRNA families containing pseudoknots is an important component of genome-scale ncRNA annotation. However, the state-of-the-art known ncRNA search is based on context-free grammar (CFG), which cannot effectively model pseudoknots. Thus, existing CFG-based ncRNA identification tools usually ignore pseudoknots during search. As a result, dozens of sequences that do not contain the native pseudoknots are reported by these tools. When pseudoknot structures are vital to the functions of the ncRNAs, these sequences may not be true members. RESULTS: In this work, we design a pseudoknot search tool using multiple simple sub-structures, which are derived from knot-free and bifurcation-free structural motifs in the underlying family. We test our tool on a contiguous 22-Mb region of the Maize Genome. The experimental results show that our work competes favorably with other pseudoknot search methods. CONCLUSIONS: Our sub-structure based tool can conduct genome-scale pseudoknot-containing ncRNA search effectively and efficiently. It provides a complementary pseudoknot search tool to Infernal. The source codes are available at http://www.cse.msu.edu/~chengy/knotsearch

Springer - Publisher Connector

PubMed Central

Computational identification and analysis of noncoding RNAs - Unearthing the buried treasures in the genome

Author: Vaidyanathan P. P.
Yoon Byung-Jun
Publication venue
Publication date: 01/01/2007
Field of study

The central dogma of molecular biology states that the genetic information flows from DNA to RNA to protein. This dogma has exerted a substantial influence on our understanding of the genetic activities in the cells. Under this influence, the prevailing assumption until the recent past was that genes are basically repositories for protein coding information, and proteins are responsible for most of the important biological functions in all cells. In the meanwhile, the importance of RNAs has remained rather obscure, and RNA was mainly viewed as a passive intermediary that bridges the gap between DNA and protein. Except for classic examples such as tRNAs (transfer RNAs) and rRNAs (ribosomal RNAs), functional noncoding RNAs were considered to be rare. However, this view has experienced a dramatic change during the last decade, as systematic screening of various genomes identified myriads of noncoding RNAs (ncRNAs), which are RNA molecules that function without being translated into proteins [11], [40]. It has been realized that many ncRNAs play important roles in various biological processes. As RNAs can interact with other RNAs and DNAs in a sequence-specific manner, they are especially useful in tasks that require highly specific nucleotide recognition [11]. Good examples are the miRNAs (microRNAs) that regulate gene expression by targeting mRNAs (messenger RNAs) [4], [20], and the siRNAs (small interfering RNAs) that take part in the RNAi (RNA interference) pathways for gene silencing [29], [30]. Recent developments show that ncRNAs are extensively involved in many gene regulatory mechanisms [14], [17]. The roles of ncRNAs known to this day are truly diverse. These include transcription and translation control, chromosome replication, RNA processing and modification, and protein degradation and translocation [40], just to name a few. These days, it is even claimed that ncRNAs dominate the genomic output of the higher organisms such as mammals, and it is being suggested that the greater portion of their genome (which does not encode proteins) is dedicated to the control and regulation of cell development [27]. As more and more evidence piles up, greater attention is paid to ncRNAs, which have been neglected for a long time. Researchers began to realize that the vast majority of the genome that was regarded as “junk,” mainly because it was not well understood, may indeed hold the key for the best kept secrets in life, such as the mechanism of alternative splicing, the control of epigenetic variations and so forth [27]. The complete range and extent of the role of ncRNAs are not so obvious at this point, but it is certain that a comprehensive understanding of cellular processes is not possible without understanding the functions of ncRNAs [47]

Caltech Authors

Structural Alignment of RNAs Using Profile-csHMMs and Its Application to RNA Homology Search: Overview and New Results

Author: Vaidyanathan P. P.
Yoon Byung-Jun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

Systematic research on noncoding RNAs (ncRNAs) has revealed that many ncRNAs are actively involved in various biological networks. Therefore, in order to fully understand the mechanisms of these networks, it is crucial to understand the roles of ncRNAs. Unfortunately, the annotation of ncRNA genes that give rise to functional RNA molecules has begun only recently, and it is far from being complete. Considering the huge amount of genome sequence data, we need efficient computational methods for finding ncRNA genes. One effective way of finding ncRNA genes is to look for regions that are similar to known ncRNA genes. As many ncRNAs have well-conserved secondary structures, we need statistical models that can represent such structures for this purpose. In this paper, we propose a new method for representing RNA sequence profiles and finding structural alignment of RNAs based on profile context-sensitive hidden Markov models (profile-csHMMs). Unlike existing models, the proposed approach can handle any kind of RNA secondary structures, including pseudoknots. We show that profile-csHMMs can provide an effective framework for the computational analysis of RNAs and the identification of ncRNA genes

CiteSeerX

Caltech Authors

An Efficient Alignment Algorithm for Searching Simple Pseudoknots over Long Genomic Sequence

Author: Hon W
Lam TW
Ma CCC
Sadakane K
Wong KF
Yiu SM
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

published_or_final_versio

HKU Scholars Hub

LaRA 2: parallel and vectorized program for sequence–structure alignment of RNA sequences

Author: Ficarra Elisa
Reinert Knut
Urgese Gianvito
Winkler Jörg
Publication venue
Publication date: 01/01/2022
Field of study

Background The function of non-coding RNA sequences is largely determined by their spatial conformation, namely the secondary structure of the molecule, formed by Watson–Crick interactions between nucleotides. Hence, modern RNA alignment algorithms routinely take structural information into account. In order to discover yet unknown RNA families and infer their possible functions, the structural alignment of RNAs is an essential task. This task demands a lot of computational resources, especially for aligning many long sequences, and it therefore requires efficient algorithms that utilize modern hardware when available. A subset of the secondary structures contains overlapping interactions (called pseudoknots), which add additional complexity to the problem and are often ignored in available software. Results We present the SeqAn-based software LaRA 2 that is significantly faster than comparable software for accurate pairwise and multiple alignments of structured RNA sequences. In contrast to other programs our approach can handle arbitrary pseudoknots. As an improved re-implementation of the LaRA tool for structural alignments, LaRA 2 uses multi-threading and vectorization for parallel execution and a new heuristic for computing a lower boundary of the solution. Our algorithmic improvements yield a program that is up to 130 times faster than the previous version. Conclusions With LaRA 2 we provide a tool to analyse large sets of RNA secondary structures in relatively short time, based on structural alignment. The produced alignments can be used to derive structural motifs for the search in genomic databases

Institutional Repository of the Freie Universität Berlin

Computational analysis of noncoding RNAs

Author: Akutsu
Alkan
Amaral
Amaral
Ambros
Anders
Andronescu
Andronescu
Andronescu
Aravin
Au
Bartel
Bartel
Bateman
Bentwich
Berezikov
Bernhart
Bon
Bon
Bonnet
Breaker
Busch
Cabili
Chen
Chen
Chen
Clark
Coventry
Deigan
del Val
Ding
Dinger
Dirks
Do
Do
Dowell
ENCODE Project Consortium
Findei
Flamm
Freyhult
Frhlich
Friedländer
Frith
Galperin
Gardner
Gardner
Gautheret
Grad
Griffith
Grishok
Gruber
Guttman
Guttman
Harmanci
Havgaard
He
Hendrix
Hertel
Hertel
Hofacker
Hofacker
Hofacker
Jiang
Katz
Kertesz
Knudsen
Kozomara
Kruger
Lagesen
Lai
Laing
Laslett
Lau
Li
Lim
Lin
Lorenz
Lowe
Lowe
Lu
Lu
Lucks
Lyngso
Macke
Markham
Mathelier
Mathews
Mathews
Mathews
Mattick
McCaskill
Menzel
Mituyama
Mohl
Mortazavi
Mourier
Mückstein
Nagel
Nam
Nawrocki
Noller
Nussinov
Ohler
Pang
Parisien
Pasquinelli
Pedersen
Pervouchine
Pfeffer
Reeder
Regalia
Ren
Reuter
Rivas
Rivas
Rivas
Rivas
Roberts
Robertson
Robinson
Ruan
Ruby
Salari
Sankoff
Sato
Schattner
Schnall-Levin
Schnall-Levin
Seemann
Seemann
Seetin
Seitz
Sethupathy
Shi
Sperschneider
Stark
Stark
Tabaska
Tang
Torarinsson
Trapnell
Trapnell
Uemura
Underwood
van Bakel
Wang
Wang
Washietl
Washietl
Washietl
Washietl
Washietl
Washietl
Washietl
Weeks
Weinberg
Weinberg
Will
Will
Will
Wolfinger
Wu
Wuchty
Xayaphoummine
Xia
Xie
Xue
Yao
Zerbino
zu Siederdissen
Zuker
Zuker
Publication venue: 'Wiley'
Publication date: 01/11/2012
Field of study

Noncoding RNAs have emerged as important key players in the cell. Understanding their surprisingly diverse range of functions is challenging for experimental and computational biology. Here, we review computational methods to analyze noncoding RNAs. The topics covered include basic and advanced techniques to predict RNA structures, annotation of noncoding RNAs in genomic data, mining RNA-seq data for novel transcripts and prediction of transcript structures, computational aspects of microRNAs, and database resources.Austrian Science Fund (Schrodinger Fellowship J2966-B12)German Research Foundation (grant WI 3628/1-1 to SW)National Institutes of Health (U.S.) (NIH award 1RC1CA147187

DSpace@MIT

Crossref

PubMed Central

LaRA 2: parallel and vectorized program for sequence–structure alignment of RNA sequences

Author: Ficarra Elisa
Reinert Knut
Urgese Gianvito
Winkler Jörg
Publication venue
Publication date: 01/01/2022
Field of study

Institutional Repository of the Freie Universität Berlin

Design and implementation of a cyberinfrastructure for RNA motif search, prediction and analysis

Author: Wen Dongrong
Publication venue: Digital Commons @ NJIT
Publication date: 31/01/2012
Field of study

RNA secondary and tertiary structure motifs play important roles in cells. However, very few web servers are available for RNA motif search and prediction. In this dissertation, a cyberinfrastructure, named RNAcyber, capable of performing RNA motif search and prediction, is proposed, designed and implemented. The first component of RNAcyber is a web-based search engine, named RmotifDB. This web-based tool integrates an RNA secondary structure comparison algorithm with the secondary structure motifs stored in the Rfam database. With a user-friendly interface, RmotifDB provides the ability to search for ncRNA structure motifs in both structural and sequential ways. The second component of RNAcyber is an enhanced version of RmotifDB. This enhanced version combines data from multiple sources, incorporates a variety of well-established structure-based search methods, and is integrated with the Gene Ontology. To display RmotifDB’s search results, a software tool, called RSview, is developed. RSview is able to display the search results in a graphical manner. Finally, RNAcyber contains a web-based tool called Junction-Explorer, which employs a data mining method for predicting tertiary motifs in RNA junctions. Specifically, the tool is trained on solved RNA tertiary structures obtained from the Protein Data Bank, and is able to predict the configuration of coaxial helical stacks and families (topologies) in RNA junctions at the secondary structure level. Junction-Explorer employs several algorithms for motif prediction, including a random forest classification algorithm, a pseudoknot removal algorithm, and a feature ranking algorithm based on the gini impurity measure. A series of experiments including 10-fold cross- validation has been conducted to evaluate the performance of the Junction-Explorer tool. Experimental results demonstrate the effectiveness of the proposed algorithms and the superiority of the tool over existing methods. The RNAcyber infrastructure is fully operational, with all of its components accessible on the Internet

Digital Commons @ New Jersey Institute of Technology (NJIT)