Search CORE

6,129 research outputs found

From Structure Prediction to Genomic Screens for Novel Non-Coding RNAs

Author: A Ben-Hur
AF Bompfünewerer
AM Khalil
AO Harmanci
AR Gruber
AV Uzilov
AX Wang
B Knudsen
B Lewis
BW Matthews
C Warden
C Workman
D Guarnieri
D Mathews
D Sankoff
DH Mathews
DH Turner
DK Chiu
E Bonnet
E Nudler
E Rivas
E Rivas
E Rivas
E Torarinsson
E Torarinsson
EP Nawrocki
EP Nawrocki
ES Andersen
ES Andersen
F Sleutels
GardnerJPP Daub
H Jia
I Holmes
I Holmes
IL Hofacker
Ivo L. Hofacker
J Felsenstein
J Gorodkin
J Gorodkin
J Gorodkin
J Gorodkin
J Gorodkin
J Gorodkin
J Gorodkin
Jan Gorodkin
JC Ellis
JG Underwood
JH Havgaard
JM Watts
JP McCutcheon
JS Mattick
JS Pedersen
JW Brown
K Doshi
K Okamura
K Reiche
KC Wang
KE Deigan
KM Weeks
L Redrup
M Georges
M Guttman
M Kertesz
M Kertesz
M Lindow
M Xie
MB Gerstein
MC Tsai
Michael Levitt
MW Hentze
N Lau
P Anandam
P Clote
P Gardner
P Larsson
P Menzel
P Schattner
PG Hawkins
PN Seibel
PP Gardner
R Nussinov
RA Gupta
RD Dowell
RD Dowell
RJ Klein
RJ Klein
RM Kuhn
RR Gutell
RR Gutell
S Eddy
S Griffiths-Jones
S Siebert
S Washietl
S Washietl
S Washietl
S Will
SE Seemann
SF Altschul
SR Eddy
T Gesell
T Hung
T Lowe
T Nagano
TF Consortium
TJ Macke
UA Ørom
V Kim
V Tripathi
W Deng
W Filipowicz
W Fontana
Y Park
Y Sakakibara
Z Weinberg
Z Weinberg
Z Yao
Z Yao
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Non-coding RNAs (ncRNAs) are receiving more and more attention not only as an abundant class of genes, but also as regulatory structural elements (some located in mRNAs). A key feature of RNA function is its structure. Computational methods were developed early for folding and prediction of RNA structure with the aim of assisting in functional analysis. With the discovery of more and more ncRNAs, it has become clear that a large fraction of these are highly structured. Interestingly, a large part of the structure is comprised of regular Watson-Crick and GU wobble base pairs. This and the increased amount of available genomes have made it possible to employ structure-based methods for genomic screens. The field has moved from folding prediction of single sequences to computational screens for ncRNAs in genomic sequence using the RNA structure as the main characteristic feature. Whereas early methods focused on energy-directed folding of single sequences, comparative analysis based on structure preserving changes of base pairs has been efficient in improving accuracy, and today this constitutes a key component in genomic screens. Here, we cover the basic principles of RNA folding and touch upon some of the concepts in current methods that have been applied in genomic screens for de novo RNA structures in searches for novel ncRNA genes and regulatory RNA structure on mRNAs. We discuss the strengths and weaknesses of the different strategies and how they can complement each other

Crossref

Directory of Open Access Journals

PubMed Central

Permanent Hosting, Archiving and Indexing of Digital Resources and Assets

Copenhagen University Research Information System

Evolutionary Modeling and Prediction of Non-Coding RNAs in Drosophila

Author: A Siepel
A Siepel
A Stark
A Varadarajan
AG Clark
Andrew V. Uzilov
B Knudsen
B Paten
CN Dewey
D Rose
D St Johnston
DP Bartel
DS Parker
E Boyle
E Lcuyer
E Nawrocki
E Rivas
E Rivas
E Torarinsson
G McGuire
Ian Holmes
IL Hofacker
J Brennecke
J Pedersen
J Ruby
JL Thorne
JP Bachellerie
JR Manak
JS Pedersen
JS Pedersen
JS Pedersen
KS Pollard
Lars Barquist
M Crosby
M Mandal
M Pheasant
M Sprinzl
Mitchell E. Skinner
N Bray
N Goldman
PD Rijk
PS Klosterman
RD Dowell
RD Dowell
RK Bradley
Robert Belshaw
Robert K. Bradley
S Griffiths-Jones
S Washietl
T Babak
T Elgavish
T Gesell
TM Lowe
V Ambros
WJ Bruno
YR Bendana
Yuri R. Bendaña
Z Wang
Z Yang
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eight different models of structural evolution and two different programs for genome alignment. We evaluated our models using alignments of twelve Drosophila genomes. We find that ncRNA prediction performance can vary greatly between different gene predictors and subfamilies of ncRNA gene. Our estimates for false positive rates are based on simulations which preserve local islands of conservation; using these simulations, we predict a higher rate of false positives than previous computational ncRNA screens have reported. Using one of the tested prediction grammars, we provide an updated set of ncRNA predictions for D. melanogaster and compare them to previously-published predictions and experimental data. Many of our predictions show correlations with protein-coding genes. We found significant depletion of intergenic predictions near the 3′ end of coding regions and furthermore depletion of predictions in the first intron of protein-coding genes. Some of our predictions are colocated with larger putative unannotated genes: for example, 17 of our predictions showing homology to the RFAM family snoR28 appear in a tandem array on the X chromosome; the 4.5 Kbp spanned by the predicted tandem array is contained within a FlyBase-annotated cDNA

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Computational RNomics of Drosophilids

Author: Findeiß Sven
Hackermüller Jörg
Hertel Jana
Prohaska Sonja J.
Reiche Kristin
Rose Dominic
Stadler Peter F.
Washietl Stefan
Publication venue
Publication date: 18/10/2018
Field of study

Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function and hence exhibit distinctive patterns of sequence conservation characteristic for positive selection on RNA secondary structure. The deep-sequencing of 12 drosophilid species coordinated by the NHGRI provides an ideal data set of comparative computational approaches to determine those genomic loci that code for evolutionarily conserved RNA motifs. This class of loci includes the majority of the known small ncRNAs as well as structured RNA motifs in mRNAs. We report here on a genome-wide survey using RNAz

Qucosa - Publikationsserver der Universität Leipzig

Structure-based whole-genome realignment reveals many novel noncoding RNAs

Author: Berger Bonnie
Will Sebastian
Yu Michael
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/01/2012
Field of study

Recent genome-wide computational screens that search for conservation of RNA secondary structure in whole-genome alignments (WGAs) have predicted thousands of structural noncoding RNAs (ncRNAs). The sensitivity of such approaches, however, is limited, due to their reliance on sequence-based whole-genome aligners, which regularly misalign structural ncRNAs. This suggests that many more structural ncRNAs may remain undetected. Structure-based alignment, which could increase the sensitivity, has been prohibitive for genome-wide screens due to its extreme computational costs. Breaking this barrier, we present the pipeline REAPR (RE-Alignment for Prediction of structural ncRNA), which efficiently realigns whole genomes based on RNA sequence and structure, thus allowing us to boost the performance of de novo ncRNA predictors, such as RNAz. Key to the pipeline's efficiency is the development of a novel banding technique for multiple RNA alignment. REAPR significantly outperforms the widely used predictors RNAz and EvoFold in genome-wide screens; in direct comparison to the most recent RNAz screen on D. melanogaster, REAPR predicts twice as many high-confidence ncRNA candidates. Moreover, modENCODE RNA-seq experiments confirm a substantial number of its predictions as transcripts. REAPR's advancement of de novo structural characterization of ncRNAs complements the identification of transcripts from rapidly accumulating RNA-seq data.National Institutes of Health (U.S.) (Grant RO1GM081871

DSpace@MIT

Crossref

PubMed Central

Designing libraries for pooled CRISPR functional screens of long noncoding RNAs.

Author: Johnson Rory
Pulido-Quetglas Carlos
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

Human and other genomes encode tens of thousands of long noncoding RNAs (lncRNAs), the vast majority of which remain uncharacterised. High-throughput functional screening methods, notably those based on pooled CRISPR-Cas perturbations, promise to unlock the biological significance and biomedical potential of lncRNAs. Such screens are based on libraries of single guide RNAs (sgRNAs) whose design is critical for success. Few off-the-shelf libraries are presently available, and lncRNAs tend to have cell-type-specific expression profiles, meaning that library design remains in the hands of researchers. Here we introduce the topic of pooled CRISPR screens for lncRNAs and guide readers through the three key steps of library design: accurate annotation of transcript structures, curation of optimal candidate sets, and design of sgRNAs. This review is a starting point and reference for researchers seeking to design custom CRISPR screening libraries for lncRNAs

PubMed Central

Bern Open Repository and Information System (BORIS)

Non-coding RNA annotation of the genome of Trichoplax adhaerens

Author: A. Tanzer
Altschul
Atzorn
B. Schierwater
Bailey
Basu
Bernhart
Bernhart
Blanchette
D. de Jong
D. Rose
Dunn
Enright
Gotoh
Griffiths-Jones
Grimson
Gruber
H. Tafer
Hertel
Hofacker
J. Hertel
Jakob
Lee
Lowe
M. Marz
MARMIER-GOURRIER
Marzluff
Miller
Molnar
Nagai
Nazar
Nilsen
Niwa
Odorico
P. F. Stadler
Pearson
Piccinelli
Prochnik
Puente
Putnam
Ro
Rose
Rose
Roshan
Srivastava
Steigele
Sunkar
Tarn
Val'ekho-Roman
Valadkhan
Voigt
Wainright
Washietl
Washietl
Weber
Willkomm
Zhang
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

A detailed annotation of non-protein coding RNAs is typically missing in initial releases of newly sequenced genomes. Here we report on a comprehensive ncRNA annotation of the genome of Trichoplax adhaerens, the presumably most basal metazoan whose genome has been published to-date. Since blast identified only a small fraction of the best-conserved ncRNAs—in particular rRNAs, tRNAs and some snRNAs—we developed a semi-global dynamic programming tool, GotohScan, to increase the sensitivity of the homology search. It successfully identified the full complement of major and minor spliceosomal snRNAs, the genes for RNase P and MRP RNAs, the SRP RNA, as well as several small nucleolar RNAs. We did not find any microRNA candidates homologous to known eumetazoan sequences. Interestingly, most ncRNAs, including the pol-III transcripts, appear as single-copy genes or with very small copy numbers in the Trichoplax genome

Permanent Hosting, Archiving and Indexing of Digital Resources and Assets

Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change

Author: Keegan Joshua M
Mathews David H
Uzilov Andrew V
Publication venue: BioMed Central
Publication date: 01/03/2006
Field of study

BACKGROUND: Non-coding RNAs (ncRNAs) have a multitude of roles in the cell, many of which remain to be discovered. However, it is difficult to detect novel ncRNAs in biochemical screens. To advance biological knowledge, computational methods that can accurately detect ncRNAs in sequenced genomes are therefore desirable. The increasing number of genomic sequences provides a rich dataset for computational comparative sequence analysis and detection of novel ncRNAs. RESULTS: Here, Dynalign, a program for predicting secondary structures common to two RNA sequences on the basis of minimizing folding free energy change, is utilized as a computational ncRNA detection tool. The Dynalign-computed optimal total free energy change, which scores the structural alignment and the free energy change of folding into a common structure for two RNA sequences, is shown to be an effective measure for distinguishing ncRNA from randomized sequences. To make the classification as a ncRNA, the total free energy change of an input sequence pair can either be compared with the total free energy changes of a set of control sequence pairs, or be used in combination with sequence length and nucleotide frequencies as input to a classification support vector machine. The latter method is much faster, but slightly less sensitive at a given specificity. Additionally, the classification support vector machine method is shown to be sensitive and specific on genomic ncRNA screens of two different Escherichia coli and Salmonella typhi genome alignments, in which many ncRNAs are known. The Dynalign computational experiments are also compared with two other ncRNA detection programs, RNAz and QRNA. CONCLUSION: The Dynalign-based support vector machine method is more sensitive for known ncRNAs in the test genomic screens than RNAz and QRNA. Additionally, both Dynalign-based methods are more sensitive than RNAz and QRNA at low sequence pair identities. Dynalign can be used as a comparable or more accurate tool than RNAz or QRNA in genomic screens, especially for low-identity regions. Dynalign provides a method for discovering ncRNAs in sequenced genomes that other methods may not identify. Significant improvements in Dynalign runtime have also been achieved

Directory of Open Access Journals

PubMed Central