Search CORE

14 research outputs found

corRna: a web server for predicting multiple-point deleterious mutations in structural RNAs

Author: A. Kam
Afonin
Barash
Churkin
Churkin
E. Lam
Grabow
Guo
Halvorsen
Isaacs
J. Waldispuhl
Shu
Waldispuhl
Waldispuhl
You
Publication venue: Oxford University Press
Publication date
Field of study

RNA molecules can achieve a broad range of regulatory functions through specific structures that are in turn determined by their sequence. The prediction of mutations changing the structural properties of RNA sequences (a.k.a. deleterious mutations) is therefore useful for conducting mutagenesis experiments and synthetic biology applications. While brute force approaches can be used to analyze single-point mutations, this strategy does not scale well to multiple mutations. In this article, we present corRna a web server for predicting the multiple-point deleterious mutations in structural RNAs. corRna uses our RNAmutants framework to efficiently explore the RNA mutational landscape. It also enables users to apply search heuristics to improve the quality of the predictions. We show that corRna predictions correlate with mutagenesis experiments on the hepatitis C virus cis-acting replication element as well as match the accuracy of previous approaches on a large test-set in a much lower execution time. We illustrate these new perspectives offered by corRna by predicting five-point deleterious mutations—an insight that could not be achieved by previous methods. corRna is available at: http://corrna.cs.mcgill.ca

Crossref

PubMed Central

Towards 3D structure prediction of large RNA molecules: an integer programming framework to insert local 3D motifs in RNA secondary structure

Author: Berman
Do
F. Major
Frellsen
Gendron
J. Waldispuhl
Laing
Lemieux
Markham
Martinez
Parisien
Reuter
Sarver
V. Reinharz
Wuchty
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: The prediction of RNA 3D structures from its sequence only is a milestone to RNA function analysis and prediction. In recent years, many methods addressed this challenge, ranging from cycle decomposition and fragment assembly to molecular dynamics simulations. However, their predictions remain fragile and limited to small RNAs. To expand the range and accuracy of these techniques, we need to develop algorithms that will enable to use all the structural information available. In particular, the energetic contribution of secondary structure interactions is now well documented, but the quantification of non-canonical interactions—those shaping the tertiary structure—is poorly understood. Nonetheless, even if a complete RNA tertiary structure energy model is currently unavailable, we now have catalogues of local 3D structural motifs including non-canonical base pairings. A practical objective is thus to develop techniques enabling us to use this knowledge for robust RNA tertiary structure predictors

Crossref

PubMed Central

Simultaneous alignment and folding of protein sequences

Author: A. Caprara
B.E. Shakhnovich
C.B. Do
C.B. Do
D. Frishman
D. Sankoff
D.H. Mathews
G. Raghava
I.L. Hofacker
J. Selbig
J. Waldispuhl
J. Waldispuhl
J.H. Havgaard
L.R. Forrest
M. Brudno
M. Cline
M. Lomize
M. Menke
P. Bradley
P. Fariselli
P. Rice
R. Backofen
R. Doolittle
R.A. Sutormin
R.C. Edgar
R.C. Edgar
R.C. Edgar
R.L.J. Dunbrack
S. Henikoff
S. Will
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We presentpartiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm’s complexity is polynomial in time and space. Algorithmically,partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane β-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments,partiFold-Align significantly outperforms state-of-the-art pairwise sequence alignment tools in the most difficult low sequence homology case and improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families. partiFold-Align is available at http://partiFold.csail.mit.edu

CiteSeerX

DSpace@MIT

Crossref

Efficient Algorithms for Probing the RNA Mutation Landscape

Author: A Coventry
A Omer
A Serganov
AO Harmanci
B Baker
B Knudsen
Bonnie Berger
C Reidys
C Thurner
Consortium ENCODE Project
D Barash
D Mathews
DH Mathews
E Rivas
I Hofacker
I Hofacker
I Miklos
IL Hofacker
IM Meyer
J Waldispuhl
J Waldispuhl
JS McCaskill
JS Pedersen
JS Weinger
Jérôme Waldispühl
M Yanagi
M Yang
M Zuker
M Zuker
MC Cowperthwaite
MC Cowperthwaite
MT Cheah
NM Cuceanu
P Clote
P Schuster
P Schuster
Peter Clote
PP Gardner
R Nussinov
RA Dimitrov
RD Dowell
S Brown
S Griffiths-Jones
S Griffiths-Jones
S You
SH Bernhart
Srinivas Devadas
T Kulinski
T Xia
Uwe Ohler
V Ambros
W Fontana
W Grüner
W Shu
Y Ding
Y Ding
Y Ding
Y Ponty
Publication venue: Public Library of Science
Publication date: 08/08/2008
Field of study

The diversity and importance of the role played by RNAs in the regulation and development of the cell are now well-known and well-documented. This broad range of functions is achieved through specific structures that have been (presumably) optimized through evolution. State-of-the-art methods, such as McCaskill's algorithm, use a statistical mechanics framework based on the computation of the partition function over the canonical ensemble of all possible secondary structures on a given sequence. Although secondary structure predictions from thermodynamics-based algorithms are not as accurate as methods employing comparative genomics, the former methods are the only available tools to investigate novel RNAs, such as the many RNAs of unknown function recently reported by the ENCODE consortium. In this paper, we generalize the McCaskill partition function algorithm to sum over the grand canonical ensemble of all secondary structures of all mutants of the given sequence. Specifically, our new program, RNAmutants, simultaneously computes for each integer k the minimum free energy structure MFE(k) and the partition function Z(k) over all secondary structures of all k-point mutants, even allowing the user to specify certain positions required not to mutate and certain positions required to base-pair or remain unpaired. This technically important extension allows us to study the resilience of an RNA molecule to pointwise mutations. By computing the mutation profile of a sequence, a novel graphical representation of the mutational tendency of nucleotide positions, we analyze the deleterious nature of mutating specific nucleotide positions or groups of positions. We have successfully applied RNAmutants to investigate deleterious mutations (mutations that radically modify the secondary structure) in the Hepatitis C virus cis-acting replication element and to evaluate the evolutionary pressure applied on different regions of the HIV trans-activation response element. In particular, we show qualitative agreement between published Hepatitis C and HIV experimental mutagenesis studies and our analysis of deleterious mutations using RNAmutants. Our work also predicts other deleterious mutations, which could be verified experimentally. Finally, we provide evidence that the 3′ UTR of the GB RNA virus C has been optimized to preserve evolutionarily conserved stem regions from a deleterious effect of pointwise mutations. We hope that there will be long-term potential applications of RNAmutants in de novo RNA design and drug design against RNA viruses. This work also suggests potential applications for large-scale exploration of the RNA sequence-structure network. Binary distributions are available at http://RNAmutants.csail.mit.edu/

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Disease-Associated Mutations That Alter the RNA Structural Ensemble

Author: A Charlesworth
A Laederach
A Morgado
A Sgourou
AB Glinskii
Alain Laederach
AS Dimas
BE Stranger
BJ Tucker
C Jousse
C Kimchi-Sarfaty
CG Mathew
D Karolchik
D Wang
DH Mathews
E Beaudoing
E Bindewald
EA Doherty
EC Lai
EC Lai
EJ Benjamin
EL de Bruijne
ES Lein
F Ferrari
FW Leebeek
G Pesole
G Pesole
GV Glinsky
GV Glinsky
HF Noller
HH Kazazian Jr
HY Huang
I Inoue
IL Hofacker
J Reeder
J Treutlein
J Waldispuhl
J Wang
JK Cowell
Joshua S. Martin
JS Waye
JX Wang
K Darty
KP Burdon
L Bonafe
L Cremonesi
L Cremonesi
L Jankovic
LL Elnitski
M Kozak
M Macias
M Nuinoon
M Sanchez
Matthew Halvorsen
MB Boffa
NE Morton
PD Stenson
PJ Castaldi
PJ Ho
PJ Ho
PJ Ho
PJ Ho
RA George
S Chappell
S Ezzikouri
S Quarrier
SA Woodson
Sam Broadaway
SH Bernhart
SH Lee
SJ Child
ST Lee
T Ishigami
Takashi Gojobori
TE Baroni
TM Rana
V Iadevaia
WM Gommans
Y Ding
Y Ding
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Genome-wide association studies (GWAS) often identify disease-associated mutations in intergenic and non-coding regions of the genome. Given the high percentage of the human genome that is transcribed, we postulate that for some observed associations the disease phenotype is caused by a structural rearrangement in a regulatory region of the RNA transcript. To identify such mutations, we have performed a genome-wide analysis of all known disease-associated Single Nucleotide Polymorphisms (SNPs) from the Human Gene Mutation Database (HGMD) that map to the untranslated regions (UTRs) of a gene. Rather than using minimum free energy approaches (e.g. mFold), we use a partition function calculation that takes into consideration the ensemble of possible RNA conformations for a given sequence. We identified in the human genome disease-associated SNPs that significantly alter the global conformation of the UTR to which they map. For six disease-states (Hyperferritinemia Cataract Syndrome, β-Thalassemia, Cartilage-Hair Hypoplasia, Retinoblastoma, Chronic Obstructive Pulmonary Disease (COPD), and Hypertension), we identified multiple SNPs in UTRs that alter the mRNA structural ensemble of the associated genes. Using a Boltzmann sampling procedure for sub-optimal RNA structures, we are able to characterize and visualize the nature of the conformational changes induced by the disease-associated mutations in the structural ensemble. We observe in several cases (specifically the 5′ UTRs of FTL and RB1) SNP–induced conformational changes analogous to those observed in bacterial regulatory Riboswitches when specific ligands bind. We propose that the UTR and SNP combinations we identify constitute a “RiboSNitch,” that is a regulatory RNA in which a specific SNP has a structural consequence that results in a disease phenotype. Our SNPfold algorithm can help identify RiboSNitches by leveraging GWAS data and an analysis of the mRNA structural ensemble

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Waddlia Genome: A Window into Chlamydial Biology

Growing evidence suggests that a novel member of the Chlamydiales order, Waddlia chondrophila, is a potential agent of miscarriage in humans and abortion in ruminants. Due to the lack of genetic tools to manipulate chlamydia, genomic analysis is proving to be the most incisive tool in stimulating investigations into the biology of these obligate intracellular bacteria. 454/Roche and Solexa/Illumina technologies were thus used to sequence and assemble de novo the full genome of the first representative of the Waddliaceae family, W. chondrophila. The bacteria possesses a 2′116′312bp chromosome and a 15′593 bp low-copy number plasmid that might integrate into the bacterial chromosome. The Waddlia genome displays numerous repeated sequences indicating different genome dynamics from classical chlamydia which almost completely lack repetitive elements. Moreover, W. chondrophila exhibits many virulence factors also present in classical chlamydia, including a functional type III secretion system, but also a large complement of specific factors for resistance to host or environmental stresses. Large families of outer membrane proteins were identified indicating that these highly immunogenic proteins are not Chlamydiaceae specific and might have been present in their last common ancestor. Enhanced metabolic capability for the synthesis of nucleotides, amino acids, lipids and other co-factors suggests that the common ancestor of the modern Chlamydiales may have been less dependent on their eukaryotic host. The fine-detailed analysis of biosynthetic pathways brings us closer to possibly developing a synthetic medium to grow W. chondrophila, a critical step in the development of genetic tools. As a whole, the availability of the W. chondrophila genome opens new possibilities in Chlamydiales research, providing new insights into the evolution of members of the order Chlamydiales and the biology of the Waddliaceae

Public Library of Science (PLOS)

Crossref

Serveur académique lausannois

Directory of Open Access Journals

PubMed Central

Queensland University of Technology ePrints Archive

ZORA

Publications at Bielefeld University

USC Research Bank - University of the Sunshine Coast

Asymptotic structural properties of quasi-random saturated structures of RNA

Author: A Banerjee
A Böck
A Omer
A Xayaphoummine
AR Woods
B Knudsen
BJ Tucker
C Flamm
Danny Krizanc
E Bornberg-Bauer
Evangelos Kranakis
FH Van Batenburg
G Zipf
HK Hwang
I Hofacker
J Waldispuhl
JS Weinger
L Devroye
L Devroye
L Lim
LV Danilova
M Bekaert
M Drmota
M Mandal
M Zuker
M Zuker
ME Nebel
MS Waterman
MT Cheah
NR Markham
P Clote
P Clote
P Clote
P Flajolet
Peter Clote
PR Stein
R Nussinov
R Sedgewick
S Chowdhury
SP Lalley
W Li
WA Lorenz
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Energy landscape of k-point mutants of an RNA molecule

Author: Altschul
B. Behzadi
Clote
CLOTE
Clote
Deutsch
Ding
DING
Griffiths-Jones
Gutell
J. Waldispuhl
J.-M. Steyaert
MATHEWS
Mathews
McCaskill
Nussinov
P. Clote
Schuster
Zuker
Zuker
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

Predicting weakly stable regions, oligomerization state, and protein-protein interfaces in transmembrane domains of outer membrane proteins

Author: Adamian
Bastolla
Bigelow
Bishop
Bishop
Bogdanov
Elofsson
Evanics
Gentle
H. Naveed
Haltia
Hessa
Ho
Hong
Huysmans
J. Liang
Jackups
Jackups
Levy
Li
Lukatsky
Marianayagam
M ndez
R. Jackups
Schulz
Stanley
Tamm
van den Berg
Van Gelder
Waldispuhl
Wimley
Wimley
Wouters
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date
Field of study

Crossref

How Many 3D Structures Do We Need to Train a Predictor?

Author: Adamczak
Altschul
Anfinsen
Bagos
Bagos
Bagos
Bagos
Baldi
Baldi
Berman
Ceroni
Chandonia
Chandonia
Chou
Chou
Cuff
Cuff
Deleage
Eddy
Frishman
Garnier
Garnier
Gascuel
Georgios N. Tsaousis
Gibrat
Guo
Hobohm
Holley
Hua
Ito
Jones
Kihara
Kim
Kloczkowski
Krogh
Levin
Lin
Lin
Liu
Liu
Mamitsuka
Martin-Galiano
Nguyen
Nguyen
Oberai
Ouali
Pan
Pantelis G. Bagos
Petersen
Pollastri
Pollastri
Pollastri
Przybylski
Punta
Qian
Qin
Reczko
Riis
Rost
Rost
Rychlewski
Sadeghi
Salamov
Schmidler
Shestopalov
Stavros J. Hamodrakas
Thompson
Thompson
Tusnady
Tusnady
Viklund
von Bertalanffy
Vullo
Vullo
Waldispuhl
Wang
Ward
White
Wood
Wood
Yi
Zemla
Zhang
Zvelebil
Publication venue: 'Elsevier BV'
Publication date: 01/01/2009
Field of study

It has been shown that the progress in the determination of membrane protein structure grows exponentially, with approximately the same growth rate as that of the water-soluble proteins. In order to investigate the effect of this, on the performance of prediction algorithms for both α-helical and β-barrel membrane proteins, we conducted a prospective study based on historical records. We trained separate hidden Markov models with different sized training sets and evaluated their performance on topology prediction for the two classes of transmembrane proteins. We show that the existing top-scoring algorithms for predicting the transmembrane segments of α-helical membrane proteins perform slightly better than that of β-barrel outer membrane proteins in all measures of accuracy. With the same rationale, a meta-analysis of the performance of the secondary structure prediction algorithms indicates that existing algorithmic techniques cannot be further improved by just adding more non-homologous sequences to the training sets. The upper limit for secondary structure prediction is estimated to be no more than 70% and 80% of correctly predicted residues for single sequence based methods and multiple sequence based ones, respectively. Therefore, we should concentrate our efforts on utilizing new techniques for the development of even better scoring predictors. © 2009 Beijing Genomics Institute

CiteSeerX

Elsevier - Publisher Connector

Crossref

PubMed Central

University of Thessaly Institutional Repository