Search CORE

Prediction of super-secondary structure in α-helical and β-barrel transmembrane proteins

Author: C Zhang
HR Bigelow
J Waldispühl
Jean-Marc Steyaert
P Martelli
Philippe Chassignet
Van Du Tran
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

International audienceA dynamic programming algorithm is proposed to predict the structure of different families of proteins and is tested with the b-barrel transmembrane proteins.Un algorithme est proposé qui permet, par programmation dynamique, de prédire la strucutre de différentes familles de protéines. Il est testé sur les proteeines transmembranaires (beta)

HAL-CentraleSupelec

Springer - Publisher Connector

INRIA a CCSD electronic archive server

arXiv.org e-Print Archive

HAL-Polytechnique

10 simple rules to create a serious game, illustrated with examples from structural biology

Author: A Kawrykow
Antoine Taly
AO O’Hagan
AW Woolley
B M Good
D Centola
D Djaouti
D Kwak
D Michael
E Law
F Khatib
G McGill
GG Graham
H Jenkins
H Sauermann
HM Bik
I Iacovides
J Alvarez
J Belanich
J Franco
J Himmelstein
J Lee
J Lorenz
J Moult
JA Evans
JP Gee
JS Kim
Jérôme Waldispühl
KN Laland
L Mazzanti
M Gilski
Marc Baaden
N Ferey
N Férey
N Prestopnik
Nicolas Ferey
Olivier Delalande
R Das
R Das
R Follett
R McDaniel
RJ Ellis
S Cooper
S Cooper
S Doutreligne
S Horowitz
Samuela Pasquali
Scott Markel
SI O'Donoghue
V Curtis
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/03/2018
Field of study

Serious scientific games are games whose purpose is not only fun. In the field of science, the serious goals include crucial activities for scientists: outreach, teaching and research. The number of serious games is increasing rapidly, in particular citizen science games, games that allow people to produce and/or analyze scientific data. Interestingly, it is possible to build a set of rules providing a guideline to create or improve serious games. We present arguments gathered from our own experience ( Phylo , DocMolecules , HiRE-RNA contest and Pangu) as well as examples from the growing literature on scientific serious games

arXiv.org e-Print Archive

HAL Descartes

Hal-Diderot

HAL-Rennes 1

A Combinatorial Framework for Designing (Pseudoknotted) RNA Algorithms

We extend an hypergraph representation, introduced by Finkelstein and Roytberg, to unify dynamic programming algorithms in the context of RNA folding with pseudoknots. Classic applications of RNA dynamic programming energy minimization, partition function, base-pair probabilities...) are reformulated within this framework, giving rise to very simple algorithms. This reformulation allows one to conceptually detach the conformation space/energy model -- captured by the hypergraph model -- from the specific application, assuming unambiguity of the decomposition. To ensure the latter property, we propose a new combinatorial methodology based on generating functions. We extend the set of generic applications by proposing an exact algorithm for extracting generalized moments in weighted distribution, generalizing a prior contribution by Miklos and al. Finally, we illustrate our full-fledged programme on three exemplary conformation spaces (secondary structures, Akutsu's simple type pseudoknots and kissing hairpins). This readily gives sets of algorithms that are either novel or have complexity comparable to classic implementations for minimization and Boltzmann ensemble applications of dynamic programming

HAL-CentraleSupelec

CiteSeerX

INRIA a CCSD electronic archive server

Hal-Diderot

HAL-Polytechnique

HAL-Rennes 1

Efficient Algorithms for Probing the RNA Mutation Landscape

Author: A Coventry
A Omer
A Serganov
AO Harmanci
B Baker
B Knudsen
Bonnie Berger
C Reidys
C Thurner
Consortium ENCODE Project
D Barash
D Mathews
DH Mathews
E Rivas
I Hofacker
I Hofacker
I Miklos
IL Hofacker
IM Meyer
J Waldispuhl
J Waldispuhl
JS McCaskill
JS Pedersen
JS Weinger
Jérôme Waldispühl
M Yanagi
M Yang
M Zuker
M Zuker
MC Cowperthwaite
MC Cowperthwaite
MT Cheah
NM Cuceanu
P Clote
P Schuster
P Schuster
Peter Clote
PP Gardner
R Nussinov
RA Dimitrov
RD Dowell
S Brown
S Griffiths-Jones
S Griffiths-Jones
S You
SH Bernhart
Srinivas Devadas
T Kulinski
T Xia
Uwe Ohler
V Ambros
W Fontana
W Grüner
W Shu
Y Ding
Y Ding
Y Ding
Y Ponty
Publication venue: Public Library of Science
Publication date: 08/08/2008
Field of study

The diversity and importance of the role played by RNAs in the regulation and development of the cell are now well-known and well-documented. This broad range of functions is achieved through specific structures that have been (presumably) optimized through evolution. State-of-the-art methods, such as McCaskill's algorithm, use a statistical mechanics framework based on the computation of the partition function over the canonical ensemble of all possible secondary structures on a given sequence. Although secondary structure predictions from thermodynamics-based algorithms are not as accurate as methods employing comparative genomics, the former methods are the only available tools to investigate novel RNAs, such as the many RNAs of unknown function recently reported by the ENCODE consortium. In this paper, we generalize the McCaskill partition function algorithm to sum over the grand canonical ensemble of all secondary structures of all mutants of the given sequence. Specifically, our new program, RNAmutants, simultaneously computes for each integer k the minimum free energy structure MFE(k) and the partition function Z(k) over all secondary structures of all k-point mutants, even allowing the user to specify certain positions required not to mutate and certain positions required to base-pair or remain unpaired. This technically important extension allows us to study the resilience of an RNA molecule to pointwise mutations. By computing the mutation profile of a sequence, a novel graphical representation of the mutational tendency of nucleotide positions, we analyze the deleterious nature of mutating specific nucleotide positions or groups of positions. We have successfully applied RNAmutants to investigate deleterious mutations (mutations that radically modify the secondary structure) in the Hepatitis C virus cis-acting replication element and to evaluate the evolutionary pressure applied on different regions of the HIV trans-activation response element. In particular, we show qualitative agreement between published Hepatitis C and HIV experimental mutagenesis studies and our analysis of deleterious mutations using RNAmutants. Our work also predicts other deleterious mutations, which could be verified experimentally. Finally, we provide evidence that the 3′ UTR of the GB RNA virus C has been optimized to preserve evolutionarily conserved stem regions from a deleterious effect of pointwise mutations. We hope that there will be long-term potential applications of RNAmutants in de novo RNA design and drug design against RNA viruses. This work also suggests potential applications for large-scale exploration of the RNA sequence-structure network. Binary distributions are available at http://RNAmutants.csail.mit.edu/

Public Library of Science (PLOS)

Public Library of Science (PLOS)

Phylo: A Citizen Science Approach for Improving Multiple Sequence Alignment

Author: A Löytynoja
A Löytynoja
A Siepel
AB Diallo
AJ Westphal
Alexander Kawrykow
Alfred Kam
AM Waterhouse
B Knudsen
B Paten
BN Chorley
C Notredame
Chu Wu
Clarence Leung
D Sankoff
Daniel Kwak
E Korpela
Eleyine Zarour
F Khatib
Gary Roumanis
GG Loots
J Amberger
JS Pedersen
Jérôme Waldispühl
K Land
K Lindblad-Toh
L Chindelevitch
L von Ahn
L von Ahn
L Wang
Luis Sarmenta
M Blanchette
M Blanchette
M Blanchette
M Brudno
M Gouy
M Kellis
M Shirts
Mathieu Blanchette
N Bray
PA Fujita
Pawel Michalak
PC Ng
S Cooper
S De
S Schwartz
SB Needleman
T Jiang
W Fletcher
W Miller
WM Fitch
Publication venue: Public Library of Science
Publication date: 07/03/2012
Field of study

BACKGROUND: Comparative genomics, or the study of the relationships of genome structure and function across different species, offers a powerful tool for studying evolution, annotating genomes, and understanding the causes of various genetic disorders. However, aligning multiple sequences of DNA, an essential intermediate step for most types of analyses, is a difficult computational task. In parallel, citizen science, an approach that takes advantage of the fact that the human brain is exquisitely tuned to solving specific types of problems, is becoming increasingly popular. There, instances of hard computational problems are dispatched to a crowd of non-expert human game players and solutions are sent back to a central server. METHODOLOGY/PRINCIPAL FINDINGS: We introduce Phylo, a human-based computing framework applying "crowd sourcing" techniques to solve the Multiple Sequence Alignment (MSA) problem. The key idea of Phylo is to convert the MSA problem into a casual game that can be played by ordinary web users with a minimal prior knowledge of the biological context. We applied this strategy to improve the alignment of the promoters of disease-related genes from up to 44 vertebrate species. Since the launch in November 2010, we received more than 350,000 solutions submitted from more than 12,000 registered users. Our results show that solutions submitted contributed to improving the accuracy of up to 70% of the alignment blocks considered. CONCLUSIONS/SIGNIFICANCE: We demonstrate that, combined with classical algorithms, crowd computing techniques can be successfully used to help improving the accuracy of MSA. More importantly, we show that an NP-hard computational problem can be embedded in casual game that can be easily played by people without significant scientific training. This suggests that citizen science approaches can be used to exploit the billions of "human-brain peta-flops" of computation that are spent every day playing games. Phylo is available at: http://phylo.cs.mcgill.ca

Reconstruction of ancestral RNA sequences under multiple structural constraints

Author: A Srivastava
AR Gruber
B Knudsen
B Paten
C Bustamante
CA Nasrallah
D Sankoff
DA Sorescu
E Rogers
EP Nawrocki
F Vogel
FK de Boer
J Stombaugh
JH Urban
JJ Gillespie
JL Knies
Jérôme Waldispühl
K Robert
KM Kutchko
L Pauling
LJ Jerome
M Blanchette
MP Hoeppner
O Tremblay-Savard
Olivier Tremblay-Savard
P Schuster
PG Higgs
PP Gardner
PS Klosterman
R Lorenz
RB Lyngsø
RK Bradley
RR Gutell
SR Eddy
T Biesen
V Reinharz
Vladimir Reinharz
WM Fitch
Y Chen
Z Yang
Z Yao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Accurate classification of RNA structures using topological fingerprints

While RNAs are well known to possess complex structures, functionally similar RNAs often have little sequence similarity. While the exact size and spacing of base-paired regions vary, functionally similar RNAs have pronounced similarity in the arrangement, or topology, of base-paired stems. Furthermore, predicted RNA structures often lack pseudoknots (a crucial aspect of biological activity), and are only partially correct, or incomplete. A topological approach addresses all of these difficulties. In this work we describe each RNA structure as a graph that can be converted to a topological spectrum (RNA fingerprint). The set of subgraphs in an RNA structure, its RNA fingerprint, can be compared with the fingerprints of other RNA structures to identify and correctly classify functionally related RNAs. Topologically similar RNAs can be identified even when a large fraction, up to 30%, of the stems are omitted, indicating that highly accurate structures are not necessary. We investigate the performance of the RNA fingerprint approach on a set of eight highly curated RNA families, with diverse sizes and functions, containing pseudoknots, and with little sequence similarity–an especially difficult test set. In spite of the difficult test set, the RNA fingerprint approach is very successful (ROC AUC \u3e 0.95). Due to the inclusion of pseudoknots, the RNA fingerprint approach both covers a wider range of possible structures than methods based only on secondary structure, and its tolerance for incomplete structures suggests that it can be applied even to predicted structures. Source code is freely available at https://github.rcac.purdue.edu/mgribsko/XIOS_RNA_fingerprint