Search CORE

166 research outputs found

Nucleic Acids Res

Author: Gautheret D. (D)
Jossinet F. (Fabrice)
Lehmann J. (J)
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2013
Field of study

The structure and function of conserved motifs constituting the apex of Stem I in T-box mRNA leaders are investigated. We point out that this apex shares striking similarities with the L1 stalk (helices 76-78) of the ribosome. A sequence and structure analysis of both elements shows that, similarly to the head of the L1 stalk, the function of the apex of Stem I lies in the docking of tRNA through a stacking interaction with the conserved G19:C56 base pair platform. The inferred structure in the apex of Stem I consists of a module of two T-loops bound together head to tail, a module that is also present in the head of the L1 stalk, but went unnoticed. Supporting the analysis, we show that a highly conserved structure in RNAse P formerly described as the J11/12-J12/11 module, which is precisely known to bind the elbow of tRNA, constitutes a third instance of this T-loop module. A structural analysis explains why six nucleotides constituting the core of this module are highly invariant among all three types of RNA. Our finding that major RNA partners of tRNA bind the elbow with a same RNA structure suggests an explanation for the origin of the tRNA L-shape

univOAK

Sequence determinants in human polyadenylation site selection

Author: A Moreira
C Burge
D Gautheret
D Zarkower
DF Colgan
E Beaudoing
E Beaudoing
F Chen
G Edwalds-Gilbert
G Pesole
J Zhao
JE Tabaska
N Proudfoot
RV Davuluri
S Brackenridge
Y Aissouni
ZF Chou
Publication venue: BioMed Central
Publication date: 01/01/2003
Field of study

BACKGROUND: Differential polyadenylation is a widespread mechanism in higher eukaryotes producing mRNAs with different 3' ends in different contexts. This involves several alternative polyadenylation sites in the 3' UTR, each with its specific strength. Here, we analyze the vicinity of human polyadenylation signals in search of patterns that would help discriminate strong and weak polyadenylation sites, or true sites from randomly occurring signals. RESULTS: We used human genomic sequences to retrieve the region downstream of polyadenylation signals, usually absent from cDNA or mRNA databases. Analyzing 4956 EST-validated polyadenylation sites and their -300/+300 nt flanking regions, we clearly visualized the upstream (USE) and downstream (DSE) sequence elements, both characterized by U-rich (not GU-rich) segments. The presence of a USE and a DSE is the main feature distinguishing true polyadenylation sites from randomly occurring A(A/U)UAAA hexamers. While USEs are indifferently associated with strong and weak poly(A) sites, DSEs are more conspicuous near strong poly(A) sites. We then used the region encompassing the hexamer and DSE as a training set for poly(A) site identification by the ERPIN program and achieved a prediction specificity of 69 to 85% for a sensitivity of 56%. CONCLUSION: The availability of complete genomes and large EST sequence databases now permit large-scale observation of polyadenylation sites. Both U-rich sequences flanking both sides of poly(A) signals contribute to the definition of "true" sites. However, the downstream U-rich sequences may also play an enhancing role. Based on this information, poly(A) site prediction accuracy was moderately but consistently improved compared to the best previously available algorithm

Crossref

HAL AMU

Springer - Publisher Connector

Directory of Open Access Journals

HAL-Inserm

PubMed Central

Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences

Author: A Lescoute
AM Rosenblad
C Papanicolaou
C Woese
C Zwieb
C Zwieb
D Chiu
D Gautheret
D Mathews
D Matthews
D Sankoff
E Bindewald
F Rousset
F Tahi
F Tahi
Fariza Tahi
I Hofacker
J Brown
K Han
K Horimoto
L Vawter
M Szymanski
M Zuker
N Savill
O Perriquet
P Baldi
P Doty
P Higgs
PP Gardner
R Nussinov
RJ Klein
RR Gutell
S Freier
S Lindgreen
Stéfan Engelen
WC Curtis
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The secondary structure of an RNA must be known before the relationship between its structure and function can be determined. One way to predict the secondary structure of an RNA is to identify covarying residues that maintain the pairings (Watson-Crick, Wobble and non-canonical pairings). This "comparative approach" consists of identifying mutations from homologous sequence alignments. The sequences must covary enough for compensatory mutations to be revealed, but comparison is difficult if they are too different. Thus the choice of homologous sequences is critical. While many possible combinations of homologous sequences may be used for prediction, only a few will give good structure predictions. This can be due to poor quality alignment in stems or to the variability of certain sequences. This problem of sequence selection is currently unsolved. Results This paper describes an algorithm, <it>SSCA</it>, which measures the suitability of sequences for the comparative approach. It is based on evolutionary models with structure constraints, particularly those on sequence variations and stem alignment. We propose three models, based on different constraints on sequence alignments. We show the results of the <it>SSCA </it>algorithm for predicting the secondary structure of several RNAs. <it>SSCA </it>enabled us to choose sets of homologous sequences that gave better predictions than arbitrarily chosen sets of homologous sequences. Conclusion <it>SSCA </it>is an algorithm for selecting combinations of RNA homologous sequences suitable for secondary structure predictions with the comparative approach.</p

HAL Evry

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Complete chloroplast genome sequence of Holoparasite Cistanche Deserticola (Orobanchaceae) reveals gene loss and horizontal gene transfer from Its host Haloxylon Ammodendron (Chenopodiaceae)

Author: AD Wolfe
AD Wolfe
AD Wolfe
B Bremer
B Heinze
CC Davis
D Gautheret
D Grivet
D Laslett
D Posada
DH Kim
E Delannoy
F Ronquist
GC Conant
H Akaike
H Shimda
HT Funk
J Carlsson
J Vogel
JD Thompson
JH Westwood
Jianqiang Li
Jiayuan Zhao
JM Park
JN Timmis
JP Mower
JR Bennett
JR McNeal
K Krause
K Krause
K Tobe
KD Pruitt
KH Wolfe
KH Wolfe
LE Olson
LW Lin
M Koulintchenko
M Martín
M Woloszynska
M. James C Crabbe
Masami Hasegawa
MD Logacheva
Meng-xiang Sun
MM Guisinger
MW Gray
MW Gray
NJ Wickett
P Librado
PJ Keeling
PR Haddrill
Qin Qiao
R Bock
R Zoschke
RC Haberle
RG Olmstead
SK Wyman
T Wakasugi
TA Hall
Takahiro Yonezawa
Ti-Cao Zhang
TJ Barkman
U Bergthorsson
V Quiñones
V Shedge
WR Hess
X Wang
Xi Li
Y Asakura
Yang Zhong
Z Cai
Zhumei Ren
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

The central function of chloroplasts is to carry out photosynthesis, and its gene content and structure are highly conserved across land plants. Parasitic plants, which have reduced photosynthetic ability, suffer gene losses from the chloroplast (cp) genome accompanied by the relaxation of selective constraints. Compared with the rapid rise in the number of cp genome sequences of photosynthetic organisms, there are limited data sets from parasitic plants. The authors report the complete sequence of the cp genome of Cistanche deserticola, a holoparasitic desert species belonging to the family Orobanchaceae

Public Library of Science (PLOS)

Crossref

PubMed Central

University of Bedfordshire Repository

Phylogenetic Analysis of the Complete Mitochondrial Genome of Madurella mycetomatis Confirms Its Taxonomic Position within the Order Sordariales

Author: A Ahmed
A Ahmed
AD van Diepeningen
AO Ahmed
AO Ahmed
AO Ahmed
AO Ahmed
AR Kubelik
B Paquin
BF Lang
CE Bullerwell
CW Basse
D Gautheret
D Laslett
D Laslett
DJ Cummings
DJ Cummings
DJ Jacobson
DR Edgell
E Prochazka
F Foury
F Michel
GS de Hoog
GS de Hoog
J Kleidon
J Sethuraman
JW Ballard
JW Taylor
K Tamura
M Nowrousian
M Paoletti
MA Cardoso
ME Silliker
MJ Laforest
N Zhang
P Schattner
PC Woo
PV Pramateftaki
PV Pramateftaki
RA Collins
S Amlacher
SF Torriani
T Sekito
TM Lowe
Vishnu Chaturvedi
VN Kouvelis
Wendy W. J. van de Sande
WWJ van de Sande
WWJ van de Sande
Y Wu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Background: Madurella mycetomatis is the most common cause of human eumycetoma. The genus Madurella has been characterized by overall sterility on mycological media. Due to this sterility and the absence of other reliable morphological and ultrastructural characters, the taxonomic classification of Madurella has long been a challenge. Mitochondria are of monophyletic origin and mitochondrial genomes have been proven to be useful in phylogenetic analyses. Results: The first complete mitochondrial DNA genome of a mycetoma-causative agent was sequenced using 454 sequencing. The mitochondrial genome of M. mycetomatis is a circular DNA molecule with a size of 45,590 bp, encoding for the small and the large subunit rRNAs, 27 tRNAs, 11 genes encoding subunits of respiratory chain complexes, 2 ATP synthase subunits, 5 hypothetical proteins, 6 intronic proteins including the ribosomal protein rps3. In phylogenetic analyses using amino acid sequences of the proteins involved in respiratory chain complexes and the 2 ATP synthases it appeared that M. mycetomatis clustered together with members of the order Sordariales and that it was most closely related to Chaetomium thermophilum. Analyses of the gene order showed that within the order Sordariales a similar gene order is found. Furthermore also the tRNA order seemed mostly conserved. Conclusion: Phylogenetic analyses of fungal mitochondrial genomes confirmed that M. mycetomatis belongs to the order of Sordariales and that it was most closely related to Chaetomium thermophilum, with which it also shared a comparable gene and tRNA order

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

EUR Research Repository

Erasmus University Digital Repository

RNAcentral: A vision for an international database of RNA sequences

Author: Agrawal Shipra
Bateman Alex
Birney Ewan
Bruford Elspeth A
Bujnicki Janusz M
Cochrane Guy
Cole James R
Dinger Marcel E
Enright Anton J
Gardner Paul P
Gautheret Daniel
Griffiths-Jones Sam
Harrow Jen
Herrero Javier
Holmes Ian H
Huang Hsien-Da
Kelly Krystyna A
Kersey Paul
Kozomara Ana
Lowe Todd M
Marz Manja
Moxon Simon
Pruitt Kim D
Samuelsson Tore
Stadler Peter F
Vilella Albert J
Vogel Jan-Hinnerk
Williams Kelly P
Wright Mathew W
Zwieb Christian
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 23/09/2011
Field of study

During the last decade there has been a great increase in the number of noncoding RNA genes identified, including new classes such as microRNAs and piRNAs. There is also a large growth in the amount of experimental characterization of these RNA components. Despite this growth in information, it is still difficult for researchers to access RNA data, because key data resources for noncoding RNAs have not yet been created. The most pressing omission is the lack of a comprehensive RNA sequence database, much like UniProt, which provides a comprehensive set of protein knowledge. In this article we propose the creation of a new open public resource that we term RNAcentral, which will contain a comprehensive collection of RNA sequences and fill an important gap in the provision of biomedical databases. We envision RNA researchers from all over the world joining a federated RNAcentral network, contributing specialized knowledge and databases. RNAcentral would centralize key data that are currently held across a variety of databases, allowing researchers instant access to a single, unified resource. This resource would facilitate the next generation of RNA research and help drive further discoveries, including those that improve food production and human and animal health. We encourage additional RNA database resources and research groups to join this effort. We aim to obtain international network funding to further this endeavor

Crossref

UCL Discovery

PubMed Central

The University of Manchester - Institutional Repository

University of East Anglia digital repository

Effects of Restrained Sampling Space and Nonplanar Amino Groups on Free-Energy Predictions for RNA with Imino and Sheared Tandem GA Base Pairs Flanked by GC, CG, iGiC or iCiG Base Pairs

Guanine-adenine (GA) base pairs play important roles in determining the structure, dynamics, and stability of RNA. In RNA internal loops, GA base pairs often occur in tandem arrangements and their structure is context and sequence dependent. Calculations reported here test the thermodynamic integration (TI) approach with the amber99 force field by comparing computational predictions of free energy differences with the free energy differences expected on the basis of NMR determined structures of the RNA motifs (5′-GCGGACGC-3′)2, (5′-GCiGGAiCGC-3′)2, (5′-GGCGAGCC-3′)2, and (5′-GGiCGAiGCC-3′)2. Here, iG and iC denote isoguanosine and isocytidine, which have amino and carbonyl groups transposed relative to guanosine and cytidine. The NMR structures show that the GA base pairs adopt either imino (cis Watson−Crick/Watson−Crick A-G) or sheared (trans Hoogsteen/Sugar edge A-G) conformations depending on the identity and orientation of the adjacent base pair. A new mixing function for the TI method is developed that allows alchemical transitions in which atoms can disappear in both the initial and final states. Unrestrained calculations gave ΔG° values 2−4 kcal/mol different from expectations based on NMR data. Restraining the structures with hydrogen bond restraints did not improve the predictions. Agreement with NMR data was improved by 0.7 to 1.5 kcal/mol, however, when structures were restrained with weak positional restraints to sample around the experimentally determined NMR structures. The amber99 force field was modified to partially include pyramidalization effects of the unpaired amino group of guanosine in imino GA base pairs. This provided little or no improvement in comparisons with experiment. The marginal improvement is observed when the structure has potential cross-strand out-of-plane hydrogen bonding with the G amino group. The calculations using positional restraints and a nonplanar amino group reproduce the signs of ΔG° from the experimental results and are, thus, capable of providing useful qualitative insights complementing the NMR experiments. Decomposition of the terms in the calculations reveals that the dominant terms are from electrostatic and interstrand interactions other than hydrogen bonds in the base pairs. The results suggest that a better description of the backbone is key to reproducing the experimental free energy results with computational free energy predictions

Crossref

PubMed Central

nocoRNAc: Characterization of non-coding RNAs in prokaryotes

Author: A Busch
A Hüttenhofer
A Muffler
A Rodríguez-García
A Sittka
A Zhang
AC Darling
Alexander Herbig
AR Gruber
AV Uzilov
B Tjaden
B Voss
B Xiao
C Barrandon
C Pichon
C Pichon
CJ Benham
CM Sharma
CM Sharma
D D'Alia
D Gautheret
DD Sledjeski
E Rivas
EP Nawrocki
F Battke
F Battke
F Repoila
G Storz
H Wang
H Wang
HH Tseng
I Irnov
J Bode
J Livny
J Pánek
J Schlüter
J Vogel
JP Swiercz
JS Pedersen
K Nieselt
Kay Nieselt
LF Abu-Qatouseh
M Albrecht
M Giangrossi
N Yachie
P Saetrom
R Development Core Team
R Gentleman
S Altuvia
S Brantl
S Washietl
SD Bentley
SR Eddy
T Geissmann
TM Lowe
TT Tran
X Wang
Z Polonskaya
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The interest in non-coding RNAs (ncRNAs) constantly rose during the past few years because of the wide spectrum of biological processes in which they are involved. This led to the discovery of numerous ncRNA genes across many species. However, for most organisms the non-coding transcriptome still remains unexplored to a great extent. Various experimental techniques for the identification of ncRNA transcripts are available, but as these methods are costly and time-consuming, there is a need for computational methods that allow the detection of functional RNAs in complete genomes in order to suggest elements for further experiments. Several programs for the genome-wide prediction of functional RNAs have been developed but most of them predict a genomic locus with no indication whether the element is transcribed or not. Results We present <smcaps>NOCO</smcaps>RNAc, a program for the genome-wide prediction of ncRNA transcripts in bacteria. <smcaps>NOCO</smcaps>RNAc incorporates various procedures for the detection of transcriptional features which are then integrated with functional ncRNA loci to determine the transcript coordinates. We applied RNAz and <smcaps>NOCO</smcaps>RNAc to the genome of <it>Streptomyces coelicolor </it>and detected more than 800 putative ncRNA transcripts most of them located antisense to protein-coding regions. Using a custom design microarray we profiled the expression of about 400 of these elements and found more than 300 to be transcribed, 38 of them are predicted novel ncRNA genes in intergenic regions. The expression patterns of many ncRNAs are similarly complex as those of the protein-coding genes, in particular many antisense ncRNAs show a high expression correlation with their protein-coding partner. Conclusions We have developed <smcaps>NOCO</smcaps>RNAc, a framework that facilitates the automated characterization of functional ncRNAs. <smcaps>NOCO</smcaps>RNAc increases the confidence of predicted ncRNA loci, especially if they contain transcribed ncRNAs. <smcaps>NOCO</smcaps>RNAc is not restricted to intergenic regions, but it is applicable to the prediction of ncRNA transcripts in whole microbial genomes. The software as well as a user guide and example data is available at <url>http://www.zbit.uni-tuebingen.de/pas/nocornac.htm</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Structural Constraints Identified with Covariation Analysis in Ribosomal RNA

Covariation analysis is used to identify those positions with similar patterns of sequence variation in an alignment of RNA sequences. These constraints on the evolution of two positions are usually associated with a base pair in a helix. While mutual information (MI) has been used to accurately predict an RNA secondary structure and a few of its tertiary interactions, early studies revealed that phylogenetic event counting methods are more sensitive and provide extra confidence in the prediction of base pairs. We developed a novel and powerful phylogenetic events counting method (PEC) for quantifying positional covariation with the Gutell lab’s new RNA Comparative Analysis Database (rCAD). The PEC and MI-based methods each identify unique base pairs, and jointly identify many other base pairs. In total, both methods in combination with an N-best and helix-extension strategy identify the maximal number of base pairs. While covariation methods have effectively and accurately predicted RNAs secondary structure, only a few tertiary structure base pairs have been identified. Analysis presented herein and at the Gutell lab’s Comparative RNA Web (CRW) Site reveal that the majority of these latter base pairs do not covary with one another. However, covariation analysis does reveal a weaker although significant covariation between sets of nucleotides that are in proximity in the three-dimensional RNA structure. This reveals that covariation analysis identifies other types of structural constraints beyond the two nucleotides that form a base pair

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Texas ScholarWorks

FigShare