Search CORE

13 research outputs found

Correlation between nucleotide composition and folding energy of coding sequences with special attention to wobble bases

Author: AA Komar
AA Komar
C Kimchi-Sarfaty
C Soares
C Workman
DE Draper
DJ Patterson
EG Shaper
EG Shpaer
F Pagani
GD Stormo
IM Meyer
J Duan
J Konecny
Jan C Biro
JC Biro
JC Biro
JC Biro
JC Biro
JC Biro
JD Lieb
JR Powell
JV Chamary
K Mita
K Robison
KB Nielsen
L Cartegni
L Katz
M Jia
M Kellis
M Oresic
M Oresic
M Zama
M Zuker
M Zuker
MD Ermolaeva
NR Markham
P Cortazzo
P Mukhopadhyay
S Itzkovitz
SA Shabalina
W Gu
W Gu
W Seffens
W Seffens
WC Winkler
ZE Sauna
Publication venue
Publication date: 01/07/2008
Field of study

Background: The secondary structure and complexity of mRNA influences its accessibility to regulatory molecules (proteins, micro-RNAs), its stability and its level of expression. The mobile elements of the RNA sequence, the wobble bases, are expected to regulate the formation of structures encompassing coding sequences. Results: The sequence/folding energy (FE) relationship was studied by statistical, bioinformatic methods in 90 CDS containing 26,370 codons. I found that the FE (dG) associated with coding sequences is significant and negative (407 kcal/1000 bases, mean +/- S.E.M.) indicating that these sequences are able to form structures. However, the FE has only a small free component, less than 10% of the total. The contribution of the 1st and 3rd codon bases to the FE is larger than the contribution of the 2nd (central) bases. It is possible to achieve a ~ 4-fold change in FE by altering the wobble bases in synonymous codons. The sequence/FE relationship can be described with a simple algorithm, and the total FE can be predicted solely from the sequence composition of the nucleic acid. The contributions of different synonymous codons to the FE are additive and one codon cannot replace another. The accumulated contributions of synonymous codons of an amino acid to the total folding energy of an mRNA is strongly correlated to the relative amount of that amino acid in the translated protein. Conclusion: Synonymous codons are not interchangable with regard to their role in determining the mRNA FE and the relative amounts of amino acids in the translated protein, even if they are indistinguishable in respect of amino acid coding.Comment: 14 pages including 6 figures and 1 tabl

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

PubMed Central

Gnare: Automated System For High-Throughput Genome Analysis With Grid Computational Backend

Author: A Bateman
A Bateman
Alex Rodriguez
Altschul
AP Burgard
CH Wu
Dinanath Sulakhe
EG Shpaer
FM Pearl
I Foster
Ian Foster
L Lo Conte
Mark D'Souza
Michael Wilde
Natalia Maltsev
NJ Mulder
Overbeek Ross
S Henikoff
SF Altschul
T Ideker
Veronika Nefedova
W Allcock
WR Pearson
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A comparative genome-wide study of ncRNAs in trypanosomatids

Abstract Background Recent studies have provided extensive evidence for multitudes of non-coding RNA (ncRNA) transcripts in a wide range of eukaryotic genomes. ncRNAs are emerging as key players in multiple layers of cellular regulation. With the availability of many whole genome sequences, comparative analysis has become a powerful tool to identify ncRNA molecules. In this study, we performed a systematic genome-wide in silico screen to search for novel small ncRNAs in the genome of <it>Trypanosoma brucei </it>using techniques of comparative genomics. Results In this study, we identified by comparative genomics, and validated by experimental analysis several novel ncRNAs that are conserved across multiple trypanosomatid genomes. When tested on known ncRNAs, our procedure was capable of finding almost half of the known repertoire through homology over six genomes, and about two-thirds of the known sequences were found in at least four genomes. After filtering, 72 conserved unannotated sequences in at least four genomes were found, 29 of which, ranging in size from 30 to 392 nts, were conserved in all six genomes. Fifty of the 72 candidates in the final set were chosen for experimental validation. Eighteen of the 50 (36%) were shown to be expressed, and for 11 of them a distinct expression product was detected, suggesting that they are short ncRNAs. Using functional experimental assays, five of the candidates were shown to be novel H/ACA and C/D snoRNAs; these included three sequences that appear as singletons in the genome, unlike previously identified snoRNA molecules that are found in clusters. The other candidates appear to be novel ncRNA molecules, and their function is, as yet, unknown. Conclusions Using comparative genomic techniques, we predicted 72 sequences as ncRNA candidates in <it>T. brucei</it>. The expression of 50 candidates was tested in laboratory experiments. This resulted in the discovery of 11 novel short ncRNAs in procyclic stage <it>T. brucei</it>, which have homologues in the other trypansomatids. A few of these molecules are snoRNAs, but most of them are novel ncRNA molecules. Based on this study, our analysis suggests that the total number of ncRNAs in trypanosomatids is in the range of several hundred.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Coalescent estimates of HIV-1 generation time in vivo.

Author: Brojatsch J
Delwart EL
Gallo MV
Hirsch MS
Iversen AK
Mullins JI
Rodrigo AG
Shpaer EG
Walker BD
Publication venue
Publication date: 01/01/1999
Field of study

The generation time of HIV Type 1 (HIV-1) in vivo has previously been estimated using a mathematical model of viral dynamics and was found to be on the order of one to two days per generation. Here, we describe a new method based on coalescence theory that allows the estimate of generation times to be derived by using nucleotide sequence data and a reconstructed genealogy of sequences obtained over time. The method is applied to sequences obtained from a long-term nonprogressing individual at five sampling occasions. The estimate of viral generation time using the coalescent method is 1.2 days per generation and is close to that obtained by mathematical modeling (1.8 days per generation), thus strengthening confidence in estimates of a short viral generation time. Apart from the estimation of relevant parameters relating to viral dynamics, coalescent modeling also allows us to simulate the evolutionary behavior of samples of sequences obtained over time

Oxford University Research Archive

SyntTax: a web server linking synteny to prokaryotic taxonomy

Author: A Dereeper
A Despalins
A Hecker
A Hecker
A Hecker
B El Yacoubi
B El Yacoubi
B Snel
C Deutsch
C Fong
CE Martinez-Guerrero
D Szklarczyk
DY Mao
E Lerat
E Passarge
EG Shpaer
EM Marcotte
EW Sayers
I Grin
J Oberto
J Oberto
J Oberto
JA Kiel
Jacques Oberto
JH Renwick
JI Handford
K Isono
M Srinivasan
MY Galperin
S Federhen
SF Altschul
VR Pejaver
WR Pearson
YP Denielou
Publication venue: BMC
Publication date: 01/01/2013
Field of study

Abstract Background The study of the conservation of gene order or synteny constitutes a powerful methodology to assess the orthology of genomic regions and to predict functional relationships between genes. The exponential growth of microbial genomic databases is expected to improve synteny predictions significantly. Paradoxically, this genomic data plethora, without information on organisms relatedness, could impair the performance of synteny analysis programs. Results In this work, I present SyntTax, a synteny web service designed to take full advantage of the large amount or archaeal and bacterial genomes by linking them through taxonomic relationships. SyntTax incorporates a full hierarchical taxonomic tree allowing intuitive access to all completely sequenced prokaryotes. Single or multiple organisms can be chosen on the basis of their lineage by selecting the corresponding rank nodes in the tree. The synteny methodology is built upon our previously described Absynte algorithm with several additional improvements. Conclusions SyntTax aims to produce robust syntenies by providing prompt access to the taxonomic relationships connecting all completely sequenced microbial genomes. The reduction in redundancy offered by lineage selection presents the benefit of increasing accuracy while reducing computation time. This web tool was used to resolve successfully several conserved complex gene clusters described in the literature. In addition, particular features of SyntTax permit the confirmation of the involvement of the four components constituting the <it>E. coli</it> YgjD multiprotein complex responsible for tRNA modification. By analyzing the clustering evolution of alternative gene fusions, new proteins potentially interacting with this complex could be proposed. The web service is available at <url>http://archaea.u-psud.fr/SyntTax</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals