Search CORE

Springer - Publisher Connector

The reach of the genome signature in prokaryotes

Author: Bart Aldert
Boekhout Teun
Kuramae Eiko E
Luyf Angela CM
van Passel Mark WJ
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: With the increased availability of sequenced genomes there have been several initiatives to infer evolutionary relationships by whole genome characteristics. One of these studies suggested good congruence between genome synteny, shared gene content, 16S ribosomal DNA identity, codon usage and the genome signature in prokaryotes. Here we rigorously test the phylogenetic signal of the genome signature, which consists of the genome-specific relative frequencies of dinucleotides, on 334 sequenced prokaryotic genome sequences. RESULTS: Intrageneric comparisons show that in general the genomic dissimilarity scores are higher than in intraspecific comparisons, in accordance with the suggested phylogenetic signal of the genome signature. Exceptions to this trend, (Bartonella spp., Bordetella spp., Salmonella spp. and Yersinia spp.), which have low average intrageneric genomic dissimilarity scores, suggest that members of these genera might be considered the same species. On the other hand, high genomic dissimilarity values for intraspecific analyses suggest that in some cases (e.g.Prochlorococcus marinus, Pseudomonas fluorescens, Buchnera aphidicola and Rhodopseudomonas palustris) different strains from the same species may actually represent different species. Comparing 16S rDNA identity with genomic dissimilarity values corroborates the previously suggested trend in phylogenetic signal, albeit that the dissimilarity values only provide low resolution. CONCLUSION: The genome signature has a distinct phylogenetic signal, independent of individual genetic marker genes. A reliable phylogenetic clustering cannot be based on dissimilarity values alone, as bootstrapping is not possible for this parameter. It can however be used to support or refute a given phylogeny and resulting taxonomy

The University of Arizona

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Sequence composition similarities with the 7SL RNA are highly predictive of functional genomic features

Author: Alan Anderson
Bohne
Bowen
Bush
Capy
Carninci
Cohen
Corvelo
Crawford
Crawford
Dean
Dehnert
Delabesse
Fertil
Feschotte
Follows
Follows
Gross
GuhaThakurta
Heintzman
Hughes
Humphries
Jordan
Kamal
Karlin
Karlin
Keich
Kohany
Kramerov
Lander
Levine
Mahajan
Makalowski
Marino-Ramirez
Marino-Ramirez
Matsutani
Merriam
Mikkelsen
Molete
Nobrega
Papachatzopoulou
Pennacchio
Pennacchio
Pevzner
Polavarapu
Sorek
Steinberg
Tang
Visel
Visel
Walser
Wheelan
Wong
Yang
Yanick Paquet
Publication venue: Oxford University Press
Publication date
Field of study

Transposable elements derived from the 7SL RNA gene, such as Alu elements in primates, have had remarkable success in several mammalian lineages. The results presented here show a broad spectrum of functions for genomic segments that display sequence composition similarities with the 7SL RNA gene. Using thoroughly documented loci, we report that DNaseI-hypersensitive sites can be singled out in large genomic sequences by an assessment of sequence composition similarities with the 7SL RNA gene. We apply a root word frequency approach to illustrate a distinctive relationship between the sequence of the 7SL RNA gene and several classes of functional genomic features that are not presumed to be of transposable origin. Transposable elements that show noticeable similarities with the 7SL sequence include Alu sequences, as expected, but also long terminal repeats and the 5′-untranslated regions of long interspersed repetitive elements. In sequences masked for repeated elements, we find, when using the 7SL RNA gene as query sequence, distinctive similarities with promoters, exons and distal gene regulatory regions. The latter being the most notoriously difficult to detect, this approach may be useful for finding genomic segments that have regulatory functions and that may have escaped detection by existing methods

arXiv.org e-Print Archive

Generalized Whittle-Mat $\acute{\text{E}}$ rn random field as a model of correlated fluctuations

Author: Adler R J
Anh V V
Benassi A
Bochner S
Chilès J-P
Coffey W T
Cressie N
de Luna X
Falconer K J
Gelfand I M
Gradshteyn I S
Guttorp P
Hilfer R
Kilbas A A
Kotz S
Kutner R
L P Teo
Lim S C
Lim S C Teo L P
Matheron G
Matérn B
Matérn B
Mecke K
Metzler R
Nelder J A
Ostrovskii I V
Percival D B
Pitt L D
Pitt L D
Porcu E
S C Lim
Samko S
Samko S
Samorodnitsky G
Shkarofsky I P
Stein M L
Stein M L
Tatarski V I
von Kármán T
Wackernagel H
West B J
West B J
Whittle P
Whittle P
Publication venue: 'IOP Publishing'
Publication date: 22/01/2009
Field of study

This paper considers a generalization of Gaussian random field with covariance function of Whittle-Mat

\acute{\text{e}}

rn family. Such a random field can be obtained as the solution to the fractional stochastic differential equation with two fractional orders. Asymptotic properties of the covariance functions belonging to this generalized Whittle-Mat

\acute{\text{e}}

rn family are studied, which are used to deduce the sample path properties of the random field. The Whittle-Mat

\acute{\text{e}}

rn field has been widely used in modeling geostatistical data such as sea beam data, wind speed, field temperature and soil data. In this article we show that generalized Whittle-Mat

\acute{\text{e}}

rn field provides a more flexible model for wind speed data.Comment: 22 pages, 10 figures, accepted by Journal of Physics

CiteSeerX

Estimating the Fraction of Non-Coding RNAs in Mammalian Transcriptomes

Author: Gan Hin Hark
Quarta Giulio
Schlick Tamar
Xin Yurong
Publication venue: Libertas Academica
Publication date: 01/01/2008
Field of study

Recent studies of mammalian transcriptomes have identified numerous RNA transcripts that do not code for proteins; their identity, however, is largely unknown. Here we explore an approach based on sequence randomness patterns to discern different RNA classes. The relative z-score we use helps identify the known ncRNA class from the genome, intergene and intron classes. This leads us to a fractional ncRNA measure of putative ncRNA datasets which we model as a mixture of genuine ncRNAs and other transcripts derived from genomic, intergenic and intronic sequences. We use this model to analyze six representative datasets identified by the FANTOM3 project and two computational approaches based on comparative analysis (RNAz and EvoFold). Our analysis suggests fewer ncRNAs than estimated by DNA sequencing and comparative analysis, but the verity of our approach and its prediction requires more extensive experimental RNA data

CiteSeerX

Public Library of Science (PLOS)

Organization of Excitable Dynamics in Hierarchical Biological Networks

Author: A Arenas
A Arenas
A Arenas
A Roxin
AL Barabási
AL Barabási
AL Barabási
B Drossel
C Song
C Zhou
CC Hilgetag
CJ Honey
Claus C. Hilgetag
DJ Watts
E Oh
E Ravasz
E Ravasz
EM Izhikevich
GA Burns
H Jeong
HW Hethcote
I Graham
J Karbowski
JDJ Han
JG White
JL Vincent
JW Scannell
JW Scannell
K Stephan
KI Goh
LC Freeman
LC Freeman
LK Gallos
M Dehnert
M Girvan
M Kaiser
M Kaiser
M Müller-Linow
M Reigl
Marc-Thorsten Hütt
Mark Müller-Linow
ME Raichle
MEJ Newman
MEJ Newman
MP Young
MP Young
N Kashtan
NTJ Bailey
O Sporns
O Sporns
O Sporns
Olaf Sporns
P Bak
R Albert
R Guimerà
R Kötter
R Milo
R Salvador
R Salvador
RM Anderson
S Achard
TB Achacoso
U Alon
U Brandes
Y Moreno
Y Zheng
Publication venue: Public Library of Science
Publication date: 26/09/2008
Field of study

This study investigates the contributions of network topology features to the dynamic behavior of hierarchically organized excitable networks. Representatives of different types of hierarchical networks as well as two biological neural networks are explored with a three-state model of node activation for systematically varying levels of random background network stimulation. The results demonstrate that two principal topological aspects of hierarchical networks, node centrality and network modularity, correlate with the network activity patterns at different levels of spontaneous network activation. The approach also shows that the dynamic behavior of the cerebral cortical systems network in the cat is dominated by the network's modular organization, while the activation behavior of the cellular neuronal network of Caenorhabditis elegans is strongly influenced by hub nodes. These findings indicate the interaction of multiple topological features and dynamic states in the function of complex biological networks