Search CORE

21,620 research outputs found

Characterization of DNA methylation as a function of biological complexity via dinucleotide inter-distances

Author: Castellani Gastone C.
Cristadoro Giampaolo
Esposti Mirko Degli
Lenci Marco
Monti Barbara
Paci Giulia
Remondini Daniel
Publication venue: 'The Royal Society'
Publication date: 26/11/2015
Field of study

We perform a statistical study of the distances between successive occurrencies of a given dinucleotide in the DNA sequence for a number of organisms of different complexity. Our analysis highlights peculiar features of the dinucleotide CG distribution in mammalian DNA, pointing towards a connection with the role of such dinucleotide in DNA methylation. While the CG distributions of mammals exhibit exponential tails with comparable parameters, the picture for the other organisms studied (e.g., fish, insects, bacteria and viruses) is more heterogeneous, possibly because in these organisms DNA methylation has different functional roles. Our analysis suggests that the distribution of the distances between dinucleotides CG provides useful insights in characterizing and classifying organisms in terms of methylation functionalities.Comment: 13 pages, 5 figures. To be published in the Philosophical Transactions A theme issue "DNA as information

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Bayesian modeling of recombination events in bacterial populations

Author: A Baldwin
A Baldwin
A Baldwin
A Rambaut
A Skalka
Adam Baldwin
C Fraser
Chris Dowson
CP Robert
CX Chan
D Falush
D Husmeier
D Posada
DJ Hand
E Mahenthiralingam
E Mahenthiralingam
EHL Aarts
Eshwar Mahenthiralingam
FM Cohan
J Corander
J Corander
J Corander
J Corander
J Felsenstein
J Hein
J Maynard Smith
JG Lawrence
JS Sinsheimer
Jukka Corander
JV Braun
M Arenas
M Hasegawa
MA Suchard
MJ Schervish
NC Grassly
P Marttinen
Pekka Marttinen
R Jain
RA Elton
S Sawyer
SA Sisson
VN Minin
VN Minin
William P Hanage
WJ Wiersinga
WP Hanage
X Didelot
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Background: We consider the discovery of recombinant segments jointly with their origins within multilocus DNA sequences from bacteria representing heterogeneous populations of fairly closely related species. The currently available methods for recombination detection capable of probabilistic characterization of uncertainty have a limited applicability in practice as the number of strains in a data set increases. Results: We introduce a Bayesian spatial structural model representing the continuum of origins over sites within the observed sequences, including a probabilistic characterization of uncertainty related to the origin of any particular site. To enable a statistically accurate and practically feasible approach to the analysis of large-scale data sets representing a single genus, we have developed a novel software tool (BRAT, Bayesian Recombination Tracker) implementing the model and the corresponding learning algorithm, which is capable of identifying the posterior optimal structure and to estimate the marginal posterior probabilities of putative origins over the sites. Conclusion: A multitude of challenging simulation scenarios and an analysis of real data from seven housekeeping genes of 120 strains of genus Burkholderia are used to illustrate the possibilities offered by our approach. The software is freely available for download at URL http://web.abo.fi/fak/ mnf//mate/jc/software/brat.html

Crossref

Online Research @ Cardiff

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Warwick Research Archives Portal Repository

BOOL-AN: A method for comparative sequence analysis and phylogenetic reconstruction

Author: Ari Eszter
Horváth Arnold
Ittzés Péter
Jakó Éena
Podani János
Publication venue: 'Elsevier BV'
Publication date: 01/01/2009
Field of study

A novel discrete mathematical approach is proposed as an additional tool for molecular systematics which does not require prior statistical assumptions concerning the evolutionary process. The method is based on algorithms generating mathematical representations directly from DNA/RNA or protein sequences, followed by the output of numerical (scalar or vector) and visual characteristics (graphs). The binary encoded sequence information is transformed into a compact analytical form, called the Iterative Canonical Form (or ICF) of Boolean functions, which can then be used as a generalized molecular descriptor. The method provides raw vector data for calculating different distance matrices, which in turn can be analyzed by neighbor-joining or UPGMA to derive a phylogenetic tree, or by principal coordinates analysis to get an ordination scattergram. The new method and the associated software for inferring phylogenetic trees are called the Boolean analysis or BOOL-AN

Crossref

Repository of the Academy's Library

Genomic Selective Constraints in Murid Noncoding DNA

Author: Altschul
Bejerano
Bejerano
Boissinot
Bray
Britten
Casane
Chamary
Chamary
Cooper
Cooper
Daniel J. Gaffney
Deininger
Dermitzakis
Dermitzakis
Dermitzakis
Eisenberg
Eyre-Walker
Fairbrother
Frazer
Gaffney
Gibbs
Hanawalt
Hubbard
Jaeger
Kamal
Keightley
Keightley
Keightley
Keightley
Kimura
Kondrashov
Kondrashov
Lander
Li
Margulies
Meunier
Mi
Mikkelsen
Nagylaki
Nelson
Parmley
Peter D. Keightley
Seoighe
Siepel
Sironi
Sorek
Tamura
Thomas
Thompson
Urrutia
Vinogradov
Vinogradov
Waterston
Webster
Yelin
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2006
Field of study

Recent work has suggested that there are many more selectively constrained, functional noncoding than coding sites in mammalian genomes. However, little is known about how selective constraint varies amongst different classes of noncoding DNA. We estimated the magnitude of selective constraint on a large dataset of mouse-rat gene orthologs and their surrounding noncoding DNA. Our analysis indicates that there are more than three times as many selectively constrained, nonrepetitive sites within noncoding DNA as in coding DNA in murids. The majority of these constrained noncoding sites appear to be located within intergenic regions, at distances greater than 5 kilobases from known genes. Our study also shows that in murids, intron length and mean intronic selective constraint are negatively correlated with intron ordinal number. Our results therefore suggest that functional intronic sites tend to accumulate toward the 5' end of murid genes. Our analysis also reveals that mean number of selectively constrained noncoding sites varies substantially with the function of the adjacent gene. We find that, among others, developmental and neuronal genes are associated with the greatest numbers of putatively functional noncoding sites compared with genes involved in electron transport and a variety of metabolic processes. Combining our estimates of the total number of constrained coding and noncoding bases we calculate that over twice as many deleterious mutations have occurred in intergenic regions as in known genic sequence and that the total genomic deleterious point mutation rate is 0.91 per diploid genome, per generation. This estimated rate is over twice as large as a previous estimate in murids

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Glassy transition in a disordered model for the RNA secondary structure

Author: A. Pagnani
A. S. Gliozzi
C. Branden
C. Micheletti
D. Cule
E. Marinari
F. Ricci-Tersenghi
G. Parisi
J. D. Bryngelson
J. Houdayer
J. N. Onuchic
K. F. Lau
M. Mézard
M. Zuker
P. G. Higgs
P. G. Higgs
R. Bundschuh
R. F. Gesteland
R. Nussinov
S. R. Morgan
W. Fontana
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2000
Field of study

We numerically study a disordered model for the RNA secondary structure and we find that it undergoes a phase transition, with a breaking of the replica symmetry in the low temperature region (like in spin glasses). Our results are based on the exact evaluation of the partition function.Comment: 4 pages, 3 figure

arXiv.org e-Print Archive

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Archivio della ricerca- Università di Roma La Sapienza

PORTO Publications Open Repository TOrino

Extending colonic mucosal microbiome analysis - Assessment of colonic lavage as a proxy for endoscopic colonic biopsies

Author: A Durban
A Jain
AC Ouwehand
AD Kostic
AD Kostic
B Willing
CL O’Brien
E Pruesse
EG Zoetendal
EH Simpson
F Backhed
F Chierico Del
G Li
GL Hold
HJ Flint
HL Cash
I Mukhopadhya
I Rangel
J Handelsman
J Jalanka
J Qin
JJ Kozich
JM Choo
JR Marchesi
L Chen
L Drago
L Harrell
M Morotomi
MG Langille
MH McLean
N Segata
NA Kennedy
P Lepage
P Louis
PB Eckburg
PD Schloss
PJ Turnbaugh
R Bibiloni
R Hansen
RE Ley
RL Warren
RM Shobar
S Delgado
SJ Salter
T Vatanen
Team RC
TZ DeSantis
V Mai
Y Momozawa
Y Xie
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/11/2016
Field of study

This study was supported through GI Research funds and MRC Grant Ref: MR/M00533X/1 to GH.Peer reviewedPublisher PD

Aberdeen University Research

Crossref

Springer - Publisher Connector

PubMed Central

UNSWorks

FigShare

Ground state and glass transition of the RNA secondary structure

Author: Bundschuh
Bundschuh
Carpentier
Dill
Gennes
Halpin-Healy
Hartmann
Higgs
Hwa
Hwa
Isaacs
Jr
Jr
Karlin
Krzakala
L.-H. Tang
Lässig
Marinari
Mukhopadhyay
Mézard
Nussinov
Onuchic
Pagnani
S. Hui
Schueler-Furman
Shakhnovich
Snow
Tang
Tang
Yu
Zeng
Zuker
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/08/2006
Field of study

RNA molecules form a sequence-specific self-pairing pattern at low temperatures. We analyze this problem using a random pairing energy model as well as a random sequence model that includes a base stacking energy in favor of helix propagation. The free energy cost for separating a chain into two equal halves offers a quantitative measure of sequence specific pairing. In the low temperature glass phase, this quantity grows quadratically with the logarithm of the chain length, but it switches to a linear behavior of entropic origin in the high temperature molten phase. Transition between the two phases is continuous, with characteristics that resemble those of a disordered elastic manifold in two dimensions. For designed sequences, however, a power-law distribution of pairing energies on a coarse-grained level may be more appropriate. Extreme value statistics arguments then predict a power-law growth of the free energy cost to break a chain, in agreement with numerical simulations. Interestingly, the distribution of pairing distances in the ground state secondary structure follows a remarkable power-law with an exponent -4/3, independent of the specific assumptions for the base pairing energies

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Free energy landscape and characteristic forces for the initiation of DNA unzipping

Author: Andricioaei Ioan
Brunk Elizabeth
Florescu Ana Maria
Joyeux Marc
Mentes Ahmet
Wereszczynski Jeff
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

DNA unzipping, the separation of its double helix into single strands, is crucial in modulating a host of genetic processes. Although the large-scale separation of double-stranded DNA has been studied with a variety of theoretical and experimental techniques, the minute details of the very first steps of unzipping are still unclear. Here, we use atomistic molecular dynamics (MD) simulations, coarse-grained simulations and a statistical-mechanical model to study the initiation of DNA unzipping by an external force. The calculation of the potential of mean force profiles for the initial separation of the first few terminal base pairs in a DNA oligomer reveal that forces ranging between 130 and 230 pN are needed to disrupt the first base pair, values of an order of magnitude larger than those needed to disrupt base pairs in partially unzipped DNA. The force peak has an "echo," of approximately 50 pN, at the distance that unzips the second base pair. We show that the high peak needed to initiate unzipping derives from a free energy basin that is distinct from the basins of subsequent base pairs because of entropic contributions and we highlight the microscopic origin of the peak. Our results suggest a new window of exploration for single molecule experiments.Comment: 25 pages, 6 figures , Accepted for publication in Biophysical Journa

arXiv.org e-Print Archive

Elsevier - Publisher Connector

PubMed Central

eScholarship - University of California

MPG.PuRe