Search CORE

101,324 research outputs found

EERTREE: An Efficient Data Structure for Processing Palindromes in Strings

Author: Rubinchik Mikhail
Shur Arseny M.
Publication venue
Publication date: 17/08/2015
Field of study

We propose a new linear-size data structure which provides a fast access to all palindromic substrings of a string or a set of strings. This structure inherits some ideas from the construction of both the suffix trie and suffix tree. Using this structure, we present simple and efficient solutions for a number of problems involving palindromes.Comment: 21 pages, 2 figures. Accepted to IWOCA 201

arXiv.org e-Print Archive

Crossref

Institutional repository of Ural Federal University named after the first President of Russia B.N.Yeltsin

An Introduction to Programming for Bioscientists: A Python-based Primer

Author: Ekmekci Berk
McAnany Charles E.
Mura Cameron
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 17/05/2016
Field of study

Computing has revolutionized the biological sciences over the past several decades, such that virtually all contemporary research in the biosciences utilizes computer programs. The computational advances have come on many fronts, spurred by fundamental developments in hardware, software, and algorithms. These advances have influenced, and even engendered, a phenomenal array of bioscience fields, including molecular evolution and bioinformatics; genome-, proteome-, transcriptome- and metabolome-wide experimental studies; structural genomics; and atomistic simulations of cellular-scale molecular assemblies as large as ribosomes and intact viruses. In short, much of post-genomic biology is increasingly becoming a form of computational biology. The ability to design and write computer programs is among the most indispensable skills that a modern researcher can cultivate. Python has become a popular programming language in the biosciences, largely because (i) its straightforward semantics and clean syntax make it a readily accessible first language; (ii) it is expressive and well-suited to object-oriented programming, as well as other modern paradigms; and (iii) the many available libraries and third-party toolkits extend the functionality of the core language into virtually every biological domain (sequence and structure analyses, phylogenomics, workflow management systems, etc.). This primer offers a basic introduction to coding, via Python, and it includes concrete examples and exercises to illustrate the language's usage and capabilities; the main text culminates with a final project in structural bioinformatics. A suite of Supplemental Chapters is also provided. Starting with basic concepts, such as that of a 'variable', the Chapters methodically advance the reader to the point of writing a graphical user interface to compute the Hamming distance between two DNA sequences.Comment: 65 pages total, including 45 pages text, 3 figures, 4 tables, numerous exercises, and 19 pages of Supporting Information; currently in press at PLOS Computational Biolog

arXiv.org e-Print Archive

Directory of Open Access Journals

PubMed Central

FigShare

Efficient Algorithms for the Closest Pair Problem and Applications

Author: Pathak Sudipta
Rajasekaran Sanguthevar
Publication venue
Publication date: 21/07/2014
Field of study

The closest pair problem (CPP) is one of the well studied and fundamental problems in computing. Given a set of points in a metric space, the problem is to identify the pair of closest points. Another closely related problem is the fixed radius nearest neighbors problem (FRNNP). Given a set of points and a radius

R

, the problem is, for every input point

p

, to identify all the other input points that are within a distance of

R

from

p

. A naive deterministic algorithm can solve these problems in quadratic time. CPP as well as FRNNP play a vital role in computational biology, computational finance, share market analysis, weather prediction, entomology, electro cardiograph, N-body simulations, molecular simulations, etc. As a result, any improvements made in solving CPP and FRNNP will have immediate implications for the solution of numerous problems in these domains. We live in an era of big data and processing these data take large amounts of time. Speeding up data processing algorithms is thus much more essential now than ever before. In this paper we present algorithms for CPP and FRNNP that improve (in theory and/or practice) the best-known algorithms reported in the literature for CPP and FRNNP. These algorithms also improve the best-known algorithms for related applications including time series motif mining and the two locus problem in Genome Wide Association Studies (GWAS)

arXiv.org e-Print Archive

CiteSeerX

Red Queen Coevolution on Fitness Landscapes

Author: A. Hastings
A. Hoffman
A. Ilachinsky
A. Kerr
A.A. Berryman
A.F. Agrawal
A.F. Agrawal
B. Drossel
C. Darwin
C.M. Lively
D.J. Mode
D.K. Clarke
D.M. Raup
D.M. Raup
E. Decaestecker
F. Dercole
F. Jacob
F. Jacob
G. Lafforgue
G. Nicolis
H. Freund
H. Haken
H.H. Flor
J. Jaenike
J. Quer
J. Sardanyés
J. Sardanyés
J. Sardanyés
J. Sardanyés
J. Sardanyés
J. Sardanyés
J. Sardanyés
J. Seger
J.A.G.M. Visser de
J.B.S. Haldane
J.M. Montoya
J.N. Thompson
J.N. Thompson
J.N. Thompson
J.N. Thompson
J.S. McCaskill
J.S. Weltz
K. Kaneko
K.C. King
K.Y. Yip
L. Valen Van
L. Valen Van
L. Valen Van
L.T. Morran
M. Eigen
M. Pascual
M. Tegmark
M.A. Parker
M.E.J. Newman
M.J. Benton
N.C. Stenseth
P. Bak
P.D. Roopnarine
P.R. Ehrlich
R. Ferrière
R.K. Grosberg
R.M. May
R.S. Howard
R.V. Solé
R.V. Solé
R.V. Solé
R.V. Solé
R.V. Solé
R.V. Solé
R.V. Solé
R.V. Solé
R.V. Solé
R.V. Solé
R.V. Solé
S. Wright
S. Wright
S.A. Kauffman
S.A. Kauffman
S.C. Manrubia
S.F. Elena
S.J. Gould
T. Dobzhansky
T. Ikegami
T.J. Case
U. Dieckmann
W.D. Hamilton
W.D. Hamilton
W.H. Li
Z.Q. Chen
Publication venue
Publication date: 22/03/2013
Field of study

Species do not merely evolve, they also coevolve with other organisms. Coevolution is a major force driving interacting species to continuously evolve ex- ploring their fitness landscapes. Coevolution involves the coupling of species fit- ness landscapes, linking species genetic changes with their inter-specific ecological interactions. Here we first introduce the Red Queen hypothesis of evolution com- menting on some theoretical aspects and empirical evidences. As an introduction to the fitness landscape concept, we review key issues on evolution on simple and rugged fitness landscapes. Then we present key modeling examples of coevolution on different fitness landscapes at different scales, from RNA viruses to complex ecosystems and macroevolution.Comment: 40 pages, 12 figures. To appear in "Recent Advances in the Theory and Application of Fitness Landscapes" (H. Richter and A. Engelbrecht, eds.). Springer Series in Emergence, Complexity, and Computation, 201

arXiv.org e-Print Archive

Crossref