Search CORE

866 research outputs found

Construction of minimal DFAs from biological motifs

Author: Marschall Tobias
Publication venue
Publication date: 01/01/2010
Field of study

Deterministic finite automata (DFAs) are constructed for various purposes in computational biology. Little attention, however, has been given to the efficient construction of minimal DFAs. In this article, we define simple non-deterministic finite automata (NFAs) and prove that the standard subset construction transforms NFAs of this type into minimal DFAs. Furthermore, we show how simple NFAs can be constructed from two types of patterns popular in bioinformatics, namely (sets of) generalized strings and (generalized) strings with a Hamming neighborhood

arXiv.org e-Print Archive

CiteSeerX

Sparse approaches for the exact distribution of patterns in long state sequences generated by a Markov source

Author: Aho
Allauzen
Antzoulakos
Beaudoing
Boeva
Boeva
Brazma
Chang
Cormen
Cowan
Crochemore
Crochemore
Denise
El~Karoui
Erhardsson
Fiduccia
Frith
Fu
Geske
Godbole
Gregory Nuel
Hampson
Hopcroft
Hopcroft
Jean-Guillaume Dumas
Kaltofen
Karlin
Kleffe
Knuth
Le~Maout
Lladser
Mariño-Ramírez
Nicodème
Nuel
Nuel
Nuel
Nuel
Nuel
Nuel
Nuel
Pevzner
Prum
Reinert
Ribeca
Régnier
Stefanov
Stefanov
Storjohann
van Helden
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

We present two novel approaches for the computation of the exact distribution of a pattern in a long sequence. Both approaches take into account the sparse structure of the problem and are two-part algorithms. The first approach relies on a partial recursion after a fast computation of the second largest eigenvalue of the transition matrix of a Markov chain embedding. The second approach uses fast Taylor expansions of an exact bivariate rational reconstruction of the distribution. We illustrate the interest of both approaches on a simple toy-example and two biological applications: the transcription factors of the Human Chromosome 5 and the PROSITE signatures of functional motifs in proteins. On these example our methods demonstrate their complementarity and their hability to extend the domain of feasibility for exact computations in pattern problems to a new level

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

HAL Descartes

Hal-Diderot

Significance Score of Motifs in Biological Sequences

Author: Grégory Nuel
Publication venue: 'IntechOpen'
Publication date: 01/01/2011
Field of study

Available from: http://www.intechopen.com/books/bioinformatics-trends-and-methodologies/significance-score-of-motifs-in-biological-sequence

IntechOpen

Crossref

General Iteration graphs and Boolean automata circuits

Author: Noual Mathilde
Publication venue
Publication date: 15/01/2010
Field of study

This article is set in the field of regulation networks modeled by discrete dynamical systems. It focuses on Boolean automata networks. In such networks, there are many ways to update the states of every element. When this is done deterministically, at each time step of a discretised time flow and according to a predefined order, we say that the network is updated according to block-sequential update schedule (blocks of elements are updated sequentially while, within each block, the elements are updated synchronously). Many studies, for the sake of simplicity and with some biologically motivated reasons, have concentrated on networks updated with one particular block-sequential update schedule (more often the synchronous/parallel update schedule or the sequential update schedules). The aim of this paper is to give an argument formally proven and inspired by biological considerations in favour of the fact that the choice of a particular update schedule does not matter so much in terms of the possible and likely dynamical behaviours that networks may display

arXiv.org e-Print Archive

HAL-ENS-LYON

CiteSeerX

INRIA a CCSD electronic archive server

Hal-Diderot

An Ansatz for undecidable computation in RNA-world automata

Author: Prokopenko Mikhail
Svahn Adam J.
Publication venue
Publication date: 30/09/2020
Field of study

In this Ansatz we consider theoretical constructions of RNA polymers into automata, a form of computational structure. The basis for transitions in our automata are plausible RNA-world enzymes that may perform ligation or cleavage. Limited to these operations, we construct RNA automata of increasing complexity; from the Finite Automaton (RNA-FA) to the Turing Machine equivalent 2-stack PDA (RNA-2PDA) and the universal RNA-UPDA. For each automaton we show how the enzymatic reactions match the logical operations of the RNA automaton, and describe how biological exploration of the corresponding evolutionary space is facilitated by the efficient arrangement of RNA polymers into a computational structure. A critical theme of the Ansatz is the self-reference in RNA automata configurations which exploits the program-data duality but results in undecidable computation. We describe how undecidable computation is exemplified in the self-referential Liar paradox that places a boundary on a logical system, and by construction, any RNA automata. We argue that an expansion of the evolutionary space for RNA-2PDA automata can be interpreted as a hierarchical resolution of the undecidable computation by a meta-system (akin to Turing's oracle), in a continual process analogous to Turing's ordinal logics and Post's extensible recursively generated logics. On this basis, we put forward the hypothesis that the resolution of undecidable configurations in RNA-world automata represents a mechanism for novelty generation in the evolutionary space, and propose avenues for future investigation of biological automata

arXiv.org e-Print Archive

A framework for the local information dynamics of distributed computation in complex systems

The nature of distributed computation has often been described in terms of the component operations of universal computation: information storage, transfer and modification. We review the first complete framework that quantifies each of these individual information dynamics on a local scale within a system, and describes the manner in which they interact to create non-trivial computation where "the whole is greater than the sum of the parts". We describe the application of the framework to cellular automata, a simple yet powerful model of distributed computation. This is an important application, because the framework is the first to provide quantitative evidence for several important conjectures about distributed computation in cellular automata: that blinkers embody information storage, particles are information transfer agents, and particle collisions are information modification events. The framework is also shown to contrast the computations conducted by several well-known cellular automata, highlighting the importance of information coherence in complex computation. The results reviewed here provide important quantitative insights into the fundamental nature of distributed computation and the dynamics of complex systems, as well as impetus for the framework to be applied to the analysis and design of other systems.Comment: 44 pages, 8 figure

arXiv.org e-Print Archive

Crossref

Macquarie University ResearchOnline

Boolean networks with reliable dynamics

Author: B. Drossel
Barbara Drossel
C. Gershenson
D. E. Knuth
S. Wolfram
Tiago P. Peixoto
U. Alon
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2009
Field of study

We investigated the properties of Boolean networks that follow a given reliable trajectory in state space. A reliable trajectory is defined as a sequence of states which is independent of the order in which the nodes are updated. We explored numerically the topology, the update functions, and the state space structure of these networks, which we constructed using a minimum number of links and the simplest update functions. We found that the clustering coefficient is larger than in random networks, and that the probability distribution of three-node motifs is similar to that found in gene regulation networks. Among the update functions, only a subset of all possible functions occur, and they can be classified according to their probability. More homogeneous functions occur more often, leading to a dominance of canalyzing functions. Finally, we studied the entire state space of the networks. We observed that with increasing systems size, fixed points become more dominant, moving the networks close to the frozen phase.Comment: 11 Pages, 15 figure

arXiv.org e-Print Archive

TUbiblio

Crossref