Search CORE

7 research outputs found

Finding Nested Common Intervals Efficiently

Author: A. Bergeron
A. Bergeron
A. Bergeron
A. Bergeron
G. Blin
G. Didier
G.M. Landau
M. Zhang
R. Hoberman
S. Böcker
S. Ohno
S. Rahmann
T. Schmidt
T. Uno
U. Kurzik-Dumke
X. He
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

International audienceIn this paper, we study the problem of effi ciently fi nding gene clusters formalized by nested common intervals between two genomes represented either as permutations or as sequences. Considering permutations, we give several algorithms whose running time depends on the size of the actual output rather than the output in the worst case. Indeed, we first provide a straightforward O(n^3) time algorithm for finding all nested common intervals. We reduce this complexity by providing an O(n^2) time algorithm computing an irredundant output. Finally, we show, by providing a third algorithm, that fi nding only the maximal nested common intervals can be done in linear time. Considering sequences, we provide solutions (modi cations of previously de ned algorithms and a new algorithm) for di fferent variants of the problem, depending on the treatment one wants to apply to duplicated genes

CiteSeerX

Crossref

Publications at Bielefeld University

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

Easy identification of generalized common and conserved nested intervals

Author: de Montgolfier Fabien
Raffinot Mathieu
Rusu Irena
Publication venue
Publication date: 02/12/2013
Field of study

In this paper we explain how to easily compute gene clusters, formalized by classical or generalized nested common or conserved intervals, between a set of K genomes represented as K permutations. A b-nested common (resp. conserved) interval I of size |I| is either an interval of size 1 or a common (resp. conserved) interval that contains another b-nested common (resp. conserved) interval of size at least |I|-b. When b=1, this corresponds to the classical notion of nested interval. We exhibit two simple algorithms to output all b-nested common or conserved intervals between K permutations in O(Kn+nocc) time, where nocc is the total number of such intervals. We also explain how to count all b-nested intervals in O(Kn) time. New properties of the family of conserved intervals are proposed to do so

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

Hal-Diderot

On the distribution of the number of cycles in the breakpoint graph of a random signed permutation

Author: Grusea Simona
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/09/2011
Field of study

International audienceWe use the finite Markov chain embedding technique to obtain the distribution of the number of cycles in the breakpoint graph of a random uniform signed permutation. This further gives a very good approximation of the distribution of the reversal distance between two random genomes

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

HAL AMU

HAL-INSA Toulouse

MinMax-Profiles: A Unifying View of Common Intervals, Nested Common Intervals and Conserved Intervals of K Permutations

Author: Rusu Irena
Publication venue
Publication date: 01/01/2013
Field of study

Common intervals of K permutations over the same set of n elements were firstly investigated by T. Uno and M.Yagiura (Algorithmica, 26:290:309, 2000), who proposed an efficient algorithm to find common intervals when K=2. Several particular classes of intervals have been defined since then, e.g. conserved intervals and nested common intervals, with applications mainly in genome comparison. Each such class, including common intervals, led to the development of a specific algorithmic approach for K=2, and - except for nested common intervals - for its extension to an arbitrary K. In this paper, we propose a common and efficient algorithmic framework for finding different types of common intervals in a set P of K permutations, with arbitrary K. Our generic algorithm is based on a global representation of the information stored in P, called the MinMax-profile of P, and an efficient data structure, called an LR-stack, that we introduce here. We show that common intervals (and their subclasses of irreducible common intervals and same-sign common intervals), nested common intervals (and their subclass of maximal nested common intervals) as well as conserved intervals (and their subclass of irreducible conserved intervals) may be obtained by appropriately setting the parameters of our algorithm in each case. All the resulting algorithms run in O(Kn+N)-time and need O(n) additional space, where N is the number of solutions. The algorithms for nested common intervals and maximal nested common intervals are new for K>2, in the sense that no other algorithm has been given so far to solve the problem with the same complexity, or better. The other algorithms are as efficient as the best known algorithms.Comment: 25 pages, 2 figure

arXiv.org e-Print Archive

Finding Nested Common Intervals Efficiently

Author: Blin Guillaume
Stoye Jens
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2009
Field of study

Hal-Diderot

Finding Nested Common Intervals Efficiently

Author: Blin Guillaume
Faye David Célestin
Stoye Jens
Publication venue: 'Mary Ann Liebert Inc'
Publication date: 01/09/2010
Field of study

International audienceIn this paper, we study the problem of efficiently finding gene clusters formalized by nested common intervals between two genomes represented either as permutations or as sequences. Considering permutations, we give several algorithms whose running time depends on the size of the actual output rather than the output in the worst case. Indeed, we first provide a straightforward cubic time algorithm for finding all nested common intervals. We reduce this complexity by providing a quadratic time algorithm computing an irredundant output. We then show, by providing a third algorithm, that finding only the maximal nested common intervals can be done in linear time. Finally, we prove that finding approximate nested common intervals is fixed parameter tractable. Considering sequences, we provide solutions (modifications of previously defined algorithms and a new algorithm) for different variants of the problem, depending on the treatment one wants to apply to duplicated genes. This includes a polynomial-time algorithm for a variant implying a matching of the genes in the cluster, a setting that for other problems often leads to hardness

Hal-Diderot

Finding Nested Common Intervals Efficiently

Author: David Faye
Guillaume Blin
Jens Stoye
Rahmann S.
Publication venue: 'Mary Ann Liebert Inc'
Publication date: 01/01/2010
Field of study

Blin G, Faye D, Stoye J. Finding Nested Common Intervals Efficiently. Journal of Computational Biology. 2010;17(9):1183-1194

CiteSeerX

Crossref

Publications at Bielefeld University

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM