Search CORE

845 research outputs found

On Maximal Unbordered Factors

Author: A Ehrenfeucht
D Moore
F Franĕk
J-P Duval
J-P Duval
J-P Duval
L Ilie
P Gawrychowski
P Nielsen
R Assous
S Holub
T Kociumaka
Publication venue
Publication date: 28/04/2015
Field of study

Given a string

S

of length

n

, its maximal unbordered factor is the longest factor which does not have a border. In this work we investigate the relationship between

n

and the length of the maximal unbordered factor of

S

. We prove that for the alphabet of size

\sigma \ge 5

the expected length of the maximal unbordered factor of a string of length~

n

is at least

0.99 n

(for sufficiently large values of

n

). As an application of this result, we propose a new algorithm for computing the maximal unbordered factor of a string.Comment: Accepted to the 26th Annual Symposium on Combinatorial Pattern Matching (CPM 2015

arXiv.org e-Print Archive

Crossref

HAL Descartes

Hal-Diderot

HAL-Ecole des Ponts ParisTech

Explore Bristol Research

HAL - UPEC / UPEM

Average-case analysis of perfect sorting by reversals (Journal Version)

Author: Bouvel Mathilde
Chauve Cedric
Mishna Marni
Rossin Dominique
Publication venue
Publication date: 01/01/2011
Field of study

Perfect sorting by reversals, a problem originating in computational genomics, is the process of sorting a signed permutation to either the identity or to the reversed identity permutation, by a sequence of reversals that do not break any common interval. B\'erard et al. (2007) make use of strong interval trees to describe an algorithm for sorting signed permutations by reversals. Combinatorial properties of this family of trees are essential to the algorithm analysis. Here, we use the expected value of certain tree parameters to prove that the average run-time of the algorithm is at worst, polynomial, and additionally, for sufficiently long permutations, the sorting algorithm runs in polynomial time with probability one. Furthermore, our analysis of the subclass of commuting scenarios yields precise results on the average length of a reversal, and the average number of reversals.Comment: A preliminary version of this work appeared in the proceedings of Combinatorial Pattern Matching (CPM) 2009. See arXiv:0901.2847; Discrete Mathematics, Algorithms and Applications, vol. 3(3), 201

arXiv.org e-Print Archive

Crossref

Hal-Diderot

HAL-Polytechnique

Faster Longest Common Extension Queries in Strings over General Alphabets

Author: Gawrychowski Paweł
Kociumaka Tomasz
Rytter Wojciech
Waleń Tomasz
Publication venue
Publication date: 01/01/2016
Field of study

Longest common extension queries (often called longest common prefix queries) constitute a fundamental building block in multiple string algorithms, for example computing runs and approximate pattern matching. We show that a sequence of

q

LCE queries for a string of size

n

over a general ordered alphabet can be realized in

O(q \log \log n+n\log^*n)

time making only

O(q+n)

symbol comparisons. Consequently, all runs in a string over a general ordered alphabet can be computed in

O(n \log \log n)

time making

O(n)

symbol comparisons. Our results improve upon a solution by Kosolobov (Information Processing Letters, 2016), who gave an algorithm with

O(n \log^{2/3} n)

running time and conjectured that

O(n)

time is possible. We make a significant progress towards resolving this conjecture. Our techniques extend to the case of general unordered alphabets, when the time increases to