Search CORE

1,960 research outputs found

String attractors and combinatorics on words

Author: Mantaci S.
Restivo A.
Romana G.
Rosone G.
Sciortino M.
Publication venue: CEUR-WS
Publication date: 01/01/2019
Field of study

The notion of string attractor has recently been introduced in [Prezza, 2017] and studied in [Kempa and Prezza, 2018] to provide a unifying framework for known dictionary-based compressors. A string attractor for a word w = w[1]w[2] · · · w[n] is a subset Γ of the positions 1, . . ., n, such that all distinct factors of w have an occurrence crossing at least one of the elements of Γ. While finding the smallest string attractor for a word is a NP-complete problem, it has been proved in [Kempa and Prezza, 2018] that dictionary compressors can be interpreted as algorithms approximating the smallest string attractor for a given word. In this paper we explore the notion of string attractor from a combinatorial point of view, by focusing on several families of finite words. The results presented in the paper suggest that the notion of string attractor can be used to define new tools to investigate combinatorial properties of the words

Archivio della Ricerca - Università di Pisa

Archivio istituzionale della ricerca - Università di Palermo

Quasi-symmetric functions as polynomial functions on Young diagrams

Author: Aval Jean-Christophe
Féray Valentin
Novelli Jean-Christophe
Thibon Jean-Yves
Publication venue
Publication date: 01/01/2014
Field of study

We determine the most general form of a smooth function on Young diagrams, that is, a polynomial in the interlacing or multirectangular coordinates whose value depends only on the shape of the diagram. We prove that the algebra of such functions is isomorphic to quasi-symmetric functions, and give a noncommutative analog of this result.Comment: 34 pages, 4 figures, version including minor modifications suggested by referee

arXiv.org e-Print Archive

CiteSeerX

ZORA

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

Faster Longest Common Extension Queries in Strings over General Alphabets

Author: Gawrychowski Paweł
Kociumaka Tomasz
Rytter Wojciech
Waleń Tomasz
Publication venue
Publication date: 01/01/2016
Field of study

Longest common extension queries (often called longest common prefix queries) constitute a fundamental building block in multiple string algorithms, for example computing runs and approximate pattern matching. We show that a sequence of

q

LCE queries for a string of size

n

over a general ordered alphabet can be realized in

O(q \log \log n+n\log^*n)

time making only

O(q+n)

symbol comparisons. Consequently, all runs in a string over a general ordered alphabet can be computed in

O(n \log \log n)

time making

O(n)

symbol comparisons. Our results improve upon a solution by Kosolobov (Information Processing Letters, 2016), who gave an algorithm with

O(n \log^{2/3} n)

running time and conjectured that

O(n)

time is possible. We make a significant progress towards resolving this conjecture. Our techniques extend to the case of general unordered alphabets, when the time increases to

O(q\log n + n\log^*n)

. The main tools are difference covers and the disjoint-sets data structure.Comment: Accepted to CPM 201

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

On the Parikh-de-Bruijn grid

Author: Burcsi Péter
Lipták Zsuzsanna
Smyth W. F.
Publication venue
Publication date: 01/01/2017
Field of study

We introduce the Parikh-de-Bruijn grid, a graph whose vertices are fixed-order Parikh vectors, and whose edges are given by a simple shift operation. This graph gives structural insight into the nature of sets of Parikh vectors as well as that of the Parikh set of a given string. We show its utility by proving some results on Parikh-de-Bruijn strings, the abelian analog of de-Bruijn sequences.Comment: 18 pages, 3 figures, 1 tabl

arXiv.org e-Print Archive

Catalogo dei prodotti della ricerca