Search CORE

41,553 research outputs found

Computing LZ77 in Run-Compressed Space

Author: Policriti Alberto
Prezza Nicola
Publication venue
Publication date: 21/10/2015
Field of study

In this paper, we show that the LZ77 factorization of a text T {\in\Sigma^n} can be computed in O(R log n) bits of working space and O(n log R) time, R being the number of runs in the Burrows-Wheeler transform of T reversed. For extremely repetitive inputs, the working space can be as low as O(log n) bits: exponentially smaller than the text itself. As a direct consequence of our result, we show that a class of repetition-aware self-indexes based on a combination of run-length encoded BWT and LZ77 can be built in asymptotically optimal O(R + z) words of working space, z being the size of the LZ77 parsing

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

Dynamic Relative Compression, Dynamic Partial Sums, and Substring Concatenation

Author: Bille Philip
Cording Patrick Hagge
Gørtz Inge Li
Skjoldjensen Frederik Rye
Vildhøj Hjalte Wedel
Vind Søren
Publication venue
Publication date: 01/01/2016
Field of study

Given a static reference string

R

and a source string

S

, a relative compression of

S

with respect to

R

is an encoding of

S

as a sequence of references to substrings of

R

. Relative compression schemes are a classic model of compression and have recently proved very successful for compressing highly-repetitive massive data sets such as genomes and web-data. We initiate the study of relative compression in a dynamic setting where the compressed source string

S

is subject to edit operations. The goal is to maintain the compressed representation compactly, while supporting edits and allowing efficient random access to the (uncompressed) source string. We present new data structures that achieve optimal time for updates and queries while using space linear in the size of the optimal relative compression, for nearly all combinations of parameters. We also present solutions for restricted and extended sets of updates. To achieve these results, we revisit the dynamic partial sums problem and the substring concatenation problem. We present new optimal or near optimal bounds for these problems. Plugging in our new results we also immediately obtain new bounds for the string indexing for patterns with wildcards problem and the dynamic text and static pattern matching problem

arXiv.org e-Print Archive

Fine-Grained Complexity Analysis of Two Classic TSP Variants

Author: Buchin Kevin
de Berg Mark
Jansen Bart M. P.
Woeginger Gerhard
Publication venue
Publication date: 01/01/2016
Field of study

We analyze two classic variants of the Traveling Salesman Problem using the toolkit of fine-grained complexity. Our first set of results is motivated by the Bitonic TSP problem: given a set of

n

points in the plane, compute a shortest tour consisting of two monotone chains. It is a classic dynamic-programming exercise to solve this problem in

O(n^2)

time. While the near-quadratic dependency of similar dynamic programs for Longest Common Subsequence and Discrete Frechet Distance has recently been proven to be essentially optimal under the Strong Exponential Time Hypothesis, we show that bitonic tours can be found in subquadratic time. More precisely, we present an algorithm that solves bitonic TSP in

O(n \log^2 n)

time and its bottleneck version in

O(n \log^3 n)

time. Our second set of results concerns the popular

k

-OPT heuristic for TSP in the graph setting. More precisely, we study the

k

-OPT decision problem, which asks whether a given tour can be improved by a

k

-OPT move that replaces

k

edges in the tour by

k

new edges. A simple algorithm solves

k

-OPT in

O(n^k)

time for fixed

k

. For 2-OPT, this is easily seen to be optimal. For

k=3

we prove that an algorithm with a runtime of the form

\tilde{O}(n^{3-\epsilon})

exists if and only if All-Pairs Shortest Paths in weighted digraphs has such an algorithm. The results for

k=2,3

may suggest that the actual time complexity of

k

-OPT is

\Theta(n^k)

. We show that this is not the case, by presenting an algorithm that finds the best

k

-move in

O(n^{\lfloor 2k/3 \rfloor + 1})

time for fixed

k \geq 3

. This implies that 4-OPT can be solved in

O(n^3)

time, matching the best-known algorithm for 3-OPT. Finally, we show how to beat the quadratic barrier for

k=2

in two important settings, namely for points in the plane and when we want to solve 2-OPT repeatedly.Comment: Extended abstract appears in the Proceedings of the 43rd International Colloquium on Automata, Languages, and Programming (ICALP 2016

arXiv.org e-Print Archive

Repository TU/e

Dagstuhl Research Online Publication Server