Search CORE

150 research outputs found

Queries on LZ-Bounded Encodings

Author: Belazzougui Djamal
Gagie Travis
Gawrychowski Paweł
Kärkkäinen Juha
Ordóñez Alberto
Puglisi Simon J.
Tabei Yasuo
Publication venue
Publication date: 02/12/2014
Field of study

We describe a data structure that stores a string

S

in space similar to that of its Lempel-Ziv encoding and efficiently supports access, rank and select queries. These queries are fundamental for implementing succinct and compressed data structures, such as compressed trees and graphs. We show that our data structure can be built in a scalable manner and is both small and fast in practice compared to other data structures supporting such queries

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Computing LZ77 in Run-Compressed Space

Author: Policriti Alberto
Prezza Nicola
Publication venue
Publication date: 21/10/2015
Field of study

In this paper, we show that the LZ77 factorization of a text T {\in\Sigma^n} can be computed in O(R log n) bits of working space and O(n log R) time, R being the number of runs in the Burrows-Wheeler transform of T reversed. For extremely repetitive inputs, the working space can be as low as O(log n) bits: exponentially smaller than the text itself. As a direct consequence of our result, we show that a class of repetition-aware self-indexes based on a combination of run-length encoded BWT and LZ77 can be built in asymptotically optimal O(R + z) words of working space, z being the size of the LZ77 parsing

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Udine

Archivio istituzionale della ricerca - Università degli Studi di Venezia Ca' Foscari

Archivio della ricerca- LUISS Libera Università Internazionale degli Studi Sociali Guido Carli di Roma

Hard Instances of the Constrained Discrete Logarithm Problem

Author: A. Naor
A.M. Odlyzko
B. Bollobás
C.-P. Schnorr
D.E. Knuth
D.R. Stinson
H. Haanpää
I.Z. Ruzsa
J. Hoffstein
J. Singer
J.-S. Coron
J.M. Pollard
J.M. Pollard
J.T. Schwartz
K. Zarankiewicz
M. Chateauneuf
O. Schirokauer
P. Erdös
P.C. Oorschot van
R. Heiman
R.C. Baker
R.C. Bose
R.K. Guy
R.L. Graham
V. Shoup
V.I. Nechaev
Y. Yacobi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

The discrete logarithm problem (DLP) generalizes to the constrained DLP, where the secret exponent

x

belongs to a set known to the attacker. The complexity of generic algorithms for solving the constrained DLP depends on the choice of the set. Motivated by cryptographic applications, we study sets with succinct representation for which the constrained DLP is hard. We draw on earlier results due to Erd\"os et al. and Schnorr, develop geometric tools such as generalized Menelaus' theorem for proving lower bounds on the complexity of the constrained DLP, and construct sets with succinct representation with provable non-trivial lower bounds

arXiv.org e-Print Archive

CiteSeerX

Crossref

Small space and streaming pattern matching with k edits

Author: Kociumaka Tomasz
Porat Ely
Starikovskaya Tatiana
Publication venue
Publication date: 10/06/2021
Field of study

In this work, we revisit the fundamental and well-studied problem of approximate pattern matching under edit distance. Given an integer

k

, a pattern

P

of length

m

, and a text

T

of length

n \ge m

, the task is to find substrings of

T

that are within edit distance

k

from

P

. Our main result is a streaming algorithm that solves the problem in

\tilde{O}(k^5)

space and

\tilde{O}(k^8)

amortised time per character of the text, providing answers correct with high probability. (Hereafter,

\tilde{O}(\cdot)

hides a

\mathrm{poly}(\log n)

factor.) This answers a decade-old question: since the discovery of a

\mathrm{poly}(k\log n)

-space streaming algorithm for pattern matching under Hamming distance by Porat and Porat [FOCS 2009], the existence of an analogous result for edit distance remained open. Up to this work, no

\mathrm{poly}(k\log n)

-space algorithm was known even in the simpler semi-streaming model, where

T

comes as a stream but

P

is available for read-only access. In this model, we give a deterministic algorithm that achieves slightly better complexity. In order to develop the fully streaming algorithm, we introduce a new edit distance sketch parametrised by integers