Search CORE

28 research outputs found

Finding the Leftmost Critical Factorization on Unordered Alphabet

Author: Kosolobov Dmitry
Publication venue
Publication date: 01/01/2016
Field of study

We present a linear time and space algorithm computing the leftmost critical factorization of a given string on an unordered alphabet.Comment: 13 pages, 13 figures (accepted to Theor. Comp. Sci.

arXiv.org e-Print Archive

Crossref

Institutional repository of Ural Federal University named after the first President of Russia B.N.Yeltsin

Dagstuhl Reports : Volume 1, Issue 2, February 2011

Author: Schloss Dagstuhl Leibniz-Zentrum für Informatik
Publication venue
Publication date: 09/09/2011
Field of study

Online Privacy: Towards Informational Self-Determination on the Internet (Dagstuhl Perspectives Workshop 11061) : Simone Fischer-Hübner, Chris Hoofnagle, Kai Rannenberg, Michael Waidner, Ioannis Krontiris and Michael Marhöfer Self-Repairing Programs (Dagstuhl Seminar 11062) : Mauro Pezzé, Martin C. Rinard, Westley Weimer and Andreas Zeller Theory and Applications of Graph Searching Problems (Dagstuhl Seminar 11071) : Fedor V. Fomin, Pierre Fraigniaud, Stephan Kreutzer and Dimitrios M. Thilikos Combinatorial and Algorithmic Aspects of Sequence Processing (Dagstuhl Seminar 11081) : Maxime Crochemore, Lila Kari, Mehryar Mohri and Dirk Nowotka Packing and Scheduling Algorithms for Information and Communication Services (Dagstuhl Seminar 11091) Klaus Jansen, Claire Mathieu, Hadas Shachnai and Neal E. Youn

Hochschulschriftenserver - Universität Frankfurt am Main

Full-fledged Real-Time Indexing for Constant Size Alphabets

Author: Kucherov Gregory
Nekrich Yakov
Publication venue
Publication date: 06/07/2013
Field of study

In this paper we describe a data structure that supports pattern matching queries on a dynamically arriving text over an alphabet ofconstant size. Each new symbol can be prepended to

T

in O(1) worst-case time. At any moment, we can report all occurrences of a pattern

P

in the current text in

O(|P|+k)

time, where

|P|

is the length of

P

and

k

is the number of occurrences. This resolves, under assumption of constant-size alphabet, a long-standing open problem of existence of a real-time indexing method for string matching (see \cite{AmirN08})

arXiv.org e-Print Archive

HAL Descartes

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

Sublinear Space Algorithms for the Longest Common Substring Problem

Author: Kociumaka Tomasz
Starikovskaya Tatiana
Vildhøj Hjalte Wedel
Publication venue
Publication date: 01/01/2014
Field of study

Given

m

documents of total length

n

, we consider the problem of finding a longest string common to at least

d \geq 2

of the documents. This problem is known as the \emph{longest common substring (LCS) problem} and has a classic

O(n)

space and

O(n)

time solution (Weiner [FOCS'73], Hui [CPM'92]). However, the use of linear space is impractical in many applications. In this paper we show that for any trade-off parameter

1 \leq \tau \leq n

, the LCS problem can be solved in

O(\tau)

space and

O(n^2/\tau)

time, thus providing the first smooth deterministic time-space trade-off from constant to linear space. The result uses a new and very simple algorithm, which computes a

\tau

-additive approximation to the LCS in

O(n^2/\tau)

time and

O(1)

space. We also show a time-space trade-off lower bound for deterministic branching programs, which implies that any deterministic RAM algorithm solving the LCS problem on documents from a sufficiently large alphabet in

O(\tau)

space must use

\Omega(n\sqrt{\log(n/(\tau\log n))/\log\log(n/(\tau\log n)})

time.Comment: Accepted to 22nd European Symposium on Algorithm

arXiv.org e-Print Archive

Crossref

Online Research Database In Technology

Longest Common Extensions in Sublinear Space

Author: A Amir
D Gusfield
D Harel
EW Myers
G Manacher
GM Landau
GM Landau
GM Landau
MG Main
NJ Fine
P Bille
R Cole
R Kolpakov
RM Karp
Publication venue
Publication date: 01/01/2015
Field of study

The longest common extension problem (LCE problem) is to construct a data structure for an input string

T

of length

n

that supports LCE

(i,j)

queries. Such a query returns the length of the longest common prefix of the suffixes starting at positions

i

and

j

T

. This classic problem has a well-known solution that uses

O(n)

space and

O(1)

query time. In this paper we show that for any trade-off parameter

1 \leq \tau \leq n

, the problem can be solved in

O(\frac{n}{\tau})

space and

O(\tau)

query time. This significantly improves the previously best known time-space trade-offs, and almost matches the best known time-space product lower bound.Comment: An extended abstract of this paper has been accepted to CPM 201

arXiv.org e-Print Archive

Crossref

Copenhagen University Research Information System

Online Research Database In Technology

Dictionary matching in a stream

Author: A.V. Aho
A.Z. Broder
D. Breslauer
D. Breslauer
D.E. Knuth
M. Crochemore
M. Ružić
R. Clifford
R. Clifford
R. Clifford
R.M. Karp
Publication venue
Publication date: 01/01/2015
Field of study

We consider the problem of dictionary matching in a stream. Given a set of strings, known as a dictionary, and a stream of characters arriving one at a time, the task is to report each time some string in our dictionary occurs in the stream. We present a randomised algorithm which takes O(log log(k + m)) time per arriving character and uses O(k log m) words of space, where k is the number of strings in the dictionary and m is the length of the longest string in the dictionary

arXiv.org e-Print Archive

Crossref

Explore Bristol Research

Online Detection of Repetitions with Backtracking

Author: A Apostolico
D Breslauer
D Breslauer
H Leung
J Jansson
JJ Hong
M Crochemore
MG Main
Z Galil
Publication venue
Publication date: 01/01/2015
Field of study

In this paper we present two algorithms for the following problem: given a string and a rational

e > 1

, detect in the online fashion the earliest occurrence of a repetition of exponent

\ge e

in the string. 1. The first algorithm supports the backtrack operation removing the last letter of the input string. This solution runs in

O(n\log m)

time and

O(m)

space, where

m

is the maximal length of a string generated during the execution of a given sequence of

n

read and backtrack operations. 2. The second algorithm works in

O(n\log\sigma)

time and

O(n)

space, where

n

is the length of the input string and

\sigma

is the number of distinct letters. This algorithm is relatively simple and requires much less memory than the previously known solution with the same working time and space. a string generated during the execution of a given sequence of

n

read and backtrack operations.Comment: 12 pages, 5 figures, accepted to CPM 201

arXiv.org e-Print Archive

Crossref

Institutional repository of Ural Federal University named after the first President of Russia B.N.Yeltsin

Book announcements

Author: Sindicatura de Greuges de Barcelona
Publication venue: Published by Elsevier B.V.
Publication date: 15/05/1991
Field of study

Podeu consultar la versió en castellà a: http://hdl.handle.net/11703/10236

Elsevier - Publisher Connector

Repositori Obert de Coneixement de l'Ajuntament de Barcelona