Search CORE

23 research outputs found

Order preserving pattern matching on trees and DAGs

Author: A Amir
A Amir
I Simon
J Kim
K Park
M Dubiner
M Kubica
P Bose
RA Baeza-Yates
S Cho
S Faro
T Chhabra
Publication venue
Publication date: 25/07/2017
Field of study

The order preserving pattern matching (OPPM) problem is, given a pattern string

p

and a text string

t

, find all substrings of

t

which have the same relative orders as

p

. In this paper, we consider two variants of the OPPM problem where a set of text strings is given as a tree or a DAG. We show that the OPPM problem for a single pattern

p

of length

m

and a text tree

T

of size

N

can be solved in

O(m+N)

time if the characters of

p

are drawn from an integer alphabet of polynomial size. The time complexity becomes

O(m \log m + N)

if the pattern

p

is over a general ordered alphabet. We then show that the OPPM problem for a single pattern and a text DAG is NP-complete

arXiv.org e-Print Archive

Crossref

Duel and sweep algorithm for order-preserving pattern matching

Author: A Amir
D Gusfield
DE Knuth
J Kim
M Crochemore
M Kubica
MM Hasan
R Cole
RN Horspool
RS Boyer
S Cho
S Faro
T Chhabra
U Vishkin
U Vishkin
Publication venue
Publication date: 26/05/2017
Field of study

Given a text

T

and a pattern

P

over alphabet

\Sigma

, the classic exact matching problem searches for all occurrences of pattern

P

in text

T

. Unlike exact matching problem, order-preserving pattern matching (OPPM) considers the relative order of elements, rather than their real values. In this paper, we propose an efficient algorithm for OPPM problem using the "duel-and-sweep" paradigm. Our algorithm runs in

O(n + m\log m)

time in general and

O(n + m)

time under an assumption that the characters in a string can be sorted in linear time with respect to the string size. We also perform experiments and show that our algorithm is faster that KMP-based algorithm. Last, we introduce the two-dimensional order preserved pattern matching and give a duel and sweep algorithm that runs in

O(n^2)

time for duel stage and

O(n^2 m)

time for sweeping time with

O(m^3)

preprocessing time.Comment: 13 pages, 5 figure

arXiv.org e-Print Archive

Crossref

Minimal Suffix and Rotation of a Substring in Optimal Time

Author: Kociumaka Tomasz
Publication venue
Publication date: 01/01/2016
Field of study

For a text given in advance, the substring minimal suffix queries ask to determine the lexicographically minimal non-empty suffix of a substring specified by the location of its occurrence in the text. We develop a data structure answering such queries optimally: in constant time after linear-time preprocessing. This improves upon the results of Babenko et al. (CPM 2014), whose trade-off solution is characterized by

\Theta(n\log n)

product of these time complexities. Next, we extend our queries to support concatenations of

O(1)

substrings, for which the construction and query time is preserved. We apply these generalized queries to compute lexicographically minimal and maximal rotations of a given substring in constant time after linear-time preprocessing. Our data structures mainly rely on properties of Lyndon words and Lyndon factorizations. We combine them with further algorithmic and combinatorial tools, such as fusion trees and the notion of order isomorphism of strings

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server