Search CORE

6 research outputs found

Cartesian Tree Matching and Indexing

Author: Amir Amihood
Landau Gad M.
Park Kunsoo
Park Sung Gwan
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 30th Annual Symposium on Combinatorial Pattern Matching (CPM 2019)
Publication date: 01/01/2019
Field of study

We introduce a new metric of match, called Cartesian tree matching, which means that two strings match if they have the same Cartesian trees. Based on Cartesian tree matching, we define single pattern matching for a text of length n and a pattern of length m, and multiple pattern matching for a text of length n and k patterns of total length m. We present an O(n+m) time algorithm for single pattern matching, and an O((n+m) log k) deterministic time or O(n+m) randomized time algorithm for multiple pattern matching. We also define an index data structure called Cartesian suffix tree, and present an O(n) randomized time algorithm to build the Cartesian suffix tree. Our efficient algorithms for Cartesian tree matching use a representation of the Cartesian tree, called the parent-distance representation

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Sufficient Conditions for Efficient Indexing Under Different Matchings

Author: Amir Amihood
Kondratovsky Eitan
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 30th Annual Symposium on Combinatorial Pattern Matching (CPM 2019)
Publication date: 01/01/2019
Field of study

The most important task derived from the massive digital data accumulation in the world, is efficient access to this data, hence the importance of indexing. In the last decade, many different types of matching relations were defined, each requiring an efficient indexing scheme. Cole and Hariharan in a ground breaking paper [Cole and Hariharan, SIAM J. Comput., 33(1):26-42, 2003], formulate sufficient conditions for building an efficient indexing for quasi-suffix collections, collections that behave as suffixes. It was shown that known matchings, including parameterized, 2-D array and order preserving matchings, fit their indexing settings. In this paper, we formulate more basic sufficient conditions based on the order relation derived from the matching relation itself, our conditions are more general than the previously known conditions

Dagstuhl Research Online Publication Server

On Indeterminate Strings Matching

Author: Gawrychowski Pawe?
Ghazawi Samah
Landau Gad M.
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 31st Annual Symposium on Combinatorial Pattern Matching (CPM 2020)
Publication date: 01/01/2020
Field of study

Dagstuhl Research Online Publication Server

Computing Covers under Substring Consistent Equivalence Relations

Author: A Amir
A Amir
A Amir
A Apostolico
A Apostolico
A Apostolico
BS Baker
C Iliopoulos
CS Iliopoulos
D Breslauer
D Moore
D Moore
DE Knuth
G Gourdel
GS Brodal
J Kim
M Christou
M Christou
M Kubica
T Ehlers
Y Li
Y Matsuoka
Publication venue
Publication date: 30/07/2020
Field of study

Covers are a kind of quasiperiodicity in strings. A string

C

is a cover of another string

T

if any position of

T

is inside some occurrence of

C

T

. The shortest and longest cover arrays of

T

have the lengths of the shortest and longest covers of each prefix of

T

, respectively. The literature has proposed linear-time algorithms computing longest and shortest cover arrays taking border arrays as input. An equivalence relation

\approx

over strings is called a substring consistent equivalence relation (SCER) iff

X \approx Y

implies (1)

|X| = |Y|

and (2)

X[i:j] \approx Y[i:j]

for all

1 \le i \le j \le |X|

. In this paper, we generalize the notion of covers for SCERs and prove that existing algorithms to compute the shortest cover array and the longest cover array of a string

T

under the identity relation will work for any SCERs taking the accordingly generalized border arrays.Comment: 16 page

arXiv.org e-Print Archive

Crossref