Search CORE

33,740 research outputs found

The complexity of the Multiple Pattern Matching Problem for random strings

Author: Bassino Frédérique
Rakotoarimalala Tsinjo
Sportiello Andrea
Publication venue
Publication date: 01/01/2017
Field of study

We generalise a multiple string pattern matching algorithm, recently proposed by Fredriksson and Grabowski [J. Discr. Alg. 7, 2009], to deal with arbitrary dictionaries on an alphabet of size

s

. If

r_m

is the number of words of length

m

in the dictionary, and

\phi(r) = \max_m \ln(s\, m\, r_m)/m

, the complexity rate for the string characters to be read by this algorithm is at most

\kappa_{{}_\textrm{UB}}\, \phi(r)

for some constant

\kappa_{{}_\textrm{UB}}

. On the other side, we generalise the classical lower bound of Yao [SIAM J. Comput. 8, 1979], for the problem with a single pattern, to deal with arbitrary dictionaries, and determine it to be at least

\kappa_{{}_\textrm{LB}}\, \phi(r)

. This proves the optimality of the algorithm, improving and correcting previous claims.Comment: 25 pages, 4 figure

arXiv.org e-Print Archive

The Complexity of the Approximate Multiple Pattern Matching Problem for Random Strings

Author: Rakotoarimalala Tsinjo
Sportiello Andrea
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 31st International Conference on Probabilistic, Combinatorial and Asymptotic Methods for the Analysis of Algorithms (AofA 2020)
Publication date: 01/01/2020
Field of study

We describe a multiple string pattern matching algorithm which is well-suited for approximate search and dictionaries composed of words of different lengths. We prove that this algorithm has optimal complexity rate up to a multiplicative constant, for arbitrary dictionaries. This extends to arbitrary dictionaries the classical results of Yao [SIAM J. Comput. 8, 1979], and Chang and Marr [Proc. CPM94, 1994]

Dagstuhl Research Online Publication Server

HAL-Paris 13

Hal-Diderot

Oskar Bordeaux

String Matching: Communication, Circuits, and Learning

Author: Golovnev Alexander
Reichman Daniel
Shinkar Igor
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2019)
Publication date: 01/01/2019
Field of study

String matching is the problem of deciding whether a given n-bit string contains a given k-bit pattern. We study the complexity of this problem in three settings. - Communication complexity. For small k, we provide near-optimal upper and lower bounds on the communication complexity of string matching. For large k, our bounds leave open an exponential gap; we exhibit some evidence for the existence of a better protocol. - Circuit complexity. We present several upper and lower bounds on the size of circuits with threshold and DeMorgan gates solving the string matching problem. Similarly to the above, our bounds are near-optimal for small k. - Learning. We consider the problem of learning a hidden pattern of length at most k relative to the classifier that assigns 1 to every string that contains the pattern. We prove optimal bounds on the VC dimension and sample complexity of this problem

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Quantum pattern matching fast on average

Author: Montanaro Ashley
Publication venue
Publication date: 26/08/2015
Field of study

The

d

-dimensional pattern matching problem is to find an occurrence of a pattern of length

m \times \dots \times m

within a text of length

n \times \dots \times n

, with

n \ge m

. This task models various problems in text and image processing, among other application areas. This work describes a quantum algorithm which solves the pattern matching problem for random patterns and texts in time

\widetilde{O}((n/m)^{d/2} 2^{O(d^{3/2}\sqrt{\log m})})

. For large

m

this is super-polynomially faster than the best possible classical algorithm, which requires time

\widetilde{\Omega}( (n/m)^d + n^{d/2} )

. The algorithm is based on the use of a quantum subroutine for finding hidden shifts in

d

dimensions, which is a variant of algorithms proposed by Kuperberg.Comment: 22 pages, 2 figures; v3: further minor changes, essentially published versio

arXiv.org e-Print Archive

CiteSeerX

Average-Case Optimal Approximate Circular String Matching

Author: CS Iliopoulos
E Ukkonen
F Fernandes
GM Landau
K Fredriksson
P-H Hsu
T Hirvola
T Lee
WI Chang
Publication venue
Publication date: 24/02/2015
Field of study

Approximate string matching is the problem of finding all factors of a text t of length n that are at a distance at most k from a pattern x of length m. Approximate circular string matching is the problem of finding all factors of t that are at a distance at most k from x or from any of its rotations. In this article, we present a new algorithm for approximate circular string matching under the edit distance model with optimal average-case search time O(n(k + log m)/m). Optimal average-case search time can also be achieved by the algorithms for multiple approximate string matching (Fredriksson and Navarro, 2004) using x and its rotations as the set of multiple patterns. Here we reduce the preprocessing time and space requirements compared to that approach

arXiv.org e-Print Archive

CiteSeerX

Crossref

King's Research Portal