Search CORE

5,062 research outputs found

Fast Algorithm for Partial Covers in Words

Author: A. Apostolico
A. Apostolico
A. Apostolico
A.S. Fraenkel
D. Breslauer
D. Gusfield
D. Moore
E. Ukkonen
G.S. Brodal
G.S. Brodal
J.S. Sim
M. Crochemore
M.R. Brown
Y. Li
Publication venue
Publication date: 01/01/2013
Field of study

A factor

u

of a word

w

is a cover of

w

if every position in

w

lies within some occurrence of

u

w

. A word

w

covered by

u

thus generalizes the idea of a repetition, that is, a word composed of exact concatenations of

u

. In this article we introduce a new notion of

\alpha

-partial cover, which can be viewed as a relaxed variant of cover, that is, a factor covering at least

\alpha

positions in

w

. We develop a data structure of

O(n)

size (where

n=|w|

) that can be constructed in

O(n\log n)

time which we apply to compute all shortest

\alpha

-partial covers for a given

\alpha

. We also employ it for an

O(n\log n)

-time algorithm computing a shortest

\alpha

-partial cover for each

\alpha=1,2,\ldots,n

arXiv.org e-Print Archive

Springer - Publisher Connector

King's Research Portal

Efficient Seeds Computation Revisited

Author: A. Apostolico
C.S. Iliopoulos
D. Breslauer
G.S. Brodal
J. Fischer
K. Sadakane
M. Crochemore
M. Crochemore
M. Crochemore
O. Berkman
Y. Li
Publication venue
Publication date: 01/01/2011
Field of study

The notion of the cover is a generalization of a period of a string, and there are linear time algorithms for finding the shortest cover. The seed is a more complicated generalization of periodicity, it is a cover of a superstring of a given string, and the shortest seed problem is of much higher algorithmic difficulty. The problem is not well understood, no linear time algorithm is known. In the paper we give linear time algorithms for some of its versions --- computing shortest left-seed array, longest left-seed array and checking for seeds of a given length. The algorithm for the last problem is used to compute the seed array of a string (i.e., the shortest seeds for all the prefixes of the string) in

O(n^2)

time. We describe also a simpler alternative algorithm computing efficiently the shortest seeds. As a by-product we obtain an

O(n\log{(n/m)})

time algorithm checking if the shortest seed has length at least

m

and finding the corresponding seed. We also correct some important details missing in the previously known shortest-seed algorithm (Iliopoulos et al., 1996).Comment: 14 pages, accepted to CPM 201

arXiv.org e-Print Archive

CiteSeerX

King's Research Portal

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

Internal Quasiperiod Queries

Author: Crochemore Maxime
Iliopoulos Costas
Radoszewski Jakub
Rytter Wojciech
Straszyński Juliusz
Waleń Tomasz
Zuba Wiktor
Publication venue
Publication date: 01/01/2020
Field of study

Internal pattern matching requires one to answer queries about factors of a given string. Many results are known on answering internal period queries, asking for the periods of a given factor. In this paper we investigate (for the first time) internal queries asking for covers (also known as quasiperiods) of a given factor. We propose a data structure that answers such queries in

O(\log n \log \log n)

time for the shortest cover and in

O(\log n (\log \log n)^2)

time for a representation of all the covers, after

O(n \log n)

time and space preprocessing.Comment: To appear in the SPIRE 2020 proceeding

arXiv.org e-Print Archive

King's Research Portal

Searching of gapped repeats and subrepetitions in a word

Author: D. Gusfield
G. Brodal
J. Storer
M. Crochemore
M. Crochemore
M. Crochemore
M. Crochemore
P. Emde Boas van
R. Kolpakov
R. Kolpakov
R. Kolpakov
T. Kociumaka
Z. Galil
Publication venue
Publication date: 29/09/2013
Field of study

A gapped repeat is a factor of the form

uvu

where

u

and

v

are nonempty words. The period of the gapped repeat is defined as

|u|+|v|

. The gapped repeat is maximal if it cannot be extended to the left or to the right by at least one letter with preserving its period. The gapped repeat is called

\alpha

-gapped if its period is not greater than

\alpha |v|

. A

\delta

-subrepetition is a factor which exponent is less than 2 but is not less than

1+\delta

(the exponent of the factor is the quotient of the length and the minimal period of the factor). The

\delta

-subrepetition is maximal if it cannot be extended to the left or to the right by at least one letter with preserving its minimal period. We reveal a close relation between maximal gapped repeats and maximal subrepetitions. Moreover, we show that in a word of length

n

the number of maximal

\alpha

-gapped repeats is bounded by

O(\alpha^2n)

and the number of maximal

\delta

-subrepetitions is bounded by

O(n/\delta^2)

. Using the obtained upper bounds, we propose algorithms for finding all maximal

\alpha

-gapped repeats and all maximal

\delta

-subrepetitions in a word of length

n

. The algorithm for finding all maximal

\alpha

-gapped repeats has

O(\alpha^2n)

time complexity for the case of constant alphabet size and

O(n\log n + \alpha^2n)

time complexity for the general case. For finding all maximal

\delta

-subrepetitions we propose two algorithms. The first algorithm has

O(\frac{n\log\log n}{\delta^2})

time complexity for the case of constant alphabet size and

O(n\log n +\frac{n\log\log n}{\delta^2})

time complexity for the general case. The second algorithm has

O(n\log n+\frac{n}{\delta^2}\log \frac{1}{\delta})

expected time complexity

arXiv.org e-Print Archive

A_{n-1} singularities and nKdV hierarchies

Author: Givental Alexander
Publication venue
Publication date: 01/01/2003
Field of study

According to a conjecture of E. Witten proved by M. Kontsevich, a certain generating function for intersection indices on the Deligne -- Mumford moduli spaces of Riemann surfaces coincides with a certain tau-function of the KdV hierarchy. The generating function is naturally generalized under the name the {\em total descendent potential} in the theory of Gromov -- Witten invariants of symplectic manifolds. The papers arXiv: math.AG/0108100 and arXive: math.DG/0108160 contain two equivalent constructions, motivated by some results in Gromov -- Witten theory, which associate a total descendent potential to any semisimple Frobenius structure. In this paper, we prove that in the case of K.Saito's Frobenius structure on the miniversal deformation of the

A_{n-1}

-singularity, the total descendent potential is a tau-function of the

n

KdV hierarchy. We derive this result from a more general construction for solutions of the

n

KdV hierarchy from

n-1

solutions of the KdV hierarchy.Comment: 29 pages, to appear in Moscow Mathematical Journa

arXiv.org e-Print Archive

Strings on Celestial Sphere

Author: Stieberger Stephan
Taylor Tomasz R.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

We transform superstring scattering amplitudes into the correlation functions of primary conformal fields on two-dimensional celestial sphere. The points on celestial sphere are associated to the asymptotic directions of (light-like) momenta of external particles, with the Lorentz group realized as the SL(2,C) conformal symmetry of the sphere. The energies are dualized through Mellin transforms into the parameters that determine dimensions of the primaries. We focus on four-point amplitudes involving gauge bosons and gravitons in type I open superstring theory and in closed heterotic superstring theory at the tree-level.Comment: 28 pages, harvmac; v2: added Appendix A, final version to appear in Nucl. Phys.

arXiv.org e-Print Archive

Directory of Open Access Journals

The Number of Repetitions in 2D-Strings

Author: Charalampopoulos Panagiotis
Radoszewski Jakub
Rytter Wojciech
Wale? Tomasz
Zuba Wiktor
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 28th Annual European Symposium on Algorithms (ESA 2020)
Publication date: 01/01/2020
Field of study

The notions of periodicity and repetitions in strings, and hence these of runs and squares, naturally extend to two-dimensional strings. We consider two types of repetitions in 2D-strings: 2D-runs and quartics (quartics are a 2D-version of squares in standard strings). Amir et al. introduced 2D-runs, showed that there are

O(n^3)

of them in an

n \times n

2D-string and presented a simple construction giving a lower bound of

\Omega(n^2)

for their number (TCS 2020). We make a significant step towards closing the gap between these bounds by showing that the number of 2D-runs in an

n \times n

2D-string is

O(n^2 \log^2 n)

. In particular, our bound implies that the

O(n^2\log n + \textsf{output})

run-time of the algorithm of Amir et al. for computing 2D-runs is also

O(n^2 \log^2 n)

. We expect this result to allow for exploiting 2D-runs algorithmically in the area of 2D pattern matching. A quartic is a 2D-string composed of

2 \times 2

identical blocks (2D-strings) that was introduced by Apostolico and Brimkov (TCS 2000), where by quartics they meant only primitively rooted quartics, i.e. built of a primitive block. Here our notion of quartics is more general and analogous to that of squares in 1D-strings. Apostolico and Brimkov showed that there are

O(n^2 \log^2 n)

occurrences of primitively rooted quartics in an

n \times n

2D-string and that this bound is attainable. Consequently the number of distinct primitively rooted quartics is

O(n^2 \log^2 n)

. Here, we prove that the number of distinct general quartics is also

O(n^2 \log^2 n)

. This extends the rich combinatorial study of the number of distinct squares in a 1D-string, that was initiated by Fraenkel and Simpson (J. Comb. Theory A 1998), to two dimensions. Finally, we show some algorithmic applications of 2D-runs. (Abstract shortened due to arXiv requirements.)Comment: To appear in the ESA 2020 proceeding

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server