Search CORE

4,203 research outputs found

Sublinear Space Algorithms for the Longest Common Substring Problem

Author: Kociumaka Tomasz
Starikovskaya Tatiana
Vildhøj Hjalte Wedel
Publication venue
Publication date: 01/01/2014
Field of study

Given

m

documents of total length

n

, we consider the problem of finding a longest string common to at least

d \geq 2

of the documents. This problem is known as the \emph{longest common substring (LCS) problem} and has a classic

O(n)

space and

O(n)

time solution (Weiner [FOCS'73], Hui [CPM'92]). However, the use of linear space is impractical in many applications. In this paper we show that for any trade-off parameter

1 \leq \tau \leq n

, the LCS problem can be solved in

O(\tau)

space and

O(n^2/\tau)

time, thus providing the first smooth deterministic time-space trade-off from constant to linear space. The result uses a new and very simple algorithm, which computes a

\tau

-additive approximation to the LCS in

O(n^2/\tau)

time and

O(1)

space. We also show a time-space trade-off lower bound for deterministic branching programs, which implies that any deterministic RAM algorithm solving the LCS problem on documents from a sufficiently large alphabet in

O(\tau)

space must use

\Omega(n\sqrt{\log(n/(\tau\log n))/\log\log(n/(\tau\log n)})

time.Comment: Accepted to 22nd European Symposium on Algorithm

arXiv.org e-Print Archive

Crossref

Online Research Database In Technology

An Algorithm for the Longest Common Subsequence and Substring Problem

Author: Deka J.
Deka K.
Li R.
Publication venue
Publication date: 01/08/2023
Field of study

In this note, we first introduce a new problem called the longest common subsequence and substring problem. Let

X

and

Y

be two strings over an alphabet

\Sigma

. The longest common subsequence and substring problem for

X

and

Y

is to find the longest string which is a subsequence of

X

and a substring of

Y

. We propose an algorithm to solve the problem

arXiv.org e-Print Archive

The substring inclusion constraint longest common subsequence problem can be solved in quadratic time

Author: Alam Muhammad Rashed
Sohel Rahman M.
Publication venue: Elsevier B.V.
Publication date: 31/12/2012
Field of study

AbstractIn this paper, we study some variants of the Constrained Longest Common Subsequence (CLCS) problem, namely, the substring inclusion CLCS (Substring-IC-CLCS) problem and a generalized version thereof. In the Substring-IC-CLCS problem, we are to find a longest common subsequence (LCS) of two given strings containing a third constraint string (given) as a substring. Previous solution to this problem runs in cubic time, i.e, O(nmk) time, where n,m and k are the length of the 3 input strings. In this paper, we present simple O(nm) time algorithms to solve the Substring-IC-CLCS problem. We also study the Generalized Substring-IC-LCS problem where we are given two strings of length n and m respectively and an ordered list of p strings and the goal is to find an LCS containing each of them as a substring in the order they appear in the list. We present an O(nmp) algorithm for this generalized version of the problem

Elsevier - Publisher Connector

An Algorithm for the Constrained Longest Common Subsequence and Substring Problem

Author: Deka J.
Deka K.
Li D.
Li R.
Publication venue
Publication date: 02/08/2023
Field of study

Let

\Sigma

be an alphabet. For two strings

X

Y

, and a constrained string

P

over the alphabet

\Sigma

, the constrained longest common subsequence and substring problem for two strings

X

and

Y

with respect to

P

is to find a longest string

Z

which is a subsequence of

X

, a substring of

Y

, and has

P

as a subsequence. In this paper, we propose an algorithm for the constrained longest common subsequence and substring problem for two strings with a constrained string.Comment: arXiv admin note: text overlap with arXiv:2308.0092

arXiv.org e-Print Archive

Longest common substring with approximately k mismatches

Author: Starikovskaia Tatiana
Publication venue
Publication date: 01/01/2016
Field of study

In the longest common substring problem we are given two strings of length n and must find a substring of maximal length that occurs in both strings. It is well-known that the problem can be solved in linear time, but the solution is not robust and can vary greatly when the input strings are changed even by one letter. To circumvent this, Leimester and Morgenstern introduced the problem of the longest common substring with k mismatches. Lately, this problem has received a lot of attention in the literature, and several algorithms have been suggested. The running time of these algorithms is n^{2-o(1)}, and unfortunately, conditional lower bounds have been shown which imply that there is little hope to improve this bound. In this paper we study a different but closely related problem of the longest common substring with approximately k mismatches and use computational geometry techniques to show that it admits a randomised solution with strongly subquadratic running time

INRIA a CCSD electronic archive server

Dagstuhl Research Online Publication Server

Explore Bristol Research

Longest common substrings with k mismatches

Author: Flouri Tomas
Giaquinta Emanuele
Kobert Kassian
Ukkonen Esko
Publication venue
Publication date: 01/01/2015
Field of study

The longest common substring with k-mismatches problem is to find, given two strings S-1 and S-2, a longest substring A(1) of S-1 and A(2) of S-2 such that the Hamming distance between A(1) and A(2) isPeer reviewe

arXiv.org e-Print Archive

Elsevier - Publisher Connector

Crossref

Aaltodoc Publication Archive

Helsingin yliopiston digitaalinen arkisto