Search CORE

11 research outputs found

Palindromic Decompositions with Gaps and Errors

Author: A Apostolico
A Frid
D Breslauer
D Gusfield
D Kosolobov
DE Knuth
G Fici
G Manacher
M Crochemore
M Crochemore
M Rubinchik
R Kolpakov
S Gupta
T I
X Droubay
X Droubay
Y Fujishige
Z Galil
Publication venue
Publication date: 27/03/2017
Field of study

Identifying palindromes in sequences has been an interesting line of research in combinatorics on words and also in computational biology, after the discovery of the relation of palindromes in the DNA sequence with the HIV virus. Efficient algorithms for the factorization of sequences into palindromes and maximal palindromes have been devised in recent years. We extend these studies by allowing gaps in decompositions and errors in palindromes, and also imposing a lower bound to the length of acceptable palindromes. We first present an algorithm for obtaining a palindromic decomposition of a string of length n with the minimal total gap length in time O(n log n * g) and space O(n g), where g is the number of allowed gaps in the decomposition. We then consider a decomposition of the string in maximal \delta-palindromes (i.e. palindromes with \delta errors under the edit or Hamming distance) and g allowed gaps. We present an algorithm to obtain such a decomposition with the minimal total gap length in time O(n (g + \delta)) and space O(n g).Comment: accepted to CSR 201

arXiv.org e-Print Archive

Crossref

Finding approximate palindromes in strings

Author: Alexandre H.L. Porto
Apostolico
Baeza-Yates
Bondy
Breslauer
Galil
Gusfield
Jurka
Knuth
Landau
Landau
Levenstein
Manacher
Myers
Sankoff
Stephen
Ukkonen
Ukkonen
Valmir C. Barbosa
Wu
Publication venue: 'Elsevier BV'
Publication date: 01/01/2002
Field of study

We introduce a novel definition of approximate palindromes in strings, and provide an algorithm to find all maximal approximate palindromes in a string with up to

k

errors. Our definition is based on the usual edit operations of approximate pattern matching, and the algorithm we give, for a string of size

n

on a fixed alphabet, runs in

O(k^2 n)

time. We also discuss two implementation-related improvements to the algorithm, and demonstrate their efficacy in practice by means of both experiments and an average-case analysis

arXiv.org e-Print Archive

CiteSeerX

Crossref

Palindrome Recognition In The Streaming Model

Author: Azer Erfan Sadeqi
Berenbrink Petra
Ergün Funda
Mallmann-Trenn Frederik
Publication venue
Publication date: 28/01/2016
Field of study

In the Palindrome Problem one tries to find all palindromes (palindromic substrings) in a given string. A palindrome is defined as a string which reads forwards the same as backwards, e.g., the string "racecar". A related problem is the Longest Palindromic Substring Problem in which finding an arbitrary one of the longest palindromes in the given string suffices. We regard the streaming version of both problems. In the streaming model the input arrives over time and at every point in time we are only allowed to use sublinear space. The main algorithms in this paper are the following: The first one is a one-pass randomized algorithm that solves the Palindrome Problem. It has an additive error and uses

O(\sqrt n

) space. The second algorithm is a two-pass algorithm which determines the exact locations of all longest palindromes. It uses the first algorithm as the first pass. The third algorithm is again a one-pass randomized algorithm, which solves the Longest Palindromic Substring Problem. It has a multiplicative error using only

O(\log(n))

space. We also give two variants of the first algorithm which solve other related practical problems

arXiv.org e-Print Archive

CiteSeerX

Comparing Degenerate Strings

Author: Alzamel M. (Mai)
Ayad L.A.K. (Lorraine)
Bernardini G. (Giulia)
Grossi R. (Roberto)
Iliopoulos C.S. (Costas)
Pisanti N. (Nadia)
Pissis S. (Solon)
Rosone G. (Giovanna)
Publication venue: 'IOS Press'
Publication date: 01/01/2020
Field of study

Uncertain sequences are compact representations of sets of similar strings. They highlight common segments by collapsing them, and explicitly represent varying segments by listing all possible options. A generalized degenerate string (GD string) is a type of uncertain sequence. Formally, a GD string S is a sequence of n sets of strings of total size N, where the ith set contains strings of the same length ki but this length can vary between different sets. We denote by W the sum of these lengths k0, k1,... , kn-1. Our main result is an (N + M)-time algorithm for deciding whether two GD strings of total sizes N and M, respectively, over an integer alphabet, have a non-empty intersection. This result is based on a combinatorial result of independent interest: although the intersection of two GD strings can be exponential in the total size of the two strings, it can be represented in linear space. We then apply our string comparison tool to devise a simple algorithm for computing all palindromes in S in (min{W, n2}N)-time. We complement this upper bound by showing a similar conditional lower bound for computing maximal palindromes in S. We also show that a result, which is essentially the same as our string comparison linear-time algorithm, can be obtained by employing an automata-based approach

Crossref

CWI's Institutional Repository

INRIA a CCSD electronic archive server

Archivio della Ricerca - Università di Pisa

King's Research Portal

31th International Symposium on Theoretical Aspects of Computer Science: STACS '14, March 5th to March 8th, 2014, Lyon, France

Author: STACS <31 2014, Lyon>
Publication venue: Schloss Dagstuhl - Leibniz-Zentrum für Informatik
Publication date: 01/03/2014
Field of study

Digitale Bibliothek Thüringen

Parallel Detection of all Palindromes in a String

Author: Apostolico Alberto
Breslauer Dany
Galil Zvi
Publication venue: 'Purdue University (bepress)'
Publication date: 01/01/1993
Field of study

This paper presents two efficient concurrent-read concurrent-write parallel algorithms that find all palindromes in a given string: 1. An O(log n) time, n-processor algorithm over general alphabets. In case of constant size alphabets the algorithm requires only n= log n processors, and thus achieves an optimal-speedup. 2. An O(log log n) time, n log n= log log n-processor algorithm over general alphabets. This is the fastest possible time with the number of processors used. These new results improve on the known parallel palindrome detection algorithms by using smaller auxiliary space and either by making fewer operations or by achieving a faster running time. 1 Introduction Palindromes are symmetric strings that read the same forward and backward. Palindromes have been studied for centuries as word puzzles and more recently have found several important uses in formal languages and computability theory. Formally, a non-empty string w is a palindrome if w = w R , where w R denotes..

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

Purdue E-Pubs

Hal-Diderot