Search CORE

16 research outputs found

A Minimal Periods Algorithm with Applications

Author: A. Apostolico
A.O. Slisenko
A.S. Fraenkel
B. Schieber
D. Beauquier
D. Gusfield
D. Gusfield
D. Harel
D. Knuth
E.M. McCreight
J. Duval
J. Stoye
L. Ilie
M. Crochemore
M. Crochemore
M. Crochemore
M. Main
M. Main
M.G. Main
R. Kolpakov
S.R. Kosaraju
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/11/2009
Field of study

Kosaraju in ``Computation of squares in a string'' briefly described a linear-time algorithm for computing the minimal squares starting at each position in a word. Using the same construction of suffix trees, we generalize his result and describe in detail how to compute in O(k|w|)-time the minimal k-th power, with period of length larger than s, starting at each position in a word w for arbitrary exponent

k\geq2

and integer

s\geq0

. We provide the complete proof of correctness of the algorithm, which is somehow not completely clear in Kosaraju's original paper. The algorithm can be used as a sub-routine to detect certain types of pseudo-patterns in words, which is our original intention to study the generalization.Comment: 14 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

NTRFinder: a software tool to find nested tandem repeats

Author: A. A. Matroud
C. P. Tuffley
Domanic
Fu
Hauth
Landau
M. D. Hendy
Matroud
Sagot
Wells
Wexler
Woodford
Publication venue: Oxford University Press
Publication date
Field of study

We introduce the software tool NTRFinder to search for a complex repetitive structure in DNA we call a nested tandem repeat (NTR). An NTR is a recurrence of two or more distinct tandem motifs interspersed with each other. We propose that NTRs can be used as phylogenetic and population markers. We have tested our algorithm on both real and simulated data, and present some real NTRs of interest. NTRFinder can be downloaded from http://www.maths.otago.ac.nz/~aamatroud/

Crossref

PubMed Central

String matching problems over free partially commutative monoids

Author: Hashiguchi Kosaburo
Yamada Kazuya
Publication venue: Published by Elsevier Inc.
Publication date: 31/12/1992
Field of study

AbstractThis paper studies two string matching problems over free partially commutative monoids. We analyze these two problems in detail, and present two efficient polynomial time algorithms for solving them

Elsevier - Publisher Connector

Linear time algorithms for finding and representing all the tandem repeats in a string

Author: Gusfield Dan
Stoye Jens
Publication venue: 'Elsevier BV'
Publication date: 01/01/2004
Field of study

Gusfield D, Stoye J. Linear time algorithms for finding and representing all the tandem repeats in a string. Journal of computer and system sciences. 2004;69(4):525-546.A tandem repeat (or square) is a string [alpha][alpha], where [alpha] is a non-empty string. We present an O(|S|)-time algorithm that operates on the suffix tree T(S) for a string S, finding and marking the endpoint in T(S) of every tandem repeat that occurs in S. This decorated suffix tree implicitly represents all occurrences of tandem repeats in S, and can be used to efficiently solve many questions concerning tandem repeats and tandem arrays in S. This improves and generalizes several prior efforts to efficiently capture large subsets of tandem repeats

Elsevier - Publisher Connector

Publications at Bielefeld University

ANÁLISE E APLICAÇÃO DE ESTRUTURAS DE SUFIXOS NA RESOLUÇÃO DO STRING MATCHING

Author: Assis da Silva Francisco
Augusto Pazoti Mario
Henrique Santos Miranda Guilherme
Luiz de Almeida Leandro
Roberto Pereira Danillo
Publication venue: Universidade do Oeste Paulista - UNOESTE
Publication date: 21/05/2018
Field of study

String Matching é o problema que busca responder a seguinte pergunta: “É possível encontrar determinado padrão dentro de um texto?”. É um problema amplamente estudado na Ciência da Computação e também na Biologia Computacional, devido à existência de suas diferentes modificações em ferramentas de pesquisa e também no processamento de cadeias de DNA. Já existem algoritmos que alcançaram a solução ótima para responder a pergunta do problema, entretanto tais soluções não possuem a mesma eficiência nas extensões e variações do problema. Dessa forma, diversas pesquisas tem estudado estruturas de dados relativas aos sufixos do texto para alcançar soluções que sejam capazes de resolver variações complexas do string matching. O presente trabalho realiza um estudo e análise aprofundada sobre a eficiência de dessas estruturas: a árvore de sufixos e o autômato de sufixos. Algoritmos clássicos também são abordados e comparados às estruturas enquanto o trabalho é discorrido. As análises seguem critérios estatísticos, tempos de execução e complexidade de algoritmos para obter maior grau de confiança nos resultados

Unoeste: Revistas Colloquium / Colloquium Journals (Universidade do Oeste Paulista)

ANÁLISE E APLICAÇÃO DE ESTRUTURAS DE SUFIXOS NA RESOLUÇÃO DO STRING MATCHING

Author: Henrique Santos Miranda Guilherme
Luiz de Almeida Leandro
Roberto Pereira Danillo
Augusto Pazoti Mario
Assis da Silva Francisco
Publication venue: Universidade do Oeste Paulista - UNOESTE
Publication date: 01/01/2002
Field of study

Unoeste: Revistas Colloquium / Colloquium Journals (Universidade do Oeste Paulista)

VTT Research System

Frequent Patterns Algorithm of Biological Sequences based on Pattern Prefix-tree

Author: Lin Peng
Liu Shuang
Xie Fei
Xue Linyan
Zhang Xiaoke
Publication venue: Agora University Press
Publication date: 05/08/2019
Field of study

In the application of bioinformatics, the existing algorithms cannot be directly and efficiently implement sequence pattern mining. Two fast and efficient biological sequence pattern mining algorithms for biological single sequence and multiple sequences are proposed in this paper. The concept of the basic pattern is proposed, and on the basis of mining frequent basic patterns, the frequent pattern is excavated by constructing prefix trees for frequent basic patterns. The proposed algorithms implement rapid mining of frequent patterns of biological sequences based on pattern prefix trees. In experiment the family sequence data in the pfam protein database is used to verify the performance of the proposed algorithm. The prediction results confirm that the proposed algorithms can’t only obtain the mining results with effective biological significance, but also improve the running time efficiency of the biological sequence pattern mining

Agora University Editing House: Journals

An Optimal O(log log n) Time Parallel Algorithm for Detecting all Squares in a String

Author
Publication venue: 'Aarhus University Library'
Publication date
Field of study

Crossref