163 research outputs found
Maximal Closed Substrings
A string is closed if it has length 1 or has a nonempty border without internal occurrences. In this paper we introduce the definition of a maximal closed substring (MCS), which is an occurrence of a closed substring that cannot be extended to the left nor to the right into a longer closed substring. MCSs with exponent at least 2 are commonly called runs; those with exponent smaller than 2, instead, are particular cases of maximal gapped repeats. We show that a string of length n contains O(n1.5) MCSs. We also provide an output-sensitive algorithm that, given a string of length n over a constant-size alphabet, locates all m MCSs the string contains in O(nlog n+ m) time
A Characterization of Infinite LSP Words
G. Fici proved that a finite word has a minimal suffix automaton if and only
if all its left special factors occur as prefixes. He called LSP all finite and
infinite words having this latter property. We characterize here infinite LSP
words in terms of -adicity. More precisely we provide a finite set of
morphisms and an automaton such that an infinite word is LSP if
and only if it is -adic and all its directive words are recognizable by
Minimal Forbidden Factors of Circular Words
Minimal forbidden factors are a useful tool for investigating properties of
words and languages. Two factorial languages are distinct if and only if they
have different (antifactorial) sets of minimal forbidden factors. There exist
algorithms for computing the minimal forbidden factors of a word, as well as of
a regular factorial language. Conversely, Crochemore et al. [IPL, 1998] gave an
algorithm that, given the trie recognizing a finite antifactorial language ,
computes a DFA recognizing the language whose set of minimal forbidden factors
is . In the same paper, they showed that the obtained DFA is minimal if the
input trie recognizes the minimal forbidden factors of a single word. We
generalize this result to the case of a circular word. We discuss several
combinatorial properties of the minimal forbidden factors of a circular word.
As a byproduct, we obtain a formal definition of the factor automaton of a
circular word. Finally, we investigate the case of minimal forbidden factors of
the circular Fibonacci words.Comment: To appear in Theoretical Computer Scienc
Timing of Millisecond Pulsars in NGC 6752: Evidence for a High Mass-to-Light Ratio in the Cluster Core
Using pulse timing observations we have obtained precise parameters,
including positions with about 20 mas accuracy, of five millisecond pulsars in
NGC 6752. Three of them, located relatively close to the cluster center, have
line-of-sight accelerations larger than the maximum value predicted by the
central mass density derived from optical observation, providing dynamical
evidence for a central mass-to-light ratio >~ 10, much higher than for any
other globular cluster. It is likely that the other two millisecond pulsars
have been ejected out of the core to their present locations at 1.4 and 3.3
half-mass radii, respectively, suggesting unusual non-thermal dynamics in the
cluster core.Comment: Accepted by ApJ Letter. 5 pages, 2 figures, 1 tabl
A Characterization of Bispecial Sturmian Words
A finite Sturmian word w over the alphabet {a,b} is left special (resp. right
special) if aw and bw (resp. wa and wb) are both Sturmian words. A bispecial
Sturmian word is a Sturmian word that is both left and right special. We show
as a main result that bispecial Sturmian words are exactly the maximal internal
factors of Christoffel words, that are words coding the digital approximations
of segments in the Euclidean plane. This result is an extension of the known
relation between central words and primitive Christoffel words. Our
characterization allows us to give an enumerative formula for bispecial
Sturmian words. We also investigate the minimal forbidden words for the set of
Sturmian words.Comment: Accepted to MFCS 201
Words with the Maximum Number of Abelian Squares
An abelian square is the concatenation of two words that are anagrams of one
another. A word of length can contain distinct factors that
are abelian squares. We study infinite words such that the number of abelian
square factors of length grows quadratically with .Comment: To appear in the proceedings of WORDS 201
Palindromic Decompositions with Gaps and Errors
Identifying palindromes in sequences has been an interesting line of research
in combinatorics on words and also in computational biology, after the
discovery of the relation of palindromes in the DNA sequence with the HIV
virus. Efficient algorithms for the factorization of sequences into palindromes
and maximal palindromes have been devised in recent years. We extend these
studies by allowing gaps in decompositions and errors in palindromes, and also
imposing a lower bound to the length of acceptable palindromes.
We first present an algorithm for obtaining a palindromic decomposition of a
string of length n with the minimal total gap length in time O(n log n * g) and
space O(n g), where g is the number of allowed gaps in the decomposition. We
then consider a decomposition of the string in maximal \delta-palindromes (i.e.
palindromes with \delta errors under the edit or Hamming distance) and g
allowed gaps. We present an algorithm to obtain such a decomposition with the
minimal total gap length in time O(n (g + \delta)) and space O(n g).Comment: accepted to CSR 201
Vertebrate-mediated seed rain and artificial perches contribute to overcome seed dispersal limitation in a Mediterranean old field
Natural regeneration of vegetation is a frequent outcome of land abandonment, although the rate and diversity of such regeneration may be severely restricted by seed dispersal limitation, among other factors. In spite of this, studies aiming to quantify seed rain and test methods to enhance it, such as artificial perches, are still underrepresented in the Mediterranean. In our study, we quantified seed rain density and richness and tested the effects of artificial perches on such rain over a distance gradient on seven Mediterranean island old fields. In each of the seven sites, we positioned three sampling stations, each consisting of 1 seed trap under an artificial perch and 1 as a control on the ground, distributed at 30, 60, and 90 m from natural vegetation remnant. All traps received seeds, suggesting no overall dispersal limitation. Of the 11 seed species found, 10 were fleshy-fruited and dispersed by vertebrates. Seed traps under perches received significantly higher seed rain of fleshy-fruited species dispersed by birds, while ground traps received significantly more seeds of the species also dispersed by mammals, especially Rubus ulmifolius. The distance from the seed source was nonsignificant in all cases. Our study demonstrates the key role of vertebrate-mediated seed dispersal services to overcome dispersal limitation in old fields, as well as the effective contribution of even small artificial perches in contrasting such limitation. The lack of differences over the distance gradient reveal that the upper spatial limit of dispersal limitation was not achieved
Algorithms for Jumbled Pattern Matching in Strings
The Parikh vector p(s) of a string s is defined as the vector of
multiplicities of the characters. Parikh vector q occurs in s if s has a
substring t with p(t)=q. We present two novel algorithms for searching for a
query q in a text s. One solves the decision problem over a binary text in
constant time, using a linear size index of the text. The second algorithm, for
a general finite alphabet, finds all occurrences of a given Parikh vector q and
has sub-linear expected time complexity; we present two variants, which both
use a linear size index of the text.Comment: 18 pages, 9 figures; article accepted for publication in the
International Journal of Foundations of Computer Scienc
- âŠ