51 research outputs found
Formal Properties of XML Grammars and Languages
XML documents are described by a document type definition (DTD). An
XML-grammar is a formal grammar that captures the syntactic features of a DTD.
We investigate properties of this family of grammars. We show that every
XML-language basically has a unique XML-grammar. We give two characterizations
of languages generated by XML-grammars, one is set-theoretic, the other is by a
kind of saturation property. We investigate decidability problems and prove
that some properties that are undecidable for general context-free languages
become decidable for XML-languages. We also characterize those XML-grammars
that generate regular XML-languages.Comment: 24 page
Transfinite Lyndon words
In this paper, we extend the notion of Lyndon word to transfinite words. We
prove two main results. We first show that, given a transfinite word, there
exists a unique factorization in Lyndon words that are densely non-increasing,
a relaxation of the condition used in the case of finite words.
In the annex, we prove that the factorization of a rational word has a
special form and that it can be computed from a rational expression describing
the word
Splicing systems and the Chomsky hierarchy
In this paper, we prove decidability properties and new results on the
position of the family of languages generated by (circular) splicing systems
within the Chomsky hierarchy. The two main results of the paper are the
following. First, we show that it is decidable, given a circular splicing
language and a regular language, whether they are equal. Second, we prove the
language generated by an alphabetic splicing system is context-free. Alphabetic
splicing systems are a generalization of simple and semi-simple splicin systems
already considered in the literature
Transfinite Lyndon words
In this paper, we extend the notion of Lyndon word to transfinite words. We
prove two main results. We first show that, given a transfinite word, there
exists a unique factorization in Lyndon words that are densely non-increasing,
a relaxation of the condition used in the case of finite words.
In the annex, we prove that the factorization of a rational word has a
special form and that it can be computed from a rational expression describing
the word
Splicing Systems from Past to Future: Old and New Challenges
A splicing system is a formal model of a recombinant behaviour of sets of
double stranded DNA molecules when acted on by restriction enzymes and ligase.
In this survey we will concentrate on a specific behaviour of a type of
splicing systems, introduced by P\u{a}un and subsequently developed by many
researchers in both linear and circular case of splicing definition. In
particular, we will present recent results on this topic and how they stimulate
new challenging investigations.Comment: Appeared in: Discrete Mathematics and Computer Science. Papers in
Memoriam Alexandru Mateescu (1952-2005). The Publishing House of the Romanian
Academy, 2014. arXiv admin note: text overlap with arXiv:1112.4897 by other
author
Hopcroft's automaton minimization algorithm and Sturmian words
This paper is concerned with the analysis of the worst case behavior of Hopcroft's algorithm for minimizing deterministic finite state automata. We extend a result of Castiglione, Restivo and Sciortino. They show that Hopcroft's algorithm has a worst case behavior for the automata recognizing Fibonacci words. We prove that the same holds for all standard Sturmian words having an ultimately periodic directive sequence (the directive sequence for Fibonacci words is )
Operations preserving recognizable languages
Given a strictly increasing sequence s of non-negative integers, filtering a word a_0a_1 ... a_n by s consists in deleting the letters ai such that i is not in the set {s_0, s_1, ...}. By a natural generalization, denote by L[s], where L is a language, the set of all words of L filtered by s. The filtering problem is to characterize the filters s such that, for every regular language L, L[s] is regular. In this paper, the filtering problem is solved, and a unified approach is provided to solve similar questions, including the removal problem considered by Seiferas and McNaughton. Our approach relies on a detailed study of various residual notions, notably residually ultimately periodic sequences and residually rational transductions
- …