480 research outputs found
Minimal Synthesis of String To String Functions From Examples
We study the problem of synthesizing string to string transformations from a
set of input/output examples. The transformations we consider are expressed
using deterministic finite automata (DFA) that read pairs of letters, one
letter from the input and one from the output. The DFA corresponding to these
transformations have additional constraints, ensuring that each input string is
mapped to exactly one output string.
We suggest that, given a set of input/output examples, the smallest DFA
consistent with the examples is a good candidate for the transformation the
user was expecting. We therefore study the problem of, given a set of examples,
finding a minimal DFA consistent with the examples and satisfying the
functionality and totality constraints mentioned above.
We prove that, in general, this problem (the corresponding decision problem)
is NP-complete. This is unlike the standard DFA minimization problem which can
be solved in polynomial time. We provide several NP-hardness proofs that show
the hardness of multiple (independent) variants of the problem.
Finally, we propose an algorithm for finding the minimal DFA consistent with
input/output examples, that uses a reduction to SMT solvers. We implemented the
algorithm, and used it to evaluate the likelihood that the minimal DFA indeed
corresponds to the DFA expected by the user.Comment: SYNT 201
Multiple Context-Free Tree Grammars: Lexicalization and Characterization
Multiple (simple) context-free tree grammars are investigated, where "simple"
means "linear and nondeleting". Every multiple context-free tree grammar that
is finitely ambiguous can be lexicalized; i.e., it can be transformed into an
equivalent one (generating the same tree language) in which each rule of the
grammar contains a lexical symbol. Due to this transformation, the rank of the
nonterminals increases at most by 1, and the multiplicity (or fan-out) of the
grammar increases at most by the maximal rank of the lexical symbols; in
particular, the multiplicity does not increase when all lexical symbols have
rank 0. Multiple context-free tree grammars have the same tree generating power
as multi-component tree adjoining grammars (provided the latter can use a
root-marker). Moreover, every multi-component tree adjoining grammar that is
finitely ambiguous can be lexicalized. Multiple context-free tree grammars have
the same string generating power as multiple context-free (string) grammars and
polynomial time parsing algorithms. A tree language can be generated by a
multiple context-free tree grammar if and only if it is the image of a regular
tree language under a deterministic finite-copying macro tree transducer.
Multiple context-free tree grammars can be used as a synchronous translation
device.Comment: 78 pages, 13 figure
Logical and Algebraic Characterizations of Rational Transductions
Rational word languages can be defined by several equivalent means: finite
state automata, rational expressions, finite congruences, or monadic
second-order (MSO) logic. The robust subclass of aperiodic languages is defined
by: counter-free automata, star-free expressions, aperiodic (finite)
congruences, or first-order (FO) logic. In particular, their algebraic
characterization by aperiodic congruences allows to decide whether a regular
language is aperiodic.
We lift this decidability result to rational transductions, i.e.,
word-to-word functions defined by finite state transducers. In this context,
logical and algebraic characterizations have also been proposed. Our main
result is that one can decide if a rational transduction (given as a
transducer) is in a given decidable congruence class. We also establish a
transfer result from logic-algebra equivalences over languages to equivalences
over transductions. As a consequence, it is decidable if a rational
transduction is first-order definable, and we show that this problem is
PSPACE-complete
Inferring Symbolic Automata
We study the learnability of symbolic finite state automata, a model shown useful in many applications in software verification. The state-of-the-art literature on this topic follows the query learning paradigm, and so far all obtained results are positive. We provide a necessary condition for efficient learnability of SFAs in this paradigm, from which we obtain the first negative result. The main focus of our work lies in the learnability of SFAs under the paradigm of identification in the limit using polynomial time and data. We provide a necessary condition and a sufficient condition for efficient learnability of SFAs in this paradigm, from which we derive a positive and a negative result
Learning Sequential Tree-to-Word Transducers
International audienceWe study the problem of learning sequential top-down tree-to- word transducers (STWs). First, we present a Myhill-Nerode characterization of the corresponding class of sequential tree-to-word transformations (STW). Next, we investigate what learning of stws means, identify fundamental obstacles, and propose a learning model with abstain. Finally, we present a polynomial learning algorithm.A long version is available here</p
Pebble transducers with unary output
Boja\'nczyk recently initiated an intensive study of deterministic pebble
transducers, which are two-way automata that can drop marks (named "pebbles")
on their input word, and produce an output word. They describe functions from
words to words. Two natural restrictions of this definition have been
investigated: marble transducers by Dou\'eneau-Tabot et al., and
comparison-free pebble transducers (that we rename here "blind transducers") by
Nguy\^en et al.
Here, we study the decidability of membership problems between the classes of
functions computed by pebble, marble and blind transducers that produce a unary
output. First, we show that pebble and marble transducers have the same
expressive power when the outputs are unary (which is false over non-unary
outputs). Then, we characterize 1-pebble transducers with unary output that
describe a function computable by a blind transducer, and show that the
membership problem is decidable. These results can be interpreted in terms of
automated simplification of programs.Comment: 39 page
- …