Search CORE

59 research outputs found

MaxSAT Evaluation 2018 : Solver and Benchmark Descriptions

Author
Publication venue: Department of Computer Science, University of Helsinki
Publication date: 01/01/2018
Field of study

Non peer reviewe

Helsingin yliopiston digitaalinen arkisto

Determinization and Minimization of Automata for Nested Words Revisited

Author: Niehren Joachim
Sakho Momar
Publication venue: 'MDPI AG'
Publication date: 24/02/2021
Field of study

International audienceWe consider the problem of determinizing and minimizing automata for nested words in practice. For this we compile the nested regular expressions (

NRE_s

) from the usual XPath benchmark to nested word automata (

NW

A_s

). The determinization of these

NW

A_s

, however, fails to produce reasonably small automata. In the best case, huge deterministic

NW

A_s

are produced after few hours, even for relatively small

NRE_s

of the benchmark. We propose a different approach to the determinization of automata for nested words. For this, we introduce stepwise hedge automata (

SHA_s

) that generalize naturally on both (stepwise) tree automata and on finite word automata. We then show how to determinize

SHA_s

, yielding reasonably small deterministic automata for the

NRE_s

from the XPath benchmark. The size of deterministic

SHA_s

automata can be reduced further by a novel minimization algorithm for a subclass of

SHA_s

. In order to understand why the new approach to determinization and minimization works so nicely, we investigate the relationship between

NWA_s

and

SHA_s

further. Clearly, deterministic

SHA_s

can be compiled to deterministic NWAs in linear time, and conversely,

NW

A_s

can be compiled to nondeterministic

SHA_s

in polynomial time. Therefore, we can use

SHA_s

as intermediates for determinizing

NWA_s

, while avoiding the huge size increase with the usual determinization algorithm for

NWA_s

. Notably, the NWAs obtained from the

SHA_s

perform bottom-up and left-to-right computations only, but no top-down computations. This

NWA

-behavior can be distinguished syntactically by the (weak) single-entry property, suggesting a close relationship between

SHA_s

and single-entry

NWA_s

. In particular, it turns out that the usual determinization algorithm for

NWA_s

behaves well for single-entry

NWA_s

, while it quickly explodes without the single-entry property. Furthermore, it is known that the class of deterministic multi-module single-entry

NWA_s

enjoys unique minimization. The subclass of deterministic

SHA_s

to which our novel minimization algorithm applies is different though, in that we do not impose multiple modules. As further optimizations for reducing the sizes of the constructed

SHA_s

, we propose schema-based cleaning and symbolic representations based on apply-else rules, that can be maintained by determinization. We implemented the optimizations and report the experimental results for the automata constructed for the XPathMark benchmark

Multidisciplinary Digital Publishing Institute

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Normalization of Sequential Top-Down Tree-to-Word Transducers

Author: Croft Rodney J.
Leung Sumie
Nathan Pradeep J.
O'Kane Joanne
O'Neill Barry V.
Oliva Jessica
Phan K. Luan
Stout Julie
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2010
Field of study

International audienceWe study normalization of deterministic sequential top-down tree-to-word transducers (STWs), that capture the class of deterministic top-down nested-word to word transducers. We identify the subclass of earliest STWs (eSTWs) that yield normal forms when minimized. The main result of this paper is an effective normalization procedure for STWs. It consists of two stages: we first convert a given STW to an equivalent eSTW, and then, we minimize the eSTW. Keywords: formal language theory, tree automata, transformations, XML databases, XSLTExtended Version: A long version is available here.</p

HAL - Lille 3

ResearchOnline at James Cook University

Swinburne Research Bank

Research Online

Deep Blue Documents at the University of Michigan

Synchronizing automata over nested words

Author: Chistikov Dmitry
Martyugin Pavel
Shirmohammadi Mahsa
Publication venue: Institut für Informatik · Justus-Liebig-Universität Giessen
Publication date: 01/01/2019
Field of study

We extend the concept of a synchronizing word from deterministic finite-state automata (DFA) to nested word automata (NWA): A well-matched nested word is called synchronizing if it resets the control state of any configuration, i. e., takes the NWA from all control states to a single control state. We show that although the shortest synchronizing word for an NWA, if it exists, can be (at most) exponential in the size of the NWA, the existence of such a word can still be decided in polynomial time. As our main contribution, we show that deciding the existence of a short synchronizing word (of at most given length) becomes PSPACE-complete (as opposed to NP-complete for DFA). The upper bound makes a connection to pebble games and Strahler numbers, and the lower bound goes via small-cost synchronizing words for DFA, an intermediate problem that we also show PSPACE-complete. We also characterize the complexity of a number of related problems, using the observation that the intersection nonemptiness problem for NWA is EXP-complete

HAL Descartes

Warwick Research Archives Portal Repository

Hal-Diderot

2008 Abstracts Collection -- IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science

Author: Hariharan Ramesh
Mukund Madhavan
Vinay V
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science
Publication date: 01/01/2008
Field of study

This volume contains the proceedings of the 28th international conference on the Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2008), organized under the auspices of the Indian Association for Research in Computing Science (IARCS)

Dagstuhl Research Online Publication Server

Streaming Tree Transducers

Author: B. Courcelle
F. Neven
H. Hosoya
H. Seidl
J. Engelfriet
J. Engelfriet
J. Engelfriet
J. Engelfriet
J. Esparza
P. Madhusudan
W. Martens
Publication venue
Publication date: 01/01/2012
Field of study

Theory of tree transducers provides a foundation for understanding expressiveness and complexity of analysis problems for specification languages for transforming hierarchically structured data such as XML documents. We introduce streaming tree transducers as an analyzable, executable, and expressive model for transforming unranked ordered trees in a single pass. Given a linear encoding of the input tree, the transducer makes a single left-to-right pass through the input, and computes the output in linear time using a finite-state control, a visibly pushdown stack, and a finite number of variables that store output chunks that can be combined using the operations of string-concatenation and tree-insertion. We prove that the expressiveness of the model coincides with transductions definable using monadic second-order logic (MSO). Existing models of tree transducers either cannot implement all MSO-definable transformations, or require regular look ahead that prohibits single-pass implementation. We show a variety of analysis problems such as type-checking and checking functional equivalence are solvable for our model.Comment: 40 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

ScholarlyCommons@Penn

A Bit of Nondeterminism Makes Pushdown Automata Expressive and Succinct

Author: Guha Shibashis
Jecker Ismaël
Lehtinen Karoliina
Zimmermann Martin
Publication venue
Publication date: 01/01/2021
Field of study

We study the expressiveness and succinctness of good-for-games pushdown automata (GFG-PDA) over finite words, that is, pushdown automata whose nondeterminism can be resolved based on the run constructed so far, but independently of the remainder of the input word. We prove that GFG-PDA recognise more languages than deterministic PDA (DPDA) but not all context-free languages (CFL). This class is orthogonal to unambiguous CFL. We further show that GFG-PDA can be exponentially more succinct than DPDA, while PDA can be double-exponentially more succinct than GFG-PDA. We also study GFGness in visibly pushdown automata (VPA), which enjoy better closure properties than PDA, and for which we show GFGness to be EXPTIME-complete. GFG-VPA can be exponentially more succinct than deterministic VPA, while VPA can be exponentially more succinct than GFG-VPA. Both of these lower bounds are tight. Finally, we study the complexity of resolving nondeterminism in GFG-PDA. Every GFG-PDA has a positional resolver, a function that resolves nondeterminism and that is only dependant on the current configuration. Pushdown transducers are sufficient to implement the resolvers of GFG-VPA, but not those of GFG-PDA. GFG-PDA with finite-state resolvers are determinisable

University of Liverpool Repository

HAL AMU

HAL Descartes

Dagstuhl Research Online Publication Server

IST Austria: PubRep (Institute of Science and Technology)

Hal-Diderot

Programming Using Automata and Transducers

Author: D\u27antoni Loris
Publication venue: ScholarlyCommons
Publication date: 01/01/2015
Field of study

Automata, the simplest model of computation, have proven to be an effective tool in reasoning about programs that operate over strings. Transducers augment automata to produce outputs and have been used to model string and tree transformations such as natural language translations. The success of these models is primarily due to their closure properties and decidable procedures, but good properties come at the price of limited expressiveness. Concretely, most models only support finite alphabets and can only represent small classes of languages and transformations. We focus on addressing these limitations and bridge the gap between the theory of automata and transducers and complex real-world applications: Can we extend automata and transducer models to operate over structured and infinite alphabets? Can we design languages that hide the complexity of these formalisms? Can we define executable models that can process the input efficiently? First, we introduce succinct models of transducers that can operate over large alphabets and design BEX, a language for analysing string coders. We use BEX to prove the correctness of UTF and BASE64 encoders and decoders. Next, we develop a theory of tree transducers over infinite alphabets and design FAST, a language for analysing tree-manipulating programs. We use FAST to detect vulnerabilities in HTML sanitizers, check whether augmented reality taggers conflict, and optimize and analyze functional programs that operate over lists and trees. Finally, we focus on laying the foundations of stream processing of hierarchical data such as XML files and program traces. We introduce two new efficient and executable models that can process the input in a left-to-right linear pass: symbolic visibly pushdown automata and streaming tree transducers. Symbolic visibly pushdown automata are closed under Boolean operations and can specify and efficiently monitor complex properties for hierarchical structures over infinite alphabets. Streaming tree transducers can express and efficiently process complex XML transformations while enjoying decidable procedures

CiteSeerX

ScholarlyCommons@Penn

Equivalence Problems for Tree Transducers: A Brief Survey

Author: Aho
Aho
Alur
Alur
Andre
Andre
Benedikt
Berstel
Bozapalidis
Caralp
Courcelle
Courcelle
Courcelle
Courcelle
Culik II
Culik II
Culik II
Engelfriet
Engelfriet
Engelfriet
Engelfriet
Engelfriet
Engelfriet
Engelfriet
Engelfriet
Engelfriet
Engelfriet
Engelfriet
Esparza
Filiot
Filiot
Fischer
Friese
Friese
Fülöp
Ginsburg
Ginsburg
Griffiths
Gurari
Hakuta
Honkala
Hopcroft
Karhumäki
Knuth
Lemay
Maneth
Maneth
Maneth
Milo
Nakano
Parikh
Perst
Plandowski
Raskin
Rounds
Rounds
Ruohonen
Sebastian Maneth
Seidl
Seidl
Seidl
Seidl
Servais
Staworko
Thatcher
Vogler
Voigtländer
Zachar
Zoltán Fülöp
Zoltán Ésik
Ésik
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2014
Field of study

The decidability of equivalence for three important classes of tree transducers is discussed. Each class can be obtained as a natural restriction of deterministic macro tree transducers (MTTs): (1) no context parameters, i.e., top-down tree transducers, (2) linear size increase, i.e., MSO definable tree transducers, and (3) monadic input and output ranked alphabets. For the full class of MTTs, decidability of equivalence remains a long-standing open problem.Comment: In Proceedings AFL 2014, arXiv:1405.527

arXiv.org e-Print Archive

CiteSeerX

Crossref

Directory of Open Access Journals

Edinburgh Research Explorer