Search CORE

51 research outputs found

The Height of Piecewise-Testable Languages with Applications in Logical Complexity

Author: Karandikar Prateek
Schnoebelen Philippe
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 25th EACSL Annual Conference on Computer Science Logic (CSL 2016)
Publication date: 01/01/2016
Field of study

The height of a piecewise-testable language L is the maximum length of the words needed to define L by excluding and requiring given subwords. The height of L is an important descriptive complexity measure that has not yet been investigated in a systematic way. This paper develops a series of new techniques for bounding the height of finite languages and of languages obtained by taking closures by subwords, superwords and related operations. As an application of these results, we show that FO^2(A^*, subword), the two-variable fragment of the first-order logic of sequences with the subword ordering, can only express piecewise-testable properties and has elementary complexity

Dagstuhl Research Online Publication Server

On shuffle products, acyclic automata and piecewise-testable languages

Author: Halfon Simon
Schnoebelen Philippe
Publication venue: 'Elsevier BV'
Publication date: 01/02/2019
Field of study

We show that the shuffle L \unicode{x29E2} F of a piecewise-testable language

L

and a finite language

F

is piecewise-testable. The proof relies on a classic but little-used automata-theoretic characterization of piecewise-testable languages. We also discuss some mild generalizations of the main result, and provide bounds on the piecewise complexity of L \unicode{x29E2} F

arXiv.org e-Print Archive

The separation problem for regular languages by piecewise testable languages

Author: L. Van
M. Zeitoun
Rooijen
Publication venue
Publication date: 08/03/2013
Field of study

Separation is a classical problem in mathematics and computer science. It asks whether, given two sets belonging to some class, it is possible to separate them by another set of a smaller class. We present and discuss the separation problem for regular languages. We then give a direct polynomial time algorithm to check whether two given regular languages are separable by a piecewise testable language, that is, whether a

B{\Sigma}1(<)

sentence can witness that the languages are indeed disjoint. The proof is a reformulation and a refinement of an algebraic argument already given by Almeida and the second author

arXiv.org e-Print Archive

CiteSeerX

A Characterization for Decidable Separability by Piecewise Testable Languages

Author: Czerwiński Wojciech
Martens Wim
van Rooijen Lorijn
Zeitoun Marc
Zetzsche Georg
Publication venue
Publication date: 04/12/2017
Field of study

The separability problem for word languages of a class

\mathcal{C}

by languages of a class

\mathcal{S}

asks, for two given languages

I

and

E

from

\mathcal{C}

, whether there exists a language

S

from

\mathcal{S}

that includes

I

and excludes

E

, that is,

I \subseteq S

and

S\cap E = \emptyset

. In this work, we assume some mild closure properties for

\mathcal{C}

and study for which such classes separability by a piecewise testable language (PTL) is decidable. We characterize these classes in terms of decidability of (two variants of) an unboundedness problem. From this, we deduce that separability by PTL is decidable for a number of language classes, such as the context-free languages and languages of labeled vector addition systems. Furthermore, it follows that separability by PTL is decidable if and only if one can compute for any language of the class its downward closure wrt. the scattered substring ordering (i.e., if the set of scattered substrings of any language of the class is effectively regular). The obtained decidability results contrast some undecidability results. In fact, for all (non-regular) language classes that we present as examples with decidable separability, it is undecidable whether a given language is a PTL itself. Our characterization involves a result of independent interest, which states that for any kind of languages

I

and

E

, non-separability by PTL is equivalent to the existence of common patterns in

I

and

E

arXiv.org e-Print Archive

Episciences.org

The Edit Distance to k-Subsequence Universality

Author: Day Joel D.
Fleischmann Pamela
Kosche Maria
Manea Florin
Siemer Stefan
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 38th International Symposium on Theoretical Aspects of Computer Science (STACS 2021)
Publication date: 01/01/2021
Field of study

A word u is a subsequence of another word w if u can be obtained from w by deleting some of its letters. In the early 1970s, Imre Simon defined the relation ?_k (called now Simon-Congruence) as follows: two words having exactly the same set of subsequences of length at most k are ?_k-congruent. This relation was central in defining and analysing piecewise testable languages, but has found many applications in areas such as algorithmic learning theory, databases theory, or computational linguistics. Recently, it was shown that testing whether two words are ?_k-congruent can be done in optimal linear time. Thus, it is a natural next step to ask, for two words w and u which are not ?_k-equivalent, what is the minimal number of edit operations that we need to perform on w in order to obtain a word which is ?_k-equivalent to u. In this paper, we consider this problem in a setting which seems interesting: when u is a k-subsequence universal word. A word u with alph(u) = ? is called k-subsequence universal if the set of subsequences of length k of u contains all possible words of length k over ?. As such, our results are a series of efficient algorithms computing the edit distance from w to the language of k-subsequence universal words

Loughborough University Institutional Repository

Dagstuhl Research Online Publication Server

Longest Common Subsequence with Gap Constraints

Author: Adamson Duncan
Kosche Maria
Koß Tore
Manea Florin
Siemer Stefan
Publication venue
Publication date: 11/04/2023
Field of study

We consider the longest common subsequence problem in the context of subsequences with gap constraints. In particular, following Day et al. 2022, we consider the setting when the distance (i. e., the gap) between two consecutive symbols of the subsequence has to be between a lower and an upper bound (which may depend on the position of those symbols in the subsequence or on the symbols bordering the gap) as well as the case where the entire subsequence is found in a bounded range (defined by a single upper bound), considered by Kosche et al. 2022. In all these cases, we present effcient algorithms for determining the length of the longest common constrained subsequence between two given strings

arXiv.org e-Print Archive

Separating Regular Languages by Piecewise Testable and Unambiguous Languages

Author: A.N. Trahtman
C.J. Ash
H.B. Hunt III
I. Simon
I. Simon
J. Almeida
J. Almeida
J. Stern
J.-E. Pin
K. Henckell
K. Henckell
L. Ribes
M. Schützenberger
M. Schützenberger
W. Czerwiński
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Abstract. Separation is a classical problem asking whether, given two sets belonging to some class, it is possible to separate them by a set from another class. We discuss the separation problem for regular languages. We give a Ptime algorithm to check whether two given regular languages are separable by a piecewise testable language, that is, whether a BΣ1(<) sentence can witness that the languages are disjoint. The proof refines an algebraic argument from Almeida and the third author. When separation is possible, we also express a separator by saturating one of the original languages by a suitable congruence. Following the same line, we show that one can as well decide whether two regular languages can be separated by an unambiguous language, albeit with a higher complexity.

CiteSeerX

Crossref