Search CORE

4 research outputs found

Regular languages are Church-Rosser congruential

Author: Klaus Reinhardt (290801)
Manfred Kufleitner (4542364)
Tobias Walter (7168559)
Volker Diekert (7168556)
Publication venue
Publication date: 01/01/2015
Field of study

© 2015 ACM 0004-5411/2015/10-ART32 15.00. This article shows a general result about finite monoids and weight reducing string rewriting systems. As a consequence it proves a long standing conjecture in formal language theory: All regular languages are Church-Rosser congruential. The class of Church-Rosser congruential languages was introduced by McNaughton, Narendran, and Otto in 1988. A language L is Church-Rosser congruential if there exists a finite, confluent, and length-reducing semi-Thue system S such that L is a finite union of congruence classes modulo S. It was known that there are deterministic linear context-free languages which are not Church- Rosser congruential, but the conjecture was that all regular languages are of this form. The article offers a stronger statement: A language is regular if and only if it is strongly Church-Rosser congruential. It is the journal version of the conference abstract which was presented at ICALP 2012

Loughborough University Institutional Repository

The context-splittable normal form for Church–Rosser language systems

Author: Book
Book
Buntrock
Chomsky
Dahlhaus
Harrison
Jantzen
Jens R. Woinowski
Kapur
McNaughton
McNaughton
Niemann
Woinowski
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Languages Generated by Iterated Idempotencies.

Author: Leupold Klaus-Peter
Publication venue: 'Universitat Rovira I Virgili'
Publication date: 01/01/2006
Field of study

The rewrite relation with parameters m and n and with the possible length limit = k or :::; k we denote by w~, =kW~· or ::;kw~ respectively. The idempotency languages generated from a starting word w by the respective operations are wDAlso other special cases of idempotency languages besides duplication have come up in different contexts. The investigations of Ito et al. about insertion and deletion, Le., operations that are also observed in DNA molecules, have established that w5 and w~ both preserve regularity.Our investigations about idempotency relations and languages start out from the case of a uniform length bound. For these relations =kW~ the conditions for confluence are characterized completely. Also the question of regularity is -k n answered for aH the languages w- D 1 are more complicated and belong to the class of context-free languages.For a generallength bound, i.e."for the relations :"::kW~, confluence does not hold so frequently. This complicatedness of the relations results also in more complicated languages, which are often non-regular, as for example the languages WWithout any length bound, idempotency relations have a very complicated structure. Over alphabets of one or two letters we still characterize the conditions for confluence. Over three or more letters, in contrast, only a few cases are solved. We determine the combinations of parameters that result in the regularity of wDIn a second chapter sorne more involved questions are solved for the special case of duplication. First we shed sorne light on the reasons why it is so difficult to determine the context-freeness ofduplication languages. We show that they fulfiH aH pumping properties and that they are very dense. Therefore aH the standard tools to prove non-context-freness do not apply here.The concept of root in Formal Language ·Theory is frequently used to describe the reduction of a word to another one, which is in sorne sense elementary.For example, there are primitive roots, periodicity roots, etc. Elementary in connection with duplication are square-free words, Le., words that do not contain any repetition. Thus we define the duplication root of w to consist of aH the square-free words, from which w can be reached via the relation w~.Besides sorne general observations we prove the decidability of the question, whether the duplication root of a language is finite.Then we devise acode, which is robust under duplication of its code words.This would keep the result of a computation from being destroyed by dupli cations in the code words. We determine the exact conditions, under which infinite such codes exist: over an alphabet of two letters they exist for a length bound of 2, over three letters already for a length bound of 1.Also we apply duplication to entire languages rather than to single words; then it is interesting to determine, whether regular and context-free languages are closed under this operation. We show that the regular languages are closed under uniformly bounded duplication, while they are not closed under duplication with a generallength bound. The context-free languages are closed under both operations.The thesis concludes with a list of open problems related with the thesis' topics

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Tesis Doctorals en Xarxa

Repositori Institucional URV

Local divisors in formal languages

Author: Walter Tobias
Publication venue
Publication date: 01/01/2016
Field of study

Regular languages are exactly the class of recognizable subsets of the free monoid. In particular, the syntactic monoid of a regular language is finite. This is the starting point of algebraic language theory. In this thesis, the algebraic connection between regular languages and monoids is studied using a certain monoid construction - local divisors. Using the local divisor construction, we give a Rees decomposition of a monoid into smaller parts - the monoid is a Rees extension of a submonoid and a local divisor. Iterating this concept gives an iterated Rees decomposition of a monoid into groups appearing in the monoid. This decomposition is similar to the synthesis theorem of Rhodes and Allen. In particular, the Rees decomposition shows that closure of a variety V of finite monoids under Rees extensions is the variety H̅ induced by the groups H contained in V. Due to the connection between H̅ and local divisors, we turn our attention to a language description of H̅. The language description is a continuation of classical work of Schützenberger. He studied prefix codes of bounded synchronization delay and used those codes to give a language description of H̅ in the case that the variety H of groups contains only abelian groups. We use the local divisor approach to generalize Schützenberger's language description of H̅ for all varieties H of finite groups. The main ingredient of this generalization is the concept of group-controlled stars. The group-controlled star is an operation on prefix codes of bounded synchronization delay which generalizes the usual Kleene star. The language class SDH(A∞) is the smallest class which contains all finite languages and is closed under union, concatenation product and group-controlled stars for groups in H. We show that SDH(A∞) is the language class corresponding to H̅. As a by-product of the proof we give another language characterization of H̅: the localizable closure LocH(A∞) of H. In the last part of this thesis, we deal with Church-Rosser congruential languages (CRCL). A language is Church-Rosser congruential if it is a finite union of congruence classes modulo a finite, confluent and length-reducing semi-Thue system. This yields a linear time algorithm for the membership problem of a fixed language in CRCL. A natural question, which was open for over 25 years, is whether all regular languages are in CRCL. We give an affirmative answer to this question by proving a stronger statement: for every regular language L and for every weight, there exists a finite, confluent and weight-reducing semi-Thue system S such that A*/S is finite and recognizes L. Lifting the result from the special case of length-reducing to weight-reducing allows the use of local divisors. Next, we focus on Parikh-reducing Church-Rosser systems for regular languages. Instead of constructing a semi-Thue system for a fixed weight, a Parikh-reducing Church-Rosser system is weight-reducing for every weight. We construct such systems for all languages in A̅b̅, that is, for all languages such that the groups in the syntactic monoid are abelian. Additionally, small changes in the proof of this result also yield that for all languages L over a two letter alphabet there exists a Parikh-reducing Church-Rosser system S of finite index such that L is recognized by A*/S. Lastly, we deal with the size of the monoid A*/S for the constructed systems S. We show that in the group case this size has an exponential lower bound and a triple exponential upper bound. The key observation is that one can restrict the alphabet used in the inductive construction. Using the same observation, one can lower the upper bound in the general monoid case from a non-primitive function without this optimization to a quadruple exponential upper bound.Die Klasse der regulären Sprachen entspricht genau den erkennbaren Sprachen über dem freien Monoid. Äquivalent dazu ist die Klasse der Sprachen, deren syntaktisches Monoid endlich ist. Dies ist der Ausgangspunkt der algebraischen Sprachtheorie. In dieser Arbeit wird dieser algebraische Zusammenhang zwischen regulären Sprachen und Monoiden mit Hilfe einer Monoid-Konstruktion untersucht: den lokalen Divisoren. Zunächst werden lokale Divisoren benutzt um ein Monoid in kleinere Teile zu zerlegen. Die dabei verwendete Konstruktion ist ähnlich zur Rees-Matrix-Halbgruppe und liefert eine Zerlegung eines Monoids als sogenannte Rees-Erweiterung eines echten Untermonoids und eines lokalen Divisors. Wiederholtes Anwenden dieses Sachverhalts führt dann auf eine Rees-Zerlegung, bei der die grundlegenden Bausteine Gruppen sind, die im ursprünglichen Monoid vorkommen. Diese Zerlegung ist ähnlich zum Synthese-Theorem von Rhodes und Allen. Insbesondere liefert dies, dass der Abschluss einer Varietät V unter Rees-Erweiterungen die Varietät H̅ ist, wobei H die Varietät der endlichen Gruppen ist, die in V vorkommen. Aufgrund des Zusammenhangs zwischen lokalen Divisoren und den Varietäten H̅, werden als nächstes Sprachbeschreibungen der Varietäten H̅ untersucht. Dabei wird die Arbeit von Schützenberger über Sprachcharakterisierungen mit Hilfe von Präfix-Codes mit beschränkter Synchronisierungsverzögerung (englisch: bounded synchronization delay) fortgesetzt. Schützenberger benutzte diese Codes um die Varietäten der Form H̅ zu beschreiben, wobei V eine Varietät von endlichen abelschen Gruppen ist. Wir verallgemeinern seine Beschreibung um H̅ für alle Varietäten H von endlichen Gruppen zu charakterisieren. Das Hauptkonzept dieser Verallgemeinerung sind gruppen-kontrollierte Sterne. Dabei sind gruppen-kontrollierte Sterne Sprachoperationen, die auf Präfix-Codes mit beschränkter Synchronisierungsverzögerung aufbauen und als Spezialfall für die triviale Gruppe den Kleene-Stern liefern. Die Sprachklasse SDH(A∞) ist die kleinste Klasse von Sprachen, die alle endlichen Sprachen enthält und abgeschlossen ist unter Vereinigung, Konkatenationsprodukt und gruppen-kontrollierten Sternen, wobei die Gruppen aus H sind. Wir zeigen, dass SDH(A∞) die zu H̅ zugehörige Sprachklasse ist. Als Nebenprodukt des Beweises dieser Sprachcharakterisierung geben wir eine weitere Charakterisierung von H̅ an: der lokale Abschluss LocH(A∞) von H. Der letzte Abschnitt dieser Arbeit handelt von der Sprachklasse CRCL (Church-Rosser congruential languages). Eine Sprache ist in CRCL, falls sie eine endliche Vereinigung von Kongruenzklassen eines endlichen, konfluenten und längenreduzierenden Ersetzungssystems ist. Dies liefert direkt einen Linearzeit-Algorithmus für das Wortproblem von Sprachen aus CRCL. Eine 25 Jahre lang offene Fragestellung war, ob alle regulären Sprachen in CRCL enthalten sind. Wir beantworten diese Frage positiv, indem wir eine stärkere Aussage beweisen: Für alle regulären Sprachen L und alle Gewichtsfunktionen gibt es ein endliches, konfluentes und gewichtsreduzierendes Ersetzungssystem S, für das A*/S endlich ist und L erkennt. Durch das Erweitern der Aussage auf alle Gewichtsfunktionen erlaubt dies die Benutzung von lokalen Divisoren. Als nächstes werden Parikh-reduzierende Church-Rosser-Ersetzungssysteme betrachtet. Diese repräsentieren eine Vertauschung der Quantorenreihenfolge: Ein Parikh-reduzierendes Ersetzungssystem ist gewichtsreduzierend für alle Gewichtsfunktionen. Wir konstruieren solche Systeme für alle Sprachen in der Varietät A̅b̅, d.h. für alle Sprachen, in denen die im syntaktischem Monoid vorkommenden Gruppen abelsch sind. Zusätzlich liefert eine Abwandlung dieses Beweises dasselbe Resultat für alle regulären Sprachen über einem zwei-elementigem Alphabet. Als letztes beschäftigt sich die Arbeit mit Abschätzungen für die Größe von A*/S für die zuvor konstruierten Systeme S. Im Fall von Gruppensprachen ist die Größe von unten durch eine Exponentialfunktion und von oben durch eine dreifache Exponentialfunktion beschränkt. Für die obere Schranke wird dabei eine Beobachtung benutzt, wie man das Alphabet in der Induktion beschränken kann. Mit Hilfe dieser Beobachtung ist es ebenfalls möglich die obere Schranke im Monoid-Fall von einer nicht primitiven Funktion auf eine vierfach exponentielle Funktion zu reduzieren