4 research outputs found

    Regular languages are Church-Rosser congruential

    Get PDF
    © 2015 ACM 0004-5411/2015/10-ART32 15.00. This article shows a general result about finite monoids and weight reducing string rewriting systems. As a consequence it proves a long standing conjecture in formal language theory: All regular languages are Church-Rosser congruential. The class of Church-Rosser congruential languages was introduced by McNaughton, Narendran, and Otto in 1988. A language L is Church-Rosser congruential if there exists a finite, confluent, and length-reducing semi-Thue system S such that L is a finite union of congruence classes modulo S. It was known that there are deterministic linear context-free languages which are not Church- Rosser congruential, but the conjecture was that all regular languages are of this form. The article offers a stronger statement: A language is regular if and only if it is strongly Church-Rosser congruential. It is the journal version of the conference abstract which was presented at ICALP 2012

    Languages Generated by Iterated Idempotencies.

    Get PDF
    The rewrite relation with parameters m and n and with the possible length limit = k or :::; k we denote by w~, =kW~· or ::;kw~ respectively. The idempotency languages generated from a starting word w by the respective operations are wDAlso other special cases of idempotency languages besides duplication have come up in different contexts. The investigations of Ito et al. about insertion and deletion, Le., operations that are also observed in DNA molecules, have established that w5 and w~ both preserve regularity.Our investigations about idempotency relations and languages start out from the case of a uniform length bound. For these relations =kW~ the conditions for confluence are characterized completely. Also the question of regularity is -k n answered for aH the languages w- D 1 are more complicated and belong to the class of context-free languages.For a generallength bound, i.e."for the relations :"::kW~, confluence does not hold so frequently. This complicatedness of the relations results also in more complicated languages, which are often non-regular, as for example the languages WWithout any length bound, idempotency relations have a very complicated structure. Over alphabets of one or two letters we still characterize the conditions for confluence. Over three or more letters, in contrast, only a few cases are solved. We determine the combinations of parameters that result in the regularity of wDIn a second chapter sorne more involved questions are solved for the special case of duplication. First we shed sorne light on the reasons why it is so difficult to determine the context-freeness ofduplication languages. We show that they fulfiH aH pumping properties and that they are very dense. Therefore aH the standard tools to prove non-context-freness do not apply here.The concept of root in Formal Language ·Theory is frequently used to describe the reduction of a word to another one, which is in sorne sense elementary.For example, there are primitive roots, periodicity roots, etc. Elementary in connection with duplication are square-free words, Le., words that do not contain any repetition. Thus we define the duplication root of w to consist of aH the square-free words, from which w can be reached via the relation w~.Besides sorne general observations we prove the decidability of the question, whether the duplication root of a language is finite.Then we devise acode, which is robust under duplication of its code words.This would keep the result of a computation from being destroyed by dupli cations in the code words. We determine the exact conditions, under which infinite such codes exist: over an alphabet of two letters they exist for a length bound of 2, over three letters already for a length bound of 1.Also we apply duplication to entire languages rather than to single words; then it is interesting to determine, whether regular and context-free languages are closed under this operation. We show that the regular languages are closed under uniformly bounded duplication, while they are not closed under duplication with a generallength bound. The context-free languages are closed under both operations.The thesis concludes with a list of open problems related with the thesis' topics

    Local divisors in formal languages

    Get PDF
    Regular languages are exactly the class of recognizable subsets of the free monoid. In particular, the syntactic monoid of a regular language is finite. This is the starting point of algebraic language theory. In this thesis, the algebraic connection between regular languages and monoids is studied using a certain monoid construction - local divisors. Using the local divisor construction, we give a Rees decomposition of a monoid into smaller parts - the monoid is a Rees extension of a submonoid and a local divisor. Iterating this concept gives an iterated Rees decomposition of a monoid into groups appearing in the monoid. This decomposition is similar to the synthesis theorem of Rhodes and Allen. In particular, the Rees decomposition shows that closure of a variety V of finite monoids under Rees extensions is the variety H̅ induced by the groups H contained in V. Due to the connection between H̅ and local divisors, we turn our attention to a language description of H̅. The language description is a continuation of classical work of SchĂŒtzenberger. He studied prefix codes of bounded synchronization delay and used those codes to give a language description of H̅ in the case that the variety H of groups contains only abelian groups. We use the local divisor approach to generalize SchĂŒtzenberger's language description of H̅ for all varieties H of finite groups. The main ingredient of this generalization is the concept of group-controlled stars. The group-controlled star is an operation on prefix codes of bounded synchronization delay which generalizes the usual Kleene star. The language class SDH(A∞) is the smallest class which contains all finite languages and is closed under union, concatenation product and group-controlled stars for groups in H. We show that SDH(A∞) is the language class corresponding to H̅. As a by-product of the proof we give another language characterization of H̅: the localizable closure LocH(A∞) of H. In the last part of this thesis, we deal with Church-Rosser congruential languages (CRCL). A language is Church-Rosser congruential if it is a finite union of congruence classes modulo a finite, confluent and length-reducing semi-Thue system. This yields a linear time algorithm for the membership problem of a fixed language in CRCL. A natural question, which was open for over 25 years, is whether all regular languages are in CRCL. We give an affirmative answer to this question by proving a stronger statement: for every regular language L and for every weight, there exists a finite, confluent and weight-reducing semi-Thue system S such that A*/S is finite and recognizes L. Lifting the result from the special case of length-reducing to weight-reducing allows the use of local divisors. Next, we focus on Parikh-reducing Church-Rosser systems for regular languages. Instead of constructing a semi-Thue system for a fixed weight, a Parikh-reducing Church-Rosser system is weight-reducing for every weight. We construct such systems for all languages in A̅b̅, that is, for all languages such that the groups in the syntactic monoid are abelian. Additionally, small changes in the proof of this result also yield that for all languages L over a two letter alphabet there exists a Parikh-reducing Church-Rosser system S of finite index such that L is recognized by A*/S. Lastly, we deal with the size of the monoid A*/S for the constructed systems S. We show that in the group case this size has an exponential lower bound and a triple exponential upper bound. The key observation is that one can restrict the alphabet used in the inductive construction. Using the same observation, one can lower the upper bound in the general monoid case from a non-primitive function without this optimization to a quadruple exponential upper bound.Die Klasse der regulĂ€ren Sprachen entspricht genau den erkennbaren Sprachen ĂŒber dem freien Monoid. Äquivalent dazu ist die Klasse der Sprachen, deren syntaktisches Monoid endlich ist. Dies ist der Ausgangspunkt der algebraischen Sprachtheorie. In dieser Arbeit wird dieser algebraische Zusammenhang zwischen regulĂ€ren Sprachen und Monoiden mit Hilfe einer Monoid-Konstruktion untersucht: den lokalen Divisoren. ZunĂ€chst werden lokale Divisoren benutzt um ein Monoid in kleinere Teile zu zerlegen. Die dabei verwendete Konstruktion ist Ă€hnlich zur Rees-Matrix-Halbgruppe und liefert eine Zerlegung eines Monoids als sogenannte Rees-Erweiterung eines echten Untermonoids und eines lokalen Divisors. Wiederholtes Anwenden dieses Sachverhalts fĂŒhrt dann auf eine Rees-Zerlegung, bei der die grundlegenden Bausteine Gruppen sind, die im ursprĂŒnglichen Monoid vorkommen. Diese Zerlegung ist Ă€hnlich zum Synthese-Theorem von Rhodes und Allen. Insbesondere liefert dies, dass der Abschluss einer VarietĂ€t V unter Rees-Erweiterungen die VarietĂ€t H̅ ist, wobei H die VarietĂ€t der endlichen Gruppen ist, die in V vorkommen. Aufgrund des Zusammenhangs zwischen lokalen Divisoren und den VarietĂ€ten H̅, werden als nĂ€chstes Sprachbeschreibungen der VarietĂ€ten H̅ untersucht. Dabei wird die Arbeit von SchĂŒtzenberger ĂŒber Sprachcharakterisierungen mit Hilfe von PrĂ€fix-Codes mit beschrĂ€nkter Synchronisierungsverzögerung (englisch: bounded synchronization delay) fortgesetzt. SchĂŒtzenberger benutzte diese Codes um die VarietĂ€ten der Form H̅ zu beschreiben, wobei V eine VarietĂ€t von endlichen abelschen Gruppen ist. Wir verallgemeinern seine Beschreibung um H̅ fĂŒr alle VarietĂ€ten H von endlichen Gruppen zu charakterisieren. Das Hauptkonzept dieser Verallgemeinerung sind gruppen-kontrollierte Sterne. Dabei sind gruppen-kontrollierte Sterne Sprachoperationen, die auf PrĂ€fix-Codes mit beschrĂ€nkter Synchronisierungsverzögerung aufbauen und als Spezialfall fĂŒr die triviale Gruppe den Kleene-Stern liefern. Die Sprachklasse SDH(A∞) ist die kleinste Klasse von Sprachen, die alle endlichen Sprachen enthĂ€lt und abgeschlossen ist unter Vereinigung, Konkatenationsprodukt und gruppen-kontrollierten Sternen, wobei die Gruppen aus H sind. Wir zeigen, dass SDH(A∞) die zu H̅ zugehörige Sprachklasse ist. Als Nebenprodukt des Beweises dieser Sprachcharakterisierung geben wir eine weitere Charakterisierung von H̅ an: der lokale Abschluss LocH(A∞) von H. Der letzte Abschnitt dieser Arbeit handelt von der Sprachklasse CRCL (Church-Rosser congruential languages). Eine Sprache ist in CRCL, falls sie eine endliche Vereinigung von Kongruenzklassen eines endlichen, konfluenten und lĂ€ngenreduzierenden Ersetzungssystems ist. Dies liefert direkt einen Linearzeit-Algorithmus fĂŒr das Wortproblem von Sprachen aus CRCL. Eine 25 Jahre lang offene Fragestellung war, ob alle regulĂ€ren Sprachen in CRCL enthalten sind. Wir beantworten diese Frage positiv, indem wir eine stĂ€rkere Aussage beweisen: FĂŒr alle regulĂ€ren Sprachen L und alle Gewichtsfunktionen gibt es ein endliches, konfluentes und gewichtsreduzierendes Ersetzungssystem S, fĂŒr das A*/S endlich ist und L erkennt. Durch das Erweitern der Aussage auf alle Gewichtsfunktionen erlaubt dies die Benutzung von lokalen Divisoren. Als nĂ€chstes werden Parikh-reduzierende Church-Rosser-Ersetzungssysteme betrachtet. Diese reprĂ€sentieren eine Vertauschung der Quantorenreihenfolge: Ein Parikh-reduzierendes Ersetzungssystem ist gewichtsreduzierend fĂŒr alle Gewichtsfunktionen. Wir konstruieren solche Systeme fĂŒr alle Sprachen in der VarietĂ€t A̅b̅, d.h. fĂŒr alle Sprachen, in denen die im syntaktischem Monoid vorkommenden Gruppen abelsch sind. ZusĂ€tzlich liefert eine Abwandlung dieses Beweises dasselbe Resultat fĂŒr alle regulĂ€ren Sprachen ĂŒber einem zwei-elementigem Alphabet. Als letztes beschĂ€ftigt sich die Arbeit mit AbschĂ€tzungen fĂŒr die GrĂ¶ĂŸe von A*/S fĂŒr die zuvor konstruierten Systeme S. Im Fall von Gruppensprachen ist die GrĂ¶ĂŸe von unten durch eine Exponentialfunktion und von oben durch eine dreifache Exponentialfunktion beschrĂ€nkt. FĂŒr die obere Schranke wird dabei eine Beobachtung benutzt, wie man das Alphabet in der Induktion beschrĂ€nken kann. Mit Hilfe dieser Beobachtung ist es ebenfalls möglich die obere Schranke im Monoid-Fall von einer nicht primitiven Funktion auf eine vierfach exponentielle Funktion zu reduzieren
    corecore