8 research outputs found

    A discontinuity in pattern inference

    Get PDF
    This paper examines the learnability of a major subclass of E-pattern languages ā€“ also known as erasing or extended pattern languages ā€“ in Goldā€™s learning model: We show that the class of terminal-free E-pattern languages is inferrable from positive data if the corresponding terminal alphabet consists of three or more letters. Consequently, the recently presented negative result for binary alphabets is unique

    Unambiguous morphic images of strings

    Get PDF
    Motivated by the research on pattern languages, we study a fundamental combinatorial question on morphisms in free semigroups: With regard to any string Ī± over some alphabet we ask for the existence of a morphism Ļƒ such that Ļƒ(Ī±) is unambiguous, i.e. there is no morphism Ļ with Ļ ā‰  Ļƒ and Ļ(Ī±) = Ļƒ(Ī±). Our main result shows that a rich and natural class of strings is provided with unambiguous morphic images

    The unambiguity of segmented morphisms

    Get PDF
    This paper studies the ambiguity of morphisms in free monoids. A morphism Ļƒ is said to be ambiguous with respect to a string Ī± if there exists a morphism Ļ„ which differs from Ļƒ for a symbol occurring in Ī±, but nevertheless satisfies Ļ„(Ī±) = Ļƒ(Ī±); if there is no such Ļ„ then Ļƒ is called unambiguous. Motivated by the recent initial paper on the ambiguity of morphisms, we introduce the definition of a so-called segmented morphism Ļƒn, which, for any n āˆˆ N, maps every symbol in an infinite alphabet onto a word that consists of n distinct factors in ab+a, where a and b are different letters. For every n, we consider the set U(Ļƒn) of those finite strings over an infinite alphabet with respect to which Ļƒn is unambiguous, and we comprehensively describe its relation to any U(Ļƒm), m ā‰  n. Thus, our work features the first approach to a characterisation of sets of strings with respect to which certain fixed morphisms are unambiguous, and it leads to fairly counter-intuitive insights into the relations between such sets. Furthermore, it shows that, among the widely used homogeneous morphisms, most segmented morphisms are optimal in terms of being unambiguous for a preferably large set of strings. Finally, our paper yields several major improvements of crucial techniques previously used for research on the ambiguity of morphisms

    Unambiguous morphic images of strings

    Get PDF
    We study a fundamental combinatorial problem on morphisms in free semigroups: With regard to any string Ī± over some alphabet we ask for the existence of a morphism Ļƒ such that Ļƒ(Ī±) is unambiguous, i.e. there is no morphism T with T(i) ā‰  Ļƒ(i) for some symbol i in Ī± and, nevertheless, T(Ī±) = Ļƒ(Ī±). As a consequence of its elementary nature, this question shows a variety of connections to those topics in discrete mathematics which are based on finite strings and morphisms such as pattern languages, equality sets and, thus, the Post Correspondence Problem. Our studies demonstrate that the existence of unambiguous morphic images essen- tially depends on the structure of Ī±: We introduce a partition of the set of all finite strings into those that are decomposable (referred to as prolix) in a particular manner and those that are indecomposable (called succinct). This partition, that is also known to be of major importance for the research on pattern languages and on finite fixed points of morphisms, allows to formulate our main result according to which a string Ī± can be mapped by an injective morphism onto an unambiguous image if and only if Ī± is succinct

    Discontinuities in pattern inference

    Get PDF
    This paper deals with the inferrability of classes of E-pattern languagesā€”also referred to as extended or erasing pattern languagesā€”from positive data in Goldā€™s model of identification in the limit. The first main part of the paper shows that the recently presented negative result on terminal-free E-pattern languages over binary alphabets does not hold for other alphabet sizes, so that the full class of these languages is inferrable from positive data if and only if the corresponding terminal alphabet does not consist of exactly two distinct letters. The second main part yields the insight that the positive result on terminal-free E-pattern languages over alphabets with three or four letters cannot be extended to the class of general E-pattern languages. With regard to larger alphabets, the extensibility remains open. The proof methods developed for these main results do not directly discuss the (non-)existence of appropriate learning strategies, but they deal with structural properties of classes of E-pattern languages, and, in particular, with the problem of finding telltales for these languages. It is shown that the inferrability of classes of E-pattern languages is closely connected to some problems on the ambiguity of morphisms so that the technical contributions of the paper largely consist of combinatorial insights into morphisms in word monoids

    Discontinuities in pattern inference

    Get PDF
    This paper deals with the inferrability of classes of E-pattern languagesā€”also referred to as extended or erasing pattern languagesā€”from positive data in Goldā€™s model of identification in the limit. The first main part of the paper shows that the recently presented negative result on terminal-free E-pattern languages over binary alphabets does not hold for other alphabet sizes, so that the full class of these languages is inferrable from positive data if and only if the corresponding terminal alphabet does not consist of exactly two distinct letters. The second main part yields the insight that the positive result on terminal-free E-pattern languages over alphabets with three or four letters cannot be extended to the class of general E-pattern languages. With regard to larger alphabets, the extensibility remains open. The proof methods developed for these main results do not directly discuss the (non-)existence of appropriate learning strategies, but they deal with structural properties of classes of E-pattern languages, and, in particular, with the problem of finding telltales for these languages. It is shown that the inferrability of classes of E-pattern languages is closely connected to some problems on the ambiguity of morphisms so that the technical contributions of the paper largely consist of combinatorial insights into morphisms in word monoids

    Unambiguous Morphic Images of Strings

    Get PDF
    We study a fundamental combinatorial problem on morphisms in free semigroups: With regard to any string Ī± over some alphabet we ask for the existence of a morphism Ļƒ such that Ļƒ(Ī±) is unambiguous, i.e. there is no morphism T with T(i) ā‰  Ļƒ(i) for some symbol i in Ī± and, nevertheless, T(Ī±) = Ļƒ(Ī±). As a consequence of its elementary nature, this question shows a variety of connections to those topics in discrete mathematics which are based on finite strings and morphisms such as pattern languages, equality sets and, thus, the Post Correspondence Problem. Our studies demonstrate that the existence of unambiguous morphic images essen- tially depends on the structure of Ī±: We introduce a partition of the set of all finite strings into those that are decomposable (referred to as prolix) in a particular manner and those that are indecomposable (called succinct). This partition, that is also known to be of major importance for the research on pattern languages and on finite fixed points of morphisms, allows to formulate our main result according to which a string Ī± can be mapped by an injective morphism onto an unambiguous image if and only if Ī± is succinct

    Strongly prime PCP words

    No full text
    corecore