Search CORE

1,678 research outputs found

Computation of distances for regular and context-free probabilistic languages

Author: Nederhof Mark Jan
Satta Giorgio
Publication venue
Publication date: 01/01/2008
Field of study

Several mathematical distances between probabilistic languages have been investigated in the literature, motivated by applications in language modeling, computational biology, syntactic pattern matching and machine learning. In most cases, only pairs of probabilistic regular languages were considered. In this paper we extend the previous results to pairs of languages generated by a probabilistic context-free grammar and a probabilistic finite automaton.PostprintPeer reviewe

Elsevier - Publisher Connector

Crossref

Archivio istituzionale della ricerca - Università di Padova

University of St. Andrews - Pure

St Andrews Research Repository

On the generating power of regularly controlled bidirection grammars

Author: Asveld P.R.J.
Hogendorp J.A.
Hogendorp J.A.
Publication venue: Taylor & Francis
Publication date: 01/01/1991
Field of study

RCB-grammars or regularly controlled bidirectional grammars are context-free grammars of which the rules can be used in a productive and in a reductive fashion. In addition, the application of these\ud rules is controlled by a regular language. Several modes of derivation can be distinguished for this kind of grammar. In this paper the generating power of the derivation mode that uses right-occurrence rewriting (RO-mode) is determined. Furthermore, a new mode called RA is introduced, which is a better formalization of the intuitive idea of rightoccurrence rewriting than the RO-mode. The RO- and RA-mode have the same generating power, viz. the corresponding RCB-grammars both generate the recursively enumerable languages. Consequently, providing RCB/RO-grammars with a time bound results in a less powerful grammar model

University of Twente Research Information

On the Generating Power of Regularly Controlled Bidirectional Grammars

Author: Asveld Peter R.J.
Hogendorp Jan Anne
Publication venue: University of Twente, Department of Computer Science
Publication date: 01/01/1989
Field of study

RCB-grammars or regularly controlled bidirectional grammars are context-free grammars of which the rules can be used in a productive and in a reductive fashion. In addition, the application of these rules is controlled by a regular language. Several modes of derivation can be distinguished for this kind of grammar. In this paper the generating power of the derivation mode that uses right-occurrence rewriting (RO-mode) is determined. Furthermore, a new mode called RA is introduced, which is a better formalization of the intuitive idea of right-occurrence rewriting than the RO-mode. The RO- and RA-mode have the same generating power, viz. the corresponding RCB-grammars both generate the recursively enumerable languages. Consequently, providing RCB/RO-grammars with a time bound results in a less powerful grammar model

CiteSeerX

University of Twente Research Information

Controlled Bidirectional Grammars

Author: Hogendorp Jan Anne
Publication venue: University of Twente, Department of Computer Science
Publication date: 01/01/1988
Field of study

We investigate context-free grammars the rules of which can be used in a productive and in a reductive fashion, while the application of these rules is controlled by a regular language. We distinguish several modes of derivation for this kind of grammar. The resulting language families (properly) extend the family of context-free languages. We establish some closure properties of these language families and some grammatical transformations which yield a few normal forms for this type of grammar. Finally, we consider some special cases (viz. the context-free grammar is linear or left-linear), and generalizations, in particular, the use of arbitrary rather than regular control languages

University of Twente Research Information

A universal context-free grammar

Author: Kasai Takumi
Publication venue: Published by Elsevier Inc.
Publication date: 31/05/1975
Field of study

In this report we show that, for each alphabet Σ, there exists a context-free grammar G which satisfies the property that for each context-free language L ⊆ Σ* a regular control set C can be found such that LC(G) = L

Elsevier - Publisher Connector

The Acquisition of Morphology in Moroccan Heritage Speakers in France

Author: El Haimeur Amal
Publication venue: UWM Digital Commons
Publication date: 01/05/2019
Field of study

There are two major perspectives regarding heritage speakers’ (henceforth HS) ultimate attainment. Some researchers on HS in the U.S. conclude that HS have incomplete grammars (Benmamoun, Montrul & Polinsky, 2013). It is argued that heritage languages (henceforth HL) do not fully develop (Montrul, 2016), and they are not completely acquired because of shifting to a dominant language (Benmamoun et al., 2013). Other researchers argue that HS’ grammars are complete, but simply different as monolingual and HS experience different linguistic realities (Pascual y Cabo & Rothman, 2012). While there is abundant research on Arabic as a HL in the U.S., research on HS in Europe has been rather limited (Montrul, 2016). This dissertation focuses on Moroccan Arabic (henceforth MA) as a HL in France and aims at contributing to the understanding of the linguistic outcomes of the acquisition of Arabic as a HL in an immigrant context. The current study investigates the acquisition of nominal morphology and verbal-derivational patterns by Moroccan HS in France. Nominal morphology was represented by plural and diminutive formation, and verbal derivations were represented by four patterns. Nominal morphology gives insights into both concatenative and non-concatenative morphological processes. Verbal derivational processes are characterized by the use of non-concatenative morphological processes, and semantic notions such as causativeness and reciprocity are lexicalized within the MA verb-pattern system. The studied patterns are the basic (P1), causative (P2), medio-passive (P5), and reciprocal (P6). 15 Moroccan-French participants took part in this study. The data were gathered through three production experiments. Experiment 1 investigated the acquisition of 3 sound morphemes and 14 broken plurals. In experiment 2, participants were tested in diminutive formation, exemplified by the six types of diminutives, and experiment 3 examined the acquisition of verbal derivations. Experiment 1 revealed that participants’ plural system is mainly characterized by concatenative processes as just two sound plural morphemes were acquired by a significant number of participants. All the participants acquired the morpheme -at and 80% of the participants acquired the sound morpheme -in. Participants depend on overgeneralization and simplification of their plural system. The sound morphemes characterize the HL and were overgeneralized in broken plural targeted data. Additionally, the sound plural [-at] seems to be the underspecified default morpheme in the HL. The findings of experiment 2 show that the mean percentage of source-like use of diminutive forms is 38%. The results revealed that just two patterns were acquired by a significant number of participants: CCiCa and CCiCjCjəC. Diminutive forms that do not require complex processes are acquired by a significant number of participants and the percentage of source-like use is high as well. In this study, a brief analysis of diminutive derivational processes was given, and participants’ patterns of acquisition correspond to the suggested continuum of complexity. Irregular stems present difficulties to HS as complex processes are applied. Non-source like data is rule-governed as 69% of the non-source-like data shows the use of either initial consonant cluster or insertion of the glide /-j/, and these are the main processes characterizing diminutive processes. Additionally, the requirement of having two syllables was met. Participants tend to regularize diminutive formation and show a preference to the following processes: initial constant cluster and insertion of the palatal glide. Generally, HS’s variety is mainly characterized by two patterns. Experiment 3 reveals that the basic pattern (P1) was acquired by all the participants and 40% of the participants acquired the causative (P2). The medio-passive pattern (P5) and the reciprocal patterns (P6) were not acquired. ANOVA showed that there were statistically significant differences among the use of the four patterns. The main finding of this study is that semantic distinction realized by pattern alternation is neutralized in the HL. Specifically, the basic pattern (P1) and periphrastic constructions were used predominantly in P5 and P6 targeted data. It is likely that the basic pattern is used as a default morphological device because it unmarked. The three experiments demonstrate that participants omit irregularities and non-source like forms are rule governed. Less complex and less marked morphological structures characterize the HL. Specifically, morphological aspects thought to be acquired earlier in language development are the ones characterizing the HL. Additionally, the findings of the experiments propose implicational hierarchies for the acquisition of the studied morphological structures. Adopting overgeneralization in nominal morphology, and neutralization in verb patterns showed that HS speak a variety that is reanalyzed. Accordingly, HS in France have a distinct variety that was shaped by their linguistic experience. Their variety is different, reanalyzed and does not comprise all the patterns attested in the source language

University of Wisconsin-Milwaukee

Lexicalized non-local MCTAG with dominance links is NP-complete

Author: Champollion Lucas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

An NP-hardness proof for non-local Multicomponent Tree Adjoining Grammar (MCTAG) by Rambow and Satta (1st International Workshop on Tree Adjoining Grammers 1992), based on Dahlhaus and Warmuth (in J Comput Syst Sci 33:456–472, 1986), is extended to some linguistically relevant restrictions of that formalism. It is found that there are NP-hard grammars among non-local MCTAGs even if any or all of the following restrictions are imposed: (i) lexicalization: every tree in the grammar contains a terminal; (ii) dominance links: every tree set contains at most two trees, and in every such tree set, there is a link between the foot node of one tree and the root node of the other tree, indicating that the former node must dominate the latter in the derived tree. This is the version of MCTAG proposed in Becker et al. (Proceedings of the 5th conference of the European chapter of the Association for Computational Linguistics 1991) to account for German long-distance scrambling. This result restricts the field of possible candidates for an extension of Tree Adjoining Grammar that would be both mildly context-sensitive and linguistically adequate

Crossref

Publikationsserver der Universität Tübingen

New York University Faculty Digital Archive

Controlled Rewriting Using Productions and Reductions

Author: Hogendorp Jan Anne
Publication venue: University of Twente, Department of Computer Science
Publication date: 01/01/1988
Field of study

University of Twente Research Information