113 research outputs found

    MODELING DEPONENCY IN GERMANIC PRETERITE-PRESENT VERBS USING DATR

    Get PDF
    In certain Germanic languages, there is a group of verbs called preterite-present verbs that are often viewed as irregular, but in fact behave very predictably. They exhibit a morphological phenomenon called deponency, often in conjunction with another morphological phenomenon called heteroclisis. I examine the preterite-present verbs of three different languages: Old Norse, Modern Icelandic, and Modern German. Initially, I approach them from a historical perspective and then seek to reconcile their morphology with the modern perspective. A criteria is established for a canonical preterite-present verb, and then using a lexical programming language called DATR, I create code that generates the appropriate paradigms while also illustrating the morphological relationships between verb tenses and inflection classes, among other things. DATR is a programming language used specifically for language models

    Indexed Stems and Russian Word Formation: A Network Morphology Account of Russian Personal Nouns

    Get PDF
    Recent lexeme-based models have proposed that a lexeme carries an inventory of stems on which morphological rules operate. The various stems in the inventory are associated with different morphological rules, of both inflection and derivation. Furthermore, one stem may be selected by more than one rule. For this reason stems in the inventory are labeled with indexes, rather than being directly associated with a particular morphological function. It is claimed that an indexed-stem approach captures generalizations in the morphological system that would otherwise be missed. We argue that such an approach provides for greater generalization in the Russian morphological system. One area of Russian derivation that particularly lends itself to an indexed-stem approach is the highly productive system of personal-noun formation. We present a declarative account of Russian personal nouns that assumes indexed stems and show how such an account on the one hand obviates the need to posit either compound suffixes or concatenators and on the other dispenses with truncating/deleting rules. The account is couched within network morphology, a declarative lexeme-based framework that rests on the concept of default inheritance and is expressed in the computable lexical knowledge representation language D AT R

    GENERATING AMHARIC PRESENT TENSE VERBS: A NETWORK MORPHOLOGY & DATR ACCOUNT

    Get PDF
    In this thesis I attempt to model, that is, computationally reproduce, the natural transmission (i.e. inflectional regularities) of twenty present tense Amharic verbs (i.e. triradicals beginning with consonants) as used by the language’s speakers. I root my approach in the linguistic theory of network morphology (NM) and model it using the DATR evaluator. In Chapter 1, I provide an overview of Amharic and discuss the fidel as an abugida, the verb system’s root-and-pattern morphology, and how radicals of each lexeme interacts with prefixes and suffixes. I offer an overview of NM in Chapter 2 and DATR in Chapter 3. In both chapters I draw attention to and help interpret key terms used among scholars doing work in both fields. In Chapter 4 I set forth my full theory, along with notation, for generating the paradigms of twenty present tense Amharic verbs that follow four different patterns. Chapter 5, the final chapter, contains a summary and offers several conclusions. I provide the DATR output in the Appendix. In writing, my main hope is that this project will make a contribution, however minimal or sizeable, that might advance the field of Amharic studies in particular and (computational) linguistics in general

    Feature-based lexicons : an example and a comparison to DATR

    Get PDF
    A FEATURE-BASED lexicon is especially sensible for natural language processing systems which are feature-based. Feature-based lexicons offer the advantages: (i) having a maximally transparent (empty) interface to feature-based grammars and processors; (ii) supplying exactly the EXPRESSIVE CAPABILITY exploited in these systems; and (iii) providing concise, transparent, and elegantspecification possibilities for various lexical relationships, including both inflection and derivation. The development of TYPED feature description languages allows the use of INHERITANCE in lexical description, and recent work explores the use of DEFAULT INHERITANCE as a means of easing lexical development. TDL is the implementation of a TYPE DESCRIPTION LANGUAGE based on HPSG feature logics. It is employed for both lexical and grammatical specification. As a lexical specification tool, it not only realizes these advantages, but it also separates a linguistic and a computational view of lexical contents and supplies a development environment for lexicon engineering. The most important competitor for feature-based lexical work is the very competent special purpose tool DATR, whose interface to feature-based systems is, however, inherently problematic. It is argued that feature-based systems (such as TDL) and DATR look compatible because of their common mathematical interpretation as graph description languages for directed graphs, but that this masks radically different modeling conventions for the graphs themselves. The development of TDL is continuing at the German Artificial Intelligence Center (Deutsches Forschungszentrum für Künstliche Intelligenz - DFKI) in the natural language understanding project DISCO

    Dealing with Metonymic Readings of Named Entities

    Full text link
    The aim of this paper is to propose a method for tagging named entities (NE), using natural language processing techniques. Beyond their literal meaning, named entities are frequently subject to metonymy. We show the limits of current NE type hierarchies and detail a new proposal aiming at dynamically capturing the semantics of entities in context. This model can analyze complex linguistic phenomena like metonymy, which are known to be difficult for natural language processing but crucial for most applications. We present an implementation and some test using the French ESTER corpus and give significant results

    Natural language software registry (second edition)

    Get PDF

    An interpretation of paradigmatic morphology

    Get PDF
    The thesis has as its goal the extension of current approaches in the description of natural languages, based on logics of partial information, to the area of morphology. I review work in a number of areas which may inform the study of morphology. I define a system for the representation of lexical and morphological information similar in descriptive aims to the system of Word and Paradigm (WP) morphology developed by Matthews, although somewhat different in technical details. I show that this system has a simple mathematical structure and indicate how it is related to current proposals in the field of feature value logics for linguistic description. The descriptive use of the system is demonstrated by an analysis of verbal paradigms from Latin.The attested shortcomings of WP are reanalysed in the light of the formalization developed above, and I show that, contrary to previous claims, the structures developed for the formalization of WP may be both adequate for describing the morphology of non-inflecting languages and concise in so doing. These assertions are supported by sample analyses of the morphology of Turkish, taken as an exemplary agglutinating language, and of Semitic

    High-level methodologies for grammar engineering, introduction to the special issue

    Full text link
    • …
    corecore