40 research outputs found

    Encoding Lexicalized Tree Adjoining Grammars with a Nonmonotonic Inheritance Hierarchy

    Get PDF
    This paper shows how DATR, a widely used formal language for lexical knowledge representation, can be used to define an LTAG lexicon as an inheritance hierarchy with internal lexical rules. A bottom-up featural encoding is used for LTAG trees and this allows lexical rules to be implemented as covariation constraints within feature structures. Such an approach eliminates the considerable redundancy otherwise associated with an LTAG lexicon.Comment: Latex source, needs aclap.sty, 8 page

    GENERATING AMHARIC PRESENT TENSE VERBS: A NETWORK MORPHOLOGY & DATR ACCOUNT

    Get PDF
    In this thesis I attempt to model, that is, computationally reproduce, the natural transmission (i.e. inflectional regularities) of twenty present tense Amharic verbs (i.e. triradicals beginning with consonants) as used by the language’s speakers. I root my approach in the linguistic theory of network morphology (NM) and model it using the DATR evaluator. In Chapter 1, I provide an overview of Amharic and discuss the fidel as an abugida, the verb system’s root-and-pattern morphology, and how radicals of each lexeme interacts with prefixes and suffixes. I offer an overview of NM in Chapter 2 and DATR in Chapter 3. In both chapters I draw attention to and help interpret key terms used among scholars doing work in both fields. In Chapter 4 I set forth my full theory, along with notation, for generating the paradigms of twenty present tense Amharic verbs that follow four different patterns. Chapter 5, the final chapter, contains a summary and offers several conclusions. I provide the DATR output in the Appendix. In writing, my main hope is that this project will make a contribution, however minimal or sizeable, that might advance the field of Amharic studies in particular and (computational) linguistics in general

    Feature-based lexicons : an example and a comparison to DATR

    Get PDF
    A FEATURE-BASED lexicon is especially sensible for natural language processing systems which are feature-based. Feature-based lexicons offer the advantages: (i) having a maximally transparent (empty) interface to feature-based grammars and processors; (ii) supplying exactly the EXPRESSIVE CAPABILITY exploited in these systems; and (iii) providing concise, transparent, and elegantspecification possibilities for various lexical relationships, including both inflection and derivation. The development of TYPED feature description languages allows the use of INHERITANCE in lexical description, and recent work explores the use of DEFAULT INHERITANCE as a means of easing lexical development. TDL is the implementation of a TYPE DESCRIPTION LANGUAGE based on HPSG feature logics. It is employed for both lexical and grammatical specification. As a lexical specification tool, it not only realizes these advantages, but it also separates a linguistic and a computational view of lexical contents and supplies a development environment for lexicon engineering. The most important competitor for feature-based lexical work is the very competent special purpose tool DATR, whose interface to feature-based systems is, however, inherently problematic. It is argued that feature-based systems (such as TDL) and DATR look compatible because of their common mathematical interpretation as graph description languages for directed graphs, but that this masks radically different modeling conventions for the graphs themselves. The development of TDL is continuing at the German Artificial Intelligence Center (Deutsches Forschungszentrum für Künstliche Intelligenz - DFKI) in the natural language understanding project DISCO

    Enabling a legacy morphological parser to use DATR-based lexicons

    No full text

    Feature-based inheritance networks for computational lexicons

    Get PDF
    The virtues of viewing the lexicon as an inheritance network are its succinctness and its tendency to highlight significant clusters of linguistic properties. From its succinctness follow two practical advantages, namely its ease of maintenance and modification. In this paper we present a feature-based foundation for lexical inheritance. We argue that the feature-based foundation is both more economical and expressively more powerful than non-feature-based systems. It is more economical because it employs only mechanisms already assumed to be present elsewhere in the grammar (viz., in the feature system), and it is more expressive because feature systems are more expressive than other mechanisms used in expressing lexical inheritance (cf. DATR). The lexicon furthermore allows the use of default unification, based on the ideas of default unification, defined by Bouma. These claims are buttressed in sections sketching the opportunities for lexical description in feature-based lexicons in two central lexical topics, inflection and derivation. Briefly, we argue that the central notion of paradigm may be defined in feature structures, and that it may be more satisfactorily (in fact, immediately) linked to the syntactic information in this fashion. Our discussion of derivation is more programmatic; but here, too, we argue that feature structures of a suitably rich sort provide a foundation for the definition of lexical rules. We illustrate theoretical claims in application to German lexis. This work is currently under implementation in a natural language understanding effort (DISCO) at the German Artiffical Intelligence Center (Deutsches Forschungszentrum für Künstliche Intelligenz)

    Natural language software registry (second edition)

    Get PDF

    Representing grammar, meaning and knowledge

    Get PDF
    Among the expertises relevant for successful natural language understanding are grammar, meaning and background knowledge, all of which must be represented in order to decode messages from text (or speech). The present paper is a sketch of one cooperation of grammar and meaning representations -- with some remarks about knowledge representation -- which allows that the representations involved be heterogeneous even while cooperating closely. The modules cooperate in what might be called a PLURALIST fashion, with few assumptions about the representations involved. In point of fact, the proposal is compatible with state-of-the-art representations from all three areas. The paper proceeeds from the nearly universal assumption that the grammar formalism is feature-based and insufficiently expressive for use in meaning representation. It then demonstrates how feature formalisms may be employed as a semantic metalanguage in order that semantic constraints may be expressed in a single formalism with grammatical constraints. This allows a tight coupling of syntax and semantics, the incorporation of nonsyntactic constraints (e.g., from knowledge representation) and the opportunity to underspecify meanings in novel ways -- including, e.g., ways which distinguish ambiguity and underspecification (vagueness). We retain scepticism vis-à-vis more ASSIMILATIONIST proposals for the interaction of these -- i.e., proposals which foresee common formalisms for grammar, meaning and knowledge representation. While such proposals rightfully claim to allow for closer integration, they fail to account for the motivations which distinguish formalisms - elaborate expressive strength in the case of semantic representations, monotonic (and preferably decidable) computation in the case of grammar formalisms, and the characterization of taxonomic reasoning in the case of knowledge representation

    An interpretation of paradigmatic morphology

    Get PDF
    The thesis has as its goal the extension of current approaches in the description of natural languages, based on logics of partial information, to the area of morphology. I review work in a number of areas which may inform the study of morphology. I define a system for the representation of lexical and morphological information similar in descriptive aims to the system of Word and Paradigm (WP) morphology developed by Matthews, although somewhat different in technical details. I show that this system has a simple mathematical structure and indicate how it is related to current proposals in the field of feature value logics for linguistic description. The descriptive use of the system is demonstrated by an analysis of verbal paradigms from Latin.The attested shortcomings of WP are reanalysed in the light of the formalization developed above, and I show that, contrary to previous claims, the structures developed for the formalization of WP may be both adequate for describing the morphology of non-inflecting languages and concise in so doing. These assertions are supported by sample analyses of the morphology of Turkish, taken as an exemplary agglutinating language, and of Semitic
    corecore