416 research outputs found

    Marrying Universal Dependencies and Universal Morphology

    Full text link
    The Universal Dependencies (UD) and Universal Morphology (UniMorph) projects each present schemata for annotating the morphosyntactic details of language. Each project also provides corpora of annotated text in many languages - UD at the token level and UniMorph at the type level. As each corpus is built by different annotators, language-specific decisions hinder the goal of universal schemata. With compatibility of tags, each project's annotations could be used to validate the other's. Additionally, the availability of both type- and token-level resources would be a boon to tasks such as parsing and homograph disambiguation. To ease this interoperability, we present a deterministic mapping from Universal Dependencies v2 features into the UniMorph schema. We validate our approach by lookup in the UniMorph corpora and find a macro-average of 64.13% recall. We also note incompatibilities due to paucity of data on either side. Finally, we present a critical evaluation of the foundations, strengths, and weaknesses of the two annotation projects.Comment: UDW1

    Meaning to Form: Measuring Systematicity as Information

    Full text link
    A longstanding debate in semiotics centers on the relationship between linguistic signs and their corresponding semantics: is there an arbitrary relationship between a word form and its meaning, or does some systematic phenomenon pervade? For instance, does the character bigram \textit{gl} have any systematic relationship to the meaning of words like \textit{glisten}, \textit{gleam} and \textit{glow}? In this work, we offer a holistic quantification of the systematicity of the sign using mutual information and recurrent neural networks. We employ these in a data-driven and massively multilingual approach to the question, examining 106 languages. We find a statistically significant reduction in entropy when modeling a word form conditioned on its semantic representation. Encouragingly, we also recover well-attested English examples of systematic affixes. We conclude with the meta-point: Our approximate effect size (measured in bits) is quite small---despite some amount of systematicity between form and meaning, an arbitrary relationship and its resulting benefits dominate human language.Comment: Accepted for publication at ACL 201

    Weird inflects but OK : Making sense of morphological generation errors

    Get PDF
    We conduct a manual error analysis of the CoNLL-SIGMORPHON 2017 Shared Task on Morphological Reinflection. In this task, systems are given a word in citation form (e.g., hug) and asked to produce the corresponding inflected form (e.g., the simple past hugged). This design lets us analyze errors much like we might analyze children's production errors. We propose an error taxonomy and use it to annotate errors made by the top two systems across twelve languages. Many of the observed errors are related to inflectional patterns sensitive to inherent linguistic properties such as animacy or affect; many others are failures to predict truly unpredictable inflectional behaviors. We also find nearly one quarter of the residual "errors" reflect errors in the gold data. © 2019 Association for Computational Linguistics.Peer reviewe

    Dynamics and Instabilities of Planar Tensile Cracks in Heterogeneous Media

    Full text link
    The dynamics of tensile crack fronts restricted to advance in a plane are studied. In an ideal linear elastic medium, a propagating mode along the crack front with a velocity slightly less than the Rayleigh wave velocity, is found to exist. But the dependence of the effective fracture toughness Γ(v)\Gamma(v) on the crack velocity is shown to destabilize the crack front if (dΓ)/(dv)<0(d\Gamma)/(dv)<0. Short wavelength radiation due to weak random heterogeneities leads to this instability at low velocities. The implications of these results for the crack dynamics are discussed.Comment: 12 page

    Spiral cracks in drying precipitates

    Full text link
    We investigate the formation of spiral crack patterns during the desiccation of thin layers of precipitates in contact with a substrate. This symmetry-breaking fracturing mode is found to arise naturally not from torsion forces, but from a propagating stress front induced by the fold-up of the fragments. We model their formation mechanism using a coarse-grain model for fragmentation and successfully reproduce the spiral cracks. Fittings of experimental and simulation data show that the spirals are logarithmic, corresponding to constant deviation from a circular crack path. Theoretical aspects of the logarithmic spirals are discussed. In particular we show that this occurs generally when the crack speed is proportional to the propagating speed of stress front.Comment: 4 pages, 5 figures, RevTe

    Density scaling and quasiuniversality of flow-event statistics for athermal plastic flows

    Get PDF
    Athermal plastic flows were simulated for the Kob-Andersen binary Lennard-Jones system and its repulsive version in which the sign of the attractive terms is changed to a plus. Properties evaluated from simulations at different densities include the distributions of energy drops, stress drops, and strain intervals between the flow events. By reference to hidden scale invariance we show that simulations at a single density in conjunction with an equilibrium-liquid simulation at the same density allows one to predict the plastic flow-event properties at other densities. We furthermore demonstrate quasiuniversality of the flow-event statistics
    corecore