416 research outputs found
Marrying Universal Dependencies and Universal Morphology
The Universal Dependencies (UD) and Universal Morphology (UniMorph) projects
each present schemata for annotating the morphosyntactic details of language.
Each project also provides corpora of annotated text in many languages - UD at
the token level and UniMorph at the type level. As each corpus is built by
different annotators, language-specific decisions hinder the goal of universal
schemata. With compatibility of tags, each project's annotations could be used
to validate the other's. Additionally, the availability of both type- and
token-level resources would be a boon to tasks such as parsing and homograph
disambiguation. To ease this interoperability, we present a deterministic
mapping from Universal Dependencies v2 features into the UniMorph schema. We
validate our approach by lookup in the UniMorph corpora and find a
macro-average of 64.13% recall. We also note incompatibilities due to paucity
of data on either side. Finally, we present a critical evaluation of the
foundations, strengths, and weaknesses of the two annotation projects.Comment: UDW1
Meaning to Form: Measuring Systematicity as Information
A longstanding debate in semiotics centers on the relationship between
linguistic signs and their corresponding semantics: is there an arbitrary
relationship between a word form and its meaning, or does some systematic
phenomenon pervade? For instance, does the character bigram \textit{gl} have
any systematic relationship to the meaning of words like \textit{glisten},
\textit{gleam} and \textit{glow}? In this work, we offer a holistic
quantification of the systematicity of the sign using mutual information and
recurrent neural networks. We employ these in a data-driven and massively
multilingual approach to the question, examining 106 languages. We find a
statistically significant reduction in entropy when modeling a word form
conditioned on its semantic representation. Encouragingly, we also recover
well-attested English examples of systematic affixes. We conclude with the
meta-point: Our approximate effect size (measured in bits) is quite
small---despite some amount of systematicity between form and meaning, an
arbitrary relationship and its resulting benefits dominate human language.Comment: Accepted for publication at ACL 201
Weird inflects but OK : Making sense of morphological generation errors
We conduct a manual error analysis of the CoNLL-SIGMORPHON 2017 Shared Task on Morphological Reinflection. In this task, systems are given a word in citation form (e.g., hug) and asked to produce the corresponding inflected form (e.g., the simple past hugged). This design lets us analyze errors much like we might analyze children's production errors. We propose an error taxonomy and use it to annotate errors made by the top two systems across twelve languages. Many of the observed errors are related to inflectional patterns sensitive to inherent linguistic properties such as animacy or affect; many others are failures to predict truly unpredictable inflectional behaviors. We also find nearly one quarter of the residual "errors" reflect errors in the gold data. © 2019 Association for Computational Linguistics.Peer reviewe
Dynamics and Instabilities of Planar Tensile Cracks in Heterogeneous Media
The dynamics of tensile crack fronts restricted to advance in a plane are
studied. In an ideal linear elastic medium, a propagating mode along the crack
front with a velocity slightly less than the Rayleigh wave velocity, is found
to exist. But the dependence of the effective fracture toughness on
the crack velocity is shown to destabilize the crack front if
. Short wavelength radiation due to weak random
heterogeneities leads to this instability at low velocities. The implications
of these results for the crack dynamics are discussed.Comment: 12 page
Spiral cracks in drying precipitates
We investigate the formation of spiral crack patterns during the desiccation
of thin layers of precipitates in contact with a substrate. This
symmetry-breaking fracturing mode is found to arise naturally not from torsion
forces, but from a propagating stress front induced by the fold-up of the
fragments. We model their formation mechanism using a coarse-grain model for
fragmentation and successfully reproduce the spiral cracks. Fittings of
experimental and simulation data show that the spirals are logarithmic,
corresponding to constant deviation from a circular crack path. Theoretical
aspects of the logarithmic spirals are discussed. In particular we show that
this occurs generally when the crack speed is proportional to the propagating
speed of stress front.Comment: 4 pages, 5 figures, RevTe
Density scaling and quasiuniversality of flow-event statistics for athermal plastic flows
Athermal plastic flows were simulated for the Kob-Andersen binary
Lennard-Jones system and its repulsive version in which the sign of the
attractive terms is changed to a plus. Properties evaluated from simulations at
different densities include the distributions of energy drops, stress drops,
and strain intervals between the flow events. By reference to hidden scale
invariance we show that simulations at a single density in conjunction with an
equilibrium-liquid simulation at the same density allows one to predict the
plastic flow-event properties at other densities. We furthermore demonstrate
quasiuniversality of the flow-event statistics
- …