32 research outputs found

    Exploiting Lexical Conceptual Structure for paraphrase generation

    Get PDF
    Abstract. Lexical Conceptual Structure (LCS) represents verbs as semantic structures with a limited number of semantic predicates. This paper attempts to exploit how LCS can be used to explain the regularities underlying lexical and syntactic paraphrases, such as verb alternation, compound word decomposition, and lexical derivation. We propose a paraphrase generation model which transforms LCSs of verbs, and then conduct an empirical experiment taking the paraphrasing of Japanese light-verb constructions as an example. Experimental results justify that syntactic and semantic properties of verbs encoded in LCS are useful to semantically constrain the syntactic transformation in paraphrase generation.

    Non-crossing dependencies: Least effort, not grammar

    Get PDF
    The use of null hypotheses (in a statistical sense) is common in hard sciences but not in theoretical linguistics. Here the null hypothesis that the low frequency of syntactic dependency crossings is expected by an arbitrary ordering of words is rejected. It is shown that this would require star dependency structures, which are both unrealistic and too restrictive. The hypothesis of the limited resources of the human brain is revisited. Stronger null hypotheses taking into account actual dependency lengths for the likelihood of crossings are presented. Those hypotheses suggests that crossings are likely to reduce when dependencies are shortened. A hypothesis based on pressure to reduce dependency lengths is more parsimonious than a principle of minimization of crossings or a grammatical ban that is totally dissociated from the general and non-linguistic principle of economy.Postprint (author's final draft

    Comparative constructions of similarity in Northern Samoyedic languages

    Get PDF
    The purpose of this paper is to analyze the suffixes which are used in Northern Samoyedic languages to build comparative constructions of equality. Depending on the language, the suffixes may perform three functions: word-building, form-building, and inflectional. When they mark the noun, they serve as simulative suffixes and are employed to build object comparison. In the inflectional function, these suffixes mark the verb and are a means of constructing situational comparison. In this case, they signal the formation of a special mood termed the Approximative. This paper provides a detailed description of the Approximative from paradigmatic and syntagmatic perspectives

    English cum, a borrowed coordinator turned complex-compound marker

    No full text
    International audienceThis article takes issue with the traditional view of English compounds such as governess-cum- piano-teacher, according to which the medial morpheme -cum- is insignificant. The study is first centered on the appearance of the linking element in the English language. New insight into its distribution and function is then provided by scrutinizing a list of 259 compounds extracted from a present-day newspaper corpus. It is found that -cum- appears exclusively in non-institutionalized coordinate nominal and adjectival compounds and that it plays a distinctive role which sets -cum- compounds apart from asyndetic compounds: the linking element is predominantly used in complex compounds to simultaneously mark the internal boundary (boundaries) within the construction and the coordinate relation that holds between the compounding elements. The discussion finally focuses on the status of -cum-, which appears to be a hybrid syntactic-morphological unit of present-day English

    Mapping protein energy landscapes with amide hydrogen exchange and mass spectrometry: I. A generalized model for a two-state protein and comparison with experiment

    No full text
    Protein amide hydrogen exchange (HDX) is a convoluted process, whose kinetics is determined by both dynamics of the protein and the intrinsic exchange rate of labile hydrogen atoms fully exposed to solvent. Both processes are influenced by a variety of intrinsic and extrinsic factors. A mathematical formalism initially developed to rationalize exchange kinetics of individual amide hydrogen atoms is now often used to interpret global exchange kinetics (e.g., as measured in HDX MS experiments). One particularly important advantage of HDX MS is direct visualization of various protein states by observing distinct protein ion populations with different levels of isotope labeling under conditions favoring correlated exchange (the so-called EX1 exchange mechanism). However, mildly denaturing conditions often lead to a situation where the overall HDX kinetics cannot be clearly classified as either EX1 or EX2. The goal of this work is to develop a framework for a generalized exchange model that takes into account multiple processes leading to amide hydrogen exchange, and does not require that the exchange proceed strictly via EX1 or EX2 kinetics. To achieve this goal, we use a probabilistic approach that assigns a transition probability and a residual protection to each equilibrium state of the protein. When applied to a small protein chymotrypsin inhibitor 2, the algorithm allows complex HDX patterns observed experimentally to be modeled with remarkably good fidelity. On the basis of the model we are now in a position to begin to extract quantitative dynamic information from convoluted exchange kinetics

    DILUCT: An Open-Source Spanish Dependency Parser based on Rules, Heuristics, and Selectional Preferences

    No full text
    A method for recognizing syntactic patterns for Spanish is presented. This method is based on dependency parsing using heuristic rules to infer dependency relationships between words, and word co-occurrence statistics (learnt in an unsupervised manner) to resolve ambiguities such as prepositional phrase attachment. If a complete parse cannot be produced, a partial structure is built with some (if not all) dependency relations identified. Evaluation shows that in spite of its simplicity, the parser’s accuracy is superior to the available existing parsers for Spanish. Though certain grammar rules, as well as the lexical resources used, are specific for Spanish, the suggested approach is language-independent
    corecore