206 research outputs found

    Identification of sense selection in regular polysemy using shallow features

    Get PDF
    Proceedings of the 18th Nordic Conference of Computational Linguistics NODALIDA 2011. Editors: Bolette Sandford Pedersen, Gunta Nešpore and Inguna Skadiņa. NEALT Proceedings Series, Vol. 11 (2011), 18-25. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/16955

    Benchmarking Joint Lexical and Syntactic Analysis on Multiword-Rich Data

    Get PDF
    International audienceThis article evaluates the extension of a dependency parser that performs joint syntactic analysis and multiword expression identification. We show that, given sufficient training data, the parser benefits from explicit multiword information and improves overall labeled accuracy score in eight of the ten evaluation cases

    Benchmarking Joint Lexical and Syntactic Analysis on Multiword-Rich Data

    Get PDF
    International audienceThis article evaluates the extension of a dependency parser that performs joint syntactic analysis and multiword expression identification. We show that, given sufficient training data, the parser benefits from explicit multiword information and improves overall labeled accuracy score in eight of the ten evaluation cases

    Universal Dependencies for the AnCora treebanks

    Get PDF
    International audienceAbstract: The present article describes the conversion of the Catalan and Spanish AnCora treebanks to the Universal Dependencies formalism. We describe the conversion process and assess the quality of the resulting treebank in terms of parsing accuracy by means of monolingual, cross-lingual and cross-domain parsing evaluation. The converted treebanks show an internal consistency comparable to the one shown by the original CoNLL09 distribution of AnCora, and indicate some differences in terms of multiword expression inventory with regards to the already existing UD Spanish treebank. The two new converted treebanks will be released in version 1.3 of Universal Dependencies

    When is multitask learning effective? Semantic sequence prediction under varying data conditions

    Get PDF
    International audienceMultitask learning has been applied successfully to a range of tasks, mostly mor-phosyntactic. However, little is known on when MTL works and whether there are data characteristics that help to determine its success. In this paper we evaluate a range of semantic sequence labeling tasks in a MTL setup. We examine different auxiliary tasks, amongst which a novel setup, and correlate their impact to data-dependent conditions. Our results show that MTL is not always effective, significant improvements are obtained only for 1 out of 5 tasks. When successful, auxiliary tasks with compact and more uniform label distributions are preferable

    Multilingual projection for parsing truly low resource languages

    Get PDF
    International audienceWe propose a novel approach to cross-lingual part-of-speech tagging and dependency parsing for truly low-resource languages. Our annotation projection-based approach yields tagging and parsing models for over 100 languages. All that is needed are freely available parallel texts, and taggers and parsers for resource-rich languages. The empirical evaluation across 30 test languages shows that our method consistently provides top-level accuracies , close to established upper bounds, and outperforms several competitive baselines

    Determination of the minimum integral entropy, water sorption and glass transition temperature to establishing critical storage conditions of beetroot juice microcapsules by spray drying

    Get PDF
    The aim of this work was to microencapsulate beetroot juice (BJ) (Beta vulgaris L.) by spray-drying using as protective colloid gum Arabic. The adsorption isotherms of the microcapsules and the minimum integral entropy (∆S int)T were determined at 25, 35 and 40 ◦C. The glass transition temperature (Tg) was measured by differential scanning calorimetry and modeled by GordonTaylor equation. The water contents-water activity (M-aW ) sets obtained from (∆S int)T , and critical water content (CWC) and critical water activity (CWA) from the Tg were similar, being in the range of water content of 5.11-7.5 kg H2O/100 kg d.s. and in the water activity range of 0.532-0.590. These critical storage conditions were considered as the best conditions for increase the stability of the microcapsules, where the percentage of retention Betanin in the microcapsules was higher compared with other storage conditions in the temperature and aw range studied. Keywords: beetroot juice, microcapsules, minimum integral entropy, glass transition temperature, critical water content, critical water activity

    Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer

    Get PDF
    International audienceWe present an efficient and accurate method for transferring annotations between two different treebanks of the same language. This method led to the creation of a new instance of the French Treebank (Abeillé et al., 2003), which follows the Universal Dependency annotation scheme and which was proposed to the participants of the CoNLL 2017 Universal Dependency parsing shared task (Zeman et al., 2017). Strong results from an evaluation on our gold standard (94.75% of LAS, 99.40% UAS on the test set) demonstrate the quality of this new annotated data set and validate our approach

    Enfrentando los riesgos socionaturales

    Get PDF
    El objetivo del libro es comprender la magnitud de los Riesgos Socionaturales en México y Latinoamérica, para comprender el peligro que existe por algún tipo de desastre, ya sea inundaciones, sismos, remoción en masa, entre otros, además conocer qué medidas preventivas, correctivas y de contingencias existen para estar atentos ante alguna señal que la naturaleza esté enviando y así evitar alguna catástrofe. El libro se enfoca en los aspectos básicos de análisis de los peligros, escenarios de riesgo, vulnerabilidad y resiliencia, importantes para la gestión prospectiva o preventiva
    corecore