942 research outputs found

    Boosting terminology extraction through crosslingual resources

    Get PDF
    Terminology Extraction is an important Natural Language Processing task with multiple applications in many areas. The task has been approached from different points of view using different techniques. Language and domain independent systems have been proposed as well. Our contribution in this paper focuses on the improvements on Terminology Extraction using crosslingual resources and specifically the Wikipedia and on the use of a variant of PageRank for scoring the candidate terms. // La extracción de terminología es una tarea de procesamiento de la lengua sumamente importante y aplicable en numerosas áreas. La tarea se ha abordado desde múltiples perspectivas y utilizando técnicas diversas. También se han propuesto sistemas independientes de la lengua y del dominio. La contribución de este artículo se centra en las mejoras que los sistemas de extracción de terminología pueden lograr utilizando recursos translingües, y concretamente la Wikipedia y en el uso de una variante de PageRank para valorar los candidatos a términoPeer ReviewedPostprint (published version

    TALP-UPC at MediaEval 2014 Placing Task: Combining geographical knowledge bases and language models for large-scale textual georeferencing

    Get PDF
    This paper describes our Georeferencing approaches, experiments, and results at the MediaEval 2014 Placing Task evaluation. The task consists of predicting the most probable geographical coordinates of Flickr images and videos using its visual, audio and metadata associated features. Our approaches used only Flickr users textual metadata annotations and tagsets. We used four approaches for this task: 1) an approach based on Geographical Knowledge Bases (GeoKB), 2) the Hiemstra Language Model (HLM) approach with Re-Ranking, 3) a combination of the GeoKB and the HLM (GeoFusion). 4) a combination of the GeoFusion with a HLM model derived from the English Wikipedia georeferenced pages. The HLM approach with Re-Ranking showed the best performance within 10m to 1km distances. The GeoFusion approaches achieved the best results within the margin of errors from 10km to 5000km. This work has been supported by the Spanish Research Department (SKATER Project: TIN2012-38584-C06-01). TALP Research Center is recognized as a Quality Research Group (2014 SGR 1338) by AGAUR, the Research Department of the Catalan Government.Peer ReviewedPostprint (published version

    Spanish named entity recognition in the biomedical domain

    Get PDF
    Named Entity Recognition in the clinical domain and in languages different from English has the difficulty of the absence of complete dictionaries, the informality of texts, the polysemy of terms, the lack of accordance in the boundaries of an entity, the scarcity of corpora and of other resources available. We present a Named Entity Recognition method for poorly resourced languages. The method was tested with Spanish radiology reports and compared with a conditional random fields system.Peer ReviewedPostprint (author's final draft

    A Machine learning approach to POS tagging

    Get PDF
    We have applied inductive learning of statistical decision trees and relaxation labelling to the Natural Language Processing (NLP) task of morphosyntactic disambiguation (Part Of Speech Tagging). The learning process is supervised and obtains a language model oriented to resolve POS ambiguities. This model consists of a set of statistical decision trees expressing distribution of tags and words in some relevant contexts. The acquired language models are complete enough to be directly used as sets of POS disambiguation rules, and include more complex contextual information than simple collections of n-grams usually used in statistical taggers. We have implemented a quite simple and fast tagger that has been tested and evaluated on the Wall Street Journal (WSJ) corpus with a remarkable accuracy. However, better results can be obtained by translating the trees into rules to feed a flexible relaxation labelling based tagger. In this direction we describe a tagger which is able to use information of any kind (n-grams, automatically acquired constraints, linguistically motivated manually written constraints, etc.), and in particular to incorporate the machine learned decision trees. Simultaneously, we address the problem of tagging when only small training material is available, which is crucial in any process of constructing, from scratch, an annotated corpus. We show that quite high accuracy can be achieved with our system in this situation.Postprint (published version

    Museo rural en Pozorrubio de Santiago

    Full text link
    En número dedicado a: La provincia de Cuenc

    Topic modeling for entity linking using keyphrase

    Get PDF
    This paper proposes an Entity Linking system that applies a topic modeling ranking. We apply a novel approach in order to provide new relevant elements to the model. These elements are keyphrases related to the queries and gathered from a huge Wikipedia-based knowledge resourcePeer ReviewedPostprint (author’s final draft

    Innovación y tendencias alimentarias

    Get PDF
    La obesidad, las enfermedades cardiovasculares, y la diabetes son las principalescausas de muerte de los adultos en el mundo. Muchas de estas enfermedadespueden prevenirse a través de la alimentación saludable, y así lo ha percibido lapoblación la cual demanda alimentos funcionales que aporten salud, juventud,belleza. Así las ventas de alimentos saludables alcanzaron los 900 mil millonesde US$ durante el año 2015 y en muchos países supone un 20% del gasto de lacesta de la compra. El nuevo consumidor da importancia al origen e historia delos productos, prefiere alimentos sin gluten, orgánicos, busca una alimentaciónconsciente, personalizada a su medida y quiere vivir una experiencia alimentaria.Por ello las empresas han generado una oferta amplia de nuevos productos loscuales han tenido éxito en los supermercados como belVita u Organic Valley,pero queda trabajo por realizar y generar nuevos productos que tengan un “healthclaim” avalado científicamente con estudios rigurosos.Palabras claves: Alimentación Saludable, Innovación, Tendencia

    Los hierros en Molina de Aragón

    Full text link

    El "pelegri" de Tossa

    Full text link
    En número dedicado a: Gerona: comarcas de La Selva el Gironés y el Bajo Ampurdá

    El carnaval de Bielsa

    Full text link
    En número dedicado a: Huesc
    corecore