19 research outputs found

    Information extraction from bibliography for Marker Assisted Selection in wheat

    Get PDF
    Improvement of most animal and plant species of agronomical interest in the near future has become an international stake because of the increasing demand for feeding a growing world population and to mitigate the reduction of the industrial resources. The recent advent of genomic tools contributed to improve the discovery of linkage between molecular markers and genes that are involved in the control of traits of agronomical interest such as grain number or disease resistance. This information is mostly published as scientific papers but rarely available in databases. Here, we present a method aiming at automatically extract this information from the scientific literature and relying on a knowledge model of the target information and on the WheatPhenotype ontology that we developed for this purpose. The information extraction results were evaluated and integrated into the on-line semantic search engine [i]AlvisIR WheatMarker.[/i

    Event extraction of bacteria biotopes: a knowledge-intensive NLP-based approach

    Get PDF
    International audienceBackground: Bacteria biotopes cover a wide range of diverse habitats including animal and plant hosts, natural, medical and industrial environments. The high volume of publications in the microbiology domain provides a rich source of up-to-date information on bacteria biotopes. This information, as found in scientific articles, is expressed in natural language and is rarely available in a structured format, such as a database. This information is of great importance for fundamental research and microbiology applications (e.g., medicine, agronomy, food, bioenergy). The automatic extraction of this information from texts will provide a great benefit to the field

    Corpus-based extension of termino-ontology by linguistic analysis: a use case in biomedical event extraction

    No full text
    The automatic population of a termino-ontology is a difficult and challenging task. We propose a text-based ontology extension method that was experimented and evaluated on, for a semantic annota-tion task in the biomedical domain. It is based on the linguistic analysis of terms and their heads. The head-based method improves both the identification of rele-vant areas of a termino-ontology and the matching of the corpus terms within the-se areas

    Corpus-based extension of termino-ontology by linguistic analysis: a use case in biomedical event extraction

    No full text
    National audienceThe automatic population of a termino-ontology is a difficult and challenging task. We propose a text-based ontology extension method that was experimented and evaluated on, for a semantic annota-tion task in the biomedical domain. It is based on the linguistic analysis of terms and their heads. The head-based method improves both the identification of rele-vant areas of a termino-ontology and the matching of the corpus terms within the-se areas

    Improving term extraction with linguistic analysis in the biomedical domain

    No full text
    International audienceThis paper presents a linguistic-based approach to term extraction in the biomedical domain. The method is based on a linguistic analysis of constraints on terms and their context, focusing on participles and prepositional complements. The purpose of our approach is to obtain terms that are relevant for knowledge acquisition applications, such as the creation and enrichment of terminologies and ontologies. We report on the evaluations conducted following two complementary strategies, using a reference terminology and a manual validation. They were applied to two corpora of differing genre and domain, namely pharmacology patents and animal physiology scientific articles. Our work shows that the linguistic analysis-based developments significantly improve extraction results. The method is especially efficient when dealing with gerunds and "to" prepositional modifier

    Improving term extraction with linguistic analysis in the biomedical domain

    No full text
    This paper presents a linguistic-based approach to term extraction in the biomedical domain. The method is based on a linguistic analysis of constraints on terms and their context, focusing on participles and prepositional complements. The purpose of our approach is to obtain terms that are relevant for knowledge acquisition applications, such as the creation and enrichment of terminologies and ontologies. We report on the evaluations conducted following two complementary strategies, using a reference terminology and a manual validation. They were applied to two corpora of differing genre and domain, namely pharmacology patents and animal physiology scientific articles. Our work shows that the linguistic analysis-based developments significantly improve extraction results. The method is especially efficient when dealing with gerunds and "to" prepositional modifier

    Improving term extraction with linguistic analysis in the biomedical domain

    No full text
    à ce jour 31/01/2014 cette parution est en " draft version "International audienceThis paper presents a linguistic-based approach to term extraction in the biomedical domain. The method is based on a linguistic analysis of constraints on terms and their context, focusing on participles and prepositional complements. The purpose of our approach is to obtain terms that are relevant for knowledge acquisition applications, such as the creation and enrichment of terminologies and ontologies. We report on the evaluations conducted following two complementary strategies, using a reference terminology and a manual validation. They were applied to two corpora of differing genre and domain, namely pharmacology patents and animal physiology scientific articles. Our work shows that the linguistic analysis-based developments significantly improve extraction results. The method is especially efficient when dealing with gerunds and "to" prepositional modifier

    Building large lexicalized ontologies from text: a use case in automatic indexing of biotechnology patents

    No full text
    International audienceThis paper presents a tool, TyDI, and methods experimented in the building of a termino-ontology, i.e. a lexicalized ontology aimed at fine-grained indexation for semantic search applications. TyDI provides facilities for knowledge engineers and domain experts to efficiently collaborate to validate, organize and conceptualize corpus extracted terms. A use case on biotechnology patent search demonstrates TyDI’s potential
    corecore