2,395 research outputs found

    Open Data Platform for Knowledge Access in Plant Health Domain : VESPA Mining

    Get PDF
    Important data are locked in ancient literature. It would be uneconomic to produce these data again and today or to extract them without the help of text mining technologies. Vespa is a text mining project whose aim is to extract data on pest and crops interactions, to model and predict attacks on crops, and to reduce the use of pesticides. A few attempts proposed an agricultural information access. Another originality of our work is to parse documents with a dependency of the document architecture

    Mining Domain-Specific Thesauri from Wikipedia: A case study

    Get PDF
    Domain-specific thesauri are high-cost, high-maintenance, high-value knowledge structures. We show how the classic thesaurus structure of terms and links can be mined automatically from Wikipedia. In a comparison with a professional thesaurus for agriculture we find that Wikipedia contains a substantial proportion of its concepts and semantic relations; furthermore it has impressive coverage of contemporary documents in the domain. Thesauri derived using our techniques capitalize on existing public efforts and tend to reflect contemporary language usage better than their costly, painstakingly-constructed manual counterparts

    A semantic-based platform for the digital analysis of architectural heritage

    Get PDF
    This essay focuses on the fields of architectural documentation and digital representation. We present a research paper concerning the development of an information system at the scale of architecture, taking into account the relationships that can be established between the representation of buildings (shape, dimension, state of conservation, hypothetical restitution) and heterogeneous information about various fields (such as the technical, the documentary or still the historical one). The proposed approach aims to organize multiple representations (and associated information) around a semantic description model with the goal of defining a system for the multi-field analysis of buildings

    Scholars Forum: A New Model For Scholarly Communication

    Get PDF
    Scholarly journals have flourished for over 300 years because they successfully address a broad range of authors' needs: to communicate findings to colleagues, to establish precedence of their work, to gain validation through peer review, to establish their reputation, to know the final version of their work is secure, and to know their work will be accessible by future scholars. Eventually, the development of comprehensive paper and then electronic indexes allowed past work to be readily identified and cited. Just as postal service made it possible to share scholarly work regularly and among a broad readership, the Internet now provides a distribution channel with the power to reduce publication time and to expand traditional print formats by supporting multi-media options and threaded discourse. Despite widespread acceptance of the web by the academic and research community, the incorporation of advanced network technology into a new paradigm for scholarly communication by the publishers of print journals has not materialized. Nor have journal publishers used the lower cost of distribution on the web to make online versions of journals available at lower prices than print versions. It is becoming increasingly clear to the scholarly community that we must envision and develop for ourselves a new, affordable model for disseminating and preserving results, that synthesizes digital technology and the ongoing needs of scholars. In March 1997, with support from the Engineering Information Foundation, Caltech sponsored a Conference on Scholarly Communication to open a dialogue around key issues and to consider the feasibility of alternative undertakings. A general consensus emerged recognizing that the certification of scholarly articles through peer review could be "decoupled" from the rest of the publishing process, and that the peer review process is already supported by the universities whose faculty serve as editors, members of editorial boards, and referees. In the meantime, pressure to enact regressive copyright legislation has added another important element. The ease with which electronic files may be copied and forwarded has encouraged publishers and other owners of copyrighted material to seek means for denying access to anything they own in digital form to all but active subscribers or licensees. Furthermore, should publishers retain the only version of a publication in a digital form, there is a significant risk that this material may eventually be lost through culling little-used or unprofitable back-files, through not investing in conversion expense as technology evolves, through changes in ownership, or through catastrophic physical events. Such a scenario presents an intolerable threat to the future of scholarship

    Report of the Stanford Linked Data Workshop

    No full text
    The Stanford University Libraries and Academic Information Resources (SULAIR) with the Council on Library and Information Resources (CLIR) conducted at week-long workshop on the prospects for a large scale, multi-national, multi-institutional prototype of a Linked Data environment for discovery of and navigation among the rapidly, chaotically expanding array of academic information resources. As preparation for the workshop, CLIR sponsored a survey by Jerry Persons, Chief Information Architect emeritus of SULAIR that was published originally for workshop participants as background to the workshop and is now publicly available. The original intention of the workshop was to devise a plan for such a prototype. However, such was the diversity of knowledge, experience, and views of the potential of Linked Data approaches that the workshop participants turned to two more fundamental goals: building common understanding and enthusiasm on the one hand and identifying opportunities and challenges to be confronted in the preparation of the intended prototype and its operation on the other. In pursuit of those objectives, the workshop participants produced:1. a value statement addressing the question of why a Linked Data approach is worth prototyping;2. a manifesto for Linked Libraries (and Museums and Archives and …);3. an outline of the phases in a life cycle of Linked Data approaches;4. a prioritized list of known issues in generating, harvesting & using Linked Data;5. a workflow with notes for converting library bibliographic records and other academic metadata to URIs;6. examples of potential “killer apps” using Linked Data: and7. a list of next steps and potential projects.This report includes a summary of the workshop agenda, a chart showing the use of Linked Data in cultural heritage venues, and short biographies and statements from each of the participants

    a digital research tool for Social Sciences, Arts and Humanities

    Get PDF
    LISBOA-01-0145-FEDER-022139ROSSIO Infrastructure is building an open-access and free platform that aims to aggregate, organise and connect digital resources related to Social Sciences, Arts and Humanities located in Portuguese educational and cultural institutions. This paper aims to present ROSSIO infrastructure, the institutions involved, its main goals and the services it will provide, such as a discovery portal, exhibitions, collections and a virtual research environment. Underlying these services is a metadata aggregation approach that brings into ROSSIO the metadata on digital objects from the providing institutions. The aggregated dataset is transformed into linked data and enriched with entities from controlled vocabularies, which are defined by ROSSIO. We will detail this process, including the applications employed and how they interoperate. Finally, we will conclusively reflect on the potentialities of these services for public dissemination of science, taking into account the FAIR principles.publishersversionpublishe

    Enhancing Heritage fruition through 3D semantic modelling and digital tools: the INCEPTION project

    Get PDF
    The INCEPTION project, “Inclusive Cultural Heritage in Europe through 3D Semantic Modelling”, started in June 2015 and lasting four years, aims at developing advanced 3D modelling for accessing and understanding European cultural assets. One of the main challenges of the project is to close the gap between effective user experiences of Cultural Heritage via digital tools and representations, and the enrichment of the scientific knowledge. Within this framework, the INCEPTION project goals are consistently aligned while accomplishing the main objectives of accessing, understanding and strengthening European cultural heritage by means of enriched 3D models. At the end of the third year of activity, the project is now facing different challenging actions starting from already developed advancement in 3D data capturing and holistic digital documentation, under interdisciplinary and cross-cutting fields of knowledge. In this direction, the approach and the methodology for semantic organization and data management toward H-BIM modelling will be presented, as well as a preliminary nomenclature for semantic enrichment of heritage 3D models. According to the overall INCEPTION workflow, the H-BIM modelling procedure starts with documenting user needs, including experts and non-experts. The identification of the Cultural Heritage buildings semantic ontology and data structure for information catalogue will allow the integration of semantic attributes with hierarchically and mutually aggregated 3D digital geometric models for management of heritage information

    a digital humanities platform to explore the Portuguese cultural heritage

    Get PDF
    LISBOA-01-0145-FEDER-022139The ROSSIO Infrastructure is developing a free and open-access platform for aggregating, organising, and connecting the digital resources in the Social Sciences, Arts and Humanities provided by Portuguese higher education and cultural institutions. This paper presents an overview of the ROSSIO Infrastructure, its main objectives, the institutions involved, and the services offered by the infrastructure’s aims through its platform—namely, a discovery portal, digital exhibitions, collections, and a virtual research environment. These services rely on a metadata-aggregation solution for bringing the digital objects’ metadata from the providing institutions into ROSSIO. The aggregated datasets are converted into linked data and undergo an enrichment process based on controlled vocabularies, which are developed and published by ROSSIO. The paper will describe this process, the applications involved, and how they interoperate. We will further reflect on how these services may enhance the dissemination of science, considering the FAIR principles.publishersversionpublishe
    corecore