9 research outputs found

    Proceedings of the 6th Dutch-Belgian Information Retrieval Workshop

    Get PDF

    Dataset Preservation for the Long Term: Results of the DareLux Project

    Full text link

    Critique of Architectures for Long-Term Digital Preservation

    Get PDF
    Evolving technology and fading human memory threaten the long-term intelligibility of many kinds of documents. Furthermore, some records are susceptible to improper alterations that make them untrustworthy. Trusted Digital Repositories (TDRs) and Trustworthy Digital Objects (TDOs) seem to be the only broadly applicable digital preservation methodologies proposed. We argue that the TDR approach has shortfalls as a method for long-term digital preservation of sensitive information. Comparison of TDR and TDO methodologies suggests differentiating near-term preservation measures from what is needed for the long term. TDO methodology addresses these needs, providing for making digital documents durably intelligible. It uses EDP standards for a few file formats and XML structures for text documents. For other information formats, intelligibility is assured by using a virtual computer. To protect sensitive information—content whose inappropriate alteration might mislead its readers, the integrity and authenticity of each TDO is made testable by embedded public-key cryptographic message digests and signatures. Key authenticity is protected recursively in a social hierarchy. The proper focus for long-term preservation technology is signed packages that each combine a record collection with its metadata and that also bind context—Trustworthy Digital Objects.

    Digital Curation and Costs: Approaches and Perceptions

    Get PDF
    The production of large volumes of scientific information, considering its cost, requires approaches that ensure its maintenance, reuse and recovery. These concerns prompted the emergence of digital curation. We intend to discuss the relevant thinking concerning the costs of digital curation. This means addressing the definition of the concept and the issue of costs, based on the studies related to cost models. A literature review was conducted using B-On and RCAAP as research sources, exploring the perceptions of the authors regarding the digital curation and its costs. The views expressed were organized around a scheme based on the Digital Curation Centre (DCC) lifecycle and the reference model Open Archival Information System (OAIS). It is proposed a systematization of digital curation issues bridging the DCC life cycle view of the digital object curation to the OAIS reference model approach, using a cross view seized by cost models and plan/data management policies

    Planning for the Lifecycle Management and Long-Term Preservation of Research Data: A Federated Approach

    Get PDF
    Outcomes of the grant are archived here.The “data deluge” is a recent but increasingly well-understood phenomenon of scientific and social inquiry. Large-scale research instruments extend our observational power by many orders of magnitude but at the same time generate massive amounts of data. Researchers work feverishly to document and preserve changing or disappearing habitats, cultures, languages, and artifacts resulting in volumes of media in various formats. New software tools mine a growing universe of historical and modern texts and connect the dots in our semantic environment. Libraries, archives, and museums undertake digitization programs creating broad access to unique cultural heritage resources for research. Global-scale research collaborations with hundreds or thousands of participants, drive the creation of massive amounts of data, most of which cannot be recreated if lost. The University of Kansas (KU) Libraries in collaboration with two partners, the Greater Western Library Alliance (GWLA) and the Great Plains Network (GPN), received an IMLS National Leadership Grant designed to leverage collective strengths and create a proposal for a scalable and federated approach to the lifecycle management of research data based on the needs of GPN and GWLA member institutions.Institute for Museum and Library Services LG-51-12-0695-1

    Surveying the state of data curation: a review of policy and practice in UK HEIs

    Get PDF
    As the value of research data has become increasingly recognized in the United Kingdom by public funding organizations, pressure has been placed on higher education institutions to provide access to research data or risk future funding. As a result, research data services have emerged rapidly over the past few years. However, it is not clear whether these services effectively ensure the long-term preservation of research data or apply appropriate data curation measures. Through a three-part methodology, the dissertation research aimed to provide a clear picture of the current state of data curation in UK HEIs, including adherence to best practices and the existence of provisions for data curation efforts. A survey questionnaire was disseminated as the primary method of data collection, and additional information was gathered through a literature review and an analysis of online resources and institutional policies. Data curation practices were found to be mostly inconsistent with best practices and were largely focused on facilitating access to research data. However, there is an awareness of the underdeveloped areas of data curation, especially preservation, and efforts are being made to improve these areas. Institutional policies were found to be mostly documents that defined roles and responsibilities and provided little guidance for follow-through. The role of researchers was repeatedly emphasized in both policy and practice and was essential in understanding the current state of data curation

    Research Data Curation and Management Bibliography

    Get PDF
    This e-book includes over 800 selected English-language articles and books that are useful in understanding the curation of digital research data in academic and other research institutions. It covers topics such as research data creation, acquisition, metadata, provenance, repositories, management, policies, support services, funding agency requirements, open access, peer review, publication, citation, sharing, reuse, and preservation. It has live links to included works. Abstracts are included in this bibliography if a work is under certain Creative Commons Licenses. This book is licensed under a Creative Commons Attribution 4.0 International License. Cite as: Bailey, Charles W., Jr. Research Data Curation and Management Bibliography. Houston: Digital Scholarship, 2021

    Long-time preservation of data sets, results of the DareLux project

    No full text
    corecore