27,359 research outputs found
Critique of Architectures for Long-Term Digital Preservation
Evolving technology and fading human memory threaten the long-term intelligibility of many kinds of documents. Furthermore, some records are susceptible to improper alterations that make them untrustworthy. Trusted Digital Repositories (TDRs) and Trustworthy Digital Objects (TDOs) seem to be the only broadly applicable digital preservation methodologies proposed. We argue that the TDR approach has shortfalls as a method for long-term digital preservation of sensitive information. Comparison of TDR and TDO methodologies suggests differentiating near-term preservation measures from what is needed for the long term.
TDO methodology addresses these needs, providing for making digital documents durably intelligible. It uses EDP standards for a few file formats and XML structures for text documents. For other information formats, intelligibility is assured by using a virtual computer. To protect sensitive information—content whose inappropriate alteration might mislead its readers, the integrity and authenticity of each TDO is made testable by embedded public-key cryptographic message digests and signatures. Key authenticity is protected recursively in a social hierarchy. The proper focus for long-term preservation technology is signed packages that each combine a record collection with its metadata and that also bind context—Trustworthy Digital Objects.
Facilitating Collaborative Metadata Creation for Faculty-initiated Digital Projects
The usability and long-term preservation of digital humanities projects, such as a digital archive or other project built around digitized materials, depend on thoughtful and thorough metadata creation. The variety of expertise required to create high-quality metadata for digital humanities projects practically requires a collaborative approach. Putting the call for collaboration into practice requires tools that are accessible and functional for all collaborators. Research on tools for metadata creation has tended to focus either on tools for librarians to manage digital project metadata or on tools for independent author metadata creation (Greenberg, 2003; Crystal & Greenberg, 2005). The literature has also tended to focus solely on the use of spreadsheets for metadata creation. Lincoln (2018) has discussed best practices for Google Sheets in archival metadata entry, and Broman and Woo (2017) have discussed best practices for spreadsheet data entry in general. This article positions tool selection and configuration as site of collaboration for the creation of digital project metadata through its examination of a Google Forms-based workflow for the creation and organization of metadata
Managing long-term access to digital data objects: a metadata approach
As society becomes increasingly reliant on information technology for data
exchange and long-term data storage the need for a system of data management to document and provide access to the 'societal memory' is becoming imperative. An examination of both the literature and current 'best Practice' underlines the absence to date of a proven universal conceptual basis to digital data preservation. The
examination of differences in nature and sources of origin, between traditional 'print-based' and digital objects leads to a re-appraisal of current practices of data selection and preservation. The need to embrace past, present and future metadata developments in a rapidly changing environment is considered. Various
hypotheses were formulated and supported regarding; the similarities and differences required in selection criteria for different types of Digital Data Objects (DDOs), the ability to define universal threshold standards for a framework of metadata for digital data preservation, and the role of selection criteria in such a framework. The research uses Soft Systems Methodology to investigate the
potential of the metadata concept as the key to universal data management. Semi-structured
interviews were conducted to explore the attitudes of information professionals in the United Kingdom towards the challenges facing information-dependent
organisations attempting to preserve digital data over the long-term. In
particular, the nature of DDOs being encountered by stakeholders, the reasons,
policies, and procedures for preserving them, together with a range of specific
issues such as; the role of metadata, access to, and rights management of DDOs.
The societal need for selection to ensure efficient long-term access is considered.
Drawing on - SSM modelling, this research develops a flexible, long-term management framework for digital data at a level higher than metadata, with
selection as an essential component. The framework's conceptual feasibility has
been examined from both financial and societal benefit perspectives, together with
the recognition of constraints. The super-metadata framework provides a possible
systematic approach to managing a wide range of digital data in a variety of
formats, created/owned by a spectrum of information-dependent organisations
Application of Preservation Metadata for Long-Term Accessibility of Digital Objects
In the process of digital preservation metadata management for long-term accessibility of
digital objects has been an important discussion point internationally. However, there is a gap
in the implementation of preservation metadata standards from theory to practice. Hitherto
little research has been conducted to show the application of preservation metadata and
therefore new case studies on both implementation and use of metadata standards in
preservation strategies is needed. The aim of this thesis is to study the extent of implementing
standard preservation metadata in the preservation practice at memory institutions.
This study adopts a qualitative method based upon a pragmatic approach and uses the case
study strategy. Metadata experts/specialists in three memory institutions (National Library of
Estonia, National Archives of Estonia and National Library of Wales) were interviewed using
semi-structured interviews, accompanied by document analysis.
Results of the study show that these memory institutions are recording a wide range of
metadata in all categories: descriptive, structural as well as administrative metadata (including
the rights, provenance, and technical metadata). They use metadata elements from a variety of
metadata standards/schema to suit their practical purposes. However, the level of exploitation
of preservation metadata standards differs in scale, data management practices as well as
heterogeneity of metadata recorded. Metadata is recorded about different digital objects like
books, WebPages, photographs, audio, video, and their files and bitstreams. The level of
implementation of metadata for each object type varies between institutions. The application
of the PREMIS metadata standard entities varies from institution to institution as it ranges
from reviewing/analyzing stage to practical implementation. Significant differences have also
been seen between national libraries and archives in mission, process of ingest, influence of
their traditional cataloguing practices and types of standards used for the development of their
metadata specification. In managing the metadata on the digital preservation processes
different problems and challenges have been faced and investigated by these memory
institutions and further research should be carried out to study other aspects of metadata
implementation.Joint Master Degree in Digital Library Learning (DILL
A semi-automated digital preservation system based on semantic Web services
This paper describes a Web-services-based system which we have developed to enable organizations to semi -automatically preserve their digital collections by dynamically discovering and invoking the most appropriate preservation service, as it is required. By periodically comparing preservation metadata for digital objects in a collection with a software version registry, potential object obsolescence can be detected and a notification message sent to the relevant agent. By making preservation software modules available as Web services and describing them semantically using a machine-processable ontology (OWL-S), the most appropriate preservation service(s) for each object can then be automatically discovered, composed and invoked by software agents (with optional human input at critical decision-making steps). We believe that this approach represents a significant advance towards providing a viable, cost-effective solution to the long term preservation of large-scale collections of digital objects
Diagnóstico de preservação digital dos repositórios institucionais das universidades públicas nacionais: metadados de preservação
The digital preservation metadata of institutional repositories are tools that ensure the preservation of digital objects in national public universities. The approach to this initiative in its informational or repositories preservation policies indicate that there are information retrieval. It aims to propose preservation strategies for the use of these metadata, based on the bibliography, information policies and preservation of institutional repositories. It analyzes how the description of preservation metadata sets occurs in the policies of these repositories and compares the strategies proposed by the researchers, using Bardin's content analysis method. The research is descriptive and explanatory, and the research approach will be quanti-qualitative. It concludes that preservation metadata in Institutional Repositories are not common in the digital preservation strategies of most universities. It also proposes guidelines for preservation metadata, making their description unique and accurate, especially regarding the objects entrusted to them for better reproduction and long-term accessibility.Os metadados de preservação digital dos repositórios institucionais são ferramentas que asseguram a preservação dos objetos digitais nas universidades públicas nacionais. A abordagem sobre essa iniciativa em suas políticas informacionais ou de preservação dos repositórios indica que há a recuperação da informação. Este trabalho objetiva propor estratégias de preservação para o uso desses metadados, com base na bibliografia, nas políticas de informação e preservação dos repositórios institucionais. Analisa como ocorre a descrição dos conjuntos de metadados de preservação nas políticas desses repositórios e compara as estratégias propostas pelos pesquisadores, usando o método de análise de conteúdo de Bardin. A pesquisa caracteriza-se descritiva e explicativa e, quanto à abordagem de pesquisa, será a quanti-qualitativa. Conclui-se que os metadados de preservação nos Repositório Institucionais não são comuns nas estratégias de preservação digital da maioria das universidades. Propõe diretrizes para metadados de preservação, tornando sua descrição singular e precisa, sobretudo quanto aos objetos a eles confiados para uma melhor reprodução e acessibilidade em longo prazo
Exploring digital preservation requirements: a case study from the National Geoscience Data Centre (NGDC)
Purpose
This case study is based on an MSc dissertation research undertaken at Northumbria University. The aim was to explore digital preservation requirements within the wider NGDC organisational framework in preparation for developing a preservation policy and integrating associated preservation workflows throughout the existing research data management processes.
Design/methodology/approach
This mixed methods case study used quantitative and qualitative data to explore the preservation requirements and triangulation to strengthen the design validity. Corporate and the wider scientific priorities were identified through literature and a stakeholder survey. Organisational preparedness was investigated through staff interviews.
Findings
Stakeholders expect data to be reliable, reusable, and available in preferred formats. To ensure digital continuity, the creation of high quality metadata is critical, and data depositors need data management training to achieve this. Recommendations include completing a risk assessment, creating a digital asset register, and a technology watch to mitigate against risks.
Research limitations/implications
The main constraint in this study is the lack of generalisability of results. As the NGDC is a unique organisation, it may not be possible to generalise the organisational findings although those relating to research data management may be transferrable.
Originality/value
This research examines the specific nature of geoscience data retention requirements and looks at existing NGDC procedures in terms of enhancing digital continuity, providing new knowledge on the preservation requirements for a number of national datasets
Digital preservation at Big Data scales: proposing a step-change in preservation system architectures
Purpose
The purpose of this paper is to consider how digital preservation system architectures will support business analysis of large-scale collections of preserved resources, and the use of Big Data analyses by future researchers.
Design/methodology/approach
This paper reviews the architecture of existing systems, then discusses experimental surveys of large digital collections using existing digital preservation tools at Big Data scales. Finally, it introduces the design of a proposed new architecture to work with Big Data volumes of preserved digital resources – also based upon experience of managing a collection of 30 million digital images.
Findings
Modern visualisation tools enable business analyses based on file-related metadata, but most currently available systems need more of this functionality “out-of-the-box”. Scalability of preservation architecture to Big Data volumes depends upon the ability to run preservation processes in parallel, so indexes that enable effective sub-division of collections are vital. Not all processes scale easily: those that do not require complex management.
Practical implications
The complexities caused by scaling up to Big Data volumes can be seen as being at odds with preservation, where simplicity matters. However, the sustainability of preservation systems relates directly to their usefulness, and maintaining usefulness will increasingly depend upon being able to process digital resources at Big Data volumes. An effective balance between these conflicting situations must be struck.
Originality/value
Preservation systems are at a step-change as they move to Big Data scale architectures and respond to more technical research processes. This paper is a timely illustration of the state of play at this pivotal moment.
The Polonsky Foundatio
- …