254 research outputs found
BlogForever: D3.1 Preservation Strategy Report
This report describes preservation planning approaches and strategies recommended by the BlogForever project as a core component of a weblog repository design. More specifically, we start by discussing why we would want to preserve weblogs in the first place and what it is exactly that we are trying to preserve. We further present a review of past and present work and highlight why current practices in web archiving do not address the needs of weblog preservation adequately. We make three distinctive contributions in this volume: a) we propose transferable practical workflows for applying a combination of established metadata and repository standards in developing a weblog repository, b) we provide an automated approach to identifying significant properties of weblog content that uses the notion of communities and how this affects previous strategies, c) we propose a sustainability plan that draws upon community knowledge through innovative repository design
Authenticity and Provenance in Long Term Digital Preservation: Modeling and Implementation in Preservation Aware Storage
none1http://www.usenix.org/event/tapp09/tech/full_papers/factor/factor.pdf - ISSN:openGUERCIO M.Guercio, Mari
Building LABDRIVE, a Petabyte Scale, OAIS/ISO 16363 Conformant, Environmentally Sustainable Archive, Tested by Large Scientific Organisations to Preserve their Raw and Processed Data, Software and Documents
Vast amounts of scientific, cultural, social, business and government, and other, information is being created every day. There are billions of objects, in a multitude of formats, semantics and associated software. Much, perhaps the majority, of this information is transitory but there is still an immense amount which should be preserved for the medium and long term – perhaps even indefinitely.
Preservation requires that the information continues to be usable, not simply to be printed or displayed. Of course, the digital objects (the bits) must be preserved, as must the “metadata” which enables the bits to the understood which includes the software.
Before LABDRIVE no system could adequately preserve such information, especially in such gigantic volume and variety.
In this paper we describe the development of LABDRIVE and its ability to preserve tens or hundreds of petabytes in a way which is conformant to the OAIS Reference Model and capable of being ISO 16363 certified
The Preservation of Digital Objects in German Repositories: Die Archivierung digitaler Objekte in deutschenRepositorien: Drei Fallstudien: Three Case Studies
Taking its cue from the increasing amount of digital content deposited into institutional and subject repositories as well as the open question of repositories'' role in long-term preservation, this study presents case studies of three German institutional and subject repositories all of which are in a different stage of establishing a (cooperative) framework for the long-term preservation of their digital collections. Drawing on different sets of criteria for trustworthy repositories, it is investigated which strategies the selected repositories pursue to preserve the digital assets in their collections, and how these strategies are implemented with the help of both human repository staff and the repository software used. The following repositories are considered: pedocs (Deutsches Institut für Internationale Pädagogische Forschung), JUWEL (Forschungszentrum Jülich), and Qucosa (SLUB Dresden). In that the latter can be regarded as examples for common types of (German) repositories, the results of this study might on the one hand serve as a guideline for repositories that intend, similar to the ones described here, to explore questions of long-term preservation in the near future, or are even taking their first concrete steps in this field. On the other hand, it is hoped that this work can at least give some hints as to the stage and status of long-term preservation in the German repository landscape
Ontologijos ir technologiniai sprendimai skaitmeninio kultūros paveldo integravimui ir prieigai: Lietuvos patirtis
Web technologies are the key for the implementing and ensuring the full range of user needs in the digital age. On the other hand, the issue of unified representation of digital content from diverse memory institutions in order to ensure semantic integrity still remains a matter of urgency. Semantic interoperability of information and data is essential in an integrated system. In this paper, we analyze and describe an ontology-based metadata interoperability approach and how this approach could be applied for memory institution data from diverse sources which do not support ontologies. In particular, we describe the use of the CIDOC CRM ontology as a mediating schema within Lithuania’s Information System of the Virtual Electronic Heritage (hereinafter ”VEPIS”) The paper introduces the role of the CIDOC CRM based Thesaurus of Personal Names, Geographical Names and Historical Chronology (hereinafter “BAVIC”), which operates as a core ontology within VEPIS by allowing to understand things and relationships between things as well as identify the time and space of things. The paper also focuses on trust of the cultural information on the Web. Users make trust judgments based on provenance that may or may not be explicitly offered to them. In particular, we describe how provenance is managed within digital preservation and access processes within VEPIS and define whether this management meets the W3C Provenance Incubator Group’s Requirements for Provenance on the Web. The paper is based on the results of the research initiated in 2018–2019 at the Faculty of Communication and the Faculty of Mathematics and Informatics of Vilnius University by authors of this paper.Saityno technologijos sudaro galimybę tenkinti įvairiapusiškas informacines skaitmeninės eros vartotojų reikmes. Kita vertus, iki šiol aktuali problema išlieka atminties institucijų į saityną teikiamo skaitmeninto turinio semantinis integralumas. Informacijos ir duomenų turinio semantinis suderinamumas ypač aktualus integruotoms sistemoms. Straipsnyje apibūdinama ontologijomis grindžiamų metaduomenų koncepcija. Straipsnyje aprašomas CIDOC/CRM ontologijos kaip tarpininkavimo schemos vaidmuo VEPIS sistemoje. Straipsnis taip pat supažindina su Asmenvardžių, vietovardžių ir istorinės chronologijos tezauru (BAVIC), VEPIS atliekančiu pamatinės ontologijos vaidmenį (leidžia suprasti esybes ir jų santykius, jų santykį su laiku ir erdve). Kita straipsnyje analizuojama problema yra susijusi su kultūros informacijos turinio patikimumu saityne. Vartotojai apie informacijos ir duomenų patikimumą sprendžia remdamiesi proveniencija, kuri gali arba negali būti jiems tiesiogiai pateikiama. Straipsnyje analizuojama, kaip proveniencija yra valdoma VEPIS skaitmeninto turinio ilgalaikio išsaugojimo ir jos sklaidos procesų metu, ir kartu nustatoma, ar šie procesai atitinka proveniencijos saityne W3C Provenance Incubator Group reikalavimus. Straipsnyje remiamasi Vilniaus universiteto Komunikacijos fakulteto ir Matematikos fakulteto 2018–2019 m. straipsnio autorių inicijuoto tyrimo rezultatais
Recommended from our members
A Framework for Applying the Concept of Significant Properties to Datasets
The concept of significant properties, properties that must be identified and preserved in any successful digital object preservation, is now common in data curation. Although this notion has clearly demonstrated its usefulness in cultural heritage domains its application to the preservation of scientific datasets is not as well developed. One obstacle to this application is that the familiar preservation models are not sufficiently explicit to identify the relevant entities, properties, and relationships involved in dataset preservation. We present a logic-based formal framework of dataset concepts that provides the levels of abstraction necessary to identify and correctly assign significant properties to their appropriate entities. A unique feature of this model is that it recognizes that a typed symbol structure is a unique requirement for datasets, but not for other information objects
ネットワーク情報環境におけるメタデータの長期利用性向上のためのメタデータスキーマの来歴記述に関する研究
筑波大学 (University of Tsukuba)201
"No Going Back?" The final report of the Effective Records Management Project
The overall objective of the Effective Records Management project was the provision of ‘protocols and tools for the effective management of information in the digital order, with particular attention to information held in a document-based form’ . The project developed a demonstrator system which addressed the full range of issues involved in the management of a ‘testbed’ of digital records in the form of documents, from their creation through distribution and use to their final disposal or permanent retention. The testbed selected was the records of a subset of the university’s committees and was used to develop ideas and to assess how well they worked in practice. This project report indicates that the ERM project provides an investment in the future, opportunities for better information use or re-use, identification of legal risks and identification of good practice whether you be an administrator, a computing scientist, an archivist, a librarian or an information services manager
- …