32,669 research outputs found

    Piloting access to the Belgian web-archive for scientific research: a methodological exploration

    Get PDF
    The web is fraught with contradiction. On the one hand, the web has become a central means of information in everyday life and therefore holds the primary sources of our history created by a large variety of people (Milligan, 2016; Winters, 2017). Yet, much less importance is attached to its preservation, meaning that potentially interesting sources for future (humanities) research are lost. Web archiving therefore is a direct result of the computational turn and has a role to play in knowledge production and dissemination as demonstrated by a number of publications (e.g. Brügger & Schroeder, 2017) and research initiatives related to the research use of web archives (e.g. https://resaw.eu/). However, conducting research, and answering research questions based on web archives - in short; ‘using web archives as a data resource for digital scholars’ (Vlassenroot et al., 2019) - demonstrates that this so-called ‘computational turn’ in humanities and social sciences (i.e. the increased incorporation of advanced computational research methods and large datasets into disciplines which have traditionally dealt with considerably more limited collections of evidence), indeed requires new skills and new software. In December 2016, a pilot web-archiving project called PROMISE (PReserving Online Multiple Information: towards a Belgian StratEgy) was funded. The aim of the project was to (i) identify current best practices in web-archiving and apply them to the Belgian context, (ii) pilot Belgian web-archiving, (iii) pilot access (and use) of the pilot Belgian web archive for scientific research, and (iv) make recommendations for a sustainable web-archiving service for Belgium. Now the project is moving towards its final stages, the project team is focusing on the third objective of the project, namely how pilot access to the Belgian web archive for scientific research. The aim of this presentation is to discuss how the PROMISE team approached piloting access to the Belgian web-archive for scientific research, including: a) reviewing how existing web-archives provide access to their collections for research, b) assessing the needs of researchers based on a range of initiatives focussing on research-use of web-archives (e.g. RESAW, BUDDAH, WARCnet, IIPC Research Working Group, etc. and c) exploring how the five persona’s created as part of the French National Library’s Corpus project (Moiraghi, 2018) could help us to explore how different types of academic researchers that might use web archives in their research. Finally, we will introduce the emerging Digital Research Lab at the Royal Library of Belgium (KBR) as part of a long-term collaboration with the Ghent Centre for Digital Humanities (GhentCDH) which aims to facilitate data-level access to KBR’s digitised and born-digital collections and could potentially provide the solution for offering research access to the Belgian web-archive

    The selection, appraisal and retention of digital scientific data: dighlights of an ERPANET/CODATA workshop

    Get PDF
    CODATA and ERPANET collaborated to convene an international archiving workshop on the selection, appraisal, and retention of digital scientific data, which was held on 15-17 December 2003 at the Biblioteca Nacional in Lisbon, Portugal. The workshop brought together more than 65 researchers, data and information managers, archivists, and librarians from 13 countries to discuss the issues involved in making critical decisions regarding the long-term preservation of the scientific record. One of the major aims for this workshop was to provide an international forum to exchange information about data archiving policies and practices across different scientific, institutional, and national contexts. Highlights from the workshop discussions are presented

    Open access self-archiving: An author study

    Get PDF
    This, our second author international, cross-disciplinary study on open access had 1296 respondents. Its focus was on self-archiving. Almost half (49%) of the respondent population have self-archived at least one article during the last three years. Use of institutional repositories for this purpose has doubled and usage has increased by almost 60% for subject-based repositories. Self-archiving activity is greatest amongst those who publish the largest number of papers. There is still a substantial proportion of authors unaware of the possibility of providing open access to their work by self-archiving. Of the authors who have not yet self-archived any articles, 71% remain unaware of the option. With 49% of the author population having self-archived in some way, this means that 36% of the total author population (71% of the remaining 51%), has not yet been appraised of this way of providing open access. Authors have frequently expressed reluctance to self-archive because of the perceived time required and possible technical difficulties in carrying out this activity, yet findings here show that only 20% of authors found some degree of difficulty with the first act of depositing an article in a repository, and that this dropped to 9% for subsequent deposits. Another author worry is about infringing agreed copyright agreements with publishers, yet only 10% of authors currently know of the SHERPA/RoMEO list of publisher permissions policies with respect to self-archiving, where clear guidance as to what a publisher permits is provided. Where it is not known if permission is required, however, authors are not seeking it and are self-archiving without it. Communicating their results to peers remains the primary reason for scholars publishing their work; in other words, researchers publish to have an impact on their field. The vast majority of authors (81%) would willingly comply with a mandate from their employer or research funder to deposit copies of their articles in an institutional or subject-based repository. A further 13% would comply reluctantly; 5% would not comply with such a mandate

    Bots, Seeds and People: Web Archives as Infrastructure

    Full text link
    The field of web archiving provides a unique mix of human and automated agents collaborating to achieve the preservation of the web. Centuries old theories of archival appraisal are being transplanted into the sociotechnical environment of the World Wide Web with varying degrees of success. The work of the archivist and bots in contact with the material of the web present a distinctive and understudied CSCW shaped problem. To investigate this space we conducted semi-structured interviews with archivists and technologists who were directly involved in the selection of content from the web for archives. These semi-structured interviews identified thematic areas that inform the appraisal process in web archives, some of which are encoded in heuristics and algorithms. Making the infrastructure of web archives legible to the archivist, the automated agents and the future researcher is presented as a challenge to the CSCW and archival community

    Developing a model for e-prints and open access journal content in UK further and higher education

    Get PDF
    A study carried out for the UK Joint Information Systems Committee examined models for the provision of access to material in institutional and subject-based archives and in open access journals. Their relative merits were considered, addressing not only technical concerns but also how e-print provision (by authors) can be achieved – an essential factor for an effective e-print delivery service (for users). A "harvesting" model is recommended, where the metadata of articles deposited in distributed archives are harvested, stored and enhanced by a national service. This model has major advantages over the alternatives of a national centralized service or a completely decentralized one. Options for the implementation of a service based on the harvesting model are presented

    Journal publishing and author self-archiving: Peaceful Co-Existence and Fruitful Collaboration

    No full text
    The UK Research Funding Councils (RCUK) have proposed that all RCUK fundees should self-archive on the web, free for all, their own final drafts of all journal articles reporting their RCUK-funded research, in order to maximise their usage and impact. ALPSP (a learned publishers' association) now seeks to delay and block the RCUK proposal, arguing that it will ruin journals. All objective evidence from the past decade and a half of self-archiving, however, shows that self-archiving can and does co-exist peacefully with journals while greatly enhancing both author/article and journal impact, to the benefit of both. Journal publishers should not be trying to delay and block self-archiving policy; they should be collaborating with the research community on ways to share its vast benefits

    The Open Challenge: A Brief History

    Get PDF
    Milestones in the history of the Open Access (OA) Movement, especially the 1994 "Subversive Proposal" for authors to self-archive their peer-reviewed journal articles, the creation of the first OAI-compliant open source software for creating an Institutional Repository (EPrints, 2000), the evidence for the OA impact advantage (2001), the first OA Self-Archiving Mandate (U. Southampton ECS 2002), the OA Mandates Registry (ROARMAP, 2003), and the creation of the OA Policy Guidance organization for universities worldwide, EnablingOpenScholarship (EOS 2010)

    The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives

    Get PDF
    The Archives Unleashed project aims to improve scholarly access to web archives through a multi-pronged strategy involving tool creation, process modeling, and community building -- all proceeding concurrently in mutually --reinforcing efforts. As we near the end of our initially-conceived three-year project, we report on our progress and share lessons learned along the way. The main contribution articulated in this paper is a process model that decomposes scholarly inquiries into four main activities: filter, extract, aggregate, and visualize. Based on the insight that these activities can be disaggregated across time, space, and tools, it is possible to generate "derivative products", using our Archives Unleashed Toolkit, that serve as useful starting points for scholarly inquiry. Scholars can download these products from the Archives Unleashed Cloud and manipulate them just like any other dataset, thus providing access to web archives without requiring any specialized knowledge. Over the past few years, our platform has processed over a thousand different collections from over two hundred users, totaling around 300 terabytes of web archives.This research was supported by the Andrew W. Mellon Foundation, the Social Sciences and Humanities Research Council of Canada, as well as Start Smart Labs, Compute Canada, the University of Waterloo, and York University. We’d like to thank Jeremy Wiebe, Ryan Deschamps, and Gursimran Singh for their contributions

    The European Landscape of Qualitative Social Research Archives: Methodological and Practical Issues

    Get PDF
    In this article I set about describing current practices in archiving and reusing qualitative data. I discuss where can you find archived sources of qualitative data, and discuss some of the debates surrounding methodological, ethical and theoretical considerations relating to re-using data. I then address more pragmatic issues involved acquiring, preserving, providing access to and supporting the use of the data. Where best do qualitative data collections sit?in traditional libraries or archives alongside historical documents or as part of more holistic digital collections of contemporary social science research resources? This question relates to accessibility, resource discovery and cataloging methods, data preparation and documentation and promotional and outreach efforts to encourage data use. The ESDS Qualidata unit at the UK Data Archive is used as case study for showcasing archival practices, and is situated within the broader European landscape of social science-oriented data archives. Infrastructure requirements for running an archive are discussed and a look forward future developments

    Publish and Die

    Get PDF
    • …
    corecore