Search CORE

32,669 research outputs found

Piloting access to the Belgian web-archive for scientific research: a methodological exploration

Author: Chambers Sally
Geeraert Friedel
Mechant Peter
Vlassenroot Eveline
Publication venue: Maynooth University Arts and Humanities Institute
Publication date: 01/01/2020
Field of study

The web is fraught with contradiction. On the one hand, the web has become a central means of information in everyday life and therefore holds the primary sources of our history created by a large variety of people (Milligan, 2016; Winters, 2017). Yet, much less importance is attached to its preservation, meaning that potentially interesting sources for future (humanities) research are lost. Web archiving therefore is a direct result of the computational turn and has a role to play in knowledge production and dissemination as demonstrated by a number of publications (e.g. Brügger & Schroeder, 2017) and research initiatives related to the research use of web archives (e.g. https://resaw.eu/). However, conducting research, and answering research questions based on web archives - in short; ‘using web archives as a data resource for digital scholars’ (Vlassenroot et al., 2019) - demonstrates that this so-called ‘computational turn’ in humanities and social sciences (i.e. the increased incorporation of advanced computational research methods and large datasets into disciplines which have traditionally dealt with considerably more limited collections of evidence), indeed requires new skills and new software. In December 2016, a pilot web-archiving project called PROMISE (PReserving Online Multiple Information: towards a Belgian StratEgy) was funded. The aim of the project was to (i) identify current best practices in web-archiving and apply them to the Belgian context, (ii) pilot Belgian web-archiving, (iii) pilot access (and use) of the pilot Belgian web archive for scientific research, and (iv) make recommendations for a sustainable web-archiving service for Belgium. Now the project is moving towards its final stages, the project team is focusing on the third objective of the project, namely how pilot access to the Belgian web archive for scientific research. The aim of this presentation is to discuss how the PROMISE team approached piloting access to the Belgian web-archive for scientific research, including: a) reviewing how existing web-archives provide access to their collections for research, b) assessing the needs of researchers based on a range of initiatives focussing on research-use of web-archives (e.g. RESAW, BUDDAH, WARCnet, IIPC Research Working Group, etc. and c) exploring how the five persona’s created as part of the French National Library’s Corpus project (Moiraghi, 2018) could help us to explore how different types of academic researchers that might use web archives in their research. Finally, we will introduce the emerging Digital Research Lab at the Royal Library of Belgium (KBR) as part of a long-term collaboration with the Ghent Centre for Digital Humanities (GhentCDH) which aims to facilitate data-level access to KBR’s digitised and born-digital collections and could potentially provide the solution for offering research access to the Belgian web-archive

Ghent University Academic Bibliography

The selection, appraisal and retention of digital scientific data: dighlights of an ERPANET/CODATA workshop

Author: Anderson W.
Davidson J.
Esanu E.
Ross S.
Publication venue: Ubiquity Press Ltd.
Publication date: 01/12/2004
Field of study

CODATA and ERPANET collaborated to convene an international archiving workshop on the selection, appraisal, and retention of digital scientific data, which was held on 15-17 December 2003 at the Biblioteca Nacional in Lisbon, Portugal. The workshop brought together more than 65 researchers, data and information managers, archivists, and librarians from 13 countries to discuss the issues involved in making critical decisions regarding the long-term preservation of the scientific record. One of the major aims for this workshop was to provide an international forum to exchange information about data archiving policies and practices across different scientific, institutional, and national contexts. Highlights from the workshop discussions are presented

Enlighten

Open access self-archiving: An author study

Author: Brown Sheridan
Swan Alma
Publication venue
Publication date: 01/01/2005
Field of study

This, our second author international, cross-disciplinary study on open access had 1296 respondents. Its focus was on self-archiving. Almost half (49%) of the respondent population have self-archived at least one article during the last three years. Use of institutional repositories for this purpose has doubled and usage has increased by almost 60% for subject-based repositories. Self-archiving activity is greatest amongst those who publish the largest number of papers. There is still a substantial proportion of authors unaware of the possibility of providing open access to their work by self-archiving. Of the authors who have not yet self-archived any articles, 71% remain unaware of the option. With 49% of the author population having self-archived in some way, this means that 36% of the total author population (71% of the remaining 51%), has not yet been appraised of this way of providing open access. Authors have frequently expressed reluctance to self-archive because of the perceived time required and possible technical difficulties in carrying out this activity, yet findings here show that only 20% of authors found some degree of difficulty with the first act of depositing an article in a repository, and that this dropped to 9% for subsequent deposits. Another author worry is about infringing agreed copyright agreements with publishers, yet only 10% of authors currently know of the SHERPA/RoMEO list of publisher permissions policies with respect to self-archiving, where clear guidance as to what a publisher permits is provided. Where it is not known if permission is required, however, authors are not seeking it and are self-archiving without it. Communicating their results to peers remains the primary reason for scholars publishing their work; in other words, researchers publish to have an impact on their field. The vast majority of authors (81%) would willingly comply with a mandate from their employer or research funder to deposit copies of their articles in an institutional or subject-based repository. A further 13% would comply reluctantly; 5% would not comply with such a mandate

Southampton (e-Prints Soton)

CogPrints Cognitive Sciences Eprint Archive

Bots, Seeds and People: Web Archives as Infrastructure

Author: Booms Hans
Botticelli Peter
Cook Terry
Couture Carol
Geertz Clifford
Jordan Brigitte
Mohr Gordon
Niu Jinfang
Seaver Nick
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 08/11/2016
Field of study

The field of web archiving provides a unique mix of human and automated agents collaborating to achieve the preservation of the web. Centuries old theories of archival appraisal are being transplanted into the sociotechnical environment of the World Wide Web with varying degrees of success. The work of the archivist and bots in contact with the material of the web present a distinctive and understudied CSCW shaped problem. To investigate this space we conducted semi-structured interviews with archivists and technologists who were directly involved in the selection of content from the web for archives. These semi-structured interviews identified thematic areas that inform the appraisal process in web archives, some of which are encoded in heuristics and algorithms. Making the infrastructure of web archives legible to the archivist, the automated agents and the future researcher is presented as a challenge to the CSCW and archival community

arXiv.org e-Print Archive

Crossref

Developing a model for e-prints and open access journal content in UK further and higher education

Author: Brown Sheridan
Hardy Rachel
Muir Adrienne
Needham Paul
Oppenheim Charles
O’Brien Ann
Probets Steve
Rowland Fytton
Swan Alma
Publication venue
Publication date: 01/01/2005
Field of study

A study carried out for the UK Joint Information Systems Committee examined models for the provision of access to material in institutional and subject-based archives and in open access journals. Their relative merits were considered, addressing not only technical concerns but also how e-print provision (by authors) can be achieved – an essential factor for an effective e-print delivery service (for users). A "harvesting" model is recommended, where the metadata of articles deposited in distributed archives are harvested, stored and enhanced by a national service. This model has major advantages over the alternatives of a national centralized service or a completely decentralized one. Options for the implementation of a service based on the harvesting model are presented

Southampton (e-Prints Soton)

E-LIS

Crossref

Cranfield CERES

CogPrints Cognitive Sciences Eprint Archive

Journal publishing and author self-archiving: Peaceful Co-Existence and Fruitful Collaboration

Author: Berners-Lee Tim
De Roure Dave
Harnad Stevan
Shadbolt Nigel
Publication venue
Publication date: 01/01/2005
Field of study

The UK Research Funding Councils (RCUK) have proposed that all RCUK fundees should self-archive on the web, free for all, their own final drafts of all journal articles reporting their RCUK-funded research, in order to maximise their usage and impact. ALPSP (a learned publishers' association) now seeks to delay and block the RCUK proposal, arguing that it will ruin journals. All objective evidence from the past decade and a half of self-archiving, however, shows that self-archiving can and does co-exist peacefully with journals while greatly enhancing both author/article and journal impact, to the benefit of both. Journal publishers should not be trying to delay and block self-archiving policy; they should be collaborating with the research community on ways to share its vast benefits

Southampton (e-Prints Soton)

The Open Challenge: A Brief History

Author: Harnad Stevan
Publication venue
Publication date: 01/01/2010
Field of study

Milestones in the history of the Open Access (OA) Movement, especially the 1994 "Subversive Proposal" for authors to self-archive their peer-reviewed journal articles, the creation of the first OAI-compliant open source software for creating an Institutional Repository (EPrints, 2000), the evidence for the OA impact advantage (2001), the first OA Self-Archiving Mandate (U. Southampton ECS 2002), the OA Mandates Registry (ROARMAP, 2003), and the creation of the OA Policy Guidance organization for universities worldwide, EnablingOpenScholarship (EOS 2010)

Southampton (e-Prints Soton)

Archipel - Université du Québec à Montréal

The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives

Author: Fritz Samantha
Lin Jimmy
Milligan Ian
Ruest Nick
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 15/01/2020
Field of study

The Archives Unleashed project aims to improve scholarly access to web archives through a multi-pronged strategy involving tool creation, process modeling, and community building -- all proceeding concurrently in mutually --reinforcing efforts. As we near the end of our initially-conceived three-year project, we report on our progress and share lessons learned along the way. The main contribution articulated in this paper is a process model that decomposes scholarly inquiries into four main activities: filter, extract, aggregate, and visualize. Based on the insight that these activities can be disaggregated across time, space, and tools, it is possible to generate "derivative products", using our Archives Unleashed Toolkit, that serve as useful starting points for scholarly inquiry. Scholars can download these products from the Archives Unleashed Cloud and manipulate them just like any other dataset, thus providing access to web archives without requiring any specialized knowledge. Over the past few years, our platform has processed over a thousand different collections from over two hundred users, totaling around 300 terabytes of web archives.This research was supported by the Andrew W. Mellon Foundation, the Social Sciences and Humanities Research Council of Canada, as well as Start Smart Labs, Compute Canada, the University of Waterloo, and York University. We’d like to thank Jeremy Wiebe, Ryan Deschamps, and Gursimran Singh for their contributions

arXiv.org e-Print Archive

Crossref

YorkSpace

The European Landscape of Qualitative Social Research Archives: Methodological and Practical Issues

Author: Corti Louise
Publication venue: FQS
Publication date: 01/01/2011
Field of study

In this article I set about describing current practices in archiving and reusing qualitative data. I discuss where can you find archived sources of qualitative data, and discuss some of the debates surrounding methodological, ethical and theoretical considerations relating to re-using data. I then address more pragmatic issues involved acquiring, preserving, providing access to and supporting the use of the data. Where best do qualitative data collections sit?in traditional libraries or archives alongside historical documents or as part of more holistic digital collections of contemporary social science research resources? This question relates to accessibility, resource discovery and cataloging methods, data preparation and documentation and promotional and outreach efforts to encourage data use. The ESDS Qualidata unit at the UK Data Archive is used as case study for showcasing archival practices, and is situated within the broader European landscape of social science-oriented data archives. Infrastructure requirements for running an archive are discussed and a look forward future developments

University of Essex Research Repository

Directory of Open Access Journals

SSOAR - Social Science Open Access Repository

Forum Qualitative Sozialforschung (Forum: Qualitative Social Research)

Publish and Die

Author: Biggs Simon
Publication venue
Publication date: 01/01/2010
Field of study

Edinburgh Research Explorer