221,651 research outputs found

    HELIN Task Force on Electronic Archiving Report

    Get PDF
    Report of the HELIN Electronic Archiving Task Force, appointed from the HELIN Serials Committee and the HELIN Collection Development Committee

    Report of the DELAMAN Costing Case Study

    Get PDF
    DELAMAN member archives estimated the costs of archiving two sample deposits of language documentation data. Comparing these costs to the cost of funding a documentation project which would generate this amount of data, the costs of archiving can be estimated to 8% of the total direct costs of the funded project. DELAMAN proposes that grantees and grantors use this 8% figure as a more simplified way to calculate archiving costs which better reflect the nature of archiving as basic infrastructure for endangered language research.http://delaman.or

    Mandated data archiving greatly improves access to research data

    Full text link
    The data underlying scientific papers should be accessible to researchers both now and in the future, but how best can we ensure that these data are available? Here we examine the effectiveness of four approaches to data archiving: no stated archiving policy, recommending (but not requiring) archiving, and two versions of mandating data deposition at acceptance. We control for differences between data types by trying to obtain data from papers that use a single, widespread population genetic analysis, STRUCTURE. At one extreme, we found that mandated data archiving policies that require the inclusion of a data availability statement in the manuscript improve the odds of finding the data online almost a thousand-fold compared to having no policy. However, archiving rates at journals with less stringent policies were only very slightly higher than those with no policy at all. At one extreme, we found that mandated data archiving policies that require the inclusion of a data availability statement in the manuscript improve the odds of finding the data online almost a thousand fold compared to having no policy. However, archiving rates at journals with less stringent policies were only very slightly higher than those with no policy at all. We also assessed the effectiveness of asking for data directly from authors and obtained over half of the requested datasets, albeit with about 8 days delay and some disagreement with authors. Given the long term benefits of data accessibility to the academic community, we believe that journal based mandatory data archiving policies and mandatory data availability statements should be more widely adopted

    Archiving scientific data

    Get PDF
    We present an archiving technique for hierarchical data with key structure. Our approach is based on the notion of timestamps whereby an element appearing in multiple versions of the database is stored only once along with a compact description of versions in which it appears. The basic idea of timestamping was discovered by Driscoll et. al. in the context of persistent data structures where one wishes to track the sequences of changes made to a data structure. We extend this idea to develop an archiving tool for XML data that is capable of providing meaningful change descriptions and can also efficiently support a variety of basic functions concerning the evolution of data such as retrieval of any specific version from the archive and querying the temporal history of any element. This is in contrast to diff-based approaches where such operations may require undoing a large number of changes or significant reasoning with the deltas. Surprisingly, our archiving technique does not incur any significant space overhead when contrasted with other approaches. Our experimental results support this and also show that the compacted archive file interacts well with other compression techniques. Finally, another useful property of our approach is that the resulting archive is also in XML and hence can directly leverage existing XML tools

    Repositories for Institutional Open Access: Mandated Deposit Policies

    No full text
    Only 15% of articles are currently being made Open Access (OA) through spontaneous self-archiving efforts by their authors. They average 25%-250% more citations in all 12 disciplines tested so far. Ninety-four percent of journals endorse immediate OA self-archiving. There is no evidence that self-archiving induces subscription cancellations. The “OA advantage” consists of: Early Advantage (early self-archiving produces both earlier and more citations), Usage Advantage (more downloads for OA articles, correlated with later citations), Competitive Advantage (relative citation advantage of OA over non-OA articles: disappears at 100% OA), Quality Advantage (OA advantage is higher, the higher the quality of the article) and Quality Bias (authors selectively self-archiving their higher quality articles – a non-causal component: disappears at 100% OA). We are currently comparing the OA advantage for mandated and spontaneous (self-selected) self-archiving. Deposit rates in Institutional Repositories (IRs) remain at 15% if unmandated, but climb toward 100% OA if mandated, confirming surveys that predicted 95% compliance. In the UK, 4 of the 8 research funding councils and the Wellcome Trust mandate self-archiving and it is being considered by the European Commission and the US federal FRPAA. There is no reason for universities to wait for the passage of the legislation. Five universities and two research institutions (including CERN) have already mandated it, with documented success. An Immediate-Deposit/Optional-Access Mandate covers all cases and moots all legal issues: metadata are immediately visible webwide and, where needed, access to the postprint can be set as Closed Access instead of OA throughout any embargo period. Software to support this approach (that allows the author to email individual copies of non-Open Access papers to individual requesters) has been created for both EPrints and DSpace repository platforms

    Archiving the Relaxed Consistency Web

    Full text link
    The historical, cultural, and intellectual importance of archiving the web has been widely recognized. Today, all countries with high Internet penetration rate have established high-profile archiving initiatives to crawl and archive the fast-disappearing web content for long-term use. As web technologies evolve, established web archiving techniques face challenges. This paper focuses on the potential impact of the relaxed consistency web design on crawler driven web archiving. Relaxed consistent websites may disseminate, albeit ephemerally, inaccurate and even contradictory information. If captured and preserved in the web archives as historical records, such information will degrade the overall archival quality. To assess the extent of such quality degradation, we build a simplified feed-following application and simulate its operation with synthetic workloads. The results indicate that a non-trivial portion of a relaxed consistency web archive may contain observable inconsistency, and the inconsistency window may extend significantly longer than that observed at the data store. We discuss the nature of such quality degradation and propose a few possible remedies.Comment: 10 pages, 6 figures, CIKM 201

    The use of paper in everyday student life

    Get PDF
    The information we encounter in modern life, in developed countries, is a hybrid of the physical and the digital. Personal archiving tools allow users to capture and retrieve aspects of their everyday lives in digital form. In this paper we use a diary study of students’ interactions with paper-based information to inform the design of such archiving tools

    Maximizing research impact through institutional and national open-access self-archiving mandates

    No full text
    No research institution can afford all the journals its researchers may need, so all articles are losing research impact (usage and citations). Articles made “Open Access,” (OA) by self-archiving them on the web are cited twice as much, but only 15% of articles are being spontaneously self-archived. The only institutions approaching 100% self-archiving are those that mandate it. Surveys show that 95% of authors will comply with a self-archiving mandate; the actual experience of institutions with mandates has confirmed this. What institutions and funders need to mandate is that (1) immediately upon acceptance for publication, (2) the author’s final draft must be (3) deposited into the Institutional Repository. Only the depositing needs to be mandated; setting access privileges to the full-text as either OA or Restricted Access (RA) can be left up to the author. For articles published in the 93% of journals that have already endorsed self-archiving, access can be set as OA immediately; for the remaining 7%, authors can email the eprint in response to individual email requests automatically forwarded by the Repository
    • 

    corecore