Search CORE

505 research outputs found

The Repeatability Experiment of SIGMOD 2008

Author: Afanasiev L. (Loredana)
Arion A.
Dittrich J.
Manegold S. (Stefan)
Manolescu I.
Polyzotis N.
Schnaitter K.
Senellart P.
Shasha D.
Zoupanos S.
Publication venue: A.C.M.
Publication date: 01/01/2008
Field of study

SIGMOD 2008 was the first database conference that offered to test submitters' programs against their data to verify the experiments published. This paper discusses the rationale for this effort, the community's reaction, our experiences, and advice for future similar efforts

CWI's Institutional Repository

International Migration, Integration and Social Cohesion online publications

A Provenance-Based Infrastructure to Support the Life Cycle of Executable Papers

Author
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

The IT University of Copenhagen's Repository

A Provenance-Based Infrastructure to Support the Life Cycle of Executable Papers

Author: Bauer Bela
Bonnet Philippe
Freire Juliana
Koop David
Mates Phillip
Santos Emanuele
Silva Cláudio T.
Surer Brigitte
Tohline Joel E.
Troyer Matthias
Vo Huy T.
Williams Dean N.
Publication venue: Published by Elsevier B.V.
Publication date
Field of study

AbstractAs publishers establish a greater online presence as well as infrastructure to support the distribution of more varied information, the idea of an executable paper that enables greater interaction has developed. An executable paper provides more information for computational experiments and results than the text, tables, and figures of standard papers. Executable papers can bundle computational content that allow readers and reviewers to interact, validate, and explore experiments. By including such content, authors facilitate future discoveries by lowering the barrier to reproducing and extending results. We present an infrastructure for creating, disseminating, and maintaining executable papers. Our approach is rooted in provenance, the documentation of exactly how data, experiments, and results were generated. We seek to improve the experience for everyone involved in the life cycle of an executable paper. The automated capture of provenance information allows authors to easily integrate and update results into papers as they write, and also helps reviewers better evaluate approaches by enabling them to explore experimental results by varying parameters or data. With a provenance-based system, readers are able to examine exactly how a result was developed to better understand and extend published findings

Elsevier - Publisher Connector

Sciunits: Reusable Research Objects

Author: Fils Gabriel
Malik Tanu
That Dai Hai Ton
Yuan Zhihao
Publication venue
Publication date: 11/09/2017
Field of study

Science is conducted collaboratively, often requiring knowledge sharing about computational experiments. When experiments include only datasets, they can be shared using Uniform Resource Identifiers (URIs) or Digital Object Identifiers (DOIs). An experiment, however, seldom includes only datasets, but more often includes software, its past execution, provenance, and associated documentation. The Research Object has recently emerged as a comprehensive and systematic method for aggregation and identification of diverse elements of computational experiments. While a necessary method, mere aggregation is not sufficient for the sharing of computational experiments. Other users must be able to easily recompute on these shared research objects. In this paper, we present the sciunit, a reusable research object in which aggregated content is recomputable. We describe a Git-like client that efficiently creates, stores, and repeats sciunits. We show through analysis that sciunits repeat computational experiments with minimal storage and processing overhead. Finally, we provide an overview of sharing and reproducible cyberinfrastructure based on sciunits gaining adoption in the domain of geosciences

arXiv.org e-Print Archive

Crossref

The Truth, the Whole Truth, and Nothing but the Truth: A Pragmatic Guide to Assessing Empirical Evaluations

Author: Amer Diwan
Andreas Zeller
Antony L. Hosking
Cliff Click
Daniel Frampton
Jones Richard
José Nelson Amaral
Laurie J. Hendren
Lieven Eeckhout
Lubomír Bulej
Matthias Hauswirth
Michael Hind
Nathan Keynes
Nathaniel Nystrom
Peter F. Sweeney
Richard E. Jones
Sebastian Fischmeister
Stephen M. Blackburn
Stodden Victoria
Tim Brecht
Tomas Kalibera
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2016
Field of study

An unsound claim can misdirect a field, encouraging the pursuit of unworthy ideas and the abandonment of promising ideas. An inadequate description of a claim can make it difficult to reason about the claim, for example to determine whether the claim is sound. Many practitioners will acknowledge the threat of un- sound claims or inadequate descriptions of claims to their field. We believe that this situation is exacerbated and even encouraged by the lack of a systematic approach to exploring, exposing, and addressing the source of unsound claims and poor exposition. This paper proposes a framework that identifies three sins of reasoning that lead to unsound claims and two sins of exposition that lead to poorly described claims. Sins of exposition obfuscate the objective of determining whether or not a claim is sound, while sins of reasoning lead directly to unsound claims. Our framework provides practitioners with a principled way of critiquing the integrity of their own work and the work of others. We hope that this will help individuals conduct better science and encourage a cultural shift in our research community to identify and promulgate sound claims

Crossref

CISPA – Helmholtz-Zentrum für Informationssicherheit

Ghent University Academic Bibliography

Kent Academic Repository

Principles of Experimental Evaluation

Author: Manolescu Ioana
Publication venue: HAL CCSD
Publication date: 09/06/2014
Field of study

International audienceLa validation expérimentale est devenue incontournable pour faireaccepter les conclusions d'une étude scientifique, en particulier dansle domaine de la gestion de données. En même temps, mener à bien uneétude expérimentale complète et crédible demande de maîtriser unesérie de techniques et bonnes pratiques qui visent à organiser l'étudeexpérimentale, à analyser les résultats pour bien les interpréter, età rendre les expériences faciles à reproduire, afin de permettre àd'autres de les analyser et/ou de les refaire.Le tutoriel se concentre sur deux aspects liés à l'expérimentation, àsavoir la planification des expériences, et les bonnes pratiques afinde rendre les expériences reproductibles. Une version étendue en a étéprésentée aux conférences ICDE 2008 et EDBT 2009

INRIA a CCSD electronic archive server

Quantifying Reproducibility in Computational Biology: The Case of the Tuberculosis Drugome

Author: Bourne Philip E.
Garijo Verdejo Daniel
Gil Yolanda
Kinnings Sarah
Xie Lei
Xie Li
Zhang Yinliang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

How easy is it to reproduce the results found in a typical computational biology paper? Either through experience or intuition the reader will already know that the answer is with difficulty or not at all. In this paper we attempt to quantify this difficulty by reproducing a previously published paper for different classes of users (ranging from users with little expertise to domain experts) and suggest ways in which the situation might be improved. Quantification is achieved by estimating the time required to reproduce each of the steps in the method described in the original paper and make them part of an explicit workflow that reproduces the original results. Reproducing the method took several months of effort, and required using new versions and new software that posed challenges to reconstructing and validating the results. The quantification leads to “reproducibility maps” that reveal that novice researchers would only be able to reproduce a few of the steps in the method, and that only expert researchers with advance knowledge of the domain would be able to reproduce the method in its entirety. The workflow itself is published as an online resource together with supporting software and data. The paper concludes with a brief discussion of the complexities of requiring reproducibility in terms of cost versus benefit, and a desiderata with our observations and guidelines for improving reproducibility. This has implications not only in reproducing the work of others from published papers, but reproducing work from one’s own laboratory

CiteSeerX

City University of New York

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

PubMed Central

Archivo Digital UPM