Search CORE

21 research outputs found

Sciunits: Reusable Research Objects

Author: Fils Gabriel
Malik Tanu
That Dai Hai Ton
Yuan Zhihao
Publication venue
Publication date: 11/09/2017
Field of study

Science is conducted collaboratively, often requiring knowledge sharing about computational experiments. When experiments include only datasets, they can be shared using Uniform Resource Identifiers (URIs) or Digital Object Identifiers (DOIs). An experiment, however, seldom includes only datasets, but more often includes software, its past execution, provenance, and associated documentation. The Research Object has recently emerged as a comprehensive and systematic method for aggregation and identification of diverse elements of computational experiments. While a necessary method, mere aggregation is not sufficient for the sharing of computational experiments. Other users must be able to easily recompute on these shared research objects. In this paper, we present the sciunit, a reusable research object in which aggregated content is recomputable. We describe a Git-like client that efficiently creates, stores, and repeats sciunits. We show through analysis that sciunits repeat computational experiments with minimal storage and processing overhead. Finally, we provide an overview of sharing and reproducible cyberinfrastructure based on sciunits gaining adoption in the domain of geosciences

arXiv.org e-Print Archive

Crossref

Four level provenance support to achieve portable reproducibility of scientific workflows

Author: Bánáti Anna
Kacsuk Péter
Kozlovszky Miklós
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

Crossref

SZTAKI Publication Repository

Classification of Scientific Workflows Based on Reproducibility Analysis

Author: Bánáti Anna
Kacsuk Péter
Kozlovszky Miklós
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

Crossref

SZTAKI Publication Repository

Minimal sufficient information about the scientific workflows to create reproducible experiment

Author: Bánáti Anna
Kacsuk Péter
Kozlovszky Miklós
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

Crossref

SZTAKI Publication Repository

Workflow-centric research objects: First class citizens in scholarly discourse.

Author: Bechhofer Sean
Belhajjame Khalid
Corcho Oscar
De Roure David
García Cuesta Esteban
Garijo Daniel
Goble Carole A.
Gómez-Pérez José Manuel
Klyne Graham
Missier Paolo
Newman David
Page Kevin
Palma Raúl
Roos Marco
Ruiz José Enrique
Soiland-Reyes Stian
Verdes-Montenegro Lourdes
Zhao Jun
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2012
Field of study

A workflow-centric research object bundles a workflow, the provenance of the results obtained by its enactment, other digital objects that are relevant for the experiment (papers, datasets, etc.), and annotations that semantically describe all these objects. In this paper, we propose a model to specify workflow-centric research objects, and show how the model can be grounded using semantic technologies and existing vocabularies, in particular the Object Reuse and Exchange (ORE) model and the Annotation Ontology (AO).We describe the life-cycle of a research object, which resembles the life-cycle of a scienti?c experiment

CiteSeerX

University of Birmingham Research Portal

The University of Manchester - Institutional Repository

Archivo Digital UPM

Digital libraries for the preservation of research methods and associated artifacts

Author: Corcho Oscar
Hołubowicz Piotr
Mazurek Cezary
Page K.
Palma R.
Pérez Álvarez Sara
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2013
Field of study

New digital artifacts are emerging in data-intensive science. For example, scientific workflows are executable descriptions of scientific procedures that define the sequence of computational steps in an automated data analysis, supporting reproducible research and the sharing and replication of best-practice and know-how through reuse. Workflows are specified at design time and interpreted through their execution in a variety of situations, environments, and domains. Hence it is essential to preserve both their static and dynamic aspects, along with the research context in which they are used. To achieve this, we propose the use of multidimensional digital objects (Research Objects) that aggregate the resources used and/or produced in scientific investigations, including workflow models, provenance of their executions, and links to the relevant associated resources, along with the provision of technological support for their preservation and efficient retrieval and reuse. In this direction, we specified a software architecture for the design and implementation of a Research Object preservation system, and realized this architecture with a set of services and clients, drawing together practices in digital libraries, preservation systems, workflow management, social networking and Semantic Web technologies. In this paper, we describe the backbone system of this realization, a digital library system built on top of dLibra

Archivo Digital UPM

ROHub – A digital library of Research Objects supporting scientists towards reproducible science

Author: Cezary Mazurek
José Manuel Gómez-Pérez
Oscar Corcho
Raúl Palma
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2014
Field of study

Abstract. Research Objects (ROs) are semantic aggregations of related scientific resources, their annotations and research context. They are meant to help scientists to incorporate and refer to all the research materials that they are working with in the course of an investigation. ROHub is a digital library system for ROs that supports their storage, lifecycle management and preservation. It provides a Web interface and a set of RESTful APIs. ROHub enables the sharing of scientific findings via ROs and includes features that help scientists throughout the research lifecycle to create and maintain high-quality ROs that can be interpreted and reproduced in the future. For instance, during the RO creation, scientists can assess and visualise the conformance of the RO to a set of predefined requirements. Scientists can also create at any point in time RO Snapshots. Snapshots may be useful to release the current version of research outcomes, submit it to be peer reviewed or published, share it with supervisors or collaborators, or for acknowledgement and citation purposes. ROHub can also generate nested ROs for workflow runs, exposing their full content and annotations, and includes monitoring features, such as fixity checking and RO quality, which generate notifications when changes are detected

CiteSeerX

Structuring research methods and data with the research object model: genomics workflows as a case study

Author
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Utilizing Provenance in Reusable Research Objects

Author: Fils Gabriel
Kothari Siddhant
Malik Tanu
That Dai Hai Ton
Yuan Zhihao
Publication venue: 'MDPI AG'
Publication date: 01/03/2018
Field of study

Science is conducted collaboratively, often requiring the sharing of knowledge about computational experiments. When experiments include only datasets, they can be shared using Uniform Resource Identifiers (URIs) or Digital Object Identifiers (DOIs). An experiment, however, seldom includes only datasets, but more often includes software, its past execution, provenance, and associated documentation. The Research Object has recently emerged as a comprehensive and systematic method for aggregation and identification of diverse elements of computational experiments. While a necessary method, mere aggregation is not sufficient for the sharing of computational experiments. Other users must be able to easily recompute on these shared research objects. Computational provenance is often the key to enable such reuse. In this paper, we show how reusable research objects can utilize provenance to correctly repeat a previous reference execution, to construct a subset of a research object for partial reuse, and to reuse existing contents of a research object for modified reuse. We describe two methods to summarize provenance that aid in understanding the contents and past executions of a research object. The first method obtains a process-view by collapsing low-level system information, and the second method obtains a summary graph by grouping related nodes and edges with the goal to obtain a graph view similar to application workflow. Through detailed experiments, we show the efficacy and efficiency of our algorithms.Comment: 25 page

arXiv.org e-Print Archive

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals