Article thumbnail

Provenance and scientific workflows: challenges and opportunities

By Susan B. Davidson and Juliana Freire

Abstract

Provenance in the context of workflows, both for the data they derive and for their specification, is an essential component to allow for result reproducibility, sharing, and knowledge re-use in the scientific community. Several workshops have been held on the topic, and it has been the focus of many research projects and prototype systems. This tutorial provides an overview of research issues in provenance for scientific workflows, with a focus on recent literature and technology in this area. It is aimed at a general database research audience and at people who work with scientific data and workflows. We will (1) provide a general overview of scientific workflows, (2) describe research on provenance for scientific workflows and show in detail how provenance is supported in existing systems; (3) discuss emerging applications that are enabled by provenance; and (4) outline open problems and new directions for database-related research

Topics: H.2 [Database Management, General General Terms Documentation, Experimentation Keywords provenance, scientific workflows 1. IMPORTANCE OF PROVENANCE FOR
Year: 2008
OAI identifier: oai:CiteSeerX.psu:10.1.1.216.7796
Provided by: CiteSeerX

Suggested articles


To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.