Search CORE

2 research outputs found

FAIR Data Pipeline: provenance-driven data management for traceable scientific workflows

Modern epidemiological analyses to understand and combat the spread of disease depend critically on access to, and use of, data. Rapidly evolving data, such as data streams changing during a disease outbreak, are particularly challenging. Data management is further complicated by data being imprecisely identified when used. Public trust in policy decisions resulting from such analyses is easily damaged and is often low, with cynicism arising where claims of "following the science" are made without accompanying evidence. Tracing the provenance of such decisions back through open software to primary data would clarify this evidence, enhancing the transparency of the decision-making process. Here, we demonstrate a Findable, Accessible, Interoperable and Reusable (FAIR) data pipeline developed during the COVID-19 pandemic that allows easy annotation of data as they are consumed by analyses, while tracing the provenance of scientific outputs back through the analytical source code to data sources. Such a tool provides a mechanism for the public, and fellow scientists, to better assess the trust that should be placed in scientific evidence, while allowing scientists to support policy-makers in openly justifying their decisions. We believe that tools such as this should be promoted for use across all areas of policy-facing research

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

PubMed Central

Edinburgh Research Explorer

Enlighten

White Rose Research Online

Hal-Diderot

SRUC - Scotland's Rural College

FAIR Data Pipeline: provenance-driven data management for traceable scientific workflows

Author: Aaron Reeves
Aaron Reeves
Aaron Reeves
Alejandra N. Gonzalez-Beltran
Alejandra N. Gonzalez-Beltran
Alejandra N. Gonzalez-Beltran
Alys Brett
Alys Brett
Alys Brett
Andrew Lahiff
Andrew Lahiff
Andrew Lahiff
Antony Wilson
Antony Wilson
Antony Wilson
Blair Archibald
Blair Archibald
Blair Archibald
Bram Boskamp
Bram Boskamp
Bram Boskamp
Bruno Viola
Bruno Viola
Bruno Viola
Christopher David Hughes
Christopher David Hughes
Christopher David Hughes
Christopher Mark Pooley
Christopher Mark Pooley
Christopher Mark Pooley
Ciaran Mcmonagle
Ciaran Mcmonagle
Ciaran Mcmonagle
Claire Harris
Claire Harris
Claire Harris
Dennis Reddyhoff
Dennis Reddyhoff
Dennis Reddyhoff
Dominic Mellor
Dominic Mellor
Dominic Mellor
Edward Townsend
Edward Townsend
Edward Townsend
Glenn Marion
Glenn Marion
Glenn Marion
Iain J. Mckendrick
Iain J. Mckendrick
Iain J. Mckendrick
Ian Hinder
Ian Hinder
Ian Hinder
Jeremy Walton
Jeremy Walton
Jeremy Walton
Jessica Enright
Jessica Enright
Jessica Enright
Jonathan Hollocombe
Jonathan Hollocombe
Jonathan Hollocombe
Kristian Zarebski
Kristian Zarebski
Kristian Zarebski
Lisa A Boden
Lisa A Boden
Lisa A Boden
Louise Matthews
Louise Matthews
Louise Matthews
Martin Burke
Martin Burke
Martin Burke
Martin Knight
Martin Knight
Martin Knight
Nathan Cummings
Nathan Cummings
Nathan Cummings
Paul Bessell
Paul Bessell
Paul Bessell
Richard Blackwell
Richard Blackwell
Richard Blackwell
Richard Reeve
Richard Reeve
Richard Reeve
Robert Turner
Robert Turner
Robert Turner
Ruth Dundas
Ruth Dundas
Ruth Dundas
Ryan Field
Ryan Field
Ryan Field
Sam Brett
Sam Brett
Sam Brett
Sibylle Mohr
Sibylle Mohr
Sibylle Mohr
Sonia Natalie Mitchell
Sonia Natalie Mitchell
Sonia Natalie Mitchell
Thibaud Porphyre
Thibaud Porphyre
Thibaud Porphyre
Vino Mano
Vino Mano
Vino Mano
Publication venue: HAL CCSD
Publication date: 08/08/2022
Field of study

INRIA a CCSD electronic archive server

Hal-Diderot