Search CORE

4 research outputs found

Detailed Implementation of a Reproducible Machine Learning-Enabled Workflow

Author: Charles E. Cook
Heidi J. Imker
Kenneth E. Schackart III
Publication venue: Ubiquity Press
Publication date: 01/04/2024
Field of study

Machine learning (ML) and advanced computational methods are powerful tools for processing and deriving value from large data volumes. These methods are being developed and deployed rapidly, but best practices are still evolving regarding code and data standards, leading to irreproducibility of ML-enabled research. In this Practice Paper, we describe our efforts to make a ML-enabled research project to create a global inventory of biodata resources open and reproducible. To contribute to community conversations on evolving norms and expectations, we present our experiences as a practical, real-world case study that includes the implementation details as well as our overall approach and subsequent decisions. Our goal in openly sharing this experience is to provide a concrete example that others may consider as they look to vet, adapt, and adopt similar strategies to make their own work open and reproducible

Directory of Open Access Journals

A machine learning-enabled open biodata resource inventory from the scientific literature

Author: Ana-Maria Istrate
Charles E. Cook
Heidi J. Imker
Kenneth E. Schackart
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2023
Field of study

Directory of Open Access Journals

Biodata Resource Inventory Dataset

Author: Cook Charles E.
Imker Heidi
Istrate Ana-Maria
Schackart Kenneth
Publication venue: Zenodo
Publication date: 10/11/2023
Field of study

<p>final_inventory_2022.csv is the result of the Biodata Resource Inventory conducted in 2022. data_dictionary.csv provides an explanation of the columns in the inventory file.</p&gt

ZENODO

Biodata Resource Inventory Supplemental Materials

Author: Cook Charles E.
Imker Heidi J.
Istrate Ana-Maria
Schackart III Kenneth E.
Publication venue: Zenodo
Publication date: 26/05/2023
Field of study

<p>This is a preprint version of the supplemental materials associated with the manuscript titled A Machine Learning-Enabled Open Biodata Resource Inventory from the Scientific Literature by Imker et al. 2023.</p&gt

ZENODO