OpenML Benchmarking Suites

Bischl, Bernd; Casalicchio, Giuseppe; Feurer, Matthias; Hutter, Frank; Lang, Michel; Mantovani, Rafael G.; van Rijn, Jan N.; Vanschoren, Joaquin

research

OpenML Benchmarking Suites

Authors: Bernd Bischl
Giuseppe Casalicchio
Matthias Feurer
Frank Hutter
Michel Lang
Rafael G. Mantovani
Jan N. van Rijn
Joaquin Vanschoren
Publication date: 24 September 2019
Publisher

Abstract

Machine learning research depends on objectively interpretable, comparable, and reproducible algorithm benchmarks. Therefore, we advocate the use of curated, comprehensive suites of machine learning tasks to standardize the setup, execution, and reporting of benchmarks. We enable this through software tools that help to create and leverage these benchmarking suites. These are seamlessly integrated into the OpenML platform, and accessible through interfaces in Python, Java, and R. OpenML benchmarking suites are (a) easy to use through standardized data formats, APIs, and client libraries; (b) machine-readable, with extensive meta-information on the included datasets; and (c) allow benchmarks to be shared and reused in future studies. We also present a first, carefully curated and practical benchmarking suite for classification: the OpenML Curated Classification benchmarking suite 2018 (OpenML-CC18)

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:1708.03731

Last time updated on 08/09/2017