Search CORE

2 research outputs found

Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning

Author: Gürel Nezihe Merve
Kalogerias Dionysis
Karbasi Amin
Mageirakos Vasilis
Nikolakakis Konstantinos E.
Okanovic Patrik
Rekatsinas Theodoros
Waleffe Roger
Publication venue
Publication date: 28/05/2023
Field of study

Methods for carefully selecting or generating a small set of training data to learn from, i.e., data pruning, coreset selection, and data distillation, have been shown to be effective in reducing the ever-increasing cost of training neural networks. Behind this success are rigorously designed strategies for identifying informative training examples out of large datasets. However, these strategies come with additional computational costs associated with subset selection or data distillation before training begins, and furthermore, many are shown to even under-perform random sampling in high data compression regimes. As such, many data pruning, coreset selection, or distillation methods may not reduce 'time-to-accuracy', which has become a critical efficiency measure of training deep neural networks over large datasets. In this work, we revisit a powerful yet overlooked random sampling strategy to address these challenges and introduce an approach called Repeated Sampling of Random Subsets (RSRS or RS2), where we randomly sample the subset of training data for each epoch of model training. We test RS2 against thirty state-of-the-art data pruning and data distillation methods across four datasets including ImageNet. Our results demonstrate that RS2 significantly reduces time-to-accuracy compared to existing techniques. For example, when training on ImageNet in the high-compression regime (using less than 10% of the dataset each epoch), RS2 yields accuracy improvements up to 29% compared to competing pruning methods while offering a runtime reduction of 7x. Beyond the above meta-study, we provide a convergence analysis for RS2 and discuss its generalization capability. The primary goal of our work is to establish RS2 as a competitive baseline for future data selection or distillation techniques aimed at efficient training

arXiv.org e-Print Archive

Preparing Distributed Computing Operations for the HL-LHC Era With Operational Intelligence

Author: Beermann Thomas
Boehler Michael
Bonacorsi Daniele
Clissa Luca
Decker de Sousa Leticia
Di Girolamo Alessandro
Diotalevi Tommaso
Giommi Luca
Giordano Domenico
Grigorieva Maria
Hohn David
Javůrek Tomáš
Jezequel Stephane
Kuznetsov Valentin
Lassnig Mario
Legger Federica
Mageirakos Vasilis
Olocco Micol
Padolski Siarhei
Paltenghi Matteo
Paparrigopoulos Panos
Rinaldi Lorenzo
Schovancová Jaroslava
Sharma Mayank
Tisbeni Simone Rossi
Tuckus Nikodemas
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2022
Field of study

International audienceAs a joint effort from various communities involved in the Worldwide LHC Computing Grid, the Operational Intelligence project aims at increasing the level of automation in computing operations and reducing human interventions. The distributed computing systems currently deployed by the LHC experiments have proven to be mature and capable of meeting the experiment goals, by allowing timely delivery of scientific results. However, a substantial number of interventions from software developers, shifters and operational teams is needed to efficiently manage such heterogeneous infrastructures. Under the scope of the Operational Intelligence project experts from several areas have gathered to propose and work on “smart” solutions. Machine learning, data mining, log analysis, and anomaly detection are only some of the tools we have evaluated for our use cases. In this Community Study contribution, we report on the development of a suite of Operational Intelligence services to cover various use cases: workload management, data management, and site operations

HAL-IN2P3

PubMed Central

HAL Université de Savoie

CERN Document Server