On the Convergence to a Global Solution of Shuffling-Type Gradient
  Algorithms

Nguyen, Lam M.; Tran, Trang H.

On the Convergence to a Global Solution of Shuffling-Type Gradient Algorithms

Authors: Lam M. Nguyen
Trang H. Tran
Publication date: 25 October 2023
Publisher

Abstract

Stochastic gradient descent (SGD) algorithm is the method of choice in many machine learning tasks thanks to its scalability and efficiency in dealing with large-scale problems. In this paper, we focus on the shuffling version of SGD which matches the mainstream practical heuristics. We show the convergence to a global solution of shuffling SGD for a class of non-convex functions under over-parameterized settings. Our analysis employs more relaxed non-convex assumptions than previous literature. Nevertheless, we maintain the desired computational complexity as shuffling SGD has achieved in the general convex setting.Comment: The 37th Conference on Neural Information Processing Systems (NeurIPS 2023

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2206.05869

Last time updated on 20/08/2022