Search CORE

11 research outputs found

Smooth minimization of nonsmooth functions with parallel coordinate descent methods

Author: Fercoq Olivier
Richtárik Peter
Publication venue
Publication date: 01/01/2013
Field of study

We study the performance of a family of randomized parallel coordinate descent methods for minimizing the sum of a nonsmooth and separable convex functions. The problem class includes as a special case L1-regularized L1 regression and the minimization of the exponential loss ("AdaBoost problem"). We assume the input data defining the loss function is contained in a sparse

m\times n

matrix

A

with at most

\omega

nonzeros in each row. Our methods need

O(n \beta/\tau)

iterations to find an approximate solution with high probability, where

\tau

is the number of processors and

\beta = 1 + (\omega-1)(\tau-1)/(n-1)

for the fastest variant. The notation hides dependence on quantities such as the required accuracy and confidence levels and the distance of the starting iterate from an optimal point. Since

\beta/\tau

is a decreasing function of

\tau

, the method needs fewer iterations when more processors are used. Certain variants of our algorithms perform on average only O(\nnz(A)/n) arithmetic operations during a single iteration per processor and, because

\beta

decreases when

\omega

does, fewer iterations are needed for sparser problems.Comment: 39 pages, 1 algorithm, 3 figures, 2 table

arXiv.org e-Print Archive

CiteSeerX

Edinburgh Research Explorer

Parallel Successive Convex Approximation for Nonsmooth Nonconvex Optimization

Author: Hong Mingyi
Hong Mingyi
Luo Zhi-Quan
Pang Jong-Shi
Razaviyayn Mesiam
Publication venue
Publication date: 01/01/2014
Field of study

Consider the problem of minimizing the sum of a smooth (possibly non-convex) and a convex (possibly nonsmooth) function involving a large number of variables. A popular approach to solve this problem is the block coordinate descent (BCD) method whereby at each iteration only one variable block is updated while the remaining variables are held fixed. With the recent advances in the developments of the multi-core parallel processing technology, it is desirable to parallelize the BCD method by allowing multiple blocks to be updated simultaneously at each iteration of the algorithm. In this work, we propose an inexact parallel BCD approach where at each iteration, a subset of the variables is updated in parallel by minimizing convex approximations of the original objective function. We investigate the convergence of this parallel BCD method for both randomized and cyclic variable selection rules. We analyze the asymptotic and non-asymptotic convergence behavior of the algorithm for both convex and non-convex objective functions. The numerical experiments suggest that for a special case of Lasso minimization problem, the cyclic block selection rule can outperform the randomized rule

arXiv.org e-Print Archive

Digital Repository @ Iowa State University (ISU)

Smooth Primal-Dual Coordinate Descent Algorithms for Nonsmooth Convex Optimization

Author: Alacaoglu Ahmet
Cevher Volkan
Fercoq Olivier
Tran-Dinh Quoc
Publication venue
Publication date: 09/11/2017
Field of study

We propose a new randomized coordinate descent method for a convex optimization template with broad applications. Our analysis relies on a novel combination of four ideas applied to the primal-dual gap function: smoothing, acceleration, homotopy, and coordinate descent with non-uniform sampling. As a result, our method features the first convergence rate guarantees among the coordinate descent methods, that are the best-known under a variety of common structure assumptions on the template. We provide numerical evidence to support the theoretical results with a comparison to state-of-the-art algorithms.Comment: NIPS 201

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Semi-stochastic coordinate descent

Author: Jakub Konečný
Peter Richtárik
Shalev-Shwartz S.
Zheng Qu
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2017
Field of study

Crossref

Edinburgh Research Explorer

HKU Scholars Hub

Smooth Minimization of Nonsmooth Functions with Parallel Coordinate Descent Methods

Author: A Ruszczyński
D Leventhal
I Necoara
I Necoara
I Necoara
I Palit
M Collins
M Journée
M Telgarsky
R Tappenden
S Shalev-Shwartz
S Shalev-Shwartz
Y Nesterov
Publication venue: HAL CCSD
Publication date
Field of study

39 pages, 1 algorithm, 3 figures, 2 tablesInternational audienceWe study the performance of a family of randomized parallel coordinate descent methods for minimizing the sum of a nonsmooth and separable convex functions. The problem class includes as a special case L1-regularized L1 regression and the minimization of the exponential loss ("AdaBoost problem"). We assume the input data defining the loss function is contained in a sparse

m\times n

matrix

A

with at most

\omega

nonzeros in each row. Our methods need

O(n \beta/\tau)

iterations to find an approximate solution with high probability, where

\tau

is the number of processors and

\beta = 1 + (\omega-1)(\tau-1)/(n-1)

for the fastest variant. The notation hides dependence on quantities such as the required accuracy and confidence levels and the distance of the starting iterate from an optimal point. Since

\beta/\tau

is a decreasing function of

\tau

\beta

decreases when

\omega

does, fewer iterations are needed for sparser problems

Crossref