Search CORE

23 research outputs found

Graph Oracle Models, Lower Bounds, and Gaps for Parallel Stochastic Optimization

Author: McMahan Brendan
Smith Adam
Srebro Nathan
Wang Jialei
Woodworth Blake
Publication venue
Publication date: 01/12/2018
Field of study

We suggest a general oracle-based framework that captures different parallel stochastic optimization settings described by a dependency graph, and derive generic lower bounds in terms of this graph. We then use the framework and derive lower bounds for several specific parallel optimization settings, including delayed updates and parallel processing with intermittent communication. We highlight gaps between lower and upper bounds on the oracle complexity, and cases where the "natural" algorithms are not known to be optimal

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)

Parallel Submodular Function Minimization

Author: Chakrabarty Deeparnab
Graur Andrei
Jiang Haotian
Sidford Aaron
Publication venue
Publication date: 08/09/2023
Field of study

We consider the parallel complexity of submodular function minimization (SFM). We provide a pair of methods which obtain two new query versus depth trade-offs a submodular function defined on subsets of

n

elements that has integer values between

-M

and

M

. The first method has depth

2

and query complexity

n^{O(M)}

and the second method has depth

\widetilde{O}(n^{1/3} M^{2/3})

and query complexity

O(\mathrm{poly}(n, M))

. Despite a line of work on improved parallel lower bounds for SFM, prior to our work the only known algorithms for parallel SFM either followed from more general methods for sequential SFM or highly-parallel minimization of convex

\ell_2

-Lipschitz functions. Interestingly, to obtain our second result we provide the first highly-parallel algorithm for minimizing

\ell_\infty

-Lipschitz function over the hypercube which obtains near-optimal depth for obtaining constant accuracy

arXiv.org e-Print Archive

No Quantum Speedup over Gradient Descent for Non-Smooth Convex Optimization

Author: Garg Ankit
Kothari Robin
Netrapalli Praneeth
Sherif Suhail
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 12th Innovations in Theoretical Computer Science Conference (ITCS 2021)
Publication date: 05/10/2020
Field of study

We study the first-order convex optimization problem, where we have black-box access to a (not necessarily smooth) function

f:\mathbb{R}^n \to \mathbb{R}

and its (sub)gradient. Our goal is to find an

\epsilon

-approximate minimum of

f

starting from a point that is distance at most

R

from the true minimum. If

f

G

-Lipschitz, then the classic gradient descent algorithm solves this problem with

O((GR/\epsilon)^{2})

queries. Importantly, the number of queries is independent of the dimension

n

and gradient descent is optimal in this regard: No deterministic or randomized algorithm can achieve better complexity that is still independent of the dimension

n

. In this paper we reprove the randomized lower bound of

\Omega((GR/\epsilon)^{2})

using a simpler argument than previous lower bounds. We then show that although the function family used in the lower bound is hard for randomized algorithms, it can be solved using

O(GR/\epsilon)

quantum queries. We then show an improved lower bound against quantum algorithms using a different set of instances and establish our main result that in general even quantum algorithms need

\Omega((GR/\epsilon)^2)

queries to solve the problem. Hence there is no quantum speedup over gradient descent for black-box first-order convex optimization without further assumptions on the function family.Comment: 25 page

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Memory-Query Tradeoffs for Randomized Convex Optimization

Author: Chen Xi
Peng Binghui
Publication venue
Publication date: 21/06/2023
Field of study

We show that any randomized first-order algorithm which minimizes a

d

-dimensional,

1

-Lipschitz convex function over the unit ball must either use

\Omega(d^{2-\delta})

bits of memory or make

\Omega(d^{1+\delta/6-o(1)})

queries, for any constant

\delta\in (0,1)

and when the precision

\epsilon

is quasipolynomially small in

d

. Our result implies that cutting plane methods, which use

\tilde{O}(d^2)

bits of memory and

\tilde{O}(d)

queries, are Pareto-optimal among randomized first-order algorithms, and quadratic memory is required to achieve optimal query complexity for convex optimization

arXiv.org e-Print Archive

Submodular Maximization with Matroid and Packing Constraints in Parallel

Author: Barbosa Rafael D.P.
Fast
Mahoney Michael W
Maximization Distributed Submodular
Soma Tasuku
Using
Publication venue
Publication date: 08/11/2018
Field of study

We consider the problem of maximizing the multilinear extension of a submodular function subject a single matroid constraint or multiple packing constraints with a small number of adaptive rounds of evaluation queries. We obtain the first algorithms with low adaptivity for submodular maximization with a matroid constraint. Our algorithms achieve a

1-1/e-\epsilon

approximation for monotone functions and a

1/e-\epsilon

approximation for non-monotone functions, which nearly matches the best guarantees known in the fully adaptive setting. The number of rounds of adaptivity is

O(\log^2{n}/\epsilon^3)

, which is an exponential speedup over the existing algorithms. We obtain the first parallel algorithm for non-monotone submodular maximization subject to packing constraints. Our algorithm achieves a

1/e-\epsilon

approximation using

O(\log(n/\epsilon) \log(1/\epsilon) \log(n+m)/ \epsilon^2)

parallel rounds, which is again an exponential speedup in parallel time over the existing algorithms. For monotone functions, we obtain a

1-1/e-\epsilon

approximation in

O(\log(n/\epsilon)\log(m)/\epsilon^2)

parallel rounds. The number of parallel rounds of our algorithm matches that of the state of the art algorithm for solving packing LPs with a linear objective. Our results apply more generally to the problem of maximizing a diminishing returns submodular (DR-submodular) function

arXiv.org e-Print Archive

Crossref