Search CORE

3,417 research outputs found

A Coordinate Descent Primal-Dual Algorithm and Application to Distributed Asynchronous Optimization

Author: Bianchi Pascal
Hachem Walid
Iutzeler Franck
Publication venue
Publication date: 30/09/2015
Field of study

Based on the idea of randomized coordinate descent of

\alpha

-averaged operators, a randomized primal-dual optimization algorithm is introduced, where a random subset of coordinates is updated at each iteration. The algorithm builds upon a variant of a recent (deterministic) algorithm proposed by V\~u and Condat that includes the well known ADMM as a particular case. The obtained algorithm is used to solve asynchronously a distributed optimization problem. A network of agents, each having a separate cost function containing a differentiable term, seek to find a consensus on the minimum of the aggregate objective. The method yields an algorithm where at each iteration, a random subset of agents wake up, update their local estimates, exchange some data with their neighbors, and go idle. Numerical results demonstrate the attractive performance of the method. The general approach can be naturally adapted to other situations where coordinate descent convex optimization algorithms are used with a random choice of the coordinates.Comment: 10 page

arXiv.org e-Print Archive

HAL-CentraleSupelec

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL-Rennes 1

Lipschitz Bandits: Regret Lower Bounds and Optimal Algorithms

Author: Combes Richard
Magureanu Stefan
Proutiere Alexandre
Publication venue
Publication date: 01/01/2014
Field of study

We consider stochastic multi-armed bandit problems where the expected reward is a Lipschitz function of the arm, and where the set of arms is either discrete or continuous. For discrete Lipschitz bandits, we derive asymptotic problem specific lower bounds for the regret satisfied by any algorithm, and propose OSLB and CKL-UCB, two algorithms that efficiently exploit the Lipschitz structure of the problem. In fact, we prove that OSLB is asymptotically optimal, as its asymptotic regret matches the lower bound. The regret analysis of our algorithms relies on a new concentration inequality for weighted sums of KL divergences between the empirical distributions of rewards and their true distributions. For continuous Lipschitz bandits, we propose to first discretize the action space, and then apply OSLB or CKL-UCB, algorithms that provably exploit the structure efficiently. This approach is shown, through numerical experiments, to significantly outperform existing algorithms that directly deal with the continuous set of arms. Finally the results and algorithms are extended to contextual bandits with similarities.Comment: COLT 201

arXiv.org e-Print Archive

HAL-CentraleSupelec

Publikationer från KTH

CiteSeerX

Digitala Vetenskapliga Arkivet - Academic Archive On-line

HAL-Rennes 1