Search CORE

34 research outputs found

Online variance minimization

Author: A. Agarwal
A. Kalai
D. Helmbold
D. Helmbold
D. Helmbold
D. Helmbold
D. Kuzmin
D. Kuzmin
D. P. Helmbold
D. S. Bernstein
Dima Kuzmin
E. Hazan
E. Hazan
G. J. Gordon
J. Abernethy
J. Kivinen
J. Kivinen
J. Kivinen
J. Kivinen
K. Tsuda
M. A. Nielsen
M. Herbster
M. Herbster
M. K. Warmuth
M. K. Warmuth
M. K. Warmuth
M. K. Warmuth
M. K. Warmuth
M. K. Warmuth
M. K. Warmuth
M. Zinkevich
Manfred K. Warmuth
N. Cesa-Bianchi
N. Cesa-Bianchi
N. Littlestone
N. Littlestone
O. Bousquet
R. Bhatia
R. Jain
S. Arora
S. Arora
S. Boyd
S. Shalev-Shwartz
T. M. Cover
V. Vovk
Y. Freund
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

The Computational Power of Optimization in Online Learning

Author: Agarwal A.
Agarwal A.
Dani V.
Dud´ık M.
Gofer E.
Hazan E.
Kakade S.
McMahan H. B.
Shalev-Shwartz S.
Zinkevich M.
Publication venue
Publication date: 27/01/2016
Field of study

We consider the fundamental problem of prediction with expert advice where the experts are "optimizable": there is a black-box optimization oracle that can be used to compute, in constant time, the leading expert in retrospect at any point in time. In this setting, we give a novel online algorithm that attains vanishing regret with respect to

N

experts in total

\widetilde{O}(\sqrt{N})

computation time. We also give a lower bound showing that this running time cannot be improved (up to log factors) in the oracle model, thereby exhibiting a quadratic speedup as compared to the standard, oracle-free setting where the required time for vanishing regret is

\widetilde{\Theta}(N)

. These results demonstrate an exponential gap between the power of optimization in online learning and its power in statistical learning: in the latter, an optimization oracle---i.e., an efficient empirical risk minimizer---allows to learn a finite hypothesis class of size

N

in time

O(\log{N})

. We also study the implications of our results to learning in repeated zero-sum games, in a setting where the players have access to oracles that compute, in constant time, their best-response to any mixed strategy of their opponent. We show that the runtime required for approximating the minimax value of the game in this setting is

\widetilde{\Theta}(\sqrt{N})

, yielding again a quadratic improvement upon the oracle-free setting, where

\widetilde{\Theta}(N)

is known to be tight

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref

QIP = PSPACE

Author: Bennett C.
John Watrous
Nielsen M.
Rahul Jain
Sarvagya Upadhyay
Zhengfeng Ji
Publication venue
Publication date: 02/08/2009
Field of study

We prove that the complexity class QIP, which consists of all problems having quantum interactive proof systems, is contained in PSPACE. This containment is proved by applying a parallelized form of the matrix multiplicative weights update method to a class of semidefinite programs that captures the computational power of quantum interactive proofs. As the containment of PSPACE in QIP follows immediately from the well-known equality IP = PSPACE, the equality QIP = PSPACE follows.Comment: 21 pages; v2 includes corrections and minor revision

arXiv.org e-Print Archive

CiteSeerX

Crossref

OPUS - University of Technology Sydney

ScholarBank@NUS

Institute Of Software, Chinese Academy Of Sciences

Mistake Bounds for Binary Matrix Completion

Author: Herbster MJ
Pasteris S
Pontil M
Publication venue: NIPS 2016
Publication date: 01/12/2016
Field of study

We study the problem of completing a binary matrix in an online learning setting.On each trial we predict a matrix entry and then receive the true entry. We propose a Matrix Exponentiated Gradient algorithm [1] to solve this problem. We provide a mistake bound for the algorithm, which scales with the margin complexity [2, 3] of the underlying matrix. The bound suggests an interpretation where each row of the matrix is a prediction task over a finite set of objects, the columns. Using this we show that the algorithm makes a number of mistakes which is comparable up to a logarithmic factor to the number of mistakes made by the Kernel Perceptron with an optimal kernel in hindsight. We discuss applications of the algorithm to predicting as well as the best biclustering and to the problem of predicting the labeling of a graph without knowing the graph in advance

UCL Discovery