Search CORE

315 research outputs found

Tight Lower Bounds for Multiplicative Weights Algorithmic Families

Author: Gravin Nick
Peres Yuval
Sivan Balasubramanian
Publication venue
Publication date: 13/07/2016
Field of study

We study the fundamental problem of prediction with expert advice and develop regret lower bounds for a large family of algorithms for this problem. We develop simple adversarial primitives, that lend themselves to various combinations leading to sharp lower bounds for many algorithmic families. We use these primitives to show that the classic Multiplicative Weights Algorithm (MWA) has a regret of

\sqrt{\frac{T \ln k}{2}}

, there by completely closing the gap between upper and lower bounds. We further show a regret lower bound of

\frac{2}{3}\sqrt{\frac{T\ln k}{2}}

for a much more general family of algorithms than MWA, where the learning rate can be arbitrarily varied over time, or even picked from arbitrary distributions over time. We also use our primitives to construct adversaries in the geometric horizon setting for MWA to precisely characterize the regret at

\frac{0.391}{\sqrt{\delta}}

for the case of

2

experts and a lower bound of

\frac{1}{2}\sqrt{\frac{\ln k}{2\delta}}

for the case of arbitrary number of experts

k

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Improved Generalization Bounds for Robust Learning

Author: Attias Idan
Kontorovich Aryeh
Mansour Yishay
Publication venue
Publication date: 02/03/2019
Field of study

We consider a model of robust learning in an adversarial environment. The learner gets uncorrupted training data with access to possible corruptions that may be affected by the adversary during testing. The learner's goal is to build a robust classifier that would be tested on future adversarial examples. We use a zero-sum game between the learner and the adversary as our game theoretic framework. The adversary is limited to

k

possible corruptions for each input. Our model is closely related to the adversarial examples model of Schmidt et al. (2018); Madry et al. (2017). Our main results consist of generalization bounds for the binary and multi-class classification, as well as the real-valued case (regression). For the binary classification setting, we both tighten the generalization bound of Feige, Mansour, and Schapire (2015), and also are able to handle an infinite hypothesis class

H

. The sample complexity is improved from

O(\frac{1}{\epsilon^4}\log(\frac{|H|}{\delta}))

O(\frac{1}{\epsilon^2}(k\log(k)VC(H)+\log\frac{1}{\delta}))

. Additionally, we extend the algorithm and generalization bound from the binary to the multiclass and real-valued cases. Along the way, we obtain results on fat-shattering dimension and Rademacher complexity of

k

-fold maxima over function classes; these may be of independent interest. For binary classification, the algorithm of Feige et al. (2015) uses a regret minimization algorithm and an ERM oracle as a blackbox; we adapt it for the multi-class and regression settings. The algorithm provides us with near-optimal policies for the players on a given training sample.Comment: Appearing at the 30th International Conference on Algorithmic Learning Theory (ALT 2019

arXiv.org e-Print Archive