Search CORE

210 research outputs found

Bandit Online Optimization Over the Permutahedron

Author: D. Suehiro
D.P. Helmbold
J. Yellott
L.G. Valiant
M. Jerrum
N. Cesa-Bianchi
P. Auer
S. Beggs
S. Yasutake
Publication venue
Publication date: 01/01/2014
Field of study

The permutahedron is the convex polytope with vertex set consisting of the vectors

(\pi(1),\dots, \pi(n))

for all permutations (bijections)

\pi

over

\{1,\dots, n\}

. We study a bandit game in which, at each step

t

, an adversary chooses a hidden weight weight vector

s_t

, a player chooses a vertex

\pi_t

of the permutahedron and suffers an observed loss of

\sum_{i=1}^n \pi(i) s_t(i)

. A previous algorithm CombBand of Cesa-Bianchi et al (2009) guarantees a regret of

O(n\sqrt{T \log n})

for a time horizon of

T

. Unfortunately, CombBand requires at each step an

n

-by-

n

matrix permanent approximation to within improved accuracy as

T

grows, resulting in a total running time that is super linear in

T

, making it impractical for large time horizons. We provide an algorithm of regret

O(n^{3/2}\sqrt{T})

with total time complexity

O(n^3T)

. The ideas are a combination of CombBand and a recent algorithm by Ailon (2013) for online optimization over the permutahedron in the full information setting. The technical core is a bound on the variance of the Plackett-Luce noisy sorting process's "pseudo loss". The bound is obtained by establishing positive semi-definiteness of a family of 3-by-3 matrices generated from rational functions of exponentials of 3 parameters

arXiv.org e-Print Archive

Crossref

Leading strategies in competitive on-line prediction

Author: A.P. Dawid
A.P. Dawid
A.P. Dawid
C.P. Schnorr
D. Blackwell
D.P. Helmbold
D.R. Cox
G. Shafer
J. Kivinen
K.S. Azoury
L.A. Levin
L.M. Bregman
M. Herbster
N. Cesa-Bianchi
N. Cesa-Bianchi
P. Auer
P. Martin-Löf
R.A. Adams
R.J. Solomonoff
V. Vovk
V. Vovk
V. Vovk
V. Vovk
V. Vovk
Y.M. Kabanov
Publication venue
Publication date: 01/01/2006
Field of study

We start from a simple asymptotic result for the problem of on-line regression with the quadratic loss function: the class of continuous limited-memory prediction strategies admits a "leading prediction strategy", which not only asymptotically performs at least as well as any continuous limited-memory strategy but also satisfies the property that the excess loss of any continuous limited-memory strategy is determined by how closely it imitates the leading strategy. More specifically, for any class of prediction strategies constituting a reproducing kernel Hilbert space we construct a leading strategy, in the sense that the loss of any prediction strategy whose norm is not too large is determined by how closely it imitates the leading strategy. This result is extended to the loss functions given by Bregman divergences and by strictly proper scoring rules.Comment: 20 pages; a conference version is to appear in the ALT'2006 proceeding

arXiv.org e-Print Archive

CiteSeerX

Royal Holloway Research Online

Elsevier - Publisher Connector

Crossref

Royal Holloway - Pure

Optimal dynamic portfolio selection with earnings-at-risk

Author: A. Chen
A. Lucas
A.D. Roy
D. Li
D.P. Helmbold
H. Markowitz
H. Yang
P. Artzner
P. Gänssler
P. Jorion
P.A. Samuelson
R. Litterman
R.C. Merton
S. Basak
S. Emmer
V. Boginski
V. Boginski
X. T. Deng
X.Y. Zhou
Z. F. Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

In this paper we investigate a continuous-time portfolio selection problem. Instead of using the classical variance as usual, we use earnings-at-risk (EaR) of terminal wealth as a measure of risk. In the settings of Black-Scholes type financial markets and constantly-rebalanced portfolio (CRP) investment strategies, we obtain closed-form expressions for the best CRP investment strategy and the efficient frontier of the mean-EaR problem, and compare our mean-EaR analysis to the classical mean-variance analysis and to the mean-CaR (capital-at-risk) analysis. We also examine some economic implications arising from using the mean-EaR model. © 2007 Springer Science+Business Media, LLC.postprin

Crossref

HKU Scholars Hub

Regret to the best vs. regret to the average

Author: A. Kalai
D. Helmbold
Eyal Even-Dar
Jennifer Wortman
Michael Kearns
N. Cesa-Bianchi
N. Cesa-Bianchi
N. Littlestone
P. Auer
T. Cover
V. Vovk
Y. Freund
Yishay Mansour
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A new PAC bound for intersection-closed concept classes

Author: A. Blumer
A. Ehrenfeucht
A. Floyd
D. Haussler
D. Helmbold
N. Sauer
P. Auer
Peter Auer
Ronald Ortner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref