Search CORE

2 research outputs found

A stochastic multi-armed bandit approach to nonparametric H∞-norm estimation

Author: Müller Matias I.
Proutiere Alexandre
Rojas Cristian R.
Valenzuela Patricio Esteban
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

We study the problem of estimating the largest gain of an unknown linear and time-invariant filter, which is also known as the H∞ norm of the system. By using ideas from the stochastic multi-armed bandit framework, we present a new algorithm that sequentially designs an input signal in order to estimate this quantity by means of input-output data. The algorithm is shown empirically to beat an asymptotically optimal method, known as Thompson Sampling, in the sense of its cumulative regret function. Finally, for a general class of algorithms, a lower bound on the performance of finding the H-infinity norm is derived.QC 20180306</p

Publikationer från KTH

Crossref

Digitala Vetenskapliga Arkivet - Academic Archive On-line