Search CORE

208 research outputs found

Learning Linear Dynamical Systems via Spectral Filtering

Author: Hazan Elad
Singh Karan
Zhang Cyril
Publication venue
Publication date: 01/01/2017
Field of study

We present an efficient and practical algorithm for the online prediction of discrete-time linear dynamical systems with a symmetric transition matrix. We circumvent the non-convex optimization problem using improper learning: carefully overparameterize the class of LDSs by a polylogarithmic factor, in exchange for convexity of the loss functions. From this arises a polynomial-time algorithm with a near-optimal regret guarantee, with an analogous sample complexity bound for agnostic learning. Our algorithm is based on a novel filtering technique, which may be of independent interest: we convolve the time series with the eigenvectors of a certain Hankel matrix.Comment: Published as a conference paper at NIPS 201

arXiv.org e-Print Archive

Princeton University Open Access Repository

Regret Minimization in Partially Observable Linear Quadratic Control

Author: Anandkumar Anima
Azizzadenesheli Kamyar
Hassibi Babak
Lale Sahin
Publication venue
Publication date: 31/01/2020
Field of study

We study the problem of regret minimization in partially observable linear quadratic control systems when the model dynamics are unknown a priori. We propose ExpCommit, an explore-then-commit algorithm that learns the model Markov parameters and then follows the principle of optimism in the face of uncertainty to design a controller. We propose a novel way to decompose the regret and provide an end-to-end sublinear regret upper bound for partially observable linear quadratic control. Finally, we provide stability guarantees and establish a regret upper bound of O(T^(2/3)) for ExpCommit, where T is the time horizon of the problem

arXiv.org e-Print Archive

Caltech Authors