Search CORE

17,209 research outputs found

Hyperparameter Learning via Distributional Transfer

Author: Chan Lucian
Huang Junzhou
Law Ho Chung Leon
Sejdinovic Dino
Zhao Peilin
Publication venue
Publication date: 01/01/2019
Field of study

Bayesian optimisation is a popular technique for hyperparameter learning but typically requires initial exploration even in cases where similar prior tasks have been solved. We propose to transfer information across tasks using learnt representations of training datasets used in those tasks. This results in a joint Gaussian process model on hyperparameters and data representations. Representations make use of the framework of distribution embeddings into reproducing kernel Hilbert spaces. The developed method has a faster convergence compared to existing baselines, in some cases requiring only a few evaluations of the target objective

arXiv.org e-Print Archive

Oxford University Research Archive

A PAC-Bayesian bound for Lifelong Learning

Author: Jebara Tony
Lampert Christoph
Pentina Anastasia
Xing Eric
Publication venue
Publication date: 01/01/2014
Field of study

Transfer learning has received a lot of attention in the machine learning community over the last years, and several effective algorithms have been developed. However, relatively little is known about their theoretical properties, especially in the setting of lifelong learning, where the goal is to transfer information to tasks for which no data have been observed so far. In this work we study lifelong learning from a theoretical perspective. Our main result is a PAC-Bayesian generalization bound that offers a unified view on existing paradigms for transfer learning, such as the transfer of parameters or the transfer of low-dimensional representations. We also use the bound to derive two principled lifelong learning algorithms, and we show that these yield results comparable with existing methods.Comment: to appear at ICML 201

arXiv.org e-Print Archive

CiteSeerX

IST Austria: PubRep (Institute of Science and Technology)

Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations

Author: Doshi-Velez Finale
Konidaris George
Publication venue
Publication date: 15/08/2013
Field of study

Control applications often feature tasks with similar, but not identical, dynamics. We introduce the Hidden Parameter Markov Decision Process (HiP-MDP), a framework that parametrizes a family of related dynamical systems with a low-dimensional set of latent factors, and introduce a semiparametric regression approach for learning its structure from data. In the control setting, we show that a learned HiP-MDP rapidly identifies the dynamics of a new task instance, allowing an agent to flexibly adapt to task variations

arXiv.org e-Print Archive

CiteSeerX