Search CORE

30,090 research outputs found

An Adaptive Strategy for Active Learning with Smooth Decision Boundary

Author: Carpentier Alexandra
Kpotufe Samory
Locatelli Andrea
Publication venue
Publication date: 25/11/2017
Field of study

We present the first adaptive strategy for active learning in the setting of classification with smooth decision boundary. The problem of adaptivity (to unknown distributional parameters) has remained opened since the seminal work of Castro and Nowak (2007), which first established (active learning) rates for this setting. While some recent advances on this problem establish adaptive rates in the case of univariate data, adaptivity in the more practical setting of multivariate data has so far remained elusive. Combining insights from various recent works, we show that, for the multivariate case, a careful reduction to univariate-adaptive strategies yield near-optimal rates without prior knowledge of distributional parameters

arXiv.org e-Print Archive

Princeton University Open Access Repository

Bayesian Policy Gradients via Alpha Divergence Dropout Inference

Author: Henderson Peter
Doan Thang
Islam Riashat
Meger David
Publication venue
Publication date: 05/12/2017
Field of study

Policy gradient methods have had great success in solving continuous control tasks, yet the stochastic nature of such problems makes deterministic value estimation difficult. We propose an approach which instead estimates a distribution by fitting the value function with a Bayesian Neural Network. We optimize an

\alpha

-divergence objective with Bayesian dropout approximation to learn and estimate this distribution. We show that using the Monte Carlo posterior mean of the Bayesian value function distribution, rather than a deterministic network, improves stability and performance of policy gradient methods in continuous control MuJoCo simulations.Comment: Accepted to Bayesian Deep Learning Workshop at NIPS 201

arXiv.org e-Print Archive

FigShare

Log-Distributional Approach for Learning Covariate Shift Ratios

Author: Bernárdez Gil Guillermo
Publication venue: Universitat Politècnica de Catalunya
Publication date: 01/01/2019
Field of study

Distributional Reinforcement Learning theory suggests that distributional fixed points could play a fundamental role to learning non additive value functions. In particular, we propose a distributional approach for learning Covariate Shift Ratios, whose update rule is originally multiplicative

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Active Nearest-Neighbor Learning in Metric Spaces

Author: Kontorovich Aryeh
Sabato Sivan
Urner Ruth
Publication venue
Publication date: 01/06/2017
Field of study

We propose a pool-based non-parametric active learning algorithm for general metric spaces, called MArgin Regularized Metric Active Nearest Neighbor (MARMANN), which outputs a nearest-neighbor classifier. We give prediction error guarantees that depend on the noisy-margin properties of the input sample, and are competitive with those obtained by previously proposed passive learners. We prove that the label complexity of MARMANN is significantly lower than that of any passive learner with similar error guarantees. MARMANN is based on a generalized sample compression scheme, and a new label-efficient active model-selection procedure

arXiv.org e-Print Archive

MPG.PuRe