Search CORE

4 research outputs found

Bayesian Reward Filtering

Author: I. Szita
M.A. Carreira-Perpinan
R.S. Sutton
V.N. Vapnik
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

From Supervised to Reinforcement Learning: a Kernel-based Bayesian Filtering Framework

Author: Fricout Gabriel
Geist Matthieu
Pietquin Olivier
Publication venue: IARIA
Publication date: 01/01/2009
Field of study

International audienceIn a large number of applications, engineers have to estimate a function linked to the state of a dynamic system. To do so, a sequence of samples drawn from this unknown function is observed while the system is transiting from state to state and the problem is to generalize these observations to unvisited states. Several solutions can be envisioned among which regressing a family of parameterized functions so as to make it fit at best to the observed samples. This is the first problem addressed with the proposed kernel-based Bayesian filtering approach, which also allows quantifying uncertainty reduction occurring when acquiring more samples. Classical methods cannot handle the case where actual samples are not directly observable but only a non linear mapping of them is available, which happens when a special sensor has to be used or when solving the Bellman equation in order to control the system. However the approach proposed in this paper can be extended to this tricky case. Moreover, an application of this indirect function approximation scheme to reinforcement learning is presented. A set of experiments is also proposed in order to demonstrate the efficiency of this kernel-based Bayesian approach

HAL-CentraleSupelec

HAL Descartes

HAL-Rennes 1

Bayesian Reward Filtering

Author: Geist Matthieu
Pietquin Olivier
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2008
Field of study

International audienceA wide variety of function approximation schemes have been applied to reinforcement learning. However, Bayesian filtering approaches,which have been shown efficient in other fields such as neural network training, have been little studied.We propose a general Bayesian filtering framework for reinforcement learning, as well as a specific implementation based on sigma point Kalman filtering and kernel machines. This allows us to derive an efficient off-policy model-free approximate temporal differences algorithm which will be demonstrated on two simple benchmarks

HAL-CentraleSupelec

HAL Descartes

HAL-Rennes 1