Search CORE

5,281 research outputs found

Basis Function Construction in Reinforcement Learning using Cascade-Correlation Learning Architecture

Author: Girgin Sertan
Preux Philippe
Publication venue: IEEE Press
Publication date: 01/01/2008
Field of study

International audienceIn reinforcement learning, it is a common practice to map the state(-action) space to a different one using ba- sis functions. This transformation aims to represent the input data in a more informative form that facilitates and improves subsequent steps. As a "good" set of basis func- tions result in better solutions and defining such functions becomes a challenge with increasing problem complexity, it is beneficial to be able to generate them automatically. In this paper, we propose a new approach based on Bellman residual for constructing basis functions using cascade- correlation learning architecture. We show how this ap- proach can be applied to Least Squares Policy Iteration al- gorithm in order to obtain a better approximation of the value function, and consequently improve the performance of the resulting policies. We also present the effectiveness of the method empirically on some benchmark problems

HAL - Lille 3

Crossref

INRIA a CCSD electronic archive server

Incremental Basis Function Expansion in Reinforcement Learning using Cascade-Correlation Networks

Author: Girgin Sertan
Preux Philippe
Publication venue: IEEE Press
Publication date: 01/01/2008
Field of study

International audienceIn reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data in a more informative form that facilitates and improves subsequent steps. As a ''good'' set of basis functions result in better solutions and defining such functions becomes a challenge with increasing problem complexity, it is beneficial to be able to generate them automatically. In this paper, we propose a new approach based on Bellman residual for constructing basis functions using cascade-correlation learning architecture. We show how this approach can be applied to Least Squares Policy Iteration algorithm in order to obtain a better approximation of the value function, and consequently improve the performance of the resulting policies. We also present the effectiveness of the method empirically on some benchmark problems

HAL - Lille 3

INRIA a CCSD electronic archive server

Basis Expansion in Natural Actor Critic Methods

Author: F. Rivest
I. Guyon
I. Menache
J. Johns
J. Peters
M. Puterman
P.W. Keller
R. Howard
R. Parr
R.J. Williams
R.S. Sutton
S. Bhatnagar
S. Mahadevan
S.-i. Amari
S.E. Fahlman
V.R. Konda
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2008
Field of study

International audienceIn reinforcement learning, the aim of the agent is to find a policy that maximizes its expected return. Policy gradient methods try to accomplish this goal by directly approximating the policy using a parametric function approximator; the expected return of the current policy is estimated and its parameters are updated by steepest ascent in the direction of the gradient of the expected return with respect to the policy parameters. In general, the policy is defined in terms of a set of basis functions that capture important features of the problem. Since the quality of the resulting policies directly depend on the set of basis func- tions, and defining them gets harder as the complexity of the problem increases, it is important to be able to find them automatically. In this paper, we propose a new approach which uses cascade-correlation learn- ing architecture for automatically constructing a set of basis functions within the context of Natural Actor-Critic (NAC) algorithms. Such basis functions allow more complex policies be represented, and consequently improve the performance of the resulting policies. We also present the effectiveness of the method empirically

HAL - Lille 3

Crossref

INRIA a CCSD electronic archive server

Session 5: Development, Neuroscience and Evolutionary Psychology

Author: Machamer Peter
Quartz Steven
Scarantino Andrea
Sullivan Jackie
Publication venue
Publication date: 01/01/2002
Field of study

Proceedings of the Pittsburgh Workshop in History and Philosophy of Biology, Center for Philosophy of Science, University of Pittsburgh, March 23-24 2001 Session 5: Development, Neuroscience and Evolutionary Psycholog

PhilSci Archive

Constructivism, epistemology and information processing

Author: Leiser David
Publication venue: 'Edicions de la Universitat de Barcelona'
Publication date: 12/01/1996
Field of study

The author analyzes the main models of artificial intelligence which deal with the transition from one stage to another, a central problem in development. He describes the contributions of rule-based systems and connectionist systems to an explanation of this transition. He considers that Artificial Intelligence models, in spite of their limitations, establish fruitful points of contact with the constructivist position.El autor analiza los principales modelos de inteligencia artificial que dan cuenta del paso de la transición de un estudio a otro, problema central del desarrollo. Describe y señala las aportaciones de los sistemas basados en reglas así como de los sistemas conexionistas para explicar dicha transición. Considera que los modelos de inteligencia artificial, a pesar de sus limitaciones, permiten establecer puntos de contacto muy fructiferos con la posición constructivista

Revistes CientÃfiques de la Universitat de Barcelona

Basis Expansion in Natural Actor Critic Methods

Author: Girgin Sertan
Preux Philippe
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/06/2008
Field of study

INRIA a CCSD electronic archive server