Search CORE

6 research outputs found

Reinforcement Learning Soccer Teams with Incomplete World Models Neural Networks for Robot Learning

Author: Salustowicz R.P.
Schmidhuber J.
Wiering M.A.
Publication venue
Publication date: 01/01/1999
Field of study

Learning Team Strategies: Soccer Case Studies

Author: Salustowicz R.P.
Schmidhuber J.
Wiering M.A.
Publication venue
Publication date: 01/01/1998
Field of study

We use simulated soccer to study multiagent learning. Each team's players (agents) share action set and policy, but may behave differently due to position-dependent inputs. All agents making up a team are rewarded or punished collectively in case of goals. We conduct simulations with varying team sizes, and compare several learning algorithms: TD-Q learning with linear neural networks (TD-Q), Probabilistic Incremental Program Evolution (PIPE), and a PIPE version that learns by coevolution (CO-PIPE). TD-Q is based on learning evaluation functions (EFs) mapping input/action pairs to expected reward. PIPE and CO-PIPE search policy space directly. They use adaptive probability distributions to synthesize programs that calculate action probabilities from current inputs. Our results show that linear TD-Q encounters several difficulties in learning appropriate shared EFs. PIPE and CO-PIPE, however, do not depend on EFs and find good policies faster and more reliably. This suggests that in some multiagent learning scenarios direct search in policy space can offer advantages over EF-based approaches

Utrecht University Repository

Evolving Flexible Neural Tree Model for Portland Cement Hydration Process

Author: F. Tomosawa
F. Tomosawa
J.M. Pommersheim
L. Wang
R. Kondo
R. Krstulović
R.P. Salustowicz
W. Chen
Y. Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

A linear estimation-of-distribution gp system

Author: A. Ratle
C. Manning
C.Y. Suen
H. Mühlenbein
J.R. Koza
K. Yanai
K. Yanai
L. Rabiner
P. Larrañaga
R.P. Salustowicz
S. Baluja
Y. Shan
Y. Shan
Publication venue
Publication date: 01/01/2008
Field of study

Abstract. We present N-gram GP, an estimation of distribution algorithm for the evolution of linear computer programs. The algorithm learns and samples a joint probability distribution of triplets of instructions (or 3-grams) at the same time as it is learning and sampling a program length distribution. We have tested N-gram GP on symbolic regressions problems where the target function is a polynomial of up to degree 12 and lawn-mower problems with lawn sizes of up to 12 × 12. Results show that the algorithm is effective and scales better on these problems than either linear GP or simple stochastic hill-climbing.

CiteSeerX

Crossref

Memetic algorithms: The polynomial local search complexity theory perspective

Author: C.H. Papadimitriou
D.S. Johnson
D.S. Johnson
E.B. Baum
E.J. Anderson
E.M.H. Aarts
G. Gutin
G. Gutin
H. Muhlenbein
J. He
Jim Smith
M.D. Vose
N. Krasnogor
Natalio Krasnogor
P. Hansen
P. Hansen
P. Merz
P.M.B. Vitany
R. Konig
R. Unger
R.P. Salustowicz
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2008
Field of study

In previous work (Krasnogor, http://www.cs.nott.ac.uk/~nxk/papers.html . In: Studies on the Theory and Design Space of Memetic Algorithms. Ph.D. thesis, University of the West of England, Bristol, UK, 2002; Krasnogor and Smith, IEEE Trans Evol Algorithms 9(6):474-488, 2005) we develop a syntax-only classification of evolutionary algorithms, in particular so-called memetic algorithms (MAs). When "syntactic sugar" is added to our model, we are able to investigate the polynomial local search (PLS) complexity of memetic algorithms. In this paper we show the PLS-completeness of whole classes of problems that occur when memetic algorithms are applied to the travelling salesman problem using a range of mutation, crossover and local search operators. Our PLS-completeness results shed light on the worst case behaviour that can be expected of a memetic algorithm under these circumstances. Moreover, we point out in this paper that memetic algorithms for graph partitioning and maximum network flow (both with important practical applications) also give rise to PLS-complete problems. © 2007 Springer Science + Business Media B.V

Crossref

UWE Bristol Research Repository

Probabilistic model building in genetic programming: a critical review

Author: A. Geyer-Schulz
A. Ortega
A.K. Joshi
A.P Dempster
A.S. d’Avila Garcez
C.D. Manning
C.M. Bishop
C.R. Reeves
C.S. Wallace
D. Koller
D.E. Goldberg
D.J. Montana
G. Wilson
G.G. Towell
H. Iba
H.G. Beyer
I. Tanev
J. Rissanen
J.L. Shapiro
J.R. Koza
K Kim
Kangil Kim
L. Getoor
M Boryczka
M. Dorigo
M. Pelikan
M. Zlochin
M.L. Wong
P. Larranaga
P. Larranaga
P.A. Whigham
R. I. McKay
R. Santana
R.P. Salustowicz
R.S. Michalski
S. Shirakawa
S.J. Pan
W.B. Langdon
X.H. Nguyen
Xuan Hoai Nguyen
Y. Hasegawa
Y. Hasegawa
Yin Shan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref