Search CORE

25,128 research outputs found

Evolutionary Algorithms for Reinforcement Learning

Author: Grefenstette J. J.
Moriarty D. E.
Schultz A. C.
Publication venue: 'AI Access Foundation'
Publication date: 01/06/2011
Field of study

There are two distinct approaches to solving reinforcement learning problems, namely, searching in value function space and searching in policy space. Temporal difference methods and evolutionary algorithms are well-known examples of these approaches. Kaelbling, Littman and Moore recently provided an informative survey of temporal difference methods. This article focuses on the application of evolutionary algorithms to the reinforcement learning problem, emphasizing alternative policy representations, credit assignment methods, and problem-specific genetic operators. Strengths and weaknesses of the evolutionary approach to reinforcement learning are presented, along with a survey of representative applications

arXiv.org e-Print Archive

Crossref

Learning behavior in abstract memory schemes for dynamic optimization problems

Author: Richter H
Yang S
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

This is the post-print version of this article. The official article can be accessed from the link below - Copyright @ 2009 Springer VerlagIntegrating memory into evolutionary algorithms is one major approach to enhance their performance in dynamic environments. An abstract memory scheme has been recently developed for evolutionary algorithms in dynamic environments, where the abstraction of good solutions is stored in the memory instead of good solutions themselves to improve future problem solving. This paper further investigates this abstract memory with a focus on understanding the relationship between learning and memory, which is an important but poorly studied issue for evolutionary algorithms in dynamic environments. The experimental study shows that the abstract memory scheme enables learning processes and hence efficiently improves the performance of evolutionary algorithms in dynamic environments.The work by S. Yang was supported by the Engineering and Physical Sciences Research Council (EPSRC) of UK under Grant EP/E060722/1

De Montfort University Open Research Archive

Brunel University Research Archive

Leicester Research Archive

Digital Ecosystems: Ecosystem-Oriented Architectures

Author: Briscoe Gerard
De Wilde Philippe
Sadedin Suzanne
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/08/2011
Field of study

We view Digital Ecosystems to be the digital counterparts of biological ecosystems. Here, we are concerned with the creation of these Digital Ecosystems, exploiting the self-organising properties of biological ecosystems to evolve high-level software applications. Therefore, we created the Digital Ecosystem, a novel optimisation technique inspired by biological ecosystems, where the optimisation works at two levels: a first optimisation, migration of agents which are distributed in a decentralised peer-to-peer network, operating continuously in time; this process feeds a second optimisation based on evolutionary computing that operates locally on single peers and is aimed at finding solutions to satisfy locally relevant constraints. The Digital Ecosystem was then measured experimentally through simulations, with measures originating from theoretical ecology, evaluating its likeness to biological ecosystems. This included its responsiveness to requests for applications from the user base, as a measure of the ecological succession (ecosystem maturity). Overall, we have advanced the understanding of Digital Ecosystems, creating Ecosystem-Oriented Architectures where the word ecosystem is more than just a metaphor.Comment: 39 pages, 26 figures, journa

arXiv.org e-Print Archive

Heriot Watt Pure

Crossref

Kent Academic Repository

Multi-criteria Evolution of Neural Network Topologies: Balancing Experience and Performance in Autonomous Systems

Author: Behjat Amir
Chidambaran Sharat
Chowdhury Souma
Publication venue
Publication date: 20/07/2018
Field of study

Majority of Artificial Neural Network (ANN) implementations in autonomous systems use a fixed/user-prescribed network topology, leading to sub-optimal performance and low portability. The existing neuro-evolution of augmenting topology or NEAT paradigm offers a powerful alternative by allowing the network topology and the connection weights to be simultaneously optimized through an evolutionary process. However, most NEAT implementations allow the consideration of only a single objective. There also persists the question of how to tractably introduce topological diversification that mitigates overfitting to training scenarios. To address these gaps, this paper develops a multi-objective neuro-evolution algorithm. While adopting the basic elements of NEAT, important modifications are made to the selection, speciation, and mutation processes. With the backdrop of small-robot path-planning applications, an experience-gain criterion is derived to encapsulate the amount of diverse local environment encountered by the system. This criterion facilitates the evolution of genes that support exploration, thereby seeking to generalize from a smaller set of mission scenarios than possible with performance maximization alone. The effectiveness of the single-objective (optimizing performance) and the multi-objective (optimizing performance and experience-gain) neuro-evolution approaches are evaluated on two different small-robot cases, with ANNs obtained by the multi-objective optimization observed to provide superior performance in unseen scenarios

arXiv.org e-Print Archive

Crossref

Born to learn: The inspiration, progress, and future of evolved plastic artificial neural networks

Author: Abbott
Abraham
Abraham
Alexander
Allis
Alpaydin
Anderson
Andrea Soltoggio
Angeline
Arifovic
Arnold
Arnold
Ay
Bailey
Baldwin
Baxter
Baxter
Bear
Beer
Bengio
Bengio
Bentley
Best
Bienenstock
Birmingham
Blynel
Boers
Bourlard
Brown
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Butz
Butz
Bäck
Cabessa
Carew
Carlson
Carver
Cervier
Chklovskii
Clark
Cliff
Clutton-Brock
Coleman
Cooper
Damasio
Darwin
Dawkins
Deary
Deng
de Vladar
Di Paolo
Di Paolo
Dobzhansky
Doidge
Downing
Downing
Doya
Doya
Draganski
Dudai
Durr
Edelman
Eiben
Ellefsen
Ellefsen
Eskridge
Fellous
Fernando
Fernando
Finnie
Floreano
Floreano
Floreano
Floreano
Floreano
Floreano
Fogel
Fontanari
Friston
Fujii
Funahashi
Gerstner
Gerstner
Goldberg
Goodfellow
Graves
Greve
Grossberg
Gruau
Gu
Gustafsson
Hansen
Happel
Harrington
Harris-Warrick
Harvey
Hasselmo
Hasselmo
Hasselmo
Hawkins
Hebb
Hensch
Hinton
Hinton
Hinton
Hochreiter
Hoinville
Holland
Holland
Hopkins
Hornby
Howard
Howard
Howard
Hull
Husbands
Izhikevich
Izhikevich
Jaeger
Jo
Kaas
Kandel
Kandel
Katz
Katz
Katz
Kenneth O. Stanley
Khan
Khan
Khan
Khan
Kirkpatrick
Kiyota
Klug
Kohonen
Kohonen
Kolb
Kolb
Kondo
Koutník
Koza
Krichmar
Krizhevsky
Kumaran
Kupfermann
Køppe
Lake
Lalejini
Lamprecht
Langton
Lanzi
LeCun
LeDoux
Lee
Lehman
Lehman
Lehman
Levy
Lüders
Lüders
Maass
Maniadakis
Marder
Marder
Markram
Mattiussi
Mayley
McClelland
McQuesten
Meng
Merzenich
Michalewicz
Michalski
Miconi
Miller
Miller
Millington
Monroe
Mouret
Murre
Niv
Nogueira
Nogueira
Nolfi
Nolfi
Nolfi
Nolfi
Norouzzadeh
Offerman
Oja
Oquab
Orchard
Ormrod
Pan
Pascual-Leone
Pavlov
Pehlevan
Pigliucci
Rauschecker
Rawal
Rescorla
Risi
Risi
Risi
Risi
Risi
Roberts
Robins
Roff
Rolls
Rumelhart
Russell
Russo
Sanchez
Schmidhuber
Schmidhuber
Schmidhuber
Schmidhuber
Schreiweis
Schroll
Schultz
Schultz
Schultz
Sebastian Risi
Silva
Silva
Silver
Sims
Sims
Sipper
Skinner
Skinner
Smith
Smith
Smith
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Sporns
Staddon
Stanley
Stanley
Stanley
Stanley
Stanley
Steels
Stone
Suri
Suri
Sutton
Taylor
Tessier-Lavigne
Thorndike
Thrun
Thrun
Thrun
Tonelli
Tonelli
Urzelai
Varela
Venkatesh
Vitay
Wagner
Walters
Widrow
Willshaw
Yamauchi
Yao
Yao
Yoder
Young
Ziemke
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

Biological plastic neural networks are systems of extraordinary computational capabilities shaped by evolution, development, and lifetime learning. The interplay of these elements leads to the emergence of adaptive behavior and intelligence. Inspired by such intricate natural phenomena, Evolved Plastic Artificial Neural Networks (EPANNs) use simulated evolution in-silico to breed plastic neural networks with a large variety of dynamics, architectures, and plasticity rules: these artificial systems are composed of inputs, outputs, and plastic components that change in response to experiences in an environment. These systems may autonomously discover novel adaptive algorithms, and lead to hypotheses on the emergence of biological adaptation. EPANNs have seen considerable progress over the last two decades. Current scientific and technological advances in artificial neural networks are now setting the conditions for radically new approaches and results. In particular, the limitations of hand-designed networks could be overcome by more flexible and innovative solutions. This paper brings together a variety of inspiring ideas that define the field of EPANNs. The main methods and results are reviewed. Finally, new opportunities and developments are presented

arXiv.org e-Print Archive

Loughborough University Institutional Repository

Crossref

The IT University of Copenhagen's Repository

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Knowledge, understanding and the dynamics of medical innovation

Author: Consoli Davide
Ramlogan Ronnie
Publication venue
Publication date
Field of study

This paper investigates the processes by which scientific knowledge is created and legitimized. It focuses on scientific developments in a branch of medicine and explores the pathways through which the growth of knowledge enables advances in medical science and in clinical practice. This work draws conceptually on evolutionary approaches to technological change. The empirical part presents a longitudinal analysis of a database of scientific publications in the field of ophthalmology over a period of 50 years. Such an exercise allows us to identify pathways of shared understanding on a disease area, and to map out distinctive trajectories followed by the ophthalmology research community. The paper also contributes to general understanding of the innovation process by supporting the notion that knowledge coordination is a distributed process that cuts across and connects complementary areas of expertise.

Research Papers in Economics

Deep Neuroevolution of Recurrent and Discrete World Models

Author: Asai Masataro
Deisenroth Marc
Justesen Niels
Schulman John
van der Maaten Laurens
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2019
Field of study

Neural architectures inspired by our own human cognitive system, such as the recently introduced world models, have been shown to outperform traditional deep reinforcement learning (RL) methods in a variety of different domains. Instead of the relatively simple architectures employed in most RL experiments, world models rely on multiple different neural components that are responsible for visual information processing, memory, and decision-making. However, so far the components of these models have to be trained separately and through a variety of specialized training methods. This paper demonstrates the surprising finding that models with the same precise parts can be instead efficiently trained end-to-end through a genetic algorithm (GA), reaching a comparable performance to the original world model by solving a challenging car racing task. An analysis of the evolved visual and memory system indicates that they include a similar effective representation to the system trained through gradient descent. Additionally, in contrast to gradient descent methods that struggle with discrete variables, GAs also work directly with such representations, opening up opportunities for classical planning in latent space. This paper adds additional evidence on the effectiveness of deep neuroevolution for tasks that require the intricate orchestration of multiple components in complex heterogeneous architectures

arXiv.org e-Print Archive

Crossref

The IT University of Copenhagen's Repository