Search CORE

6 research outputs found

Evolving Static Representations for Task Transfer

Author: Stanley Kenneth O.
Verbancsics Phillip
Publication venue: 'Information Bulletin on Variable Stars (IBVS)'
Publication date: 01/01/2010
Field of study

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Effective Task Transfer Through Indirect Encoding

Author: Verbancsics Phillip
Publication venue: 'Information Bulletin on Variable Stars (IBVS)'
Publication date: 01/01/2011
Field of study

An important goal for machine learning is to transfer knowledge between tasks. For example, learning to play RoboCup Keepaway should contribute to learning the full game of RoboCup soccer. Often approaches to task transfer focus on transforming the original representation to fit the new task. Such representational transformations are necessary because the target task often requires new state information that was not included in the original representation. In RoboCup Keepaway, changing from the 3 vs. 2 variant of the task to 4 vs. 3 adds state information for each of the new players. In contrast, this dissertation explores the idea that transfer is most effective if the representation is designed to be the same even across different tasks. To this end, (1) the bird’s eye view (BEV) representation is introduced, which can represent different tasks on the same two-dimensional map. Because the BEV represents state information associated with positions instead of objects, it can be scaled to more objects without manipulation. In this way, both the 3 vs. 2 and 4 vs. 3 Keepaway tasks can be represented on the same BEV, which is (2) demonstrated in this dissertation. Yet a challenge for such representation is that a raw two-dimensional map is highdimensional and unstructured. This dissertation demonstrates how this problem is addressed naturally by the Hypercube-based NeuroEvolution of Augmenting Topologies (HyperNEAT) approach. HyperNEAT evolves an indirect encoding, which compresses the representation by exploiting its geometry. The dissertation then explores further exploiting the power of such encoding, beginning by (3) enhancing the configuration of the BEV with a focus on iii modularity. The need for further nonlinearity is then (4) investigated through the addition of hidden nodes. Furthermore, (5) the size of the BEV can be manipulated because it is indirectly encoded. Thus the resolution of the BEV, which is dictated by its size, is increased in precision and culminates in a HyperNEAT extension that is expressed at effectively infinite resolution. Additionally, scaling to higher resolutions through gradually increasing the size of the BEV is explored. Finally, (6) the ambitious problem of scaling from the Keepaway task to the Half-field Offense task is investigated with the BEV. Overall, this dissertation demonstrates that advanced representations in conjunction with indirect encoding can contribute to scaling learning techniques to more challenging tasks, such as the Half-field Offense RoboCup soccer domain

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning

Author: Stone P.
Taylor M.E.
Whiteson S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

International Migration, Integration and Social Cohesion online publications

Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning

Author: A. G. Barto
A. Moore
C. Watkins
D. E. Goldberg
D. E. Moriarty
D. E. Moriarty
D. Whitley
G. Tesauro
G. Tesauro
I. Menache
I. Noda
I. Szita
J. Baxter
J. Pollack
J. S. Albus
K. O. Stanley
K. O. Stanley
M. A. Potter
M. G. Lagoudakis
M. Kearns
M. Powell
Matthew E. Taylor
N. J. Radcliffe
N. Kohl
N. Saravanan
P. Stagge
P. Stone
P. Stone
Peter Stone
R. E. Bellman
R. E. Bellman
R. H. Crites
R. I. Brafman
R. S. Sutton
R. S. Sutton
S. Whiteson
S. Whiteson
Shimon Whiteson
T. P. Runarsson
X. Yao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Learning and Using Multimodal Stochastic Models : A Unified Approach

Author: Edgington Mark
Publication venue
Publication date: 01/01/2016
Field of study

This dissertation presents a principled approach to representing and using instance-based knowledge. Perceptions and actions are probabilistically modelled in a unified structure which allows for simultaneous perception modelling and reasoning about desired actions. In particular, a new method for online instance-based learning of such models is presented and analyzed. This method, called Dynamic Gaussian Mixture Estimation (DGME), adapts a model's complexity to the process being modelled. The models produced by DGME are evaluated on several classification, prediction, and control applications, and its characteristics are compared with other state-of-the-art methods. In the context of control applications, an additional novel method, Gaussian Mixture Control (GMC), is introduced for precisely controlling systems that exhibit multimodality

E-LIB Dokumentserver - Staats und Universitätsbibliothek Bremen

Evolving developmental, recurrent and convolutional neural networks for deliberate motion planning in sparse reward tasks

Author: Jolley Benjamin Paul
Publication venue
Publication date: 01/06/2020
Field of study

Motion planning algorithms have seen a diverse set of approaches in a variety of disciplines. In the domain of artificial evolutionary systems, motion planning has been included in models to achieve sophisticated deliberate behaviours. These algorithms rely on fixed rules or little evolutionary influence which compels behaviours to conform within those specific policies, rather than allowing the model to establish its own specialised behaviour. In order to further these models, the constraints imposed by planning algorithms must be removed to grant greater evolutionary control over behaviours. That is the focus of this thesis. An examination of prevailing neuroevolution methods led to the use of two distinct approaches, NEAT and HyperNEAT. Both were used to gain an understanding of the components necessary to create neuroevolution planning. The findings accumulated in the formation of a novel convolutional neural network architecture with a recurrent convolution process. The architecture’s goal was to iteratively disperse local activations to greater regions of the feature space. Experimentation showed significantly improved robustness over contemporary neuroevolution techniques as well as an efficiency increase over a static rule set. Greater evolutionary responsibility is given to the model with multiple network combinations; all of which continually demonstrated the necessary behaviours. In comparison, these behaviours were shown to be difficult to achieve in a state-of-the-art deep convolutional network. Finally, the unique use of recurrent convolution is relocated to a larger convolutional architecture on an established benchmarking platform. Performance improvements are seen on a number of domains which illustrates that this recurrent mechanism can be exploited in alternative areas outside of planning. By presenting a viable neuroevolution method for motion planning a potential emerges for further systems to adopt and examine the capability of this work in prospective domains, as well as further avenues of experimentation in convolutional architectures

Keele Research Repository