Search CORE

124 research outputs found

Learning Mazes with Aliasing States: An LCS Algorithm with Associative Perception

Author: Anthony Bagnall
Bull L.
Bull L.
Butz M.V.
Butz M.V.
Cassandra A.R.
Gerard P.
Hoffman J.
Holland J.H.
Holmes M.
Hurst J.
Lanzi P.L.
Lanzi P.L.
Lanzi P.L.
Littman M.L.
Littman M.L.
McCallum A.R.
Miyazaki K.
Métivier M.
Nevison C.
O'Hara T.
Pavlov I.P.
Pear J.
Russell S.
Skinner B.F.
Studley M.
Sutton R.S.
Thorndike E.L.
Zatuchna Z.V.
Zatuchna Z.V.
Zatuchna Z.V.
Zhanna V. Zatuchna
Publication venue: 'SAGE Publications'
Publication date: 09/03/2009
Field of study

Learning classifier systems (LCSs) belong to a class of algorithms based on the principle of self-organization and have frequently been applied to the task of solving mazes, an important type of reinforcement learning (RL) problem. Maze problems represent a simplified virtual model of real environments that can be used for developing core algorithms of many real-world applications related to the problem of navigation. However, the best achievements of LCSs in maze problems are still mostly bounded to non-aliasing environments, while LCS complexity seems to obstruct a proper analysis of the reasons of failure. We construct a new LCS agent that has a simpler and more transparent performance mechanism, but that can still solve mazes better than existing algorithms. We use the structure of a predictive LCS model, strip out the evolutionary mechanism, simplify the reinforcement learning procedure and equip the agent with the ability of associative perception, adopted from psychology. To improve our understanding of the nature and structure of maze environments, we analyze mazes used in research for the last two decades, introduce a set of maze complexity characteristics, and develop a set of new maze environments. We then run our new LCS with associative perception through the old and new aliasing mazes, which represent partially observable Markov decision problems (POMDP) and demonstrate that it performs at least as well as, and in some cases better than, other published systems

XCS Classifier System with Experience Replay

Author: Hähner Jörg
Maier Roland
Rosenbauer Lukas
Stein Anthony
Publication venue
Publication date: 01/01/2020
Field of study

XCS constitutes the most deeply investigated classifier system today. It bears strong potentials and comes with inherent capabilities for mastering a variety of different learning tasks. Besides outstanding successes in various classification and regression tasks, XCS also proved very effective in certain multi-step environments from the domain of reinforcement learning. Especially in the latter domain, recent advances have been mainly driven by algorithms which model their policies based on deep neural networks -- among which the Deep-Q-Network (DQN) is a prominent representative. Experience Replay (ER) constitutes one of the crucial factors for the DQN's successes, since it facilitates stabilized training of the neural network-based Q-function approximators. Surprisingly, XCS barely takes advantage of similar mechanisms that leverage stored raw experiences encountered so far. To bridge this gap, this paper investigates the benefits of extending XCS with ER. On the one hand, we demonstrate that for single-step tasks ER bears massive potential for improvements in terms of sample efficiency. On the shady side, however, we reveal that the use of ER might further aggravate well-studied issues not yet solved for XCS when applied to sequential decision problems demanding for long-action-chains

arXiv.org e-Print Archive

OPUS Augsburg

HYBRIDIZED POLYMERIC NANO-ASSEMBLIES: KEY INSIGHTS INTO ADDRESSING MDR INFECTIONS

Author: Landis Ryan
Publication venue: ScholarWorks@UMass Amherst
Publication date: 20/03/2019
Field of study

Multidrug-resistant (MDR) bacteria contribute to more than 700,000 annual deaths world-wide. Millions more suffer from limb amputations or face high healthcare treatment costs where prolonged and costly therapeutic regimens are used to counter MDR infections. While there is an international push to develop novel and more powerful antimicrobials to address the impending threat, one particularly interesting approach that has re-emerged are essential oils, phytochemical extracts derived from plant sources. While their antimicrobial activity demonstrates a promising avenue, their stability in aqueous media, limits their practical use in or on mammals. Inspired by the versatility of polymer nanotechnology and the sustainability of traditional medicine, I employed a hybridization approach to improve the stability and subsequently the antimicrobial activity of phytochemical extracts. This approach was accomplished through a crosslinked Nano-emulsification templating strategy, generating a highly robust and reproducible library of potent oil-in-water Nano-assemblies. These assemblies, stabilized using synthetic or natural polymers, demonstrated long-term shelf life, high stability in serum-containing aqueous environments, and most notably, were demonstrated to penetrate highly refractory biofilm infections, eliminating a broad-spectrum of pathogenic bacteria where accumulated resistance towards these materials were not observed during the course of laboratory experiments. Taken together, the technology presented herein, offers key insight into addressing MDR-associated infections with hopes that future platforms can be built from to tackle the rising dangers of MDR infections

Symbiogenesis in learning classifier systems

Author: Andy Tomlinson
Larry Bull
Lindgren K.
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2001
Field of study

Abstract Symbiosis is the phenomenon in which organisms of different species live together in close association, resulting in a raised level of fitness for one or more of the organisms. Symbiogenesis is the name given to the process by which symbiotic partners combine and unify, that is, become genetically linked, giving rise to new morphologies and physiologies evolutionarily more advanced than their constituents. The importance of this process in the evolution of complexity is now well established. Learning classifier systems are a machine learning technique that uses both evolutionary computing techniques and reinforcement learning to develop a population of cooperative rules to solve a given task. In this article we examine the use of symbiogenesis within the classifier system rule base to improve their performance. Results show that incorporating simple rule linkage does not give any benefits. The concept of (temporal) encapsulation is then added to the symbiotic rules and shown to improve performance in ambiguous/non-Markov environments

CiteSeerX

A Cognitive Architecture Based on a Learning Classifier System with Spiking Classifiers

Author: AG Pipe
AL Hodgkin
AW Churchill
C Fernando
C Fernando
D Floreano
D Rumelhart
David Howard
G Buzsaki
H Hagras
H Shouval
I Rechenberg
J Hurst
J Hurst
JA Boyan
JH Holland
JH Holland
JH Holland
JY Donnart
L Bull
Larry Bull
M Dorigo
O Michel
Pier-Luca Lanzi
R Preen
RD Beer
RS Sutton
RS Sutton
S Fauer
SR Quartz
SW Wilson
W Gerstner
W Maass
W Schultz
WM Kistler
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

© 2015, Springer Science+Business Media New York. Learning classifier systems (LCS) are population-based reinforcement learners that were originally designed to model various cognitive phenomena. This paper presents an explicitly cognitive LCS by using spiking neural networks as classifiers, providing each classifier with a measure of temporal dynamism. We employ a constructivist model of growth of both neurons and synaptic connections, which permits a genetic algorithm to automatically evolve sufficiently-complex neural structures. The spiking classifiers are coupled with a temporally-sensitive reinforcement learning algorithm, which allows the system to perform temporal state decomposition by appropriately rewarding “macro-actions”, created by chaining together multiple atomic actions. The combination of temporal reinforcement learning and neural information processing is shown to outperform benchmark neural classifier systems, and successfully solve a robotic navigation task

Archivio istituzionale della ricerca - Politecnico di Milano

Model-free reconstruction of neuronal network connectivity from calcium imaging signals

Author: Demian Battaglia
John Beggs
Jordi Soriano
Olav Stetter
Theo Geisel
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 28/05/2012
Field of study

A systematic assessment of global neural network connectivity through direct electrophysiological assays has remained technically unfeasible even in dissociated neuronal cultures. We introduce an improved algorithmic approach based on Transfer Entropy to reconstruct approximations to network structural connectivities from network activity monitored through calcium fluorescence imaging. Based on information theory, our method requires no prior assumptions on the statistics of neuronal firing and neuronal connections. The performance of our algorithm is benchmarked on surrogate time-series of calcium fluorescence generated by the simulated dynamics of a network with known ground-truth topology. We find that the effective network topology revealed by Transfer Entropy depends qualitatively on the time-dependent dynamic state of the network (e.g., bursting or non-bursting). We thus demonstrate how conditioning with respect to the global mean activity improves the performance of our method. [...] Compared to other reconstruction strategies such as cross-correlation or Granger Causality methods, our method based on improved Transfer Entropy is remarkably more accurate. In particular, it provides a good reconstruction of the network clustering coefficient, allowing to discriminate between weakly or strongly clustered topologies, whereas on the other hand an approach based on cross-correlations would invariantly detect artificially high levels of clustering. Finally, we present the applicability of our method to real recordings of in vitro cortical cultures. We demonstrate that these networks are characterized by an elevated level of clustering compared to a random graph (although not extreme) and by a markedly non-local connectivity.Comment: 54 pages, 8 figures (+9 supplementary figures), 1 table; submitted for publicatio

arXiv.org e-Print Archive

Directory of Open Access Journals

FigShare