Search CORE

327 research outputs found

Learning Mazes with Aliasing States: An LCS Algorithm with Associative Perception

Author: Anthony Bagnall
Bull L.
Bull L.
Butz M.V.
Butz M.V.
Cassandra A.R.
Gerard P.
Hoffman J.
Holland J.H.
Holmes M.
Hurst J.
Lanzi P.L.
Lanzi P.L.
Lanzi P.L.
Littman M.L.
Littman M.L.
McCallum A.R.
Miyazaki K.
Métivier M.
Nevison C.
O'Hara T.
Pavlov I.P.
Pear J.
Russell S.
Skinner B.F.
Studley M.
Sutton R.S.
Thorndike E.L.
Zatuchna Z.V.
Zatuchna Z.V.
Zatuchna Z.V.
Zhanna V. Zatuchna
Publication venue: 'SAGE Publications'
Publication date: 09/03/2009
Field of study

Learning classifier systems (LCSs) belong to a class of algorithms based on the principle of self-organization and have frequently been applied to the task of solving mazes, an important type of reinforcement learning (RL) problem. Maze problems represent a simplified virtual model of real environments that can be used for developing core algorithms of many real-world applications related to the problem of navigation. However, the best achievements of LCSs in maze problems are still mostly bounded to non-aliasing environments, while LCS complexity seems to obstruct a proper analysis of the reasons of failure. We construct a new LCS agent that has a simpler and more transparent performance mechanism, but that can still solve mazes better than existing algorithms. We use the structure of a predictive LCS model, strip out the evolutionary mechanism, simplify the reinforcement learning procedure and equip the agent with the ability of associative perception, adopted from psychology. To improve our understanding of the nature and structure of maze environments, we analyze mazes used in research for the last two decades, introduce a set of maze complexity characteristics, and develop a set of new maze environments. We then run our new LCS with associative perception through the old and new aliasing mazes, which represent partially observable Markov decision problems (POMDP) and demonstrate that it performs at least as well as, and in some cases better than, other published systems

Crossref

University of East Anglia digital repository

A fuzzy-XCS classifier system with linguistic hedges

Author: Marin-Blázquez Javier
Shen Qiang
Publication venue
Publication date: 01/01/2008
Field of study

Aberystwyth Research Portal

MILCS: A mutual information learning classifier system

Author: Jiang MK
Smith RE
Publication venue: UCL (University College London)
Publication date: 27/08/2007
Field of study

This paper introduces a new variety of learning classifier system (LCS), called MILCS, which utilizes mutual information as fitness feedback. Unlike most LCSs, MILCS is specifically designed for supervised learning. MILCS's design draws on an analogy to the structural learning approach of cascade correlation networks. We present preliminary results, and contrast them to results from XCS. We discuss the explanatory power of the resulting rule sets, and introduce a new technique for visualizing explanatory power. Final comments include future directions for this research, including investigations in neural networks and other systems. Copyright 2007 ACM

UCL Discovery

A Census of the High-Density Molecular Gas in M82

Author: B. J. Naylor
Bayet
Bayet
Bradford
Bradford
Bradford
C. M. Bradford
Christopher
Colbert
Dame
Earle
Falgarone
Fuente
Förster Schreiber
Gao
Gao
Glenn
H. Inami
H. Matsuhara
H. T. Nguyen
Hailey-Dunsheath
Henkel
Hennebelle
Herrmann
Huettemeister
Hughes
Hunter
J. E. Aguirre
J. Glenn
J. J. Bock
J. Kamenetzky
J. Zmuidzinas
Kaufman
Knudsen
Koester
L. Earle
Larson
Le Floc'h
Leeuw
Mac Low
Mao
Martín
Mauersberger
Mauersberger
Mühle
Nakai
Nguyen-Q-Rieu
P. R. Maloney
Pan
Papovich
Pérez-González
Sakai
Sanders
Satyapal
Schmidt-Burgk
Seaquist
Seaquist
Strickland
Thuma
Ward
Ward
Wild
Wilson
Wu
Yao
Yao
Publication venue: 'IOP Publishing'
Publication date: 26/08/2010
Field of study

We present a three-pointing study of the molecular gas in the starburst nucleus of M82 based on 190 - 307 GHz spectra obtained with Z-Spec at the Caltech Submillimeter Observatory. We present intensity measurements, detections and upper limits, for 20 transitions, including several new detections of CS, HNC, C2H, H2CO, and CH3CCH lines. We combine our measurements with previously-published measurements at other frequencies for HCN, HNC, CS, C34S, and HCO+ in a multi-species likelihood analysis constraining gas mass, density and temperature, and the species' relative abundances. We find some 1.7 - 2.7 x 10^8 M_sun of gas with n_H2 between 1 - 6 x 10^4 cm^-3 and T > 50 K. While the mass and temperature are comparable to values inferred from mid-J CO transitions, the thermal pressure is a factor of 10 - 20 greater. The molecular interstellar medium is largely fragmented and is subject to ultraviolet irradiation from the star clusters. It is also likely subject to cosmic rays and mechanical energy input from the supernovae, and is warmer on average than the molecular gas in the massive star formation regions in the Milky Way. The typical conditions in the dense gas in M82's central kpc appear unfavorable for further star formation; if any appreciable stellar populations are currently forming, they are likely biased against low mass stars, producing a top-heavy initial mass function.Comment: 15 pages (using emulateapj.cls), 6 figures, Astrophysical Journal, in pres

arXiv.org e-Print Archive

Crossref

A brief history of learning classifier systems: from CS-1 to XCS and its variants

Author: A Fernández
A Fraser
A Orriols-Puig
A Tomlinson
AL Samuel
AL Samuel
B Farley
C Fernando
C Shannon
C Stone
D Cliff
E Bernado Mansilla
G Box
H Dam
J Casillas
J Greensmith
J Hoffmann
J Seward
J Timmis
JD Farmer
JH Holland
JH Holland
JH Holland
L Booker
L Bull
L Bull
L Bull
L Castro De
Larry Bull
M Iqbal
M Iqbal
M Studley
MV Butz
MV Butz
MV Butz
MV Butz
MV Butz
MV Butz
MV Butz
N Coufal
P Frey
P Stalph
P Stalph
P-L Lanzi
P-L Lanzi
P-L Lanzi
R Preen
R Smith
R Smith
R Sutton
R Tibshirani
R Urbanowicz
S Becker
S Vijayakumar
SW Wilson
SW Wilson
SW Wilson
W Schultz
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/09/2015
Field of study

© 2015, Springer-Verlag Berlin Heidelberg. The direction set by Wilson’s XCS is that modern Learning Classifier Systems can be characterized by their use of rule accuracy as the utility metric for the search algorithm(s) discovering useful rules. Such searching typically takes place within the restricted space of co-active rules for efficiency. This paper gives an overview of the evolution of Learning Classifier Systems up to XCS, and then of some of the subsequent developments of Wilson’s algorithm to different types of learning

Crossref

UWE Bristol Research Repository

Optimality-based Analysis of XCSF Compaction in Discrete Reinforcement Learning

Author: E Bernadó-Mansilla
F Kharbat
MV Butz
MV Butz
MV Butz
MV Butz
PW Dixon
RJ Urbanowicz
RJ Urbanowicz
RS Sutton
SW Wilson
SW Wilson
SW Wilson
SW Wilson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/09/2020
Field of study

Learning classifier systems (LCSs) are population-based predictive systems that were originally envisioned as agents to act in reinforcement learning (RL) environments. These systems can suffer from population bloat and so are amenable to compaction techniques that try to strike a balance between population size and performance. A well-studied LCS architecture is XCSF, which in the RL setting acts as a Q-function approximator. We apply XCSF to a deterministic and stochastic variant of the FrozenLake8x8 environment from OpenAI Gym, with its performance compared in terms of function approximation error and policy accuracy to the optimal Q-functions and policies produced by solving the environments via dynamic programming. We then introduce a novel compaction algorithm (Greedy Niche Mass Compaction - GNMC) and study its operation on XCSF's trained populations. Results show that given a suitable parametrisation, GNMC preserves or even slightly improves function approximation error while yielding a significant reduction in population size. Reasonable preservation of policy accuracy also occurs, and we link this metric to the commonly used steps-to-goal metric in maze-like environments, illustrating how the metrics are complementary rather than competitive

arXiv.org e-Print Archive

Crossref

A time series classifier

Author: Gore Christopher Mark
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2008
Field of study

A time series is a sequence of data measured at successive time intervals. Time series analysis refers to all of the methods employed to understand such data, either with the purpose of explaining the underlying system producing the data or to try to predict future data points in the time series...An evolutionary algorithm is a non-deterministic method of searching a solution space, and modeled after biological evolutionary processes. A learning classifier system (LCS) is a form of evolutionary algorithm that operates on a population of mapping rules. We introduce the time series classifier TSC, a new type of LCS that allows for the modeling and prediction of time series data, derived from Wilson\u27s XCSR, an LCS designed for use with real-valued inputs. Our method works by modifying the makeup of the rules in the LCS so that they are suitable for use on a time series...We tested TSC on real-world historical stock data --Abstract, page iii

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Toward Open-Set Text-Independent Speaker Identification in Tactical Communications

Author: Jae C Oh
Matt B Wolf
Misty K Blowers
Wonkyung Park
Publication venue
Publication date: 11/04/2020
Field of study

Abstract-We present the design and implementation of an open-set textindependent speaker identification system using genetic Learning Classifier Systems (LCS). We examine the use of this system in a real-number problem domain, where there is strong interest in its application to tactical communications. We investigate different encoding methods for representing real-number knowledge and study the efficacy of each method for speaker identification. We also identify several difficulties in solving the speaker identification problems with LCS and introduce new approaches to resolve the difficulties. Experimental results show that our system successfully learns 200 voice features at accuracies of 90% to 100% and 15,000 features to more than 80% for the closed-set problem, which is considered a strong result in the speaker identification community. The open-set capability is also comparable to existing numeric-based methods

CiteSeerX

学習戦略に基づく学習分類子システムの設計

Author: Masaya Nakata
中田雅也
Publication venue
Publication date: 02/09/2016
Field of study

On Learning Classifier Systems dubbed LCSs a leaning strategy which defines how LCSs cover a state-action space in a problem can be one of the most fundamental options in designing LCSs. There lacks an intensive study of the learning strategy to understand whether and how the learning strategy affects the performance of LCSs. This lack has resulted in the current design methodology of LCS which does not carefully consider the types of learning strategy. The thesis clarifies a need of a design methodology of LCS based on the learning strategy. That is, the thesis shows the learning strategy can be an option that determines the potential performance of LCSs and then claims that LCSs should be designed on the basis of the learning strategy in order to improve the performance of LCSs. First, the thesis empirically claims that the current design methodology of LCS, without the consideration of learning strategy, can be limited to design a proper LCS to solve a problem. This supports the need of design methodology based on the learning strategy. Next, the thesis presents an example of how LCS can be designed on the basis of the learning strategy. The thesis empirically show an adequate learning strategy improving the performance of LCS can be decided depending on a type of problem difficulties such as missing attributes. Then, the thesis draws an inclusive guideline that explains which learning strategy should be used to address which types of problem difficulties. Finally, the thesis further shows, on an application of LCS for a human daily activity recognition problem, the adequate learning strategy according to the guideline effectively improves the performance of the application. The thesis concludes that the learning strategy is the option of the LCS design which determines the potential performance of LCSs. Thus, before designing any type of LCSs including their applications, the learning strategy should be adequately selected at first, because their performance degrades when they employ an inadequate learning strategy to a problem they want to solve. In other words, LCSs should be designed on the basis of the adequate learning strategy.電気通信大学201

Creative Repository of Electro-Communications