Search CORE

805 research outputs found

Reinforcement Learning via AIXI Approximation

Author: Hutter Marcus
Ng Kee Siong
Silver David
Veness Joel
Publication venue
Publication date: 01/01/2010
Field of study

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it has been unclear whether the theory of AIXI could motivate the design of practical algorithms. We answer this hitherto open question in the affirmative, by providing the first computationally feasible approximation to the AIXI agent. To develop our approximation, we introduce a Monte Carlo Tree Search algorithm along with an agent-specific extension of the Context Tree Weighting algorithm. Empirically, we present a set of encouraging results on a number of stochastic, unknown, and partially observable domains.Comment: 8 LaTeX pages, 1 figur

arXiv.org e-Print Archive

CiteSeerX

UCL Discovery

The Australian National University

Association for the Advancement of Artificial Intelligence: AAAI Publications

Context tree switching

Author: Bowling Michael
Hutter Marcus
Ng Kee Siong
Veness Joel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

This paper describes the Context Tree Switching technique, a modification of Context Tree Weighting for the prediction of binary, stationary, n-Markov sources. By modifying Context Tree Weighting’s recursive weighting scheme, it is possible to mix over a strictly larger class of models without increasing the asymptotic time or space complexity of the original algorithm. We prove that this generalization preserves the desirable theoretical properties of Context Tree Weighting on stationary n-Markov sources, and show empirically that this new technique leads to consistent improvements over Context Tree Weighting as measured on the Calgary Corpus

The Australian National University

Context Tree Switching

Author: Bowling Michael
Hutter Marcus
Ng Kee Siong
Veness Joel
Publication venue
Publication date: 24/02/2016
Field of study

The Australian National University

Reinforcement Learning via AIXI Approximation

Author: Hutter Marcus
Ng Kee Siong
Silver David
Veness Joel
Publication venue: AAAI Press
Publication date: 24/02/2016
Field of study

The Australian National University

Structural Color 3D Printing By Shrinking Photonic Crystals

Author: Dong Zhaogang
Greer Julia R.
Hall-Chen Valerian H.
Ho Jinfa
Koay Eleen H. H.
Liu Hailong
Liu Yejing
Ng Ray J. H.
Ng Ryan C.
Qiu Cheng-Wei
Wang Hao
Yang Joel K. W.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

The rings, spots and stripes found on some butterflies, Pachyrhynchus weevils, and many chameleons are notable examples of natural organisms employing photonic crystals to produce colorful patterns. Despite advances in nanotechnology, we still lack the ability to print arbitrary colors and shapes in all three dimensions at this microscopic length scale. Commercial nanoscale 3D printers based on two-photon polymerization are incapable of patterning photonic crystal structures with the requisite ~300 nm lattice constant to achieve photonic stopbands/ bandgaps in the visible spectrum and generate colors. Here, we introduce a means to produce 3D-printed photonic crystals with a 5x reduction in lattice constants (periodicity as small as 280 nm), achieving sub-100-nm features with a full range of colors. The reliability of this process enables us to engineer the bandstructures of woodpile photonic crystals that match experiments, showing that observed colors can be attributed to either slow light modes or stopbands. With these lattice structures as 3D color volumetric elements (voxels), we printed 3D microscopic scale objects, including the first multi-color microscopic model of the Eiffel Tower measuring only 39-microns tall with a color pixel size of 1.45 microns. The technology to print 3D structures in color at the microscopic scale promises the direct patterning and integration of spectrally selective devices, such as photonic crystal-based color filters, onto free-form optical elements and curved surfaces

arXiv.org e-Print Archive

Caltech Authors

ScholarBank@NUS

Covariance Matrix Adaptation for the Rapid Illumination of Behavior Space

Author: Alvarez Alberto
Arulkumaran Kai
Bhatt Aditya
Cuccu Giuseppe
Hansen N.
Lehman Joel
Lehman Joel
Ng Andrew Y.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 07/05/2020
Field of study

We focus on the challenge of finding a diverse collection of quality solutions on complex continuous domains. While quality diver-sity (QD) algorithms like Novelty Search with Local Competition (NSLC) and MAP-Elites are designed to generate a diverse range of solutions, these algorithms require a large number of evaluations for exploration of continuous spaces. Meanwhile, variants of the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) are among the best-performing derivative-free optimizers in single-objective continuous domains. This paper proposes a new QD algorithm called Covariance Matrix Adaptation MAP-Elites (CMA-ME). Our new algorithm combines the self-adaptation techniques of CMA-ES with archiving and mapping techniques for maintaining diversity in QD. Results from experiments based on standard continuous optimization benchmarks show that CMA-ME finds better-quality solutions than MAP-Elites; similarly, results on the strategic game Hearthstone show that CMA-ME finds both a higher overall quality and broader diversity of strategies than both CMA-ES and MAP-Elites. Overall, CMA-ME more than doubles the performance of MAP-Elites using standard QD performance metrics. These results suggest that QD algorithms augmented by operators from state-of-the-art optimization algorithms can yield high-performing methods for simultaneously exploring and optimizing continuous search spaces, with significant applications to design, testing, and reinforcement learning among other domains.Comment: Accepted to GECCO 202

arXiv.org e-Print Archive

Crossref

The UN Sustainable Development Goals (SDGs): Contributions from the Humanities

Author: Biriotti Maurice
Christie Edward
Fallstrom Helena
Miller Nicola
Ng Stephanie
Stokes Joel
Publication venue: UCL Institute of Advanced Studies (IAS)
Publication date: 01/01/2022
Field of study

It is widely agreed that achieving the United Nations Sustainable Development Goals requires the insights, knowledge and comparative perspectives of Humanities disciplines. The UNESCO Hangzhou Declaration of 2013 highlighted the importance of culture as ‘an enabler and driver of sustainable development’. Yet the Humanities have so far featured relatively little in work on the SDGs

UCL Discovery

An Empirical Evaluation of Deep Learning on Highway Driving

Author: Andriluka Mykhaylo
Cheng-Yue Royce
Coates Adam
Huval Brody
Kiske Jeff
Migimatsu Toki
Mujica Fernando
Ng Andrew Y.
Pazhayampallil Joel
Rajpurkar Pranav
Song Will
Tandon Sameep
Wang Tao
Publication venue
Publication date: 01/01/2015
Field of study

Numerous groups have applied a variety of deep learning techniques to computer vision problems in highway perception scenarios. In this paper, we presented a number of empirical evaluations of recent deep learning advances. Computer vision, combined with deep learning, has the potential to bring about a relatively inexpensive, robust solution to autonomous driving. To prepare deep learning for industry uptake and practical applications, neural networks will require large data sets that represent all possible driving environments and scenarios. We collect a large data set of highway data and apply deep learning and computer vision algorithms to problems such as car and lane detection. We show how existing convolutional neural networks (CNNs) can be used to perform lane and vehicle detection while running at frame rates required for a real-time system. Our results lend credence to the hypothesis that deep learning holds promise for autonomous driving.Comment: Added a video for lane detectio

arXiv.org e-Print Archive

MPG.PuRe