Search CORE

10,704 research outputs found

Practical Open-Loop Optimistic Planning

Author: D Silver
D Silver
D Silver
J-F Hren
L Buşoniu
O Cappé
R Bellman
R Coulom
Publication venue
Publication date: 09/04/2019
Field of study

We consider the problem of online planning in a Markov Decision Process when given only access to a generative model, restricted to open-loop policies - i.e. sequences of actions - and under budget constraint. In this setting, the Open-Loop Optimistic Planning (OLOP) algorithm enjoys good theoretical guarantees but is overly conservative in practice, as we show in numerical experiments. We propose a modified version of the algorithm with tighter upper-confidence bounds, KLOLOP, that leads to better practical performances while retaining the sample complexity bound. Finally, we propose an efficient implementation that significantly improves the time complexity of both algorithms

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Spaceflight performance of several types of silicon solar cells on the LIPS 3 satellite

Author: Silver J.
Warfield D.
Publication venue
Publication date
Field of study

Results from exposure of several types of Solarex silicon cells to a space environment for nearly two years on the LIPS 3 satellite are presented. Experiments include standard thickness (10 mil) cells with and without back surface fields, and ultrathin (2 mil) cells also with and without back surface fields. A comparison between a widely used coverslide adhesive, DC 93-500 and a potential alternate is also presented. The major findings from the data are that the 2 mil cells without a back surface field show the smallest normalized short circuit current degradation and that the 10 mil back surface field cells show the greatest absolute power output for the radiation exposures and temperatures encountered. The new encapsulant (McGhan Nusil CV-2500) exhibits a degradation comparable to DC 93-500. A comparison is made with each of the cell types in this experiment with expectations based on JPL Radiation Handbook data

NASA Technical Reports Server

Cathodoluminescence of nanocrystalline Y2O3:Eu3+ with various Eu3+ concentrations

Author: den Engelsen D
Harris P
Ireland T
Silver J
Publication venue: 'The Electrochemical Society'
Publication date: 03/11/2014
Field of study

© The Author(s) 2014. Published by ECS. This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 License (CC BY, http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse of the work in any medium, provided the original work is properly cited.This article has been made available through the Brunel Open Access Publishing Fund.Herein a study on the preparation and cathodoluminescence of monosized spherical nanoparticles of Y2O3:Eu3+ having a Eu3+ concentration that varies between 0.01 and 10% is described. The luminous efficiency and decay time have been determined at low a current density, whereas cathodoluminescence-microscopy has been carried out at high current density, the latter led to substantial saturation of certain spectral transitions. A novel theory is presented to evaluate the critical distance for energy transfer from Eu3+ ions in S6 to Eu3+ ions in C2 sites. It was found that Y2O3:Eu3+ with 1–2% Eu3+ has the highest luminous efficiency of 16lm/w at 15keV electron energy. Decay times of the emission from 5D0 (C2) and 5D1 (C2) and 5D0 (S6) levels were determined. The difference in decay time from the 5D0 (C2) and 5D1 (C2) levels largely explained the observed phenomena in the cathodoluminescence-micrographs recorded with our field emission scanning electron microscope

Crossref

Brunel University Research Archive

Cathodoluminescence of Double Layers of Phosphor Particles

Author: den Engelsen D
Harris P
Ireland T
Silver J
Publication venue: 'The Electrochemical Society'
Publication date: 01/01/2014
Field of study

This article has been made available through the Brunel Open Access Publishing Fund.We present radiance measurements of particle layers of ZnO:Zn, Y2O3:Eu and Y2O2S:Eu bombarded with electrons at anode voltages between 1 and 15 kV. The layers described in this work refer to single component layers, double layers and two component mixtures. The phosphor layers are deposited on ITO-coated glass slides by settling; the efficiency of the cathodoluminescence is determined by summing the radiances and luminances in the reflected and transmitted modes respectively. The efficiency of a double layer of Y2O3:Eu on top of ZnO:Zn at high electron energy is significantly larger than the efficiency of a corresponding layer in which the two components are mixed. This result is interpreted in terms of the penetration-model, which predicts a larger efficiency for a high-voltage phosphor on top of a low-voltage phosphor. When a layer of the low-voltage phosphor ZnO:Zn is on top of the high-voltage phosphor Y2O3:Eu, we also observe a higher efficiency than that of the corresponding layer with both components mixed. In this case the efficiency increases due to suppression of charging in the Y2O3:Eu layer. Double layers of ZnO:Zn and Y2O2S:Eu did not show enhanced efficiency, because the size of the Y2O2S:Eu particles was too large to evoke the penetration effect. © The Author(s) 2014. Published by ECS

Crossref

Brunel University Research Archive

Hi-Val: Iterative Learning of Hierarchical Value Functions for Policy Generation

Author: D Silver
D Silver
G Chowdhary
G Konidaris
J Hostetler
Levente Kocsis
M Jun
P Auer
RS Sutton
TG Dietterich
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Task decomposition is effective in manifold applications where the global complexity of a problem makes planning and decision-making too demanding. This is true, for example, in high-dimensional robotics domains, where (1) unpredictabilities and modeling limitations typically prevent the manual specification of robust behaviors, and (2) learning an action policy is challenging due to the curse of dimensionality. In this work, we borrow the concept of Hierarchical Task Networks (HTNs) to decompose the learning procedure, and we exploit Upper Confidence Tree (UCT) search to introduce HOP, a novel iterative algorithm for hierarchical optimistic planning with learned value functions. To obtain better generalization and generate policies, HOP simultaneously learns and uses action values. These are used to formalize constraints within the search space and to reduce the dimensionality of the problem. We evaluate our algorithm both on a fetching task using a simulated 7-DOF KUKA light weight arm and, on a pick and delivery task with a Pioneer robot

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Optimization of design of space experiments from the standpoint of data processing Semiannual report, 1 Oct. 1967 - 31 Mar. 1968

Author: Algazi V. R.
Sakrison D. J.
Silver S.
Publication venue
Publication date
Field of study

Design and construction work on spacecraft array processor for onboard processing of experimental dat

NASA Technical Reports Server

Quantum Phase Transitions in Bosonic Heteronuclear Pairing Hamiltonians

Author: Bhaseen M. J.
Hohenadler M.
Silver A. O.
Simons B. D.
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2010
Field of study

We explore the phase diagram of two-component bosons with Feshbach resonant pairing interactions in an optical lattice. It has been shown in previous work to exhibit a rich variety of phases and phase transitions, including a paradigmatic Ising quantum phase transition within the second Mott lobe. We discuss the evolution of the phase diagram with system parameters and relate this to the predictions of Landau theory. We extend our exact diagonalization studies of the one-dimensional bosonic Hamiltonian and confirm additional Ising critical exponents for the longitudinal and transverse magnetic susceptibilities within the second Mott lobe. The numerical results for the ground state energy and transverse magnetization are in good agreement with exact solutions of the Ising model in the thermodynamic limit. We also provide details of the low-energy spectrum, as well as density fluctuations and superfluid fractions in the grand canonical ensemble.Comment: 11 pages, 14 figures. To appear in Phys. Rev.

arXiv.org e-Print Archive

King's Research Portal

Recommended from our members

Cathodoluminescence studies of phosphors in a scanning electron microscope

Author: Den engelsen D
Fern G
Harris P
Ireland T
Silver J
Publication venue: 'IOP Publishing'
Publication date: 17/06/2015
Field of study

Cathodoluminescence studies are reported of phosphors in a field emission scanning electron microscope (FESEM). A number of phosphor materials have been studied and exhibited a pronounced comet-like structure at high scan rates, because the particle continued to emit light after the beam had moved onto subsequent pixels. Image analysis has been used to study the loss of brightness along the tail and hence to determine the decay time of the materials. This technique provides a simple and convenient way to study the decay times of individual particles

Brunel University Research Archive

Assessing the Potential of Classical Q-learning in General Game Playing

Author: CB Browne
CJCH Watkins
CP Robert
D Silver
D Silver
H Wang
J Hu
J Méhat
M Genesereth
M Genesereth
M Świechowski
RS Sutton
V Mnih
Publication venue
Publication date: 14/10/2018
Field of study

After the recent groundbreaking results of AlphaGo and AlphaZero, we have seen strong interests in deep reinforcement learning and artificial general intelligence (AGI) in game playing. However, deep learning is resource-intensive and the theory is not yet well developed. For small games, simple classical table-based Q-learning might still be the algorithm of choice. General Game Playing (GGP) provides a good testbed for reinforcement learning to research AGI. Q-learning is one of the canonical reinforcement learning methods, and has been used by (Banerjee

\&

Stone, IJCAI 2007) in GGP. In this paper we implement Q-learning in GGP for three small-board games (Tic-Tac-Toe, Connect Four, Hex)\footnote{source code: https://github.com/wh1992v/ggp-rl}, to allow comparison to Banerjee et al.. We find that Q-learning converges to a high win rate in GGP. For the

\epsilon

-greedy strategy, we propose a first enhancement, the dynamic

\epsilon

algorithm. In addition, inspired by (Gelly

\&

Silver, ICML 2007) we combine online search (Monte Carlo Search) to enhance offline learning, and propose QM-learning for GGP. Both enhancements improve the performance of classical Q-learning. In this work, GGP allows us to show, if augmented by appropriate enhancements, that classical table-based Q-learning can perform well in small games.Comment: arXiv admin note: substantial text overlap with arXiv:1802.0594

arXiv.org e-Print Archive

Crossref

Leiden University Scholary Publications

Feshbach Resonance in Optical Lattices and the Quantum Ising Model

Author: Bhaseen M. J.
Hohenadler M.
Silver A. O.
Simons B. D.
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2009
Field of study

Motivated by experiments on heteronuclear Feshbach resonances in Bose mixtures, we investigate s-wave pairing of two species of bosons in an optical lattice. The zero temperature phase diagram supports a rich array of superfluid and Mott phases and a network of quantum critical points. This topology reveals an underlying structure that is succinctly captured by a two-component Landau theory. Within the second Mott lobe we establish a quantum phase transition described by the paradigmatic longitudinal and transverse field Ising model. This is confirmed by exact diagonalization of the 1D bosonic Hamiltonian. We also find this transition in the homonuclear case.Comment: 5 pages, 4 figure

arXiv.org e-Print Archive

Crossref

King's Research Portal