10,704 research outputs found

    Practical Open-Loop Optimistic Planning

    Get PDF
    We consider the problem of online planning in a Markov Decision Process when given only access to a generative model, restricted to open-loop policies - i.e. sequences of actions - and under budget constraint. In this setting, the Open-Loop Optimistic Planning (OLOP) algorithm enjoys good theoretical guarantees but is overly conservative in practice, as we show in numerical experiments. We propose a modified version of the algorithm with tighter upper-confidence bounds, KLOLOP, that leads to better practical performances while retaining the sample complexity bound. Finally, we propose an efficient implementation that significantly improves the time complexity of both algorithms

    Spaceflight performance of several types of silicon solar cells on the LIPS 3 satellite

    Get PDF
    Results from exposure of several types of Solarex silicon cells to a space environment for nearly two years on the LIPS 3 satellite are presented. Experiments include standard thickness (10 mil) cells with and without back surface fields, and ultrathin (2 mil) cells also with and without back surface fields. A comparison between a widely used coverslide adhesive, DC 93-500 and a potential alternate is also presented. The major findings from the data are that the 2 mil cells without a back surface field show the smallest normalized short circuit current degradation and that the 10 mil back surface field cells show the greatest absolute power output for the radiation exposures and temperatures encountered. The new encapsulant (McGhan Nusil CV-2500) exhibits a degradation comparable to DC 93-500. A comparison is made with each of the cell types in this experiment with expectations based on JPL Radiation Handbook data

    Cathodoluminescence of nanocrystalline Y2O3:Eu3+ with various Eu3+ concentrations

    Get PDF
    © The Author(s) 2014. Published by ECS. This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 License (CC BY, http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse of the work in any medium, provided the original work is properly cited.This article has been made available through the Brunel Open Access Publishing Fund.Herein a study on the preparation and cathodoluminescence of monosized spherical nanoparticles of Y2O3:Eu3+ having a Eu3+ concentration that varies between 0.01 and 10% is described. The luminous efficiency and decay time have been determined at low a current density, whereas cathodoluminescence-microscopy has been carried out at high current density, the latter led to substantial saturation of certain spectral transitions. A novel theory is presented to evaluate the critical distance for energy transfer from Eu3+ ions in S6 to Eu3+ ions in C2 sites. It was found that Y2O3:Eu3+ with 1–2% Eu3+ has the highest luminous efficiency of 16lm/w at 15keV electron energy. Decay times of the emission from 5D0 (C2) and 5D1 (C2) and 5D0 (S6) levels were determined. The difference in decay time from the 5D0 (C2) and 5D1 (C2) levels largely explained the observed phenomena in the cathodoluminescence-micrographs recorded with our field emission scanning electron microscope

    Cathodoluminescence of Double Layers of Phosphor Particles

    Get PDF
    This article has been made available through the Brunel Open Access Publishing Fund.We present radiance measurements of particle layers of ZnO:Zn, Y2O3:Eu and Y2O2S:Eu bombarded with electrons at anode voltages between 1 and 15 kV. The layers described in this work refer to single component layers, double layers and two component mixtures. The phosphor layers are deposited on ITO-coated glass slides by settling; the efficiency of the cathodoluminescence is determined by summing the radiances and luminances in the reflected and transmitted modes respectively. The efficiency of a double layer of Y2O3:Eu on top of ZnO:Zn at high electron energy is significantly larger than the efficiency of a corresponding layer in which the two components are mixed. This result is interpreted in terms of the penetration-model, which predicts a larger efficiency for a high-voltage phosphor on top of a low-voltage phosphor. When a layer of the low-voltage phosphor ZnO:Zn is on top of the high-voltage phosphor Y2O3:Eu, we also observe a higher efficiency than that of the corresponding layer with both components mixed. In this case the efficiency increases due to suppression of charging in the Y2O3:Eu layer. Double layers of ZnO:Zn and Y2O2S:Eu did not show enhanced efficiency, because the size of the Y2O2S:Eu particles was too large to evoke the penetration effect. © The Author(s) 2014. Published by ECS

    Hi-Val: Iterative Learning of Hierarchical Value Functions for Policy Generation

    Get PDF
    Task decomposition is effective in manifold applications where the global complexity of a problem makes planning and decision-making too demanding. This is true, for example, in high-dimensional robotics domains, where (1) unpredictabilities and modeling limitations typically prevent the manual specification of robust behaviors, and (2) learning an action policy is challenging due to the curse of dimensionality. In this work, we borrow the concept of Hierarchical Task Networks (HTNs) to decompose the learning procedure, and we exploit Upper Confidence Tree (UCT) search to introduce HOP, a novel iterative algorithm for hierarchical optimistic planning with learned value functions. To obtain better generalization and generate policies, HOP simultaneously learns and uses action values. These are used to formalize constraints within the search space and to reduce the dimensionality of the problem. We evaluate our algorithm both on a fetching task using a simulated 7-DOF KUKA light weight arm and, on a pick and delivery task with a Pioneer robot

    Optimization of design of space experiments from the standpoint of data processing Semiannual report, 1 Oct. 1967 - 31 Mar. 1968

    Get PDF
    Design and construction work on spacecraft array processor for onboard processing of experimental dat

    Quantum Phase Transitions in Bosonic Heteronuclear Pairing Hamiltonians

    Full text link
    We explore the phase diagram of two-component bosons with Feshbach resonant pairing interactions in an optical lattice. It has been shown in previous work to exhibit a rich variety of phases and phase transitions, including a paradigmatic Ising quantum phase transition within the second Mott lobe. We discuss the evolution of the phase diagram with system parameters and relate this to the predictions of Landau theory. We extend our exact diagonalization studies of the one-dimensional bosonic Hamiltonian and confirm additional Ising critical exponents for the longitudinal and transverse magnetic susceptibilities within the second Mott lobe. The numerical results for the ground state energy and transverse magnetization are in good agreement with exact solutions of the Ising model in the thermodynamic limit. We also provide details of the low-energy spectrum, as well as density fluctuations and superfluid fractions in the grand canonical ensemble.Comment: 11 pages, 14 figures. To appear in Phys. Rev.

    Assessing the Potential of Classical Q-learning in General Game Playing

    Get PDF
    After the recent groundbreaking results of AlphaGo and AlphaZero, we have seen strong interests in deep reinforcement learning and artificial general intelligence (AGI) in game playing. However, deep learning is resource-intensive and the theory is not yet well developed. For small games, simple classical table-based Q-learning might still be the algorithm of choice. General Game Playing (GGP) provides a good testbed for reinforcement learning to research AGI. Q-learning is one of the canonical reinforcement learning methods, and has been used by (Banerjee &\& Stone, IJCAI 2007) in GGP. In this paper we implement Q-learning in GGP for three small-board games (Tic-Tac-Toe, Connect Four, Hex)\footnote{source code: https://github.com/wh1992v/ggp-rl}, to allow comparison to Banerjee et al.. We find that Q-learning converges to a high win rate in GGP. For the ϵ\epsilon-greedy strategy, we propose a first enhancement, the dynamic ϵ\epsilon algorithm. In addition, inspired by (Gelly &\& Silver, ICML 2007) we combine online search (Monte Carlo Search) to enhance offline learning, and propose QM-learning for GGP. Both enhancements improve the performance of classical Q-learning. In this work, GGP allows us to show, if augmented by appropriate enhancements, that classical table-based Q-learning can perform well in small games.Comment: arXiv admin note: substantial text overlap with arXiv:1802.0594

    Feshbach Resonance in Optical Lattices and the Quantum Ising Model

    Full text link
    Motivated by experiments on heteronuclear Feshbach resonances in Bose mixtures, we investigate s-wave pairing of two species of bosons in an optical lattice. The zero temperature phase diagram supports a rich array of superfluid and Mott phases and a network of quantum critical points. This topology reveals an underlying structure that is succinctly captured by a two-component Landau theory. Within the second Mott lobe we establish a quantum phase transition described by the paradigmatic longitudinal and transverse field Ising model. This is confirmed by exact diagonalization of the 1D bosonic Hamiltonian. We also find this transition in the homonuclear case.Comment: 5 pages, 4 figure
    corecore