7,008 research outputs found

    Successor features for transfer in reinforcement learning

    Get PDF
    Transfer in reinforcement learning refers to the notion that generalization should occur not only within a task but also across tasks. Our focus is on transfer where the reward functions vary across tasks while the environment's dynamics remain the same. The method we propose rests on two key ideas: "successor features," a value function representation that decouples the dynamics of the environment from the rewards, and "generalized policy improvement," a generalization of dynamic programming's policy improvement step that considers a set of policies rather than a single one. Put together, the two ideas lead to an approach that integrates seamlessly within the reinforcement learning framework and allows transfer to take place between tasks without any restriction. The proposed method also provides performance guarantees for the transferred policy even before any learning has taken place. We derive two theorems that set our approach in firm theoretical ground and present experiments that show that it successfully promotes transfer in practice

    A combinatorial approach to knot recognition

    Full text link
    This is a report on our ongoing research on a combinatorial approach to knot recognition, using coloring of knots by certain algebraic objects called quandles. The aim of the paper is to summarize the mathematical theory of knot coloring in a compact, accessible manner, and to show how to use it for computational purposes. In particular, we address how to determine colorability of a knot, and propose to use SAT solving to search for colorings. The computational complexity of the problem, both in theory and in our implementation, is discussed. In the last part, we explain how coloring can be utilized in knot recognition

    Competition Between Antiferromagnetic Order and Spin-Liquid Behavior in the Two-Dimensional Periodic Anderson Model at Half-Filling

    Full text link
    We study the two-dimensional periodic Anderson model at half-filling using quantum Monte Carlo (QMC) techniques. The ground state undergoes a magnetic order-disorder transition as a function of the effective exchange coupling between the conduction and localized bands. Low-lying spin and charge excitations are determined using the maximum entropy method to analytically continue the QMC data. At finite temperature we find a competition between the Kondo effect and antiferromagnetic order which develops in the localized band through Ruderman-Kittel-Kasuya-Yosida interactions.Comment: Revtex 3.0, 10 pages + 5 figures, UCSBTH-94-2

    Polaron band formation in the Holstein model

    Full text link
    We present numerical exact results for the polaronic band structure of the Holstein molecular crystal model in one and two dimensions. The use of direct Lanczos diagonalization technique, preserving the full dynamics and quantum nature of phonons, allows us to analyze in detail the renormalization of both quasiparticle bandwidth and dispersion by the electron-phonon interaction. For the two-dimensional case some of our exact data are compared with the results obtained in the framework of a recently developed finite cluster strong-coupling perturbation theory.Comment: 10 pages (LaTeX), 6 figures (ps), submitted to Phys. Rev.

    Description of recent large-qq neutron inclusive scattering data from liquid 4^4He

    Get PDF
    We report dynamical calculations for large-qq structure functions of liquid 4^4He at TT=1.6 and 2.3 K and compare those with recent MARI data. We extend those calculations far beyond the experimental range q\le 29\Ain in order to study the approach of the response to its asymptotic limit for a system with interactions having a strong short-range repulsion. We find only small deviations from theoretical 1/q1/q behavior, valid for smooth VV. We repeat an extraction by Glyde et al of cumulant coefficients from data. We argue that fits determine the single atom momentum distribution, but express doubt as to the extraction of meaningful Final State Interaction parameters.Comment: 37 pages, 13 postscript fig

    Changes in energy content of lunchtime purchases from fast food restaurants after introduction of calorie labelling: cross sectional customer surveys

    Get PDF
    Objective To assess the impact of fast food restaurants adding calorie labelling to menu items on the energy content of individual purchases

    Fast algorithm for calculating two-photon absorption spectra

    Full text link
    We report a numerical calculation of the two-photon absorption coefficient of electrons in a binding potential using the real-time real-space higher-order difference method. By introducing random vector averaging for the intermediate state, the task of evaluating the two-dimensional time integral is reduced to calculating two one-dimensional integrals. This allows the reduction of the computation load down to the same order as that for the linear response function. The relative advantage of the method compared to the straightforward multi-dimensional time integration is greater for the calculation of non-linear response functions of higher order at higher energy resolution.Comment: 4 pages, 2 figures. It will be published in Phys. Rev. E on 1, March, 199

    LP 133-373: A New Chromospherically Active Eclipsing dMe Binary with a Distant, Cool White Dwarf Companion

    Get PDF
    We report the discovery of the partially eclipsing binary LP 133-373. Nearly identical eclipses along with observed photometric colors and spectroscopy indicate that it is a pair of chromospherically active dM4 stars in a circular 1.6 day orbit. Light and velocity curve modeling to our differential photometry and velocity data show that each star has a mass and radius of 0.340+/-0.014 Msolar and 0.33+/-0.02 Rsolar. The binary is itself part of a common proper motion pair with LP 133-374, a cool DC or possible DA white dwarf with a mass of 0.49-0.82 Msolar, which would make the system at least 3 Gyr ol
    corecore