2,471 research outputs found

    Multi-Task Policy Search for Robotics

    No full text
    © 2014 IEEE.Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for continuous task variations, requiring more principled approaches to share and transfer knowledge among similar tasks. We present a novel approach for learning a nonlinear feedback policy that generalizes across multiple tasks. The key idea is to define a parametrized policy as a function of both the state and the task, which allows learning a single policy that generalizes across multiple known and unknown tasks. Applications of our novel approach to reinforcement and imitation learning in realrobot experiments are shown

    Statistics of dressed modes in a thermal state

    Full text link
    By a Wigner-function calculation, we evaluate the trace of a certain Gaussian operator arising in the theory of a boson system subject to both finite temperature and (weak) interaction. Thereby we rederive (and generalize) a recent result by Kocharovsky, Kocharovsky, and Scully [Phys. Rev. A, vol. 61, art. 053606 (2000)] in a way that is technically much simpler. One step uses a special case of the response of Wigner functions to linear transformations, and we demonstrate the general case by simple means. As an application we extract the counting statistics for each mode of the Bose gas.Comment: to appear in Optics Communications, 10 page

    Interacting Bosons at Finite Temperature: How Bogolubov Visited a Black Hole and Came Home Again

    Get PDF
    The structure of the thermal equilibrium state of a weakly interacting Bose gas is of current interest. We calculate the density matrix of that state in two ways. The most effective method, in terms of yielding a simple, explicit answer, is to construct a generating function within the traditional framework of quantum statistical mechanics. The alternative method, arguably more interesting, is to construct the thermal state as a vector state in an artificial system with twice as many degrees of freedom. It is well known that this construction has an actual physical realization in the quantum thermodynamics of black holes, where the added degrees of freedom correspond to the second sheet of the Kruskal manifold and the thermal vector state is a state of the Unruh or the Hartle-Hawking type. What is unusual about the present work is that the Bogolubov transformation used to construct the thermal state combines in a rather symmetrical way with Bogolubov's original transformation of the same form, used to implement the interaction of the nonideal gas in linear approximation. In addition to providing a density matrix, the method makes it possible to calculate efficiently certain expectation values directly in terms of the thermal vector state of the doubled system.Comment: 25 pages, LaTeX. To appear in a special issue of Foundations of Physics in honor of Jacob Bekenstei

    A global fit of top quark effective theory to data

    Get PDF
    In this paper we present a global fit of beyond the Standard Model (BSM) dimension six operators relevant to the top quark sector to currently available data. Experimental measurements include parton-level top-pair and single top production from the LHC and the Tevatron. Higher order QCD corrections are modelled using differential and global K-factors, and we use novel fast-fitting techniques developed in the context of Monte Carlo event generator tuning to perform the fit. This allows us to provide new, fully correlated and model-independent bounds on new physics effects in the top sector from the most current direct hadron-collider measurements in light of the involved theoretical and experimental systematics. As a by-product, our analysis constitutes a proof-of-principle that fast fitting of theory to data is possible in the top quark sector, and paves the way for a more detailed analysis including top quark decays, detector corrections and precision observables.Comment: Additional references and preprint code added. Minor error in generation of plots fixed, no conclusions affecte

    Multi-Task Policy Search

    No full text
    Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for continuous task variations, requiring more principled approaches to share and transfer knowledge among similar tasks. We present a novel approach for learning a nonlinear feedback policy that generalizes across multiple tasks. The key idea is to define a parametrized policy as a function of both the state and the task, which allows learning a single policy that generalizes across multiple known and unknown tasks. Applications of our novel approach to reinforcement and imitation learning in real-robot experiments are shown
    corecore