87 research outputs found

    Embodied imitation-enhanced reinforcement learning in multi-agent systems

    Get PDF
    Imitation is an example of social learning in which an individual observes and copies another's actions. This paper presents a new method for using imitation as a way of enhancing the learning speed of individual agents that employ a well-known reinforcement learning algorithm, namely Q-learning. Compared with other research that uses imitation with reinforcement learning, our method uses imitation of purely observed behaviours to enhance learning, with no internal state access or sharing of experiences between agents. The paper evaluates our imitation-enhanced reinforcement learning approach in both simulation and with real robots in continuous space. Both simulation and real robot experimental results show that the learning speed of the group is improved. © The Author(s) 2013

    The helicity amplitudes A1/2_{1/2} and A3/2_{3/2} for the D13(1520)_{13}(1520) resonance obtained from the γppπ0\vec{\gamma} \vec{p} \to p \pi^0 reaction}

    Full text link
    The helicity dependence of the γppπ0\vec{\gamma} \vec{p} \to p \pi^0 reaction has been measured for the first time in the photon energy range from 550 to 790 MeV. The experiment, performed at the Mainz microtron MAMI, used a 4π\pi-detector system, a circularly polarized, tagged photon beam, and a longitudinally polarized frozen-spin target. These data are predominantly sensitive to the D13(1520)D_{13}(1520) resonance and are used to determine its parameters.Comment: 5 pages, 4 figure

    First measurement of the Gerasimov-Drell-Hearn integral for Hydrogen from 200 to 800 MeV

    Full text link
    A direct measurement of the helicity dependence of the total photoabsorption cross section on the proton was carried out at MAMI (Mainz) in the energy range 200 < E_gamma < 800 MeV. The experiment used a 4π\pi detection system, a circularly polarized tagged photon beam and a frozen spin target. The contributions to the Gerasimov-Drell-Hearn sum rule and to the forward spin polarizability γ0\gamma_0 determined from the data are 226 \pm 5 (stat)\pm 12(sys) \mu b and -187 \pm 8 (stat)\pm 10(sys)10^{-6} fm^4, respectively, for 200 < E_\gamma < 800 MeV.Comment: 6 pages, 3 figures, 3 table

    Experimental determination of the complete spin structure for anti-proton + proton -> anti-\Lambda + \Lambda at anti-proton beam momentum of 1.637 GeV/c

    Get PDF
    The reaction anti-proton + proton -> anti-\Lambda + \Lambda -> anti-proton + \pi^+ + proton + \pi^- has been measured with high statistics at anti-proton beam momentum of 1.637 GeV/c. The use of a transversely-polarized frozen-spin target combined with the self-analyzing property of \Lambda/anti-\Lambda decay allows access to unprecedented information on the spin structure of the interaction. The most general spin-scattering matrix can be written in terms of eleven real parameters for each bin of scattering angle, each of these parameters is determined with reasonable precision. From these results all conceivable spin-correlations are determined with inherent self-consistency. Good agreement is found with the few previously existing measurements of spin observables in anti-proton + proton -> anti-\Lambda + \Lambda near this energy. Existing theoretical models do not give good predictions for those spin-observables that had not been previously measured.Comment: To be published in Phys. Rev. C. Tables of results (i.e. Ref. 24) are available at http://www-meg.phys.cmu.edu/~bquinn/ps185_pub/results.tab 24 pages, 16 figure

    Measurement of Spin Transfer Observables in Antiproton-Proton -> Antilambda-Lambda at 1.637 GeV/c

    Full text link
    Spin transfer observables for the strangeness-production reaction Antiproton-Proton -> Antilambda-Lambda have been measured by the PS185 collaboration using a transversely-polarized frozen-spin target with an antiproton beam momentum of 1.637 GeV/c at the Low Energy Antiproton Ring at CERN. This measurement investigates observables for which current models of the reaction near threshold make significantly differing predictions. Those models are in good agreement with existing measurements performed with unpolarized particles in the initial state. Theoretical attention has focused on the fact that these models produce conflicting predictions for the spin-transfer observables D_{nn} and K_{nn}, which are measurable only with polarized target or beam. Results presented here for D_{nn} and K_{nn} are found to be in disagreement with predictions from existing models. These results also underscore the importance of singlet-state production at backward angles, while current models predict complete or near-complete triplet-state dominance.Comment: 5 pages, 3 figure

    Building collaboration in multi-agent systems using reinforcement learning

    Get PDF
    © Springer Nature Switzerland AG 2018. This paper presents a proof-of concept study for demonstrating the viability of building collaboration among multiple agents through standard Q learning algorithm embedded in particle swarm optimisation. Collaboration is formulated to be achieved among the agents via competition, where the agents are expected to balance their action in such a way that none of them drifts away of the team and none intervene any fellow neighbours territory, either. Particles are devised with Q learning for self training to learn how to act as members of a swarm and how to produce collaborative/collective behaviours. The produced experimental results are supportive to the proposed idea suggesting that a substantive collaboration can be build via proposed learning algorithm
    corecore