98 research outputs found

    Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning

    Full text link
    In multi-timescale multi-agent reinforcement learning (MARL), agents interact across different timescales. In general, policies for time-dependent behaviors, such as those induced by multiple timescales, are non-stationary. Learning non-stationary policies is challenging and typically requires sophisticated or inefficient algorithms. Motivated by the prevalence of this control problem in real-world complex systems, we introduce a simple framework for learning non-stationary policies for multi-timescale MARL. Our approach uses available information about agent timescales to define a periodic time encoding. In detail, we theoretically demonstrate that the effects of non-stationarity introduced by multiple timescales can be learned by a periodic multi-agent policy. To learn such policies, we propose a policy gradient algorithm that parameterizes the actor and critic with phase-functioned neural networks, which provide an inductive bias for periodicity. The framework's ability to effectively learn multi-timescale policies is validated on a gridworld and building energy management environment.Comment: Accepted at IEEE CDC'23. 7 pages, 6 figure

    Signatures of exciton coupling in paired nanoemitters

    Get PDF
    An exciton formed by the delocalized electronic excitation of paired nanoemitters is interpreted in terms of the electromagnetic emission of the pair and their mutual coupling with a photodetector. A formulation directly tailored for fluorescence detection is identified, giving results which are strongly dependent on geometry and selection rules. Signature symmetric and antisymmetric combinations are analyzed and their distinctive features identified

    Operational experience on the generation and control of high brightness electron bunch trains at SPARC-LAB

    Get PDF
    Sub-picosecond, high-brightness electron bunch trains are routinely produced at SPARC-LAB via the velocity bunching technique. Such bunch trains can be used to drive multi-color Free Electron Lasers (FELs) and plasma wake field accelerators. In this paper we present recent results at SPARC-LAB on the generation of such beams, highlighting the key points of our scheme. We will discuss also the on-going machine upgrades to allow driving FELs with plasma accelerated beams or with short electron pulses at an increased energy

    Gold remobilisation and formation of high grade ore shoots driven by dissolution-reprecipitation replacement and Ni substitution into auriferous arsenopyrite

    Get PDF
    Both gold-rich sulphides and ultra-high grade native gold oreshoots are common but poorly understood phenomenon in orogenic-type mineral systems, partly because fluids in these systems are considered to have relatively low gold solubilities and are unlikely to generate high gold concentrations. The world-class Obuasi gold deposit, Ghana, has gold-rich arsenopyrite spatially associated with quartz veins, which have extremely high, localised concentrations of native gold, contained in microcrack networks within the quartz veins where they are folded. Here, we examine selected samples from Obuasi using a novel combination of quantitative electron backscatter diffraction analysis, ion microprobe imaging, synchrotron XFM mapping and geochemical modelling to investigate the origin of the unusually high gold concentrations. The auriferous arsenopyrites are shown to have undergone partial replacement (~15%) by Au-poor, nickeliferous arsenopyrite, during localised crystal-plastic deformation, intragranular microfracture and metamorphism (340-460 °C, 2 kbars). Our results show the dominant replacement mechanism was pseudomorphic dissolution-reprecipitation, driven by small volumes of an infiltrating fluid that had relatively low fS2 and carried aqueous NiCl2. We find that arsenopyrite replacement produced strong chemical gradients at crystal-fluid interfaces due to an increase in fS2 during reaction, which enabled efficient removal of gold to the fluid phase and development of anomalously gold-rich fluid (potentially 10 ppm or more depending on sulphur concentration). This process was facilitated by precipitation of ankerite, which removed CO2 from the fluid, increasing the relative proportion of sulphur for gold complexation and inhibited additional quartz precipitation. Gold re-precipitation occurred over distances of 10 µm to several tens of metres and was likely a result of sulphur activity reduction through precipitation of pyrite and other sulphides. We suggest this late remobilisation process may be relatively common in orogenic belts containing abundant mafic/ultramafic rocks, which act as a source of Ni and Co scavenged by chloride-bearing fluids. Both the preference of the arsenopyrite crystal structure for Ni and Co, rather than gold, and the release of sulphur during reaction, can drive gold remobilisation in many deposits across broad regions

    Search for heavy neutral lepton production in K+ decays

    Get PDF
    A search for heavy neutral lepton production in K + decays using a data sample collected with a minimum bias trigger by the NA62 experiment at CERN in 2015 is reported. Upper limits at the 10−7 to 10−6 level are established on the elements of the extended neutrino mixing matrix |Ue4| 2 and |Uμ4| 2 for heavy neutral lepton mass in the ranges 170–448 MeV/c2 and 250–373 MeV/c2, respectively. This improves on the previous limits from HNL production searches over the whole mass range considered for |Ue4|2 and above 300 MeV/c2 for |Uμ4|2
    • …
    corecore