4,411 research outputs found

    Get It in Writing: Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL

    Full text link
    Multi-agent reinforcement learning (MARL) is a powerful tool for training automated systems acting independently in a common environment. However, it can lead to sub-optimal behavior when individual incentives and group incentives diverge. Humans are remarkably capable at solving these social dilemmas. It is an open problem in MARL to replicate such cooperative behaviors in selfish agents. In this work, we draw upon the idea of formal contracting from economics to overcome diverging incentives between agents in MARL. We propose an augmentation to a Markov game where agents voluntarily agree to binding state-dependent transfers of reward, under pre-specified conditions. Our contributions are theoretical and empirical. First, we show that this augmentation makes all subgame-perfect equilibria of all fully observed Markov games exhibit socially optimal behavior, given a sufficiently rich space of contracts. Next, we complement our game-theoretic analysis by showing that state-of-the-art RL algorithms learn socially optimal policies given our augmentation. Our experiments include classic static dilemmas like Stag Hunt, Prisoner's Dilemma and a public goods game, as well as dynamic interactions that simulate traffic, pollution management and common pool resource management.Comment: 12 pages, 8 figures, AAMAS 202

    Stochastic optimization of a cold atom experiment using a genetic algorithm

    Full text link
    We employ an evolutionary algorithm to automatically optimize different stages of a cold atom experiment without human intervention. This approach closes the loop between computer based experimental control systems and automatic real time analysis and can be applied to a wide range of experimental situations. The genetic algorithm quickly and reliably converges to the most performing parameter set independent of the starting population. Especially in many-dimensional or connected parameter spaces the automatic optimization outperforms a manual search.Comment: 4 pages, 3 figure

    A Quantitative Theory of Mechanical Unfolding of a Homopolymer Globule

    Full text link
    We propose the quantitative mean-field theory of mechanical unfolding of a globule formed by long flexible homopolymer chain collapsed in poor solvent and subjected to extensional deformation. We demonstrate that depending on the degree of polymerization and solvent quality (quantified by the Flory-Huggins χ\chi parameter) the mechanical unfolding of the collapsed chain may either occur continuously (by passing a sequence of uniformly elongated configurations) or involves intra-molecular micro-phase coexistence of a collapsed and a stretched segment followed by an abrupt unraveling transition. The force-extension curves are obtained and quantitatively compared to our recent results of numerical self-consistent field (SCF) simulations. The phase diagrams for extended homopolymer chains in poor solvent comprising one- and two-phase regions are calculated for different chain length or/and solvent quality.Comment: 24 pages, 18 figure

    Topotecan-vincristine-doxorubicin in stage 4 high risk neuroblastoma patients failing to achieve a complete metastatic response to rapid COJEC : a SIOPEN study

    Get PDF
    Purpose : Metastatic response to induction therapy for high-risk neuroblastoma is a prognostic factor. In the International Society of Paediatric Oncology Europe Neuroblastoma (SIOPEN) HR-NBL-1 protocol, only patients with metastatic complete response (CR) or partial response (PR) with <= three abnormal skeletal areas on iodine 123-metaiodobenzylguanidine ([I-123] mIBG) scintigraphy and no bone marrow disease proceed to high dose therapy (HDT). In this study, topotecan-vincristine-doxorubicin (TVD) was evaluated in patients failing to achieve these criteria, with the aim of improving the metastatic response rate. Materials and Methods : Patients with metastatic high-risk neuroblastoma who had not achieved the SIOPEN criteria for HDT after induction received two courses of topotecan 1.5 mg/m(2)/day for 5 days, followed by a 48-hour infusion of vincristine, 2 mg/m(2), and doxorubicin, 45 mg/m(2). Results : Sixty-three patients were eligible and evaluable. Following two courses of TVD, four (6.4%) patients had an overall CR, while 28 (44.4%) had a PR with a combined response rate of 50.8% (95% confidence interval [CI], 37.9 to 63.6). Of these, 23 patients achieved a metastatic CR or a PR with <= 3 mIBG skeletal areas and no bone marrow disease (36.5%; 95% CI, 24.7 to 49.6) and were eligible to receive HDT. Toxicity was mostly haematological, affecting 106 of the 126 courses (84.1%; 95% CI, 76.5 to 90.0), and dose reduction was necessary in six patients. Stomatitis was the second most common nonhematological toxicity, occurring in 20 patients (31.7%). Conclusion : TVD was effective in improving the response rate of high-risk neuroblastoma patients after induction with COJEC enabling them to proceed to HDT. However, the long-term benefits of TVD needs to be determined in randomized clinical trials

    Constant-angle surfaces in liquid crystals

    Get PDF
    We discuss some properties of surfaces in R3 whose unit normal has constant angle with an assigned direction field. The constant angle condition can be rewritten as an Hamilton-Jacobi equation correlating the surface and the direction field. We focus on examples motivated by the physics of interfaces in liquid crystals and of layered fluids, and discuss the properties of the constant-angle surfaces when the direction field is singular along a line (disclination) or at a point (hedgehog defect

    Icebergs in the North Atlantic: Modelling circulation changes and glacio-marine deposition

    Get PDF
    In order to investigate meltwater events in the North Atlantic, a simple iceberg generation, drift, and melting routine was implemented in a high-resolution OGCM. Starting from the modelled last glacial state, every 25th day cylindrical model icebergs 300 meters high were released at 32 specific points along the coasts. Icebergs launched at the Barents Shelf margin spread a light meltwater lid over the Norwegian and Greenland Seas, shutting down the deep convection and the anti-clockwise circulation in this area. Due to the constraining ocean circulation, the icebergs produce a tongue of relatively cold and fresh water extending eastward from Hudson Strait that must develop at this location, regardless of iceberg origin. From the total amount of freshwater inferred by the icebergs, the thickness of the deposited IRD could be calculated in dependance of iceberg sediment concentration. In this way, typical extent and thickness of Heinrich layers could be reproduced, running the model for 250 years of steady state with constant iceberg meltwater inflow

    Oppida, agglomerations and suburbia: The Bibracte environs and new perspectives on Late Iron Age urbanism in central-eastern France

    Get PDF
    This paper explores the nature and chronology of La Tène and early Roman unenclosed agglomerations in central-eastern France. It has been prompted by the discovery of a c. 115 ha La Tène D2b/Augustan (c. 50 bc to ad 15) site close to Bibracte in the Morvan, located around the source of the River Yonne. This complex provides a new perspective on the chronology and role of Late La Tène and early Roman unenclosed settlements, adding further complexity to the story of the development of Late La Tène oppida. It indicates that these ‘agglomerations’ followed remarkably varied chronological trajectories, raising important issues concerning the nature of landscape and social change at the end of the Iron Age

    Investigating a genetic link between Alzheimer’s Disease and CADASIL related Cerebral Small Vessel Disease

    Get PDF
    Monogenic forms of Alzheimer’s disease (AD) have been identified through mutations in genes such as APP, PSEN1, and PSEN2, whilst other genetic markers such as the APOE ε carrier allele status have been shown to increase the likelihood of having the disease. Mutations in these genes are not limited to AD, as APP mutations can also cause an amyloid form of cerebral small vessel disease (CSVD) known as cerebral amyloid angiopathy, whilst PSEN1 and PSEN2 are involved in NOTCH3 signalling, a process known to be dysregulated in the monogenic CSVD, cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL). The overlap between AD genes and causes of CSVD led to the hypothesis that mutations in other genes within the PANTHER AD–presenilin pathway may be novel causes of CSVD in a cohort of clinically suspicious CADASIL patients without a pathogenic NOTCH3 mutation. To investigate this, whole exome sequencing was performed on 50 suspected CADASIL patients with no NOTCH3 mutations, and a targeted gene analysis was completed on the PANTHER. ERN1 was identified as a novel candidate CSVD gene following predicted pathogenic gene mutation analysis. Rare variant burden testing failed to identify an association with any gene; however, it did show a nominally significant link with ERN1 and TRPC3. This study provides evidence to support a genetic overlap between CSVD and Alzheimer’s disease.</p

    Far-infrared absorption in parallel quantum wires with weak tunneling

    Full text link
    We study collective and single-particle intersubband excitations in a system of quantum wires coupled via weak tunneling. For an isolated wire with parabolic confinement, the Kohn's theorem guarantees that the absorption spectrum represents a single sharp peak centered at the frequency given by the bare confining potential. We show that the effect of weak tunneling between two parabolic quantum wires is twofold: (i) additional peaks corresponding to single-particle excitations appear in the absorption spectrum, and (ii) the main absorption peak acquires a depolarization shift. We also show that the interplay between tunneling and weak perpendicular magnetic field drastically enhances the dispersion of single-particle excitations. The latter leads to a strong damping of the intersubband plasmon for magnetic fields exceeding a critical value.Comment: 18 pages + 6 postcript figure
    • …
    corecore