128 research outputs found

    Learning Symbolic Models of Stochastic Domains

    Full text link
    In this article, we work towards the goal of developing agents that can learn to act in complex worlds. We develop a probabilistic, relational planning rule representation that compactly models noisy, nondeterministic action effects, and show how such rules can be effectively learned. Through experiments in simple planning domains and a 3D simulated blocks world with realistic physics, we demonstrate that this learning algorithm allows agents to effectively model world dynamics

    Improving Policy Learning via Language Dynamics Distillation

    Get PDF
    Recent work has shown that augmenting environments with language descriptions improves policy learning. However, for environments with complex language abstractions, learning how to ground language to observations is difficult due to sparse, delayed rewards. We propose Language Dynamics Distillation (LDD), which pretrains a model to predict environment dynamics given demonstrations with language descriptions, and then fine-tunes these language-aware pretrained representations via reinforcement learning (RL). In this way, the model is trained to both maximize expected reward and retain knowledge about how language relates to environment dynamics. On SILG, a benchmark of five tasks with language descriptions that evaluate distinct generalization challenges on unseen environments (NetHack, ALFWorld, RTFM, Messenger, and Touchdown), LDD outperforms tabula-rasa RL, VAE pretraining, and methods that learn from unlabeled demonstrations in inverse RL and reward shaping with pretrained experts. In our analyses, we show that language descriptions in demonstrations improve sample-efficiency and generalization across environments, and that dynamics modeling with expert demonstrations is more effective than with non-experts

    Multilingual Autoregressive Entity Linking

    Get PDF
    We present mGENRE, a sequence-to- sequence system for the Multilingual Entity Linking (MEL) problem—the task of resolving language-specific mentions to a multilingual Knowledge Base (KB). For a mention in a given language, mGENRE predicts the name of the target entity left-to-right, token-by-token in an autoregressive fashion. The autoregressive formulation allows us to effectively cross-encode mention string and entity names to capture more interactions than the standard dot product between mention and entity vectors. It also enables fast search within a large KB even for mentions that do not appear in mention tables and with no need for large-scale vector indices. While prior MEL works use a single representation for each entity, we match against entity names of as many languages as possible, which allows exploiting language connections between source input and target name. Moreover, in a zero-shot setting on languages with no training data at all, mGENRE treats the target language as a latent variable that is marginalized at prediction time. This leads to over 50% improvements in average accuracy. We show the efficacy of our approach through extensive evaluation including experiments on three popular MEL benchmarks where we establish new state-of-the-art results. Source code available at https://github.com/facebookresearch/GENRE

    Curvature Dependence of Surface Free Energy of Liquid Drops and Bubbles: A Simulation Study

    Full text link
    We study the excess free energy due to phase coexistence of fluids by Monte Carlo simulations using successive umbrella sampling in finite LxLxL boxes with periodic boundary conditions. Both the vapor-liquid phase coexistence of a simple Lennard-Jones fluid and the coexistence between A-rich and B-rich phases of a symmetric binary (AB) Lennard-Jones mixture are studied, varying the density rho in the simple fluid or the relative concentration x_A of A in the binary mixture, respectively. The character of phase coexistence changes from a spherical droplet (or bubble) of the minority phase (near the coexistence curve) to a cylindrical droplet (or bubble) and finally (in the center of the miscibility gap) to a slab-like configuration of two parallel flat interfaces. Extending the analysis of M. Schrader, P. Virnau, and K. Binder [Phys. Rev. E 79, 061104 (2009)], we extract the surface free energy gamma (R) of both spherical and cylindrical droplets and bubbles in the vapor-liquid case, and present evidence that for R -> Infinity the leading order (Tolman) correction for droplets has sign opposite to the case of bubbles, consistent with the Tolman length being independent on the sign of curvature. For the symmetric binary mixture the expected non-existence of the Tolman length is confirmed. In all cases {and for a range of radii} R relevant for nucleation theory, gamma(R) deviates strongly from gamma (Infinity) which can be accounted for by a term of order gamma(Infinity)/gamma(R)-1 ~ 1/R^2. Our results for the simple Lennard-Jones fluid are also compared to results from density functional theory and we find qualitative agreement in the behavior of gamma(R) as well as in the sign and magnitude of the Tolman length.Comment: 25 pages, submitted to J. Chem. Phy

    First Measurement of Coherent Elastic Neutrino-Nucleus Scattering on Argon

    Full text link
    We report the first measurement of coherent elastic neutrino-nucleus scattering (\cevns) on argon using a liquid argon detector at the Oak Ridge National Laboratory Spallation Neutron Source. Two independent analyses prefer \cevns over the background-only null hypothesis with greater than 3σ3\sigma significance. The measured cross section, averaged over the incident neutrino flux, is (2.2 ±\pm 0.7) ×\times1039^{-39} cm2^2 -- consistent with the standard model prediction. The neutron-number dependence of this result, together with that from our previous measurement on CsI, confirms the existence of the \cevns process and provides improved constraints on non-standard neutrino interactions.Comment: 8 pages, 5 figures with 2 pages, 6 figures supplementary material V3: fixes to figs 3,4 V4: fix typo in table 1, V5: replaced missing appendix, V6: fix Eq 1, new fig 3, V7 final version, updated with final revision

    Search for nucleon decays with EXO-200

    Get PDF
    A search for instability of nucleons bound in 136^{136}Xe nuclei is reported with 223 kg\cdotyr exposure of 136^{136}Xe in the EXO-200 experiment. Lifetime limits of 3.3×1023\times 10^{23} and 1.9×1023\times 10^{23} yrs are established for nucleon decay to 133^{133}Sb and 133^{133}Te, respectively. These are the most stringent to date, exceeding the prior decay limits by a factor of 9 and 7, respectively
    corecore