128 research outputs found
Learning Symbolic Models of Stochastic Domains
In this article, we work towards the goal of developing agents that can learn
to act in complex worlds. We develop a probabilistic, relational planning rule
representation that compactly models noisy, nondeterministic action effects,
and show how such rules can be effectively learned. Through experiments in
simple planning domains and a 3D simulated blocks world with realistic physics,
we demonstrate that this learning algorithm allows agents to effectively model
world dynamics
Improving Policy Learning via Language Dynamics Distillation
Recent work has shown that augmenting environments with language descriptions improves policy learning. However, for environments with complex language abstractions, learning how to ground language to observations is difficult due to sparse, delayed rewards. We propose Language Dynamics Distillation (LDD), which pretrains a model to predict environment dynamics given demonstrations with language descriptions, and then fine-tunes these language-aware pretrained representations via reinforcement learning (RL). In this way, the model is trained to both maximize expected reward and retain knowledge about how language relates to environment dynamics. On SILG, a benchmark of five tasks with language descriptions that evaluate distinct generalization challenges on unseen environments (NetHack, ALFWorld, RTFM, Messenger, and Touchdown), LDD outperforms tabula-rasa RL, VAE pretraining, and methods that learn from unlabeled demonstrations in inverse RL and reward shaping with pretrained experts. In our analyses, we show that language descriptions in demonstrations improve sample-efficiency and generalization across environments, and that dynamics modeling with expert demonstrations is more effective than with non-experts
Multilingual Autoregressive Entity Linking
We present mGENRE, a sequence-to- sequence system for the Multilingual Entity Linking (MEL) problem—the task of resolving language-specific mentions to a multilingual Knowledge Base (KB). For a mention in a given language, mGENRE predicts the name of the target entity left-to-right, token-by-token in an autoregressive fashion. The autoregressive formulation allows us to effectively cross-encode mention string and entity names to capture more interactions than the standard dot product between mention and entity vectors. It also enables fast search within a large KB even for mentions that do not appear in mention tables and with no need for large-scale vector indices. While prior MEL works use a single representation for each entity, we match against entity names of as many languages as possible, which allows exploiting language connections between source input and target name. Moreover, in a zero-shot setting on languages with no training data at all, mGENRE treats the target language as a latent variable that is marginalized at prediction time. This leads to over 50% improvements in average accuracy. We show the efficacy of our approach through extensive evaluation including experiments on three popular MEL benchmarks where we establish new state-of-the-art results. Source code available at https://github.com/facebookresearch/GENRE
Curvature Dependence of Surface Free Energy of Liquid Drops and Bubbles: A Simulation Study
We study the excess free energy due to phase coexistence of fluids by Monte
Carlo simulations using successive umbrella sampling in finite LxLxL boxes with
periodic boundary conditions. Both the vapor-liquid phase coexistence of a
simple Lennard-Jones fluid and the coexistence between A-rich and B-rich phases
of a symmetric binary (AB) Lennard-Jones mixture are studied, varying the
density rho in the simple fluid or the relative concentration x_A of A in the
binary mixture, respectively. The character of phase coexistence changes from a
spherical droplet (or bubble) of the minority phase (near the coexistence
curve) to a cylindrical droplet (or bubble) and finally (in the center of the
miscibility gap) to a slab-like configuration of two parallel flat interfaces.
Extending the analysis of M. Schrader, P. Virnau, and K. Binder [Phys. Rev. E
79, 061104 (2009)], we extract the surface free energy gamma (R) of both
spherical and cylindrical droplets and bubbles in the vapor-liquid case, and
present evidence that for R -> Infinity the leading order (Tolman) correction
for droplets has sign opposite to the case of bubbles, consistent with the
Tolman length being independent on the sign of curvature. For the symmetric
binary mixture the expected non-existence of the Tolman length is confirmed. In
all cases {and for a range of radii} R relevant for nucleation theory, gamma(R)
deviates strongly from gamma (Infinity) which can be accounted for by a term of
order gamma(Infinity)/gamma(R)-1 ~ 1/R^2. Our results for the simple
Lennard-Jones fluid are also compared to results from density functional theory
and we find qualitative agreement in the behavior of gamma(R) as well as in the
sign and magnitude of the Tolman length.Comment: 25 pages, submitted to J. Chem. Phy
First Measurement of Coherent Elastic Neutrino-Nucleus Scattering on Argon
We report the first measurement of coherent elastic neutrino-nucleus
scattering (\cevns) on argon using a liquid argon detector at the Oak Ridge
National Laboratory Spallation Neutron Source. Two independent analyses prefer
\cevns over the background-only null hypothesis with greater than
significance. The measured cross section, averaged over the incident neutrino
flux, is (2.2 0.7) 10 cm -- consistent with the
standard model prediction. The neutron-number dependence of this result,
together with that from our previous measurement on CsI, confirms the existence
of the \cevns process and provides improved constraints on non-standard
neutrino interactions.Comment: 8 pages, 5 figures with 2 pages, 6 figures supplementary material V3:
fixes to figs 3,4 V4: fix typo in table 1, V5: replaced missing appendix, V6:
fix Eq 1, new fig 3, V7 final version, updated with final revision
Search for nucleon decays with EXO-200
A search for instability of nucleons bound in Xe nuclei is reported
with 223 kgyr exposure of Xe in the EXO-200 experiment. Lifetime
limits of 3.3 and 1.9 yrs are established for
nucleon decay to Sb and Te, respectively. These are the most
stringent to date, exceeding the prior decay limits by a factor of 9 and 7,
respectively
- …