504 research outputs found

    Generation of Policy-Level Explanations for Reinforcement Learning

    Full text link
    Though reinforcement learning has greatly benefited from the incorporation of neural networks, the inability to verify the correctness of such systems limits their use. Current work in explainable deep learning focuses on explaining only a single decision in terms of input features, making it unsuitable for explaining a sequence of decisions. To address this need, we introduce Abstracted Policy Graphs, which are Markov chains of abstract states. This representation concisely summarizes a policy so that individual decisions can be explained in the context of expected future transitions. Additionally, we propose a method to generate these Abstracted Policy Graphs for deterministic policies given a learned value function and a set of observed transitions, potentially off-policy transitions used during training. Since no restrictions are placed on how the value function is generated, our method is compatible with many existing reinforcement learning methods. We prove that the worst-case time complexity of our method is quadratic in the number of features and linear in the number of provided transitions, O(F2tr_samples)O(|F|^2 |tr\_samples|). By applying our method to a family of domains, we show that our method scales well in practice and produces Abstracted Policy Graphs which reliably capture relationships within these domains.Comment: Accepted to Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (2019

    Review of Dance Research in Quebec

    Get PDF

    From liquid to solid bonding in cohesive granular media

    Full text link
    We study the transition of a granular packing from liquid to solid bonding in the course of drying. The particles are initially wetted by a liquid brine and the cohesion of the packing is ensured by capillary forces, but the crystallization of the solute transforms the liquid bonds into partially cemented bonds. This transition is evidenced experimentally by measuring the compressive strength of the samples at regular intervals of times. Our experimental data reveal three regimes: 1) Up to a critical degree of saturation, no solid bonds are formed and the cohesion remains practically constant; 2) The onset of cementation occurs at the surface and a front spreads towards the center of the sample with a nonlinear increase of the cohesion; 3) All bonds are partially cemented when the cementation front reaches the center of the sample, but the cohesion increases rapidly due to the consolidation of cemented bonds. We introduce a model based on a parametric cohesion law at the bonds and a bond crystallization parameter. This model predicts correctly the phase transition and the relation between microscopic and macroscopic cohesion.Comment: 20

    FROM 3D IMAGING OF STRUCTURES TO DIFFUSIVE PROPERTIES OF ANISOTROPIC CELLULAR MATERIALS

    No full text
    International audienceThis paper deals with diffusive properties phenomena in metallic foams. We have developed a 3D morphological tool to extract geometrical characteristics of the media from X-ray images. The anisotropy of the geometry of each phase is observed and the relationship between microstructure and effective properties is analyzed. We emphasize on geometrical tortuosity determination and impact on con¬ductive transport tensor. The conductive heat transfers are computed on a vertex-edge network to determine directional effective conductivities by solving the energy equa¬tion on this network. We realize a systematic study carried on a wide range of different Nickel foam samples. Finally, we propose a simple model of effective diffusion prop¬erties dependence on tortuosity and porosity

    Granular slumping in a fluid : focus on runout distances

    Get PDF
    We investigate the effect of an ambient fluid on the dynamics of collapse and spread of a granular column simulated by means of a recently developed model which takes into account both fluid forces that act on each grain and contacts between grains. The model couples the contact dynamics method for discrete element modeling of the grains and their interactions with the finite element method for the integration of Navier-Stokes equations in 2D. The coupling is based on the fictitious domain approach in which the fluid domain is extended to that of grains, and the rigid-body motion of the grains is imposed by means of distributed Lagrange multipliers. As in similar numerical and experimental works with dry grains, we focus here on the run-out distances and avalanche durations for different column aspect ratios (height vs width). We consider three options for the surrounding fluid: 1) no fluid, 2) water and 3) a viscous fluid that allows us to perform our simulations in the grain-inertial, fluid-inertial and viscous regimes, respectively. The run-out distance is found to increase as a power law with the aspect ratio of the column, and surprisingly, for a given aspect ratio and packing fraction, it may be similar in the grain-inertial regime and fluid inertial regimes but with considerably longer duration in the latter case. We show that the effect of the fluid in viscous and fluid-inertial regimes is both to reduce the kinetic energy during the collapse and enhance the flow by lubrication during the spread. Hence, the run-out distance in a fluid may be below or equal to that in the absence of fluid due to compensation between those effects
    corecore