Search CORE

87 research outputs found

Semiring Programming: A Semantic Framework for Generalized Sum Product Problems

Author: Abo Khamis
Andrei
Apsel
Aziz
Bacchus
Baral
Baras
Barrett
Barrett
Belle
Belle
Belle
Belle
Belle
Berre
Bistarelli
Bistarelli
Bistarelli
Boerger
Brewka
Brewka
Cadoli
Chavira
Chistikov
Darwiche
De Raedt
Dechter
den Broeck
Derkinderen
Ding
Dyer
Eisner
Enderton
Ensan
Evans
Fargier
Fargier
Fierens
Fontaine
Fourer
Freuder
Freuder
Friesen
Friesen
Frisch
Gomes
Goodman
Goodman
Gordon
Grant
Green
Halmos
Halpern
Hindi
Hooker
Kannan
Kautz
Kimmig
Kimmig
Kolb
Kolb
Koller
Kordjamshidi
Kowalski
Kuich
Lacoste-Julien
Li
Lierler
Liu
Liu
Luc De Raedt
Marriott
Milch
Minka
Mitchell
Muggleton
Nieuwenhuis
Orsini
Poon
Richardson
Sankaranarayanan
Sanner
Sebastiani
Srivastava
Ternovska
Ternovska
Thrun
Vaishak Belle
Van Hentenryck
Zuidberg Dos Martires
Publication venue: 'Elsevier BV'
Publication date: 01/11/2020
Field of study

Crossref

Edinburgh Research Explorer

Nesting Problems and Steiner Tree Problems

Author: Nielsen Benny Kjær
Publication venue
Publication date: 01/01/2008
Field of study

Copenhagen University Research Information System

Verifiable Reinforcement Learning via Policy Extraction

Author: Bastani Osbert
Pu Yewen
Solar-Lezama Armando
Publication venue
Publication date: 23/01/2019
Field of study

While deep reinforcement learning has successfully solved many challenging control tasks, its real-world applicability has been limited by the inability to ensure the safety of learned policies. We propose an approach to verifiable reinforcement learning by training decision tree policies, which can represent complex policies (since they are nonparametric), yet can be efficiently verified using existing techniques (since they are highly structured). The challenge is that decision tree policies are difficult to train. We propose VIPER, an algorithm that combines ideas from model compression and imitation learning to learn decision tree policies guided by a DNN policy (called the oracle) and its Q-function, and show that it substantially outperforms two baselines. We use VIPER to (i) learn a provably robust decision tree policy for a variant of Atari Pong with a symbolic state space, (ii) learn a decision tree policy for a toy game based on Pong that provably never loses, and (iii) learn a provably stable decision tree policy for cart-pole. In each case, the decision tree policy achieves performance equal to that of the original DNN policy

arXiv.org e-Print Archive

DSpace@MIT

Improving WCET Evaluation using Linear Relation Analysis

Author: Asavoae Mihail
Boutonnet Rémy
Carrier Fabienne
Halbwachs Nicolas
Jahier Erwan
Maiza Claire
Parent-Vigouroux Catherine
Raymond Pascal
Publication venue: European Design and Automation Association (EDAA) \ EMbedded Systems Special Interest Group (EMSIG) and Schloss Dagstuhl -- Leibniz-Zentrum für Informatik GmbH, Dagstuhl Publishing.
Publication date: 18/02/2019
Field of study

International audienceThe precision of a worst case execution time (WCET) evaluation tool on a given program is highly dependent on how the tool is able to detect and discard semantically infeasible executions of the program. In this paper, we propose to use the classical abstract interpretation-based method of linear relation analysis to discover and exploit relations between execution paths. For this purpose, we add auxiliary variables (counters) to the program to trace its execution paths. The results are easily incorporated in the classical workflow of a WCET evaluator, when the evaluator is based on the popular implicit path enumeration technique. We use existing tools-a WCET evaluator and a linear relation analyzer-to build and experiment a prototype implementation of this idea. * This work is supported by the French research fundation (ANR) as part of the W-SEPT project (ANR-12-INSE-0001

Hal - Université Grenoble Alpes

Computer Aided Verification

Author
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/02/2021
Field of study

This open access two-volume set LNCS 10980 and 10981 constitutes the refereed proceedings of the 30th International Conference on Computer Aided Verification, CAV 2018, held in Oxford, UK, in July 2018. The 52 full and 13 tool papers presented together with 3 invited papers and 2 tutorials were carefully reviewed and selected from 215 submissions. The papers cover a wide range of topics and techniques, from algorithmic and logical foundations of verification to practical applications in distributed, networked, cyber-physical, and autonomous systems. They are organized in topical sections on model checking, program analysis using polyhedra, synthesis, learning, runtime verification, hybrid and timed systems, tools, probabilistic systems, static analysis, theory and security, SAT, SMT and decisions procedures, concurrency, and CPS, hardware, industrial applications

Directory of Open Access Books (DOAB)

Formal verification of deep reinforcement learning agents

Author: Bacci Edoardo
Publication venue
Publication date: 01/01/2022
Field of study

Deep reinforcement learning has been successfully applied to many control tasks, but the application of such controllers in safety-critical scenarios has been limited due to safety concerns. Rigorous testing of these controllers is challenging, particularly when they operate in uncertain environments. In this thesis we develop novel verification techniques to give the user stronger guarantees over the performance of the trained agents that they would be able to obtain by testing, under different degrees and sources of uncertainty. In particular, we tackle three different sources of uncertainty to the agent and offer different algorithms to provide strong guarantees to the user. The first one is input noise: sensors in the real world always provide imperfect data. The second source of uncertainty comes from the actuators: once an agent decides to take a specific action, faulty actuators and or hardware problems could still prevent the agent from acting upon the decisions given by the controller. The last source of uncertainty is the policy: the set of decisions the controller takes when operating in the environment. Agents may act probabilistically for a number of reasons, such as dealing with adversaries in a competitive environment or addressing partial observability of the environment. In this thesis, we develop formal models of controllers executing under uncertainty, and propose new verification techniques based on abstract interpretation for their analysis. We cover different horizon lengths, i.e., the number of steps into the future that we analyse, and present methods for both finite-horizon and infinite-horizon verification. We perform both probabilistic and non-probabilistic analysis of the models constructed, depending on the methodology adopted. We implement and evaluate our methods on controllers trained for several benchmark control problems

University of Birmingham Research Portal

University of Birmingham Research Archive, E-theses Repository