Search CORE

963 research outputs found

Safety-Aware Apprenticeship Learning

Author: A Solar-Lezama
H Hansson
M Kwiatkowska
M Kwiatkowska
M Kwiatkowska
R Bellman
R Wimmer
S Jha
S Junges
T Han
Y-J Kuo
Publication venue
Publication date: 28/04/2018
Field of study

Apprenticeship learning (AL) is a kind of Learning from Demonstration techniques where the reward function of a Markov Decision Process (MDP) is unknown to the learning agent and the agent has to derive a good policy by observing an expert's demonstrations. In this paper, we study the problem of how to make AL algorithms inherently safe while still meeting its learning objective. We consider a setting where the unknown reward function is assumed to be a linear combination of a set of state features, and the safety property is specified in Probabilistic Computation Tree Logic (PCTL). By embedding probabilistic model checking inside AL, we propose a novel counterexample-guided approach that can ensure safety while retaining performance of the learnt policy. We demonstrate the effectiveness of our approach on several challenging AL scenarios where safety is essential.Comment: Accepted by International Conference on Computer Aided Verification (CAV) 201

arXiv.org e-Print Archive

Crossref

Evaluating the reliability of NAND multiplexing with PRISM

Author: Kwiatkowska M.
Norman G.
Parker P.
Shukla S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2005
Field of study

Probabilistic-model checking is a formal verification technique for analyzing the reliability and performance of systems exhibiting stochastic behavior. In this paper, we demonstrate the applicability of this approach and, in particular, the probabilistic-model-checking tool PRISM to the evaluation of reliability and redundancy of defect-tolerant systems in the field of computer-aided design. We illustrate the technique with an example due to von Neumann, namely NAND multiplexing. We show how, having constructed a model of a defect-tolerant system incorporating probabilistic assumptions about its defects, it is straightforward to compute a range of reliability measures and investigate how they are affected by slight variations in the behavior of the system. This allows a designer to evaluate, for example, the tradeoff between redundancy and reliability in the design. We also highlight errors in analytically computed reliability bounds, recently published for the same case study

CiteSeerX

Crossref

University of Birmingham Research Portal

Enlighten

Explicit Model Checking of Very Large MDP using Partitioning and Secondary Storage

Author: A Aggarwal
A Bell
A Hartmanns
C Baier
DD Deavours
EM Clarke
G Norman
GD Penna
H Hermanns
HC Bohnenkamp
J Barnat
L Alfaro de
M Hammer
M Kwiatkowska
M Kwiatkowska
M Timmer
ML Puterman
MZ Kwiatkowska
MZ Kwiatkowska
MZ Kwiatkowska
R Alur
R Mehmood
S Edelkamp
S Edelkamp
S Evangelista
T Bao
U Stern
V Forejt
WJ Stewart
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/05/2016
Field of study

The applicability of model checking is hindered by the state space explosion problem in combination with limited amounts of main memory. To extend its reach, the large available capacities of secondary storage such as hard disks can be exploited. Due to the specific performance characteristics of secondary storage technologies, specialised algorithms are required. In this paper, we present a technique to use secondary storage for probabilistic model checking of Markov decision processes. It combines state space exploration based on partitioning with a block-iterative variant of value iteration over the same partitions for the analysis of probabilistic reachability and expected-reward properties. A sparse matrix-like representation is used to store partitions on secondary storage in a compact format. All file accesses are sequential, and compression can be used without affecting runtime. The technique has been implemented within the Modest Toolset. We evaluate its performance on several benchmark models of up to 3.5 billion states. In the analysis of time-bounded properties on real-time models, our method neutralises the state space explosion induced by the time bound in its entirety.Comment: The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-24953-7_1

arXiv.org e-Print Archive

Crossref

Quantitative Analysis of DoS Attacks and Client Puzzles in IoT Systems

Author: B Aziz
C Baier
J Mirkovic
M Kwiatkowska
S Basagiannis
S Tritilanunt
T Aura
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/10/2017
Field of study

Denial of Service (DoS) attacks constitute a major security threat to today's Internet. This challenge is especially pertinent to the Internet of Things (IoT) as devices have less computing power, memory and security mechanisms to mitigate DoS attacks. This paper presents a model that mimics the unique characteristics of a network of IoT devices, including components of the system implementing `Crypto Puzzles' - a DoS mitigation technique. We created an imitation of a DoS attack on the system, and conducted a quantitative analysis to simulate the impact such an attack may potentially exert upon the system, assessing the trade off between security and throughput in the IoT system. We model this through stochastic model checking in PRISM and provide evidence that supports this as a valuable method to compare the efficiency of different implementations of IoT systems, exemplified by a case study

arXiv.org e-Print Archive

Crossref

Value Iteration for Long-run Average Reward in Markov Decision Processes

Author: A Komuravelli
A McIver
AF Veinott
AK McIver
C Baier
C Courcoubetis
J Filar
K Chatterjee
K Chatterjee
K Chatterjee
K Chatterjee
M Duflot
M Kwiatkowska
M Kwiatkowska
M Kwiatkowska
ML Puterman
O Michael
RA Howard
S Giro
S Haddad
T Brázdil
T Brázdil
T Brázdil
Publication venue
Publication date: 13/07/2017
Field of study

Markov decision processes (MDPs) are standard models for probabilistic systems with non-deterministic behaviours. Long-run average rewards provide a mathematically elegant formalism for expressing long term performance. Value iteration (VI) is one of the simplest and most efficient algorithmic approaches to MDPs with other properties, such as reachability objectives. Unfortunately, a naive extension of VI does not work for MDPs with long-run average rewards, as there is no known stopping criterion. In this work our contributions are threefold. (1) We refute a conjecture related to stopping criteria for MDPs with long-run average rewards. (2) We present two practical algorithms for MDPs with long-run average rewards based on VI. First, we show that a combination of applying VI locally for each maximal end-component (MEC) and VI for reachability objectives can provide approximation guarantees. Second, extending the above approach with a simulation-guided on-demand variant of VI, we present an anytime algorithm that is able to deal with very large models. (3) Finally, we present experimental results showing that our methods significantly outperform the standard approaches on several benchmarks

arXiv.org e-Print Archive

Crossref

Lancaster E-Prints

Giant cardiac tumours in the newborn: an unusual image

Author: Ciemny S.
Kozłowski D.
Kwiatkowska J.
Publication venue: 'VM Media SP. zo.o VM Group SK'
Publication date: 05/09/2018
Field of study

Primary heart tumours in the paediatric population are very rare and they range from 0.01% to 0.04%. Most are benign lesions of which about half are rhabdomyomas. Rhabdomyoma tumour diagnosis is associated with a 75–80% risk of tuberous sclerosis complex (TSC). TSC are characterised with numerous changes of hamartoma-type located in the brain, kidneys, skin and other organs including the heart. More than two-thirds of newborns with TSC present rhabdomyomas in the heart. These changes may be asymptomatic, but in some cases they may cause heart failure, arrhythmias and death. We present a case report of an infant with giant rhabdomyoma tumours in the course of TSC

Via Medica Journals

Fermi surface of an important nano-sized metastable phase: Al $_3$ Li

Author: Alam M. A.
Dugdale S. B.
Kwiatkowska J.
Laverock J.
Roussenova M. V.
Shiotani N.
Wensley J.
Publication venue: 'American Physical Society (APS)'
Publication date: 12/09/2010
Field of study

Nanoscale particles embedded in a metallic matrix are of considerable interest as a route towards identifying and tailoring material properties. We present a detailed investigation of the electronic structure, and in particular the Fermi surface, of a nanoscale phase (

L1_2

_3

Li) that has so far been inaccessible with conventional techniques, despite playing a key role in determining the favorable material properties of the alloy (Al\nobreakdash-9 at. %\nobreakdash-Li). The ordered precipitates only form within the stabilizing Al matrix and do not exist in the bulk; here, we take advantage of the strong positron affinity of Li to directly probe the Fermi surface of Al

_3

Li. Through comparison with band structure calculations, we demonstrate that the positron uniquely probes these precipitates, and present a 'tuned' Fermi surface for this elusive phase

arXiv.org e-Print Archive

Explore Bristol Research

Verification and Control of Partially Observable Probabilistic Real-Time Systems

Author: B Finkbeiner
C Baier
C Baier
F Cassez
G Behrmann
G Norman
G Shani
M Kang
M Kwiatkowska
M Kwiatkowska
O Madani
P Bouyer
P Bouyer
P Černý
R Alur
S Giro
TA Henzinger
W Lovejoy
Publication venue
Publication date: 01/01/2015
Field of study

We propose automated techniques for the verification and control of probabilistic real-time systems that are only partially observable. To formally model such systems, we define an extension of probabilistic timed automata in which local states are partially visible to an observer or controller. We give a probabilistic temporal logic that can express a range of quantitative properties of these models, relating to the probability of an event's occurrence or the expected value of a reward measure. We then propose techniques to either verify that such a property holds or to synthesise a controller for the model which makes it true. Our approach is based on an integer discretisation of the model's dense-time behaviour and a grid-based abstraction of the uncountable belief space induced by partial observability. The latter is necessarily approximate since the underlying problem is undecidable, however we show how both lower and upper bounds on numerical results can be generated. We illustrate the effectiveness of the approach by implementing it in the PRISM model checker and applying it to several case studies, from the domains of computer security and task scheduling

arXiv.org e-Print Archive

CiteSeerX

Crossref

University of Birmingham Research Portal

Enlighten