Search CORE

39,691 research outputs found

A Hierarchical Reinforcement Learning Method for Persistent Time-Sensitive Tasks

Author: Belta Calin
Li Xiao
Publication venue
Publication date: 01/01/2016
Field of study

Reinforcement learning has been applied to many interesting problems such as the famous TD-gammon and the inverted helicopter flight. However, little effort has been put into developing methods to learn policies for complex persistent tasks and tasks that are time-sensitive. In this paper, we take a step towards solving this problem by using signal temporal logic (STL) as task specification, and taking advantage of the temporal abstraction feature that the options framework provide. We show via simulation that a relatively easy to implement algorithm that combines STL and options can learn a satisfactory policy with a small number of training case

arXiv.org e-Print Archive

Boston University Institutional Repository (OpenBU)

A hierarchical reinforcement learning method for persistent time-sensitive tasks

Author: Belta Calin
Li Xiao
Publication venue
Publication date: 01/01/2016
Field of study

Boston University Institutional Repository (OpenBU)

TLA+ Proofs

Author: Damien Doligez
Damien Doligez
Damien Doligez
Daniel Ricketts
Daniel Ricketts
Denis Cousineau
Denis Cousineau
Denis Cousineau
Et Al
Leslie Lamport
Leslie Lamport
Leslie Lamport
Ricketts Hernán Vanzetto
Stephan Merz
Stephan Merz
Stephan Merz
Publication venue
Publication date: 01/01/2012
Field of study

TLA+ is a specification language based on standard set theory and temporal logic that has constructs for hierarchical proofs. We describe how to write TLA+ proofs and check them with TLAPS, the TLA+ Proof System. We use Peterson's mutual exclusion algorithm as a simple example to describe the features of TLAPS and show how it and the Toolbox (an IDE for TLA+) help users to manage large, complex proofs.Comment: A shorter version of this article appeared in the proceedings of the conference Formal Methods 2012 (FM 2012, Paris, France, Springer LNCS 7436, pp. 147-154

arXiv.org e-Print Archive

HAL-CentraleSupelec

CiteSeerX

INRIA a CCSD electronic archive server

HAL-Rennes 1

Reasoning About a Simulated Printer Case Investigation with Forensic Lucid

Author: E.A. Ashcroft
E.A. Ashcroft
E.A. Ashcroft
J. Paquet
J. Plaice
S.A. Mokhov
S.A. Mokhov
T. Rahilly
W.W. Wadge
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2011
Field of study

In this work we model the ACME (a fictitious company name) "printer case incident" and make its specification in Forensic Lucid, a Lucid- and intensional-logic-based programming language for cyberforensic analysis and event reconstruction specification. The printer case involves a dispute between two parties that was previously solved using the finite-state automata (FSA) approach, and is now re-done in a more usable way in Forensic Lucid. Our simulation is based on the said case modeling by encoding concepts like evidence and the related witness accounts as an evidential statement context in a Forensic Lucid program, which is an input to the transition function that models the possible deductions in the case. We then invoke the transition function (actually its reverse) with the evidential statement context to see if the evidence we encoded agrees with one's claims and then attempt to reconstruct the sequence of events that may explain the claim or disprove it.Comment: 18 pages, 3 figures, 7 listings, TOC, index; this article closely relates to arXiv:0906.0049 and arXiv:0904.3789 but to remain stand-alone repeats some of the background and introductory content; abstract presented at HSC'09 and the full updated paper at ICDF2C'11. This is an updated/edited version after ICDF2C proceedings with more references and correction

arXiv.org e-Print Archive

Crossref

Concordia University Research Repository

Automata guided hierarchical reinforcement learning for zero-shot skill composition

Author: Belta Calin
Li Xiao
Ma Yao
Publication venue
Publication date: 01/01/2017
Field of study

An obstacle that prevents the wide adoption of (deep) reinforcement learning (RL) in control systems is its need for a large amount of interactions with the environment in order to master a skill. The learned skill usually generalizes poorly across domains and re-training is often necessary when presented with a new task. We present a framework that combines methods in formal methods with hierarchical reinforcement learning (HRL). The set of techniques we provide allows for convenient specification of tasks with complex logic, learn hierarchical policies (meta-controller and low-level controllers) with well-defined intrinsic rewards using any RL methods and is able to construct new skills from existing ones without additional learning. We evaluate the proposed methods in a simple grid world simulation as well as simulation on a Baxter robot

Boston University Institutional Repository (OpenBU)

Model-based dependability analysis : state-of-the-art, challenges and future outlook

Author: Kabir Sohag
Papadopoulos Yiannis
Sharvia Septavera
Walker Martin
Publication venue: 'Elsevier BV'
Publication date: 30/10/2015
Field of study

Abstract: Over the past two decades, the study of model-based dependability analysis has gathered significant research interest. Different approaches have been developed to automate and address various limitations of classical dependability techniques to contend with the increasing complexity and challenges of modern safety-critical system. Two leading paradigms have emerged, one which constructs predictive system failure models from component failure models compositionally using the topology of the system. The other utilizes design models - typically state automata - to explore system behaviour through fault injection. This paper reviews a number of prominent techniques under these two paradigms, and provides an insight into their working mechanism, applicability, strengths and challenges, as well as recent developments within these fields. We also discuss the emerging trends on integrated approaches and advanced analysis capabilities. Lastly, we outline the future outlook for model-based dependability analysis

Repository@Hull - Worktribe