Search CORE

32,546 research outputs found

Reinforcement learning based local search for grouping problems: A case study on graph coloring

Author: Duval Béatrice
Hao Jin-Kao
Zhou Yangming
Publication venue
Publication date: 01/01/2016
Field of study

Grouping problems aim to partition a set of items into multiple mutually disjoint subsets according to some specific criterion and constraints. Grouping problems cover a large class of important combinatorial optimization problems that are generally computationally difficult. In this paper, we propose a general solution approach for grouping problems, i.e., reinforcement learning based local search (RLS), which combines reinforcement learning techniques with descent-based local search. The viability of the proposed approach is verified on a well-known representative grouping problem (graph coloring) where a very simple descent-based coloring algorithm is applied. Experimental studies on popular DIMACS and COLOR02 benchmark graphs indicate that RLS achieves competitive performances compared to a number of well-known coloring algorithms

arXiv.org e-Print Archive

Okina

Probably Approximately Correct Nash Equilibrium Learning

Author: Fele Filiberto
Margellos Kostas
Publication venue
Publication date: 01/01/2020
Field of study

We consider a multi-agent noncooperative game with agents' objective functions being affected by uncertainty. Following a data driven paradigm, we represent uncertainty by means of scenarios and seek a robust Nash equilibrium solution. We treat the Nash equilibrium computation problem within the realm of probably approximately correct (PAC) learning. Building upon recent developments in scenario-based optimization, we accompany the computed Nash equilibrium with a priori and a posteriori probabilistic robustness certificates, providing confidence that the computed equilibrium remains unaffected (in probabilistic terms) when a new uncertainty realization is encountered. For a wide class of games, we also show that the computation of the so called compression set - a key concept in scenario-based optimization - can be directly obtained as a byproduct of the proposed solution methodology. Finally, we illustrate how to overcome differentiability issues, arising due to the introduction of scenarios, and compute a Nash equilibrium solution in a decentralized manner. We demonstrate the efficacy of the proposed approach on an electric vehicle charging control problem.Comment: Preprint submitted to IEEE Transactions on Automatic Contro

arXiv.org e-Print Archive

Oxford University Research Archive

Engineering failure analysis and design optimisation with HiP-HOPS

Author: Grätz Uwe
Hamann Rainer
Lien Rune
Papadopoulos Yiannis
Parker David
Rüde Erich
Uhlig Andreas
Walker Martin
Publication venue: 'Elsevier BV'
Publication date: 02/10/2010
Field of study

The scale and complexity of computer-based safety critical systems, like those used in the transport and manufacturing industries, pose significant challenges for failure analysis. Over the last decade, research has focused on automating this task. In one approach, predictive models of system failure are constructed from the topology of the system and local component failure models using a process of composition. An alternative approach employs model-checking of state automata to study the effects of failure and verify system safety properties. In this paper, we discuss these two approaches to failure analysis. We then focus on Hierarchically Performed Hazard Origin & Propagation Studies (HiP-HOPS) - one of the more advanced compositional approaches - and discuss its capabilities for automatic synthesis of fault trees, combinatorial Failure Modes and Effects Analyses, and reliability versus cost optimisation of systems via application of automatic model transformations. We summarise these contributions and demonstrate the application of HiP-HOPS on a simplified fuel oil system for a ship engine. In light of this example, we discuss strengths and limitations of the method in relation to other state-of-the-art techniques. In particular, because HiP-HOPS is deductive in nature, relating system failures back to their causes, it is less prone to combinatorial explosion and can more readily be iterated. For this reason, it enables exhaustive assessment of combinations of failures and design optimisation using computationally expensive meta-heuristics. (C) 2010 Elsevier Ltd. All rights reserved

Repository@Hull - Worktribe

Crossref

Equilibria-based Probabilistic Model Checking for Concurrent Stochastic Games

Author: A Bianco
A Toumi
C Dehnert
C Lemke
D Fernando
D Lozovanu
E Kelmendi
H Hansson
J Gutierrez
J Kemeny
J Pacheco
J von Neumann
K Chatterjee
K Chatterjee
K Chatterjee
K Chatterjee
L de Alfaro
L de Alfaro
L de Alfaro
L de Moura
L Shapley
M Kwiatkowska
M Kwiatkowska
M Kwiatkowska
M Kwiatkowska
M Osborne
N Basset
N Nisan
P Čermák
R Alur
R Brenguier
S Haddad
T Chen
T Chen
U Schwalbe
Publication venue
Publication date: 01/01/2019
Field of study

Probabilistic model checking for stochastic games enables formal verification of systems that comprise competing or collaborating entities operating in a stochastic environment. Despite good progress in the area, existing approaches focus on zero-sum goals and cannot reason about scenarios where entities are endowed with different objectives. In this paper, we propose probabilistic model checking techniques for concurrent stochastic games based on Nash equilibria. We extend the temporal logic rPATL (probabilistic alternating-time temporal logic with rewards) to allow reasoning about players with distinct quantitative goals, which capture either the probability of an event occurring or a reward measure. We present algorithms to synthesise strategies that are subgame perfect social welfare optimal Nash equilibria, i.e., where there is no incentive for any players to unilaterally change their strategy in any state of the game, whilst the combined probabilities or rewards are maximised. We implement our techniques in the PRISM-games tool and apply them to several case studies, including network protocols and robot navigation, showing the benefits compared to existing approaches

arXiv.org e-Print Archive

Crossref

University of Birmingham Research Portal

Oxford University Research Archive

Enlighten