8,705 research outputs found
Learning scalable and transferable multi-robot/machine sequential assignment planning via graph embedding
Can the success of reinforcement learning methods for simple combinatorial
optimization problems be extended to multi-robot sequential assignment
planning? In addition to the challenge of achieving near-optimal performance in
large problems, transferability to an unseen number of robots and tasks is
another key challenge for real-world applications. In this paper, we suggest a
method that achieves the first success in both challenges for robot/machine
scheduling problems.
Our method comprises of three components. First, we show a robot scheduling
problem can be expressed as a random probabilistic graphical model (PGM). We
develop a mean-field inference method for random PGM and use it for Q-function
inference. Second, we show that transferability can be achieved by carefully
designing two-step sequential encoding of problem state. Third, we resolve the
computational scalability issue of fitted Q-iteration by suggesting a heuristic
auction-based Q-iteration fitting method enabled by transferability we
achieved.
We apply our method to discrete-time, discrete space problems (Multi-Robot
Reward Collection (MRRC)) and scalably achieve 97% optimality with
transferability. This optimality is maintained under stochastic contexts. By
extending our method to continuous time, continuous space formulation, we claim
to be the first learning-based method with scalable performance among
multi-machine scheduling problems; our method scalability achieves comparable
performance to popular metaheuristics in Identical parallel machine scheduling
(IPMS) problems
Robust Environmental Mapping by Mobile Sensor Networks
Constructing a spatial map of environmental parameters is a crucial step to
preventing hazardous chemical leakages, forest fires, or while estimating a
spatially distributed physical quantities such as terrain elevation. Although
prior methods can do such mapping tasks efficiently via dispatching a group of
autonomous agents, they are unable to ensure satisfactory convergence to the
underlying ground truth distribution in a decentralized manner when any of the
agents fail. Since the types of agents utilized to perform such mapping are
typically inexpensive and prone to failure, this results in poor overall
mapping performance in real-world applications, which can in certain cases
endanger human safety. This paper presents a Bayesian approach for robust
spatial mapping of environmental parameters by deploying a group of mobile
robots capable of ad-hoc communication equipped with short-range sensors in the
presence of hardware failures. Our approach first utilizes a variant of the
Voronoi diagram to partition the region to be mapped into disjoint regions that
are each associated with at least one robot. These robots are then deployed in
a decentralized manner to maximize the likelihood that at least one robot
detects every target in their associated region despite a non-zero probability
of failure. A suite of simulation results is presented to demonstrate the
effectiveness and robustness of the proposed method when compared to existing
techniques.Comment: accepted to icra 201
Decentralised Coordination in RoboCup Rescue
Emergency responders are faced with a number of significant challenges when managing major disasters. First, the number of rescue tasks posed is usually larger than the number of responders (or agents) and the resources available to them. Second, each task is likely to require a different level of effort in order to be completed by its deadline. Third, new tasks may continually appear or disappear from the environment, thus requiring the responders to quickly recompute their allocation of resources. Fourth, forming teams or coalitions of multiple agents from different agencies is vital since no single agency will have all the resources needed to save victims, unblock roads, and extinguish the ?res which might erupt in the disaster space. Given this, coalitions have to be efficiently selected and scheduled to work across the disaster space so as to maximise the number of lives and the portion of the infrastructure saved. In particular, it is important that the selection of such coalitions should be performed in a decentralised fashion in order to avoid a single point of failure in the system. Moreover, it is critical that responders communicate only locally given they are likely to have limited battery power or minimal access to long range communication devices. Against this background, we provide a novel decentralised solution to the coalition formation process that pervades disaster management. More specifically, we model the emergency management scenario defined in the RoboCup Rescue disaster simulation platform as a Coalition Formation with Spatial and Temporal constraints (CFST) problem where agents form coalitions in order to complete tasks, each with different demands. In order to design a decentralised algorithm for CFST we formulate it as a Distributed Constraint Optimisation problem and show how to solve it using the state-of-the-art Max-Sum algorithm that provides a completely decentralised message-passing solution. We then provide a novel algorithm (F-Max-Sum) that avoids sending redundant messages and efficiently adapts to changes in the environment. In empirical evaluations, our algorithm is shown to generate better solutions than other decentralised algorithms used for this problem
A Decentralized Control Framework for Energy-Optimal Goal Assignment and Trajectory Generation
This paper proposes a decentralized approach for solving the problem of
moving a swarm of agents into a desired formation. We propose a decentralized
assignment algorithm which prescribes goals to each agent using only local
information. The assignment results are then used to generate energy-optimal
trajectories for each agent which have guaranteed collision avoidance through
safety constraints. We present the conditions for optimality and discuss the
robustness of the solution. The efficacy of the proposed approach is validated
through a numerical case study to characterize the framework's performance on a
set of dynamic goals.Comment: 6 pages, 3 figures, to appear at the 2019 Conference on Decision and
Control, Nice, F
Decentralized MPC based Obstacle Avoidance for Multi-Robot Target Tracking Scenarios
In this work, we consider the problem of decentralized multi-robot target
tracking and obstacle avoidance in dynamic environments. Each robot executes a
local motion planning algorithm which is based on model predictive control
(MPC). The planner is designed as a quadratic program, subject to constraints
on robot dynamics and obstacle avoidance. Repulsive potential field functions
are employed to avoid obstacles. The novelty of our approach lies in embedding
these non-linear potential field functions as constraints within a convex
optimization framework. Our method convexifies non-convex constraints and
dependencies, by replacing them as pre-computed external input forces in robot
dynamics. The proposed algorithm additionally incorporates different methods to
avoid field local minima problems associated with using potential field
functions in planning. The motion planner does not enforce predefined
trajectories or any formation geometry on the robots and is a comprehensive
solution for cooperative obstacle avoidance in the context of multi-robot
target tracking. We perform simulation studies in different environmental
scenarios to showcase the convergence and efficacy of the proposed algorithm.
Video of simulation studies: \url{https://youtu.be/umkdm82Tt0M
- …