3,980 research outputs found
Learning for Multi-robot Cooperation in Partially Observable Stochastic Environments with Macro-actions
This paper presents a data-driven approach for multi-robot coordination in
partially-observable domains based on Decentralized Partially Observable Markov
Decision Processes (Dec-POMDPs) and macro-actions (MAs). Dec-POMDPs provide a
general framework for cooperative sequential decision making under uncertainty
and MAs allow temporally extended and asynchronous action execution. To date,
most methods assume the underlying Dec-POMDP model is known a priori or a full
simulator is available during planning time. Previous methods which aim to
address these issues suffer from local optimality and sensitivity to initial
conditions. Additionally, few hardware demonstrations involving a large team of
heterogeneous robots and with long planning horizons exist. This work addresses
these gaps by proposing an iterative sampling based Expectation-Maximization
algorithm (iSEM) to learn polices using only trajectory data containing
observations, MAs, and rewards. Our experiments show the algorithm is able to
achieve better solution quality than the state-of-the-art learning-based
methods. We implement two variants of multi-robot Search and Rescue (SAR)
domains (with and without obstacles) on hardware to demonstrate the learned
policies can effectively control a team of distributed robots to cooperate in a
partially observable stochastic environment.Comment: Accepted to the 2017 IEEE/RSJ International Conference on Intelligent
Robots and Systems (IROS 2017
Adoption of vehicular ad hoc networking protocols by networked robots
This paper focuses on the utilization of wireless networking in the robotics domain. Many researchers have already equipped their robots with wireless communication capabilities, stimulated by the observation that multi-robot systems tend to have several advantages over their single-robot counterparts. Typically, this integration of wireless communication is tackled in a quite pragmatic manner, only a few authors presented novel Robotic Ad Hoc Network (RANET) protocols that were designed specifically with robotic use cases in mind. This is in sharp contrast with the domain of vehicular ad hoc networks (VANET). This observation is the starting point of this paper. If the results of previous efforts focusing on VANET protocols could be reused in the RANET domain, this could lead to rapid progress in the field of networked robots. To investigate this possibility, this paper provides a thorough overview of the related work in the domain of robotic and vehicular ad hoc networks. Based on this information, an exhaustive list of requirements is defined for both types. It is concluded that the most significant difference lies in the fact that VANET protocols are oriented towards low throughput messaging, while RANET protocols have to support high throughput media streaming as well. Although not always with equal importance, all other defined requirements are valid for both protocols. This leads to the conclusion that cross-fertilization between them is an appealing approach for future RANET research. To support such developments, this paper concludes with the definition of an appropriate working plan
A Novel Airborne Self-organising Architecture for 5G+ Networks
Network Flying Platforms (NFPs) such as unmanned aerial vehicles, unmanned
balloons or drones flying at low/medium/high altitude can be employed to
enhance network coverage and capacity by deploying a swarm of flying platforms
that implement novel radio resource management techniques. In this paper, we
propose a novel layered architecture where NFPs, of various types and flying at
low/medium/high layers in a swarm of flying platforms, are considered as an
integrated part of the future cellular networks to inject additional capacity
and expand the coverage for exceptional scenarios (sports events, concerts,
etc.) and hard-to-reach areas (rural or sparsely populated areas). Successful
roll-out of the proposed architecture depends on several factors including, but
are not limited to: network optimisation for NFP placement and association,
safety operations of NFP for network/equipment security, and reliability for
NFP transport and control/signaling mechanisms. In this work, we formulate the
optimum placement of NFP at a Lower Layer (LL) by exploiting the airborne
Self-organising Network (SON) features. Our initial simulations show the NFP-LL
can serve more User Equipment (UE)s using this placement technique.Comment: 5 pages, 2 figures, conference paper in IEEE VTC-Fall 2017, in
Proceedings IEEE Vehicular Technology Conference (VTC-Fall 2017), Toronto,
Canada, Sep. 201
,The Impact of Human-Automation Collaboration in Decentralized Multiple Unmanned Vehicle Control
For future systems that require one or a small team of operators to supervise a network of automated agents, automated planners are critical since they are faster than humans for path planning and resource allocation in multivariate, dynamic, time-pressured environments. However, such planners
can be brittle and unable to respond to emergent events. Human operators can aid such systems by bringing their knowledge-based reasoning and experience to bear. Given a decentralized task planner and a goal-based operator interface for a network of unmanned vehicles in a search, track, and
neutralize mission, we demonstrate with a human-on-the-loop experiment that humans guiding these decentralized planners improved system performance by up to 50%. However, those tasks that required precise and rapid calculations were not significantly improved with human aid. Thus, there is a shared space in such complex missions for human–automation
collaboration
Nonlinear Model Predictive Control for Multi-Micro Aerial Vehicle Robust Collision Avoidance
Multiple multirotor Micro Aerial Vehicles sharing the same airspace require a
reliable and robust collision avoidance technique. In this paper we address the
problem of multi-MAV reactive collision avoidance. A model-based controller is
employed to achieve simultaneously reference trajectory tracking and collision
avoidance. Moreover, we also account for the uncertainty of the state estimator
and the other agents position and velocity uncertainties to achieve a higher
degree of robustness. The proposed approach is decentralized, does not require
collision-free reference trajectory and accounts for the full MAV dynamics. We
validated our approach in simulation and experimentally.Comment: Video available on: https://www.youtube.com/watch?v=Ot76i9p2ZZo&t=40
- …