9,411 research outputs found
Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning
Developing a safe and efficient collision avoidance policy for multiple
robots is challenging in the decentralized scenarios where each robot generate
its paths without observing other robots' states and intents. While other
distributed multi-robot collision avoidance systems exist, they often require
extracting agent-level features to plan a local collision-free action, which
can be computationally prohibitive and not robust. More importantly, in
practice the performance of these methods are much lower than their centralized
counterparts.
We present a decentralized sensor-level collision avoidance policy for
multi-robot systems, which directly maps raw sensor measurements to an agent's
steering commands in terms of movement velocity. As a first step toward
reducing the performance gap between decentralized and centralized methods, we
present a multi-scenario multi-stage training framework to find an optimal
policy which is trained over a large number of robots on rich, complex
environments simultaneously using a policy gradient based reinforcement
learning algorithm. We validate the learned sensor-level collision avoidance
policy in a variety of simulated scenarios with thorough performance
evaluations and show that the final learned policy is able to find time
efficient, collision-free paths for a large-scale robot system. We also
demonstrate that the learned policy can be well generalized to new scenarios
that do not appear in the entire training period, including navigating a
heterogeneous group of robots and a large-scale scenario with 100 robots.
Videos are available at https://sites.google.com/view/drlmac
Human Motion Trajectory Prediction: A Survey
With growing numbers of intelligent autonomous systems in human environments,
the ability of such systems to perceive, understand and anticipate human
behavior becomes increasingly important. Specifically, predicting future
positions of dynamic agents and planning considering such predictions are key
tasks for self-driving vehicles, service robots and advanced surveillance
systems. This paper provides a survey of human motion trajectory prediction. We
review, analyze and structure a large selection of work from different
communities and propose a taxonomy that categorizes existing methods based on
the motion modeling approach and level of contextual information used. We
provide an overview of the existing datasets and performance metrics. We
discuss limitations of the state of the art and outline directions for further
research.Comment: Submitted to the International Journal of Robotics Research (IJRR),
37 page
Decoupled Sampling-Based Motion Planning for Multiple Autonomous Marine Vehicles
There is increasing interest in the deployment and operation of multiple autonomous marine vehicles (AMVs) for a number of challenging scientific and commercial operational mission scenarios. Some of the missions, such as geotechnical surveying and 3D marine habitat mapping, require that a number of heterogeneous vehicles operate simultaneously in small areas, often in close proximity of each other. In these circumstances safety, reliability, and efficient multiple vehicle operation are key ingredients for mission success. Additionally, the deployment and operation of multiple AMVs at sea are extremely costly in terms of the logistics and human resources required for mission supervision, often during extended periods of time. These costs can be greatly minimized by automating the deployment and initial steering of a vehicle fleet to a predetermined configuration, in preparation for the ensuing mission, taking into account operational constraints. This is one of the core issues addressed in the scope of the Widely Scalable Mobile Underwater Sonar Technology project (WiMUST), an EU Horizon 2020 initiative for underwater robotics research. WiMUST uses a team of cooperative autonomous ma- rine robots, some of which towing streamers equipped with hydrophones, acting as intelligent sensing and communicat- ing nodes of a reconfigurable moving acoustic network. In WiMUST, the AMVs maintain a fixed geometric formation through cooperative navigation and motion control. Formation initialization requires that all the AMVs start from scattered positions in the water and maneuver so as to arrive at required target configuration points at the same time in a completely au- tomatic manner. This paper describes the decoupled prioritized vehicle motion planner developed in the scope of WiMUST that, together with an existing system for trajectory tracking, affords a fleet of vehicles the above capabilities, while ensuring inter- vehicle collision and streamer entanglement avoidance. Tests with a fleet of seven marine vehicles show the efficacy of the system planner developed.Peer reviewe
A Real-Time Solver For Time-Optimal Control Of Omnidirectional Robots with Bounded Acceleration
We are interested in the problem of time-optimal control of omnidirectional
robots with bounded acceleration (TOC-ORBA). While there exist approximate
solutions for such robots, and exact solutions with unbounded acceleration,
exact solvers to the TOC-ORBA problem have remained elusive until now. In this
paper, we present a real-time solver for true time-optimal control of
omnidirectional robots with bounded acceleration. We first derive the general
parameterized form of the solution to the TOC-ORBA problem by application of
Pontryagin's maximum principle. We then frame the boundary value problem of
TOC-ORBA as an optimization problem over the parametrized control space. To
overcome local minima and poor initial guesses to the optimization problem, we
introduce a two-stage optimal control solver (TSOCS): The first stage computes
an upper bound to the total time for the TOC-ORBA problem and holds the time
constant while optimizing the parameters of the trajectory to approach the
boundary value conditions. The second stage uses the parameters found by the
first stage, and relaxes the constraint on the total time to solve for the
parameters of the complete TOC-ORBA problem. We further implement TSOCS as a
closed loop controller to overcome actuation errors on real robots in
real-time. We empirically demonstrate the effectiveness of TSOCS in simulation
and on real robots, showing that 1) it runs in real time, generating solutions
in less than 0.5ms on average; 2) it generates faster trajectories compared to
an approximate solver; and 3) it is able to solve TOC-ORBA problems with
non-zero final velocities that were previously unsolvable in real-time
- …