9,411 research outputs found

    Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning

    Full text link
    Developing a safe and efficient collision avoidance policy for multiple robots is challenging in the decentralized scenarios where each robot generate its paths without observing other robots' states and intents. While other distributed multi-robot collision avoidance systems exist, they often require extracting agent-level features to plan a local collision-free action, which can be computationally prohibitive and not robust. More importantly, in practice the performance of these methods are much lower than their centralized counterparts. We present a decentralized sensor-level collision avoidance policy for multi-robot systems, which directly maps raw sensor measurements to an agent's steering commands in terms of movement velocity. As a first step toward reducing the performance gap between decentralized and centralized methods, we present a multi-scenario multi-stage training framework to find an optimal policy which is trained over a large number of robots on rich, complex environments simultaneously using a policy gradient based reinforcement learning algorithm. We validate the learned sensor-level collision avoidance policy in a variety of simulated scenarios with thorough performance evaluations and show that the final learned policy is able to find time efficient, collision-free paths for a large-scale robot system. We also demonstrate that the learned policy can be well generalized to new scenarios that do not appear in the entire training period, including navigating a heterogeneous group of robots and a large-scale scenario with 100 robots. Videos are available at https://sites.google.com/view/drlmac

    Human Motion Trajectory Prediction: A Survey

    Full text link
    With growing numbers of intelligent autonomous systems in human environments, the ability of such systems to perceive, understand and anticipate human behavior becomes increasingly important. Specifically, predicting future positions of dynamic agents and planning considering such predictions are key tasks for self-driving vehicles, service robots and advanced surveillance systems. This paper provides a survey of human motion trajectory prediction. We review, analyze and structure a large selection of work from different communities and propose a taxonomy that categorizes existing methods based on the motion modeling approach and level of contextual information used. We provide an overview of the existing datasets and performance metrics. We discuss limitations of the state of the art and outline directions for further research.Comment: Submitted to the International Journal of Robotics Research (IJRR), 37 page

    Decoupled Sampling-Based Motion Planning for Multiple Autonomous Marine Vehicles

    Get PDF
    There is increasing interest in the deployment and operation of multiple autonomous marine vehicles (AMVs) for a number of challenging scientific and commercial operational mission scenarios. Some of the missions, such as geotechnical surveying and 3D marine habitat mapping, require that a number of heterogeneous vehicles operate simultaneously in small areas, often in close proximity of each other. In these circumstances safety, reliability, and efficient multiple vehicle operation are key ingredients for mission success. Additionally, the deployment and operation of multiple AMVs at sea are extremely costly in terms of the logistics and human resources required for mission supervision, often during extended periods of time. These costs can be greatly minimized by automating the deployment and initial steering of a vehicle fleet to a predetermined configuration, in preparation for the ensuing mission, taking into account operational constraints. This is one of the core issues addressed in the scope of the Widely Scalable Mobile Underwater Sonar Technology project (WiMUST), an EU Horizon 2020 initiative for underwater robotics research. WiMUST uses a team of cooperative autonomous ma- rine robots, some of which towing streamers equipped with hydrophones, acting as intelligent sensing and communicat- ing nodes of a reconfigurable moving acoustic network. In WiMUST, the AMVs maintain a fixed geometric formation through cooperative navigation and motion control. Formation initialization requires that all the AMVs start from scattered positions in the water and maneuver so as to arrive at required target configuration points at the same time in a completely au- tomatic manner. This paper describes the decoupled prioritized vehicle motion planner developed in the scope of WiMUST that, together with an existing system for trajectory tracking, affords a fleet of vehicles the above capabilities, while ensuring inter- vehicle collision and streamer entanglement avoidance. Tests with a fleet of seven marine vehicles show the efficacy of the system planner developed.Peer reviewe

    A Real-Time Solver For Time-Optimal Control Of Omnidirectional Robots with Bounded Acceleration

    Full text link
    We are interested in the problem of time-optimal control of omnidirectional robots with bounded acceleration (TOC-ORBA). While there exist approximate solutions for such robots, and exact solutions with unbounded acceleration, exact solvers to the TOC-ORBA problem have remained elusive until now. In this paper, we present a real-time solver for true time-optimal control of omnidirectional robots with bounded acceleration. We first derive the general parameterized form of the solution to the TOC-ORBA problem by application of Pontryagin's maximum principle. We then frame the boundary value problem of TOC-ORBA as an optimization problem over the parametrized control space. To overcome local minima and poor initial guesses to the optimization problem, we introduce a two-stage optimal control solver (TSOCS): The first stage computes an upper bound to the total time for the TOC-ORBA problem and holds the time constant while optimizing the parameters of the trajectory to approach the boundary value conditions. The second stage uses the parameters found by the first stage, and relaxes the constraint on the total time to solve for the parameters of the complete TOC-ORBA problem. We further implement TSOCS as a closed loop controller to overcome actuation errors on real robots in real-time. We empirically demonstrate the effectiveness of TSOCS in simulation and on real robots, showing that 1) it runs in real time, generating solutions in less than 0.5ms on average; 2) it generates faster trajectories compared to an approximate solver; and 3) it is able to solve TOC-ORBA problems with non-zero final velocities that were previously unsolvable in real-time
    corecore