Search CORE

142,539 research outputs found

Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving

Author: Haesaert Sofie
Ma Zhiqiang
Sun Zhiyong
Wu Lin-Chi
Zhang Zengjie
Publication venue
Publication date: 25/08/2023
Field of study

Reinforcement learning (RL) is an effective approach to motion planning in autonomous driving, where an optimal driving policy can be automatically learned using the interaction data with the environment. Nevertheless, the reward function for an RL agent, which is significant to its performance, is challenging to be determined. The conventional work mainly focuses on rewarding safe driving states but does not incorporate the awareness of risky driving behaviors of the vehicles. In this paper, we investigate how to use risk-aware reward shaping to leverage the training and test performance of RL agents in autonomous driving. Based on the essential requirements that prescribe the safety specifications for general autonomous driving in practice, we propose additional reshaped reward terms that encourage exploration and penalize risky driving behaviors. A simulation study in OpenAI Gym indicates the advantage of risk-aware reward shaping for various RL agents. Also, we point out that proximal policy optimization (PPO) is likely to be the best RL method that works with risk-aware reward shaping

arXiv.org e-Print Archive

Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving

Author: Haesaert Sofie
Ma Zhiqiang
Sun Zhiyong
Wu Lin-Chi
Zhang Zengjie
Publication venue: arXiv.org
Publication date: 25/08/2023
Field of study

Pure OAI Repository

NeBula: Team CoSTAR's robotic autonomy solution that won phase II of DARPA Subterranean Challenge

Author: Agha-mohammadi Ali-akbar
Alatur Nikhilesh
Anderson Matthew
Bartlett Tara
Beltrame Giovanni
Bergh Chuck
Bouman Amanda
Burdick Joel
Buscicchio Alessandro
Carlone Luca
Cauligi Abhishek
Chang Yung
Choi Hyungho Chris
Chávez Fernando
Correa Gustavo J.
Daftry Shreyansh
Dixit Anushri
Ebadi Kamak
Edlund Jeffrey A.
Fakoorian Seyed
Fan David D.
Feras Micah
Funabiki Nobuhiro
Gao Jay
Ginting Fadhil
Harper Scott
Hatteland Alexander
Heiden Eric
Heywood Tristan
Jung Sunggoo
Kalantari Arash
Kanellakis Christoforos
Kaufmann Marcel
Kim Leon
Kim Sung-Kyun
Kim Taeyeon
Kramer Andrew
Lee Carlyn
Lee Hanseob
Lei Xianmei
Leopold Henry A.
Lew Thomas
López Brett
Maldonado-Contreras Jairo
Mayo John
Melikyan Hov
Merewether Gene
Miles Gregory
Morrell Benjamin
Nash Jeremy
Nikolakopoulos George
Otsu Kyohei
Pailevanian Torkom
Palieri Matteo
Ramtoula Benjamin
Saboia María
Salhotra Gautam
Santamaria-Navarro Àngel
Shim David
Stegun Vaquero Tiago
Stephens Alex
Tagliabue Andrea
Tepsuporn Scott
Terry Edward
Thakker Rohan
Thakur Abhishek
Tordesillas Jesús
Touma Thomas
Toupet Olivier
Walsh William
Wee Inhwan
Wolf Michael
Publication venue: John Wiley & Sons
Publication date: 01/01/2022
Field of study

This paper presents and discusses algorithms, hardware, and software architecture developed by the TEAM CoSTAR (Collaborative SubTerranean Autonomous Robots), competing in the DARPA Subterranean Challenge. Specifically, it presents the techniques utilized within the Tunnel (2019) and Urban (2020) competitions, where CoSTAR achieved second and first place, respectively. We also discuss CoSTAR¿s demonstrations in Martian-analog surface and subsurface (lava tubes) exploration. The paper introduces our autonomy solution, referred to as NeBula (Networked Belief-aware Perceptual Autonomy). NeBula is an uncertainty-aware framework that aims at enabling resilient and modular autonomy solutions by performing reasoning and decision making in the belief space (space of probability distributions over the robot and world states). We discuss various components of the NeBula framework, including (i) geometric and semantic environment mapping, (ii) a multi-modal positioning system, (iii) traversability analysis and local planning, (iv) global motion planning and exploration behavior, (v) risk-aware mission planning, (vi) networking and decentralized reasoning, and (vii) learning-enabled adaptation. We discuss the performance of NeBula on several robot types (e.g., wheeled, legged, flying), in various environments. We discuss the specific results and lessons learned from fielding this solution in the challenging courses of the DARPA Subterranean Challenge competition.The work is partially supported by the Jet Propulsion Laboratory, California Institute of Technology, under a contract with the National Aeronautics and Space Administration (80NM0018D0004), and Defense Advanced Research Projects Agency (DARPA)

PolyPublie

Digital.CSIC

Exploring haptic interfacing with a mobile robot without visual feedback

Author: Jones Peter E.
Penders Jacques
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Search and rescue scenarios are often complicated by low or no visibility conditions. The lack of visual feedback hampers orientation and causes significant stress for human rescue workers. The Guardians project [1] pioneered a group of autonomous mobile robots assisting a human rescue worker operating within close range. Trials were held with fire fighters of South Yorkshire Fire and Rescue. It became clear that the subjects by no means were prepared to give up their procedural routine and the feel of security they provide: they simply ignored instructions that contradicted their routines

Sheffield Hallam University Research Archive