Physical Deep Reinforcement Learning Towards Safety Guarantee

Caccamo, Marco; Cao, Hongpeng; Mao, Yanbing; Sha, Lui

Physical Deep Reinforcement Learning Towards Safety Guarantee

Authors: Marco Caccamo
Hongpeng Cao
Yanbing Mao
Lui Sha
Publication date: 29 March 2023
Publisher

Abstract

Deep reinforcement learning (DRL) has achieved tremendous success in many complex decision-making tasks of autonomous systems with high-dimensional state and/or action spaces. However, the safety and stability still remain major concerns that hinder the applications of DRL to safety-critical autonomous systems. To address the concerns, we proposed the Phy-DRL: a physical deep reinforcement learning framework. The Phy-DRL is novel in two architectural designs: i) Lyapunov-like reward, and ii) residual control (i.e., integration of physics-model-based control and data-driven control). The concurrent physical reward and residual control empower the Phy-DRL the (mathematically) provable safety and stability guarantees. Through experiments on the inverted pendulum, we show that the Phy-DRL features guaranteed safety and stability and enhanced robustness, while offering remarkably accelerated training and enlarged reward.Comment: Working Pape

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2303.16860

Last time updated on 02/04/2023