Constrained Reinforcement Learning and Formal Verification for Safe
  Colonoscopy Navigation

Casals, Alicia; Corsi, Davide; Dall'Alba, Diego; Farinelli, Alessandro; Fiorini, Paolo; Marzari, Luca; Pore, Ameya

Constrained Reinforcement Learning and Formal Verification for Safe Colonoscopy Navigation

Authors: Alicia Casals
Davide Corsi
Diego Dall'Alba
Alessandro Farinelli
Paolo Fiorini
Luca Marzari
Ameya Pore
Publication date: 16 August 2023
Publisher

Abstract

The field of robotic Flexible Endoscopes (FEs) has progressed significantly, offering a promising solution to reduce patient discomfort. However, the limited autonomy of most robotic FEs results in non-intuitive and challenging manoeuvres, constraining their application in clinical settings. While previous studies have employed lumen tracking for autonomous navigation, they fail to adapt to the presence of obstructions and sharp turns when the endoscope faces the colon wall. In this work, we propose a Deep Reinforcement Learning (DRL)-based navigation strategy that eliminates the need for lumen tracking. However, the use of DRL methods poses safety risks as they do not account for potential hazards associated with the actions taken. To ensure safety, we exploit a Constrained Reinforcement Learning (CRL) method to restrict the policy in a predefined safety regime. Moreover, we present a model selection strategy that utilises Formal Verification (FV) to choose a policy that is entirely safe before deployment. We validate our approach in a virtual colonoscopy environment and report that out of the 300 trained policies, we could identify three policies that are entirely safe. Our work demonstrates that CRL, combined with model selection through FV, can improve the robustness and safety of robotic behaviour in surgical applications.Comment: Accepted in the IEEE International Conference on Intelligent Robots and Systems (IROS), 2023. [Corsi, Marzari and Pore contributed equally

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2303.03207

Last time updated on 28/03/2023