5 research outputs found
Synthetic and Real Inputs for Tool Segmentation in Robotic Surgery
Semantic tool segmentation in surgical videos is important for surgical scene
understanding and computer-assisted interventions as well as for the
development of robotic automation. The problem is challenging because different
illumination conditions, bleeding, smoke and occlusions can reduce algorithm
robustness. At present labelled data for training deep learning models is still
lacking for semantic surgical instrument segmentation and in this paper we show
that it may be possible to use robot kinematic data coupled with laparoscopic
images to alleviate the labelling problem. We propose a new deep learning based
model for parallel processing of both laparoscopic and simulation images for
robust segmentation of surgical tools. Due to the lack of laparoscopic frames
annotated with both segmentation ground truth and kinematic information a new
custom dataset was generated using the da Vinci Research Kit (dVRK) and is made
available
Synthetic and Real Inputs for Tool Segmentation in Robotic Surgery
Semantic tool segmentation in surgical videos is important for surgical scene understanding and computer-assisted interventions as well as for the development of robotic automation. The problem is challenging because different illumination conditions, bleeding, smoke and occlusions can reduce algorithm robustness. At present labelled data for training deep learning models is still lacking for semantic surgical instrument segmentation and in this paper we show that it may be possible to use robot kinematic data coupled with laparoscopic images to alleviate the labelling problem. We propose a new deep learning based model for parallel processing of both laparoscopic and simulation images for robust segmentation of surgical tools. Due to the lack of laparoscopic frames annotated with both segmentation ground truth and kinematic information a new custom dataset was generated using the da Vinci Research Kit (dVRK) and is made available
HAPNet: hierarchically aggregated pyramid network for real-time stereo matching
©Recovering the 3D shape of the surgical site is crucial for multiple computer-assisted interventions. Stereo endoscopes can be used to compute 3D depth but computational stereo is a challenging, non-convex and inherently discontinuous optimisation problem. In this paper, we propose a deep learning architecture which avoids the explicit construction of a cost volume of similarity which is one of the most computationally costly blocks of stereo algorithms. This makes training our network significantly more efficient and avoids the needs for large memory allocation. Our method performs well, especially around regions comprising multiple discontinuities around surgical instrumentation or around complex small structures and instruments. The method compares well to the state-of-the-art techniques while taking a different methodological angle to computational stereo problem in surgical video