382 research outputs found
Surgical Phase Recognition of Short Video Shots Based on Temporal Modeling of Deep Features
Recognizing the phases of a laparoscopic surgery (LS) operation form its
video constitutes a fundamental step for efficient content representation,
indexing and retrieval in surgical video databases. In the literature, most
techniques focus on phase segmentation of the entire LS video using
hand-crafted visual features, instrument usage signals, and recently
convolutional neural networks (CNNs). In this paper we address the problem of
phase recognition of short video shots (10s) of the operation, without
utilizing information about the preceding/forthcoming video frames, their phase
labels or the instruments used. We investigate four state-of-the-art CNN
architectures (Alexnet, VGG19, GoogleNet, and ResNet101), for feature
extraction via transfer learning. Visual saliency was employed for selecting
the most informative region of the image as input to the CNN. Video shot
representation was based on two temporal pooling mechanisms. Most importantly,
we investigate the role of 'elapsed time' (from the beginning of the
operation), and we show that inclusion of this feature can increase performance
dramatically (69% vs. 75% mean accuracy). Finally, a long short-term memory
(LSTM) network was trained for video shot classification based on the fusion of
CNN features with 'elapsed time', increasing the accuracy to 86%. Our results
highlight the prominent role of visual saliency, long-range temporal recursion
and 'elapsed time' (a feature so far ignored), for surgical phase recognition.Comment: 6 pages, 4 figures, 6 table
Clinical Big Data and Deep Learning: Applications, Challenges, and Future Outlooks
The explosion of digital healthcare data has led to a surge of data-driven medical research based on machine learning. In recent years, as a powerful technique for big data, deep learning has gained a central position in machine learning circles for its great advantages in feature representation and pattern recognition. This article presents a comprehensive overview of studies that employ deep learning methods to deal with clinical data. Firstly, based on the analysis of the characteristics of clinical data, various types of clinical data (e.g., medical images, clinical notes, lab results, vital signs and demographic informatics) are discussed and details provided of some public clinical datasets. Secondly, a brief review of common deep learning models and their characteristics is conducted. Then, considering the wide range of clinical research and the diversity of data types, several deep learning applications for clinical data are illustrated: auxiliary diagnosis, prognosis, early warning, and other tasks. Although there are challenges involved in applying deep learning techniques to clinical data, it is still worthwhile to look forward to a promising future for deep learning applications in clinical big data in the direction of precision medicine
Learnable Weight Initialization for Volumetric Medical Image Segmentation
Hybrid volumetric medical image segmentation models, combining the advantages
of local convolution and global attention, have recently received considerable
attention. While mainly focusing on architectural modifications, most existing
hybrid approaches still use conventional data-independent weight initialization
schemes which restrict their performance due to ignoring the inherent
volumetric nature of the medical data. To address this issue, we propose a
learnable weight initialization approach that utilizes the available medical
training data to effectively learn the contextual and structural cues via the
proposed self-supervised objectives. Our approach is easy to integrate into any
hybrid model and requires no external training data. Experiments on multi-organ
and lung cancer segmentation tasks demonstrate the effectiveness of our
approach, leading to state-of-the-art segmentation performance. Our source code
and models are available at: https://github.com/ShahinaKK/LWI-VMS.Comment: Technical Repor
Recommended from our members
Improving an Active Shape Model with Random Classification Forest for Segmentation of Cervical Vertebrae
X-ray is a common modality for diagnosing cervical vertebrae injuries. Many injuries are missed by emergency physicians which later causes life threatening complications. Computer aided analysis of X-ray images has the potential to detect missed injuries. Segmentation of the vertebrae is a crucial step towards automatic injury detection system. Active shape model (ASM) is one of the most successful and popular method for vertebrae segmentation. In this work, we propose a new ASM search method based on random classification forest and a kernel density estimation-based prediction technique. The proposed method have been tested on a dataset of 90 emergency room X-ray images containing 450 vertebrae and outperformed the classical Mahalanobis distancebased ASM search and also the regression forest-based method
- …