Sparse coded handcrafted and deep features for colon capsule video summarization

Alaya Cheikh, Faouzi; Hovde, Øistein; Mohammed, Ahmed Kedir; Pedersen, Marius; Yildirim, Sule

Sparse coded handcrafted and deep features for colon capsule video summarization

Authors: Faouzi Alaya Cheikh
Øistein Hovde
Ahmed Kedir Mohammed
Marius Pedersen
Sule Yildirim
Publication date: 1 January 2017
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'

Abstract

Capsule endoscopy, which uses a wireless camera to take images of the digestive track, is emerging as an alternative to traditional wired colonoscopy. A single examination produces a sequence of approximately 50,000 frames. These sequences are manually reviewed, which is time consuming and typically takes about 45-90 minutes and requires the undivided concentration of the reviewer. In this paper, we propose a novel capsule video summarization framework using sparse coding and dictionary learning in feature space. Video frames are clustered into superframes based on power spectral density, and cluster representative frames are used for video summarization. Handcrafted and deep features that are extracted for representative frames are sparse coded using a learned dictionary. Sparse coded features are later used for training SVM classifier. The proposed method was compared with state-of-the-art methods based on sensitivity and specificity. The achieved results show that our proposed framework provides robust capsule video summarization without losing informative segments

Similar works

Full text

Available Versions

NORA - Norwegian Open Research Archives

oai:ntnuopen.ntnu.no:11250/249...

Last time updated on 14/10/2021