LAPFormer: A Light and Accurate Polyp Segmentation Transformer

Bui, Tung Thanh; Nguyen, Mai; Nguyen, Thanh Tung; Van Nguyen, Quan; Van Pham, Toan

LAPFormer: A Light and Accurate Polyp Segmentation Transformer

Authors: Tung Thanh Bui
Mai Nguyen
Thanh Tung Nguyen
Quan Van Nguyen
Toan Van Pham
Publication date: 9 October 2022
Publisher

Abstract

Polyp segmentation is still known as a difficult problem due to the large variety of polyp shapes, scanning and labeling modalities. This prevents deep learning model to generalize well on unseen data. However, Transformer-based approach recently has achieved some remarkable results on performance with the ability of extracting global context better than CNN-based architecture and yet lead to better generalization. To leverage this strength of Transformer, we propose a new model with encoder-decoder architecture named LAPFormer, which uses a hierarchical Transformer encoder to better extract global feature and combine with our novel CNN (Convolutional Neural Network) decoder for capturing local appearance of the polyps. Our proposed decoder contains a progressive feature fusion module designed for fusing feature from upper scales and lower scales and enable multi-scale features to be more correlative. Besides, we also use feature refinement module and feature selection module for processing feature. We test our model on five popular benchmark datasets for polyp segmentation, including Kvasir, CVC-Clinic DB, CVC-ColonDB, CVC-T, and ETIS-LaribComment: 7 pages, 7 figures, ACL 2023 underrevie

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2210.04393

Last time updated on 24/11/2022