ViCE: Visual Concept Embedding Discovery and Superpixelization

Carballo, Alexander; Fujii, Keisuke; Hayashi, Tomoki; Karlsson, Robin; Ohtani, Kento; Takeda, Kazuya

ViCE: Visual Concept Embedding Discovery and Superpixelization

Authors: Alexander Carballo
Keisuke Fujii
Tomoki Hayashi
Robin Karlsson
Kento Ohtani
Kazuya Takeda
Publication date: 22 April 2022
Publisher

Abstract

Recent self-supervised computer vision methods have demonstrated equal or better performance to supervised methods, opening for AI systems to learn visual representations from practically unlimited data. However, these methods are classification-based and thus ineffective for learning dense feature maps required for unsupervised semantic segmentation. This work presents a method to effectively learn dense semantically rich visual concept embeddings applicable to high-resolution images. We introduce superpixelization as a means to decompose images into a small set of visually coherent regions, allowing efficient learning of dense semantics by swapped prediction. The expressiveness of our dense embeddings is demonstrated by significantly improving the SOTA representation quality benchmarks on COCO (+16.27 mIoU) and Cityscapes (+19.24 mIoU) for both low- and high-resolution images

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2111.12460

Last time updated on 10/02/2022