1st Place Solution of The Robust Vision Challenge 2022 Semantic
  Segmentation Track

Anandkumar, Anima; Lan, Shiyi; Xiao, Junfei; Xu, Zhichao; Yu, Zhiding; Yuille, Alan

1st Place Solution of The Robust Vision Challenge 2022 Semantic Segmentation Track

Authors: Anima Anandkumar
Shiyi Lan
Junfei Xiao
Zhichao Xu
Zhiding Yu
Alan Yuille
Publication date: 7 November 2022
Publisher

Abstract

This report describes the winning solution to the Robust Vision Challenge (RVC) semantic segmentation track at ECCV 2022. Our method adopts the FAN-B-Hybrid model as the encoder and uses SegFormer as the segmentation framework. The model is trained on a composite dataset consisting of images from 9 datasets (ADE20K, Cityscapes, Mapillary Vistas, ScanNet, VIPER, WildDash 2, IDD, BDD, and COCO) with a simple dataset balancing strategy. All the original labels are projected to a 256-class unified label space, and the model is trained using a cross-entropy loss. Without significant hyperparameter tuning or any specific loss weighting, our solution ranks the first place on all the testing semantic segmentation benchmarks from multiple domains (ADE20K, Cityscapes, Mapillary Vistas, ScanNet, VIPER, and WildDash 2). The proposed method can serve as a strong baseline for the multi-domain segmentation task and benefit future works. Code will be available at https://github.com/lambert-x/RVC_Segmentation.Comment: The Winning Solution to The Robust Vision Challenge 2022 Semantic Segmentation Trac

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2210.12852

Last time updated on 12/12/2022