Search CORE

2,764 research outputs found

Learning SO(3) Equivariant Representations with Spherical CNNs

Author: A Frome
A Makadia
A Tatsuma
DM Healy
G Arfken
J Segman
JR Driscoll
MM Bronstein
S Dieleman
WP Thurston
Publication venue
Publication date: 27/09/2018
Field of study

We address the problem of 3D rotation equivariance in convolutional neural networks. 3D rotations have been a challenging nuisance in 3D classification tasks requiring higher capacity and extended data augmentation in order to tackle it. We model 3D data with multi-valued spherical functions and we propose a novel spherical convolutional network that implements exact convolutions on the sphere by realizing them in the spherical harmonic domain. Resulting filters have local symmetry and are localized by enforcing smooth spectra. We apply a novel pooling on the spectral domain and our operations are independent of the underlying spherical resolution throughout the network. We show that networks with much lower capacity and without requiring data augmentation can exhibit performance comparable to the state of the art in standard retrieval and classification benchmarks.Comment: Camera-ready. Accepted to ECCV'18 as oral presentatio

arXiv.org e-Print Archive

Crossref

Group Equivariant BEV for 3D Object Detection

Author: Guo Jielong
Li Shaobo
Liu Hongwei
Shao Dongheng
Tang Xuan
Wei Xian
Yang Jian
Zhang Jianfeng
Publication venue
Publication date: 26/04/2023
Field of study

Recently, 3D object detection has attracted significant attention and achieved continuous improvement in real road scenarios. The environmental information is collected from a single sensor or multi-sensor fusion to detect interested objects. However, most of the current 3D object detection approaches focus on developing advanced network architectures to improve the detection precision of the object rather than considering the dynamic driving scenes, where data collected from sensors equipped in the vehicle contain various perturbation features. As a result, existing work cannot still tackle the perturbation issue. In order to solve this problem, we propose a group equivariant bird's eye view network (GeqBevNet) based on the group equivariant theory, which introduces the concept of group equivariant into the BEV fusion object detection network. The group equivariant network is embedded into the fused BEV feature map to facilitate the BEV-level rotational equivariant feature extraction, thus leading to lower average orientation error. In order to demonstrate the effectiveness of the GeqBevNet, the network is verified on the nuScenes validation dataset in which mAOE can be decreased to 0.325. Experimental results demonstrate that GeqBevNet can extract more rotational equivariant features in the 3D object detection of the actual road scene and improve the performance of object orientation prediction.Comment: 8 pages,3 figures,accepted by International Joint Conference on Neural Networks (IJCNN)202

arXiv.org e-Print Archive