6,099 research outputs found
MS3D++: Ensemble of Experts for Multi-Source Unsupervised Domain Adaptation in 3D Object Detection
Deploying 3D detectors in unfamiliar domains has been demonstrated to result
in a significant 70-90% drop in detection rate due to variations in lidar,
geography, or weather from their training dataset. This domain gap leads to
missing detections for densely observed objects, misaligned confidence scores,
and increased high-confidence false positives, rendering the detector highly
unreliable. To address this, we introduce MS3D++, a self-training framework for
multi-source unsupervised domain adaptation in 3D object detection. MS3D++
generates high-quality pseudo-labels, allowing 3D detectors to achieve high
performance on a range of lidar types, regardless of their density. Our
approach effectively fuses predictions of an ensemble of multi-frame
pre-trained detectors from different source domains to improve domain
generalization. We subsequently refine predictions temporally to ensure
temporal consistency in box localization and object classification.
Furthermore, we present an in-depth study into the performance and
idiosyncrasies of various 3D detector components in a cross-domain context,
providing valuable insights for improved cross-domain detector ensembling.
Experimental results on Waymo, nuScenes and Lyft demonstrate that detectors
trained with MS3D++ pseudo-labels achieve state-of-the-art performance,
comparable to training with human-annotated labels in Bird's Eye View (BEV)
evaluation for both low and high density lidar. Code is available at
https://github.com/darrenjkt/MS3
Track, then Decide: Category-Agnostic Vision-based Multi-Object Tracking
The most common paradigm for vision-based multi-object tracking is
tracking-by-detection, due to the availability of reliable detectors for
several important object categories such as cars and pedestrians. However,
future mobile systems will need a capability to cope with rich human-made
environments, in which obtaining detectors for every possible object category
would be infeasible. In this paper, we propose a model-free multi-object
tracking approach that uses a category-agnostic image segmentation method to
track objects. We present an efficient segmentation mask-based tracker which
associates pixel-precise masks reported by the segmentation. Our approach can
utilize semantic information whenever it is available for classifying objects
at the track level, while retaining the capability to track generic unknown
objects in the absence of such information. We demonstrate experimentally that
our approach achieves performance comparable to state-of-the-art
tracking-by-detection methods for popular object categories such as cars and
pedestrians. Additionally, we show that the proposed method can discover and
robustly track a large variety of other objects.Comment: ICRA'18 submissio
- …