3,825 research outputs found
Multi-Domain Pose Network for Multi-Person Pose Estimation and Tracking
Multi-person human pose estimation and tracking in the wild is important and
challenging. For training a powerful model, large-scale training data are
crucial. While there are several datasets for human pose estimation, the best
practice for training on multi-dataset has not been investigated. In this
paper, we present a simple network called Multi-Domain Pose Network (MDPN) to
address this problem. By treating the task as multi-domain learning, our
methods can learn a better representation for pose prediction. Together with
prediction heads fine-tuning and multi-branch combination, it shows significant
improvement over baselines and achieves the best performance on PoseTrack ECCV
2018 Challenge without additional datasets other than MPII and COCO.Comment: Extended abstract for the ECCV 2018 PoseTrack Worksho
Cascade R-CNN: Delving into High Quality Object Detection
In object detection, an intersection over union (IoU) threshold is required
to define positives and negatives. An object detector, trained with low IoU
threshold, e.g. 0.5, usually produces noisy detections. However, detection
performance tends to degrade with increasing the IoU thresholds. Two main
factors are responsible for this: 1) overfitting during training, due to
exponentially vanishing positive samples, and 2) inference-time mismatch
between the IoUs for which the detector is optimal and those of the input
hypotheses. A multi-stage object detection architecture, the Cascade R-CNN, is
proposed to address these problems. It consists of a sequence of detectors
trained with increasing IoU thresholds, to be sequentially more selective
against close false positives. The detectors are trained stage by stage,
leveraging the observation that the output of a detector is a good distribution
for training the next higher quality detector. The resampling of progressively
improved hypotheses guarantees that all detectors have a positive set of
examples of equivalent size, reducing the overfitting problem. The same cascade
procedure is applied at inference, enabling a closer match between the
hypotheses and the detector quality of each stage. A simple implementation of
the Cascade R-CNN is shown to surpass all single-model object detectors on the
challenging COCO dataset. Experiments also show that the Cascade R-CNN is
widely applicable across detector architectures, achieving consistent gains
independently of the baseline detector strength. The code will be made
available at https://github.com/zhaoweicai/cascade-rcnn
- …