1,275 research outputs found
Cascade R-CNN: Delving into High Quality Object Detection
In object detection, an intersection over union (IoU) threshold is required
to define positives and negatives. An object detector, trained with low IoU
threshold, e.g. 0.5, usually produces noisy detections. However, detection
performance tends to degrade with increasing the IoU thresholds. Two main
factors are responsible for this: 1) overfitting during training, due to
exponentially vanishing positive samples, and 2) inference-time mismatch
between the IoUs for which the detector is optimal and those of the input
hypotheses. A multi-stage object detection architecture, the Cascade R-CNN, is
proposed to address these problems. It consists of a sequence of detectors
trained with increasing IoU thresholds, to be sequentially more selective
against close false positives. The detectors are trained stage by stage,
leveraging the observation that the output of a detector is a good distribution
for training the next higher quality detector. The resampling of progressively
improved hypotheses guarantees that all detectors have a positive set of
examples of equivalent size, reducing the overfitting problem. The same cascade
procedure is applied at inference, enabling a closer match between the
hypotheses and the detector quality of each stage. A simple implementation of
the Cascade R-CNN is shown to surpass all single-model object detectors on the
challenging COCO dataset. Experiments also show that the Cascade R-CNN is
widely applicable across detector architectures, achieving consistent gains
independently of the baseline detector strength. The code will be made
available at https://github.com/zhaoweicai/cascade-rcnn
Etalon Array Reconstructive Spectrometry.
Compact spectrometers are crucial in areas where size and weight may need to be minimized. These types of spectrometers often contain no moving parts, which makes for an instrument that can be highly durable. With the recent proliferation in low-cost and high-resolution cameras, camera-based spectrometry methods have the potential to make portable spectrometers small, ubiquitous, and cheap. Here, we demonstrate a novel method for compact spectrometry that uses an array of etalons to perform spectral encoding, and uses a reconstruction algorithm to recover the incident spectrum. This spectrometer has the unique capability for both high resolution and a large working bandwidth without sacrificing sensitivity, and we anticipate that its simplicity makes it an excellent candidate whenever a compact, robust, and flexible spectrometry solution is needed
Direct observation of plasmonic index ellipsoids on a deep-subwavelength metallic grating
We constructed a metallic grating on a deep-subwavelength scale and tested its plasmonic features in visible frequencies. The deep-subwavelength metallic grating effectively acts as an anisotropic homogeneous uniaxial form-birefringent metal, exhibiting different optical responses for polarizations along different optical axes. Therefore, this form-birefringent metal supports anisotropic surface plasmon polaritons that are characterized by directly imaging the generated plasmonic index ellipsoids in reciprocal space. The observed plasmonic index ellipsoids also show a rainbow effect, where different colors are dispersively distributed in reciprocal space
Learning Complexity-Aware Cascades for Deep Pedestrian Detection
The design of complexity-aware cascaded detectors, combining features of very
different complexities, is considered. A new cascade design procedure is
introduced, by formulating cascade learning as the Lagrangian optimization of a
risk that accounts for both accuracy and complexity. A boosting algorithm,
denoted as complexity aware cascade training (CompACT), is then derived to
solve this optimization. CompACT cascades are shown to seek an optimal
trade-off between accuracy and complexity by pushing features of higher
complexity to the later cascade stages, where only a few difficult candidate
patches remain to be classified. This enables the use of features of vastly
different complexities in a single detector. In result, the feature pool can be
expanded to features previously impractical for cascade design, such as the
responses of a deep convolutional neural network (CNN). This is demonstrated
through the design of a pedestrian detector with a pool of features whose
complexities span orders of magnitude. The resulting cascade generalizes the
combination of a CNN with an object proposal mechanism: rather than a
pre-processing stage, CompACT cascades seamlessly integrate CNNs in their
stages. This enables state of the art performance on the Caltech and KITTI
datasets, at fairly fast speeds
Theory of optical imaging beyond the diffraction limit with a far-field superlens
Recent theoretical and experimental studies have shown that imaging with
resolution well beyond the diffraction limit can be obtained with so-called
superlenses. Images formed by such superlenses are, however, in the near field
only, or a fraction of wavelength away from the lens. In this paper, we propose
a far-field superlens (FSL) device which is composed of a planar superlens with
periodical corrugation. We show in theory that when an object is placed in
close proximity of such a FSL, a unique image can be formed in far-field. As an
example, we demonstrate numerically that images of 40 nm lines with a 30 nm gap
can be obtained from far-field data with properly designed FSL working at 376nm
wavelength.Comment: 6 pages, 3 figure
- …