1,275 research outputs found

    Cascade R-CNN: Delving into High Quality Object Detection

    Full text link
    In object detection, an intersection over union (IoU) threshold is required to define positives and negatives. An object detector, trained with low IoU threshold, e.g. 0.5, usually produces noisy detections. However, detection performance tends to degrade with increasing the IoU thresholds. Two main factors are responsible for this: 1) overfitting during training, due to exponentially vanishing positive samples, and 2) inference-time mismatch between the IoUs for which the detector is optimal and those of the input hypotheses. A multi-stage object detection architecture, the Cascade R-CNN, is proposed to address these problems. It consists of a sequence of detectors trained with increasing IoU thresholds, to be sequentially more selective against close false positives. The detectors are trained stage by stage, leveraging the observation that the output of a detector is a good distribution for training the next higher quality detector. The resampling of progressively improved hypotheses guarantees that all detectors have a positive set of examples of equivalent size, reducing the overfitting problem. The same cascade procedure is applied at inference, enabling a closer match between the hypotheses and the detector quality of each stage. A simple implementation of the Cascade R-CNN is shown to surpass all single-model object detectors on the challenging COCO dataset. Experiments also show that the Cascade R-CNN is widely applicable across detector architectures, achieving consistent gains independently of the baseline detector strength. The code will be made available at https://github.com/zhaoweicai/cascade-rcnn

    Etalon Array Reconstructive Spectrometry.

    Get PDF
    Compact spectrometers are crucial in areas where size and weight may need to be minimized. These types of spectrometers often contain no moving parts, which makes for an instrument that can be highly durable. With the recent proliferation in low-cost and high-resolution cameras, camera-based spectrometry methods have the potential to make portable spectrometers small, ubiquitous, and cheap. Here, we demonstrate a novel method for compact spectrometry that uses an array of etalons to perform spectral encoding, and uses a reconstruction algorithm to recover the incident spectrum. This spectrometer has the unique capability for both high resolution and a large working bandwidth without sacrificing sensitivity, and we anticipate that its simplicity makes it an excellent candidate whenever a compact, robust, and flexible spectrometry solution is needed

    Direct observation of plasmonic index ellipsoids on a deep-subwavelength metallic grating

    Get PDF
    We constructed a metallic grating on a deep-subwavelength scale and tested its plasmonic features in visible frequencies. The deep-subwavelength metallic grating effectively acts as an anisotropic homogeneous uniaxial form-birefringent metal, exhibiting different optical responses for polarizations along different optical axes. Therefore, this form-birefringent metal supports anisotropic surface plasmon polaritons that are characterized by directly imaging the generated plasmonic index ellipsoids in reciprocal space. The observed plasmonic index ellipsoids also show a rainbow effect, where different colors are dispersively distributed in reciprocal space

    Learning Complexity-Aware Cascades for Deep Pedestrian Detection

    Full text link
    The design of complexity-aware cascaded detectors, combining features of very different complexities, is considered. A new cascade design procedure is introduced, by formulating cascade learning as the Lagrangian optimization of a risk that accounts for both accuracy and complexity. A boosting algorithm, denoted as complexity aware cascade training (CompACT), is then derived to solve this optimization. CompACT cascades are shown to seek an optimal trade-off between accuracy and complexity by pushing features of higher complexity to the later cascade stages, where only a few difficult candidate patches remain to be classified. This enables the use of features of vastly different complexities in a single detector. In result, the feature pool can be expanded to features previously impractical for cascade design, such as the responses of a deep convolutional neural network (CNN). This is demonstrated through the design of a pedestrian detector with a pool of features whose complexities span orders of magnitude. The resulting cascade generalizes the combination of a CNN with an object proposal mechanism: rather than a pre-processing stage, CompACT cascades seamlessly integrate CNNs in their stages. This enables state of the art performance on the Caltech and KITTI datasets, at fairly fast speeds

    Theory of optical imaging beyond the diffraction limit with a far-field superlens

    Full text link
    Recent theoretical and experimental studies have shown that imaging with resolution well beyond the diffraction limit can be obtained with so-called superlenses. Images formed by such superlenses are, however, in the near field only, or a fraction of wavelength away from the lens. In this paper, we propose a far-field superlens (FSL) device which is composed of a planar superlens with periodical corrugation. We show in theory that when an object is placed in close proximity of such a FSL, a unique image can be formed in far-field. As an example, we demonstrate numerically that images of 40 nm lines with a 30 nm gap can be obtained from far-field data with properly designed FSL working at 376nm wavelength.Comment: 6 pages, 3 figure
    corecore