7,714 research outputs found

    Super-resolution using multiple quantized images

    Get PDF
    In this paper, we study the effect of limited amplitude resolution (pixel depth) in super-resolution problem. The problem we address differs from the standard super-resolution problem in that amplitude resolution is considered as important as spatial resolution. We study the trade-off between the pixel depth and spatial resolution of low resolution (LR) images in order to obtain the best visual quality in the reconstructed high resolution (HR) image. The proposed framework reveals great flexibility in terms of pixel depth and number of LR images in super-resolution problem, and demonstrates that it is possible to obtain target visual qualities with different measurement scenarios including images with different amplitude and spatial resolutions. © 2010 IEEE

    Recurrent Scene Parsing with Perspective Understanding in the Loop

    Full text link
    Objects may appear at arbitrary scales in perspective images of a scene, posing a challenge for recognition systems that process images at a fixed resolution. We propose a depth-aware gating module that adaptively selects the pooling field size in a convolutional network architecture according to the object scale (inversely proportional to the depth) so that small details are preserved for distant objects while larger receptive fields are used for those nearby. The depth gating signal is provided by stereo disparity or estimated directly from monocular input. We integrate this depth-aware gating into a recurrent convolutional neural network to perform semantic segmentation. Our recurrent module iteratively refines the segmentation results, leveraging the depth and semantic predictions from the previous iterations. Through extensive experiments on four popular large-scale RGB-D datasets, we demonstrate this approach achieves competitive semantic segmentation performance with a model which is substantially more compact. We carry out extensive analysis of this architecture including variants that operate on monocular RGB but use depth as side-information during training, unsupervised gating as a generic attentional mechanism, and multi-resolution gating. We find that gated pooling for joint semantic segmentation and depth yields state-of-the-art results for quantitative monocular depth estimation

    BLADE: Filter Learning for General Purpose Computational Photography

    Full text link
    The Rapid and Accurate Image Super Resolution (RAISR) method of Romano, Isidoro, and Milanfar is a computationally efficient image upscaling method using a trained set of filters. We describe a generalization of RAISR, which we name Best Linear Adaptive Enhancement (BLADE). This approach is a trainable edge-adaptive filtering framework that is general, simple, computationally efficient, and useful for a wide range of problems in computational photography. We show applications to operations which may appear in a camera pipeline including denoising, demosaicing, and stylization

    Subdiffractional focusing and guiding of polaritonic rays in a natural hyperbolic material

    Get PDF
    Uniaxial materials whose axial and tangential permittivities have opposite signs are referred to as indefinite or hyperbolic media. In such materials light propagation is unusual, leading to novel and often non-intuitive optical phenomena. Here we report infrared nano-imaging experiments demonstrating that crystals of hexagonal boron nitride (hBN), a natural mid-infrared hyperbolic material, can act as a "hyper-focusing lens" and as a multi-mode waveguide. The lensing is manifested by subdiffractional focusing of phonon-polaritons launched by metallic disks underneath the hBN crystal. The waveguiding is revealed through the modal analysis of the periodic patterns observed around such launchers and near the sample edges. Our work opens new opportunities for anisotropic layered insulators in infrared nanophotonics complementing and potentially surpassing concurrent artificial hyperbolic materials with lower losses and higher optical localization.Comment: 25 pages, 5 figure
    corecore