7,714 research outputs found
Super-resolution using multiple quantized images
In this paper, we study the effect of limited amplitude resolution (pixel depth) in super-resolution problem. The problem we address differs from the standard super-resolution problem in that amplitude resolution is considered as important as spatial resolution. We study the trade-off between the pixel depth and spatial resolution of low resolution (LR) images in order to obtain the best visual quality in the reconstructed high resolution (HR) image. The proposed framework reveals great flexibility in terms of pixel depth and number of LR images in super-resolution problem, and demonstrates that it is possible to obtain target visual qualities with different measurement scenarios including images with different amplitude and spatial resolutions. © 2010 IEEE
Recurrent Scene Parsing with Perspective Understanding in the Loop
Objects may appear at arbitrary scales in perspective images of a scene,
posing a challenge for recognition systems that process images at a fixed
resolution. We propose a depth-aware gating module that adaptively selects the
pooling field size in a convolutional network architecture according to the
object scale (inversely proportional to the depth) so that small details are
preserved for distant objects while larger receptive fields are used for those
nearby. The depth gating signal is provided by stereo disparity or estimated
directly from monocular input. We integrate this depth-aware gating into a
recurrent convolutional neural network to perform semantic segmentation. Our
recurrent module iteratively refines the segmentation results, leveraging the
depth and semantic predictions from the previous iterations.
Through extensive experiments on four popular large-scale RGB-D datasets, we
demonstrate this approach achieves competitive semantic segmentation
performance with a model which is substantially more compact. We carry out
extensive analysis of this architecture including variants that operate on
monocular RGB but use depth as side-information during training, unsupervised
gating as a generic attentional mechanism, and multi-resolution gating. We find
that gated pooling for joint semantic segmentation and depth yields
state-of-the-art results for quantitative monocular depth estimation
BLADE: Filter Learning for General Purpose Computational Photography
The Rapid and Accurate Image Super Resolution (RAISR) method of Romano,
Isidoro, and Milanfar is a computationally efficient image upscaling method
using a trained set of filters. We describe a generalization of RAISR, which we
name Best Linear Adaptive Enhancement (BLADE). This approach is a trainable
edge-adaptive filtering framework that is general, simple, computationally
efficient, and useful for a wide range of problems in computational
photography. We show applications to operations which may appear in a camera
pipeline including denoising, demosaicing, and stylization
Subdiffractional focusing and guiding of polaritonic rays in a natural hyperbolic material
Uniaxial materials whose axial and tangential permittivities have opposite
signs are referred to as indefinite or hyperbolic media. In such materials
light propagation is unusual, leading to novel and often non-intuitive optical
phenomena. Here we report infrared nano-imaging experiments demonstrating that
crystals of hexagonal boron nitride (hBN), a natural mid-infrared hyperbolic
material, can act as a "hyper-focusing lens" and as a multi-mode waveguide. The
lensing is manifested by subdiffractional focusing of phonon-polaritons
launched by metallic disks underneath the hBN crystal. The waveguiding is
revealed through the modal analysis of the periodic patterns observed around
such launchers and near the sample edges. Our work opens new opportunities for
anisotropic layered insulators in infrared nanophotonics complementing and
potentially surpassing concurrent artificial hyperbolic materials with lower
losses and higher optical localization.Comment: 25 pages, 5 figure
- …