4 research outputs found
Pyramidal Edge-maps and Attention based Guided Thermal Super-resolution
Guided super-resolution (GSR) of thermal images using visible range images is
challenging because of the difference in the spectral-range between the images.
This in turn means that there is significant texture-mismatch between the
images, which manifests as blur and ghosting artifacts in the super-resolved
thermal image. To tackle this, we propose a novel algorithm for GSR based on
pyramidal edge-maps extracted from the visible image. Our proposed network has
two sub-networks. The first sub-network super-resolves the low-resolution
thermal image while the second obtains edge-maps from the visible image at a
growing perceptual scale and integrates them into the super-resolution
sub-network with the help of attention-based fusion. Extraction and integration
of multi-level edges allows the super-resolution network to process
texture-to-object level information progressively, enabling more
straightforward identification of overlapping edges between the input images.
Extensive experiments show that our model outperforms the state-of-the-art GSR
methods, both quantitatively and qualitatively
HSCS: Hierarchical Sparsity Based Co-saliency Detection for RGBD Images
Co-saliency detection aims to discover common and salient objects in an image
group containing more than two relevant images. Moreover, depth information has
been demonstrated to be effective for many computer vision tasks. In this
paper, we propose a novel co-saliency detection method for RGBD images based on
hierarchical sparsity reconstruction and energy function refinement. With the
assistance of the intra saliency map, the inter-image correspondence is
formulated as a hierarchical sparsity reconstruction framework. The global
sparsity reconstruction model with a ranking scheme focuses on capturing the
global characteristics among the whole image group through a common foreground
dictionary. The pairwise sparsity reconstruction model aims to explore the
corresponding relationship between pairwise images through a set of pairwise
dictionaries. In order to improve the intra-image smoothness and inter-image
consistency, an energy function refinement model is proposed, which includes
the unary data term, spatial smooth term, and holistic consistency term.
Experiments on two RGBD co-saliency detection benchmarks demonstrate that the
proposed method outperforms the state-of-the-art algorithms both qualitatively
and quantitatively.Comment: 11 pages, 5 figures, Accepted by IEEE Transactions on Multimedia,
https://rmcong.github.io
Single Pair Cross-Modality Super Resolution
Non-visual imaging sensors are widely used in the industry for different
purposes. Those sensors are more expensive than visual (RGB) sensors, and
usually produce images with lower resolution. To this end, Cross-Modality
Super-Resolution methods were introduced, where an RGB image of a
high-resolution assists in increasing the resolution of the low-resolution
modality. However, fusing images from different modalities is not a trivial
task; the output must be artifact-free and remain loyal to the characteristics
of the target modality. Moreover, the input images are never perfectly aligned,
which results in further artifacts during the fusion process.
We present CMSR, a deep network for Cross-Modality Super-Resolution, which
unlike previous methods, is designed to deal with weakly aligned images. The
network is trained on the two input images only, learns their internal
statistics and correlations, and applies them to up-sample the target modality.
CMSR contains an internal transformer that is trained on-the-fly together with
the up-sampling process itself, without explicit supervision. We show that CMSR
succeeds to increase the resolution of the input image, gaining valuable
information from its RGB counterpart, yet in a conservative way, without
introducing artifacts or irrelevant details
Deep Learning for Image Super-resolution: A Survey
Image Super-Resolution (SR) is an important class of image processing
techniques to enhance the resolution of images and videos in computer vision.
Recent years have witnessed remarkable progress of image super-resolution using
deep learning techniques. This article aims to provide a comprehensive survey
on recent advances of image super-resolution using deep learning approaches. In
general, we can roughly group the existing studies of SR techniques into three
major categories: supervised SR, unsupervised SR, and domain-specific SR. In
addition, we also cover some other important issues, such as publicly available
benchmark datasets and performance evaluation metrics. Finally, we conclude
this survey by highlighting several future directions and open issues which
should be further addressed by the community in the future.Comment: Accepted by IEEE TPAM