Search CORE

564 research outputs found

Registration and Fusion of Multi-Spectral Images Using a Novel Edge Descriptor

Author: Bar Sharon Duvdevani
Keller Yosi
Ofir Nati
Rozenbaum Dani
Silberstein Shai
Publication venue
Publication date: 28/05/2018
Field of study

In this paper we introduce a fully end-to-end approach for multi-spectral image registration and fusion. Our method for fusion combines images from different spectral channels into a single fused image by different approaches for low and high frequency signals. A prerequisite of fusion is a stage of geometric alignment between the spectral bands, commonly referred to as registration. Unfortunately, common methods for image registration of a single spectral channel do not yield reasonable results on images from different modalities. For that end, we introduce a new algorithm for multi-spectral image registration, based on a novel edge descriptor of feature points. Our method achieves an accurate alignment of a level that allows us to further fuse the images. As our experiments show, we produce a high quality of multi-spectral image registration and fusion under many challenging scenarios

arXiv.org e-Print Archive

Crossref

Nonparametric image registration of airborne LiDAR, hyperspectral and photographic imagery of wooded landscapes

Author: Cai X
Coomes DA
Lee J
Schönlieb CB
Publication venue: IEEE Transactions on Geoscience and Remote Sensing
Publication date: 28/07/2014
Field of study

There is much current interest in using multisensor airborne remote sensing to monitor the structure and biodiversity of woodlands. This paper addresses the application of nonparametric (NP) image-registration techniques to precisely align images obtained from multisensor imaging, which is critical for the successful identification of individual trees using object recognition approaches. NP image registration, in particular, the technique of optimizing an objective function, containing similarity and regularization terms, provides a flexible approach for image registration. Here, we develop a NP registration approach, in which a normalized gradient field is used to quantify similarity, and curvature is used for regularization (NGF-Curv method). Using a survey of woodlands in southern Spain as an example, we show that NGF-Curv can be successful at fusing data sets when there is little prior knowledge about how the data sets are interrelated (i.e., in the absence of ground control points). The validity of NGF-Curv in airborne remote sensing is demonstrated by a series of experiments. We show that NGF-Curv is capable of aligning images precisely, making it a valuable component of algorithms designed to identify objects, such as trees, within multisensor data sets.This work was supported by the Airborne Research and Survey Facility of the U.K.’s Natural Environment Research Council (NERC) for collecting and preprocessing the data used in this research project [EU11/03/100], and by the grants supported from King Abdullah University of Science Technology and Wellcome Trust (BBSRC). D. Coomes was supported by a grant from NERC (NE/K016377/1) and funding from DEFRA and the BBSRC to develop methods for monitoring ash dieback from aircraft.This is the final version. It was first published by IEEE at http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7116541&sortType%3Dasc_p_Sequence%26filter%3DAND%28p_Publication_Number%3A36%29%26pageNumber%3D5

arXiv.org e-Print Archive

Southampton (e-Prints Soton)

UCL Discovery

Apollo (Cambridge)

Deep Multi-Spectral Registration Using Invariant Descriptor Learning

Author: Bar Sharon Duvdevani
Keller Yosi
Levi Hila
Ofir Nati
Rozenbaum Dani
Silberstein Shai
Publication venue
Publication date: 23/05/2018
Field of study

In this paper, we introduce a novel deep-learning method to align cross-spectral images. Our approach relies on a learned descriptor which is invariant to different spectra. Multi-modal images of the same scene capture different signals and therefore their registration is challenging and it is not solved by classic approaches. To that end, we developed a feature-based approach that solves the visible (VIS) to Near-Infra-Red (NIR) registration problem. Our algorithm detects corners by Harris and matches them by a patch-metric learned on top of CIFAR-10 network descriptor. As our experiments demonstrate we achieve a high-quality alignment of cross-spectral images with a sub-pixel accuracy. Comparing to other existing methods, our approach is more accurate in the task of VIS to NIR registration

arXiv.org e-Print Archive

Crossref

Structural Representation: Reducing Multi-Modal Image Registration to Mono-Modal Problem

Author: Clausi David
Fieguth Paul
Kasiri Keyvan
Publication venue: 'University of Waterloo'
Publication date: 31/10/2015
Field of study

Registration of multi-modal images has been a challenging taskdue to the complex intensity relationship between images. Thestandard multi-modal approach tends to use sophisticated similaritymeasures, such as mutual information, to assess the accuracyof the alignment. Employing such measures imply the increase inthe computational time and complexity, and makes it highly difficultfor the optimization process to converge. The presented registrationmethod works based on structural representations of imagescaptured from different modalities, in order to convert the multimodalproblem into a mono-modal one. Two different representationmethods are presented. One is based on a combination ofphase congruency and gradient information of the input images,and the other utilizes a modified version of entropy images in apatch-based manner. Sample results are illustrated based on experimentsperformed on brain images from different modalities

Waterloo Library Journal Publishing Service (University of Waterloo, Canada)

Multitemporal Very High Resolution from Space: Outcome of the 2016 IEEE GRSS Data Fusion Contest

Author: Karantzalos K.
Le Saux B.
Moser G.
Mou L.
Paragios N.
Tuia D.
Vakalopoulou M.
Zhu X.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

In this paper, the scientific outcomes of the 2016 Data Fusion Contest organized by the Image Analysis and Data Fusion Technical Committee of the IEEE Geoscience and Remote Sensing Society are discussed. The 2016 Contest was an open topic competition based on a multitemporal and multimodal dataset, which included a temporal pair of very high resolution panchromatic and multispectral Deimos-2 images and a video captured by the Iris camera on-board the International Space Station. The problems addressed and the techniques proposed by the participants to the Contest spanned across a rather broad range of topics, and mixed ideas and methodologies from the remote sensing, video processing, and computer vision. In particular, the winning team developed a deep learning method to jointly address spatial scene labeling and temporal activity modeling using the available image and video data. The second place team proposed a random field model to simultaneously perform coregistration of multitemporal data, semantic segmentation, and change detection. The methodological key ideas of both these approaches and the main results of the corresponding experimental validation are discussed in this paper

Institute of Transport Research:Publications

HAL-CentraleSupelec

Wageningen University & Research Publications

Archivio istituzionale della ricerca - Università di Genova

HAL-Rennes 1

Joint tracking and video registration by factorial Hidden Markov models

Author: Mei Xue
Porikli Fatih
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/06/2016
Field of study

The Australian National University

Joint tracking and video registration by factorial Hidden Markov models”, ICASSP

Author: Fatih Porikli
Xue Mei
Publication venue
Publication date: 01/01/2008
Field of study

Tracking moving objects from image sequences obtained by a moving camera is a difficult problem since there exists apparent motion of the static background. It becomes more difficult when the camera motion between the consecutive frames is very large. Traditionally, registration is applied before tracking to compensate for the camera motion using parametric motion models. At the same time, tracking result highly depends on the performance of registration. This raises problems when there are big moving objects in the scene and the registration algorithm is prone to fail, since the tracker easily drifts away when poor registration results occur. In this paper, we tackle this problem by registering the frames and tracking the moving objects simultaneously within the factorial Hidden Markov Model framework using particle filters. Under this framework, tracking and registration are not working separately, but mutually benefit each other by interacting. Particles are drawn to provide the candidate geometric transformation parameters and moving object parameters. Background is registered according to the geometric transformation parameters by maximizing a joint gradient function. A state-of-the-art covariance tracker is used to track the moving object. The tracking score is obtained by incorporating both background and foreground information. By using knowledge of the position of the moving objects, we avoid blindly registering the image pairs without taking the moving object regions into account. We apply our algorithm to moving object tracking on numerous image sequences with camera motion and show the robustness and effectiveness of our method

CiteSeerX

BInGo: Bayesian Intrinsic Groupwise Registration via Explicit Hierarchical Disentanglement

Author: Luo Xinzhe
Wang Xin
Zhuang Xiahai
Publication venue
Publication date: 11/12/2022
Field of study

Multimodal groupwise registration aligns internal structures in a group of medical images. Current approaches to this problem involve developing similarity measures over the joint intensity profile of all images, which may be computationally prohibitive for large image groups and unstable under various conditions. To tackle these issues, we propose BInGo, a general unsupervised hierarchical Bayesian framework based on deep learning, to learn intrinsic structural representations to measure the similarity of multimodal images. Particularly, a variational auto-encoder with a novel posterior is proposed, which facilitates the disentanglement learning of structural representations and spatial transformations, and characterizes the imaging process from the common structure with shape transition and appearance variation. Notably, BInGo is scalable to learn from small groups, whereas being tested for large-scale groupwise registration, thus significantly reducing computational costs. We compared BInGo with five iterative or deep learning methods on three public intrasubject and intersubject datasets, i.e. BraTS, MS-CMR of the heart, and Learn2Reg abdomen MR-CT, and demonstrated its superior accuracy and computational efficiency, even for very large group sizes (e.g., over 1300 2D images from MS-CMR in each group)

arXiv.org e-Print Archive