3 research outputs found
Aligning Across Large Gaps in Time
We present a method of temporally-invariant image registration for outdoor
scenes, with invariance across time of day, across seasonal variations, and
across decade-long periods, for low- and high-texture scenes. Our method can be
useful for applications in remote sensing, GPS-denied UAV localization, 3D
reconstruction, and many others. Our method leverages a recently proposed
approach to image registration, where fully-convolutional neural networks are
used to create feature maps which can be registered using the
Inverse-Composition Lucas-Kanade algorithm (ICLK). We show that invariance that
is learned from satellite imagery can be transferable to time-lapse data
captured by webcams mounted on buildings near ground-level.Comment: 15 pages, 10 figure
Deep UAV Localization with Reference View Rendering
This paper presents a framework for the localization of Unmanned Aerial
Vehicles (UAVs) in unstructured environments with the help of deep learning. A
real-time rendering engine is introduced that generates optical and depth
images given a six Degrees-of-Freedom (DoF) camera pose, camera model,
geo-referenced orthoimage, and elevation map. The rendering engine is embedded
into a learning-based six-DoF Inverse Compositional Lucas-Kanade (ICLK)
algorithm that is able to robustly align the rendered and real-world image
taken by the UAV. To learn the alignment under environmental changes, the
architecture is trained using maps spanning multiple years at high resolution.
The evaluation shows that the deep 6DoF-ICLK algorithm outperforms its
non-trainable counterparts by a large margin. To further support the research
in this field, the real-time rendering engine and accompanying datasets are
released along with this publication.Comment: Initial submission; 15 pages, 3 figures, 3 table
RegNet: Learning the Optimization of Direct Image-to-Image Pose Registration
Direct image-to-image alignment that relies on the optimization of
photometric error metrics suffers from limited convergence range and
sensitivity to lighting conditions. Deep learning approaches has been applied
to address this problem by learning better feature representations using
convolutional neural networks, yet still require a good initialization. In this
paper, we demonstrate that the inaccurate numerical Jacobian limits the
convergence range which could be improved greatly using learned approaches.
Based on this observation, we propose a novel end-to-end network, RegNet, to
learn the optimization of image-to-image pose registration. By jointly learning
feature representation for each pixel and partial derivatives that replace
handcrafted ones (e.g., numerical differentiation) in the optimization step,
the neural network facilitates end-to-end optimization. The energy landscape is
constrained on both the feature representation and the learned Jacobian, hence
providing more flexibility for the optimization as a consequence leads to more
robust and faster convergence. In a series of experiments, including a broad
ablation study, we demonstrate that RegNet is able to converge for
large-baseline image pairs with fewer iterations.Comment: 8 pages, 6 figure