42,922 research outputs found
Improving Sonar Image Patch Matching via Deep Learning
Matching sonar images with high accuracy has been a problem for a long time,
as sonar images are inherently hard to model due to reflections, noise and
viewpoint dependence. Autonomous Underwater Vehicles require good sonar image
matching capabilities for tasks such as tracking, simultaneous localization and
mapping (SLAM) and some cases of object detection/recognition. We propose the
use of Convolutional Neural Networks (CNN) to learn a matching function that
can be trained from labeled sonar data, after pre-processing to generate
matching and non-matching pairs. In a dataset of 39K training pairs, we obtain
0.91 Area under the ROC Curve (AUC) for a CNN that outputs a binary
classification matching decision, and 0.89 AUC for another CNN that outputs a
matching score. In comparison, classical keypoint matching methods like SIFT,
SURF, ORB and AKAZE obtain AUC 0.61 to 0.68. Alternative learning methods
obtain similar results, with a Random Forest Classifier obtaining AUC 0.79, and
a Support Vector Machine resulting in AUC 0.66.Comment: Author versio
Interpretable and Generalizable Person Re-Identification with Query-Adaptive Convolution and Temporal Lifting
For person re-identification, existing deep networks often focus on
representation learning. However, without transfer learning, the learned model
is fixed as is, which is not adaptable for handling various unseen scenarios.
In this paper, beyond representation learning, we consider how to formulate
person image matching directly in deep feature maps. We treat image matching as
finding local correspondences in feature maps, and construct query-adaptive
convolution kernels on the fly to achieve local matching. In this way, the
matching process and results are interpretable, and this explicit matching is
more generalizable than representation features to unseen scenarios, such as
unknown misalignments, pose or viewpoint changes. To facilitate end-to-end
training of this architecture, we further build a class memory module to cache
feature maps of the most recent samples of each class, so as to compute image
matching losses for metric learning. Through direct cross-dataset evaluation,
the proposed Query-Adaptive Convolution (QAConv) method gains large
improvements over popular learning methods (about 10%+ mAP), and achieves
comparable results to many transfer learning methods. Besides, a model-free
temporal cooccurrence based score weighting method called TLift is proposed,
which improves the performance to a further extent, achieving state-of-the-art
results in cross-dataset person re-identification. Code is available at
https://github.com/ShengcaiLiao/QAConv.Comment: This is the ECCV 2020 version, including the appendi
Representation Learning: A Review and New Perspectives
The success of machine learning algorithms generally depends on data
representation, and we hypothesize that this is because different
representations can entangle and hide more or less the different explanatory
factors of variation behind the data. Although specific domain knowledge can be
used to help design representations, learning with generic priors can also be
used, and the quest for AI is motivating the design of more powerful
representation-learning algorithms implementing such priors. This paper reviews
recent work in the area of unsupervised feature learning and deep learning,
covering advances in probabilistic models, auto-encoders, manifold learning,
and deep networks. This motivates longer-term unanswered questions about the
appropriate objectives for learning good representations, for computing
representations (i.e., inference), and the geometrical connections between
representation learning, density estimation and manifold learning
Score Function Features for Discriminative Learning: Matrix and Tensor Framework
Feature learning forms the cornerstone for tackling challenging learning
problems in domains such as speech, computer vision and natural language
processing. In this paper, we consider a novel class of matrix and
tensor-valued features, which can be pre-trained using unlabeled samples. We
present efficient algorithms for extracting discriminative information, given
these pre-trained features and labeled samples for any related task. Our class
of features are based on higher-order score functions, which capture local
variations in the probability density function of the input. We establish a
theoretical framework to characterize the nature of discriminative information
that can be extracted from score-function features, when used in conjunction
with labeled samples. We employ efficient spectral decomposition algorithms (on
matrices and tensors) for extracting discriminative components. The advantage
of employing tensor-valued features is that we can extract richer
discriminative information in the form of an overcomplete representations.
Thus, we present a novel framework for employing generative models of the input
for discriminative learning.Comment: 29 page
Deep Shape Matching
We cast shape matching as metric learning with convolutional networks. We
break the end-to-end process of image representation into two parts. Firstly,
well established efficient methods are chosen to turn the images into edge
maps. Secondly, the network is trained with edge maps of landmark images, which
are automatically obtained by a structure-from-motion pipeline. The learned
representation is evaluated on a range of different tasks, providing
improvements on challenging cases of domain generalization, generic
sketch-based image retrieval or its fine-grained counterpart. In contrast to
other methods that learn a different model per task, object category, or
domain, we use the same network throughout all our experiments, achieving
state-of-the-art results in multiple benchmarks.Comment: ECCV 201
Who Can Win a Single-Elimination Tournament?
A single-elimination (SE) tournament is a popular way to select a winner in
both sports competitions and in elections. A natural and well-studied question
is the tournament fixing problem (TFP): given the set of all pairwise match
outcomes, can a tournament organizer rig an SE tournament by adjusting the
initial seeding so that their favorite player wins? We prove new sufficient
conditions on the pairwise match outcome information and the favorite player,
under which there is guaranteed to be a seeding where the player wins the
tournament. Our results greatly generalize previous results. We also
investigate the relationship between the set of players that can win an SE
tournament under some seeding (so called SE winners) and other traditional
tournament solutions. In addition, we generalize and strengthen prior work on
probabilistic models for generating tournaments. For instance, we show that
\emph{every} player in an player tournament generated by the Condorcet
Random Model will be an SE winner even when the noise is as small as possible,
; prior work only had such results for . We also establish new results for significantly more
general generative models.Comment: A preliminary version appeared in Proceedings of the 30th AAAI
Conference on Artificial Intelligence (AAAI), 201
- …