2 research outputs found
Deep Unsupervised Learning of Visual Similarities
Exemplar learning of visual similarities in an unsupervised manner is a
problem of paramount importance to Computer Vision. In this context, however,
the recent breakthrough in deep learning could not yet unfold its full
potential. With only a single positive sample, a great imbalance between one
positive and many negatives, and unreliable relationships between most samples,
training of Convolutional Neural networks is impaired. In this paper we use
weak estimates of local similarities and propose a single optimization problem
to extract batches of samples with mutually consistent relations. Conflicting
relations are distributed over different batches and similar samples are
grouped into compact groups. Learning visual similarities is then framed as a
sequence of categorization tasks. The CNN then consolidates transitivity
relations within and between groups and learns a single representation for all
samples without the need for labels. The proposed unsupervised approach has
shown competitive performance on detailed posture analysis and object
classification.Comment: arXiv admin note: text overlap with arXiv:1608.0879
Unsupervised Representation Learning by Discovering Reliable Image Relations
Learning robust representations that allow to reliably establish relations
between images is of paramount importance for virtually all of computer vision.
Annotating the quadratic number of pairwise relations between training images
is simply not feasible, while unsupervised inference is prone to noise, thus
leaving the vast majority of these relations to be unreliable. To nevertheless
find those relations which can be reliably utilized for learning, we follow a
divide-and-conquer strategy: We find reliable similarities by extracting
compact groups of images and reliable dissimilarities by partitioning these
groups into subsets, converting the complicated overall problem into few
reliable local subproblems. For each of the subsets we obtain a representation
by learning a mapping to a target feature space so that their reliable
relations are kept. Transitivity relations between the subsets are then
exploited to consolidate the local solutions into a concerted global
representation. While iterating between grouping, partitioning, and learning,
we can successively use more and more reliable relations which, in turn,
improves our image representation. In experiments, our approach shows
state-of-the-art performance on unsupervised classification on ImageNet with
46.0% and competes favorably on different transfer learning tasks on PASCAL
VOC.Comment: Accepted for Publication in 'Pattern Recognition Journal