26 research outputs found
Low-shot learning with large-scale diffusion
This paper considers the problem of inferring image labels from images when
only a few annotated examples are available at training time. This setup is
often referred to as low-shot learning, where a standard approach is to
re-train the last few layers of a convolutional neural network learned on
separate classes for which training examples are abundant. We consider a
semi-supervised setting based on a large collection of images to support label
propagation. This is possible by leveraging the recent advances on large-scale
similarity graph construction.
We show that despite its conceptual simplicity, scaling label propagation up
to hundred millions of images leads to state of the art accuracy in the
low-shot learning regime
Dual Embedding Expansion for Vehicle Re-identification
Vehicle re-identification plays a crucial role in the management of
transportation infrastructure and traffic flow. However, this is a challenging
task due to the large view-point variations in appearance, environmental and
instance-related factors. Modern systems deploy CNNs to produce unique
representations from the images of each vehicle instance. Most work focuses on
leveraging new losses and network architectures to improve the descriptiveness
of these representations. In contrast, our work concentrates on re-ranking and
embedding expansion techniques. We propose an efficient approach for combining
the outputs of multiple models at various scales while exploiting tracklet and
neighbor information, called dual embedding expansion (DEx). Additionally, a
comparative study of several common image retrieval techniques is presented in
the context of vehicle re-ID. Our system yields competitive performance in the
2020 NVIDIA AI City Challenge with promising results. We demonstrate that DEx
when combined with other re-ranking techniques, can produce an even larger gain
without any additional attribute labels or manual supervision
Reliable Label Bootstrapping for Semi-Supervised Learning
Reducing the amount of labels required to train convolutional neural networks
without performance degradation is key to effectively reduce human annotation
efforts. We propose Reliable Label Bootstrapping (ReLaB), an unsupervised
preprossessing algorithm which improves the performance of semi-supervised
algorithms in extremely low supervision settings. Given a dataset with few
labeled samples, we first learn meaningful self-supervised, latent features for
the data. Second, a label propagation algorithm propagates the known labels on
the unsupervised features, effectively labeling the full dataset in an
automatic fashion. Third, we select a subset of correctly labeled (reliable)
samples using a label noise detection algorithm. Finally, we train a
semi-supervised algorithm on the extended subset. We show that the selection of
the network architecture and the self-supervised algorithm are important
factors to achieve successful label propagation and demonstrate that ReLaB
substantially improves semi-supervised learning in scenarios of very limited
supervision on CIFAR-10, CIFAR-100 and mini-ImageNet. We reach average error
rates of with 1 random labeled sample per class on
CIFAR-10 and lower this error to when the labeled sample in
each class is highly representative. Our work is fully reproducible:
https://github.com/PaulAlbert31/ReLaB.Comment: 10 pages, 3 figure