2 research outputs found
A Two-Stream Siamese Neural Network for Vehicle Re-Identification by Using Non-Overlapping Cameras
We describe in this paper a Two-Stream Siamese Neural Network for vehicle
re-identification. The proposed network is fed simultaneously with small coarse
patches of the vehicle shape's, with 96 x 96 pixels, in one stream, and fine
features extracted from license plate patches, easily readable by humans, with
96 x 48 pixels, in the other one. Then, we combined the strengths of both
streams by merging the Siamese distance descriptors with a sequence of fully
connected layers, as an attempt to tackle a major problem in the field, false
alarms caused by a huge number of car design and models with nearly the same
appearance or by similar license plate strings. In our experiments, with 2
hours of videos containing 2982 vehicles, extracted from two low-cost cameras
in the same roadway, 546 ft away, we achieved a F-measure and accuracy of 92.6%
and 98.7%, respectively. We show that the proposed network, available at
https://github.com/icarofua/siamese-two-stream, outperforms other One-Stream
architectures, even if they use higher resolution image features.Comment: 5 pages, 6 figures, To appear in IEEE International Conference on
Image Processing (ICIP), Sept. 22-25, 2019, Taipei, Taiwa
Complementing Representation Deficiency in Few-shot Image Classification: A Meta-Learning Approach
Few-shot learning is a challenging problem that has attracted more and more
attention recently since abundant training samples are difficult to obtain in
practical applications. Meta-learning has been proposed to address this issue,
which focuses on quickly adapting a predictor as a base-learner to new tasks,
given limited labeled samples. However, a critical challenge for meta-learning
is the representation deficiency since it is hard to discover common
information from a small number of training samples or even one, as is the
representation of key features from such little information. As a result, a
meta-learner cannot be trained well in a high-dimensional parameter space to
generalize to new tasks. Existing methods mostly resort to extracting less
expressive features so as to avoid the representation deficiency. Aiming at
learning better representations, we propose a meta-learning approach with
complemented representations network (MCRNet) for few-shot image
classification. In particular, we embed a latent space, where latent codes are
reconstructed with extra representation information to complement the
representation deficiency. Furthermore, the latent space is established with
variational inference, collaborating well with different base-learners, and can
be extended to other models. Finally, our end-to-end framework achieves the
state-of-the-art performance in image classification on three standard few-shot
learning datasets.Comment: 25th International Conference on Pattern Recognition (ICPR2020