186,585 research outputs found
Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks
Person re-identification is an open and challenging problem in computer
vision. Existing approaches have concentrated on either designing the best
feature representation or learning optimal matching metrics in a static setting
where the number of cameras are fixed in a network. Most approaches have
neglected the dynamic and open world nature of the re-identification problem,
where a new camera may be temporarily inserted into an existing system to get
additional information. To address such a novel and very practical problem, we
propose an unsupervised adaptation scheme for re-identification models in a
dynamic camera network. First, we formulate a domain perceptive
re-identification method based on geodesic flow kernel that can effectively
find the best source camera (already installed) to adapt with a newly
introduced target camera, without requiring a very expensive training phase.
Second, we introduce a transitive inference algorithm for re-identification
that can exploit the information from best source camera to improve the
accuracy across other camera pairs in a network of multiple cameras. Extensive
experiments on four benchmark datasets demonstrate that the proposed approach
significantly outperforms the state-of-the-art unsupervised learning based
alternatives whilst being extremely efficient to compute.Comment: CVPR 2017 Spotligh
Adaptation of Person Re-identification Models for On-boarding New Camera(s)
Existing approaches for person re-identification have concentrated on either designing the best feature representation or learning optimal matching metrics in a static setting where the number of cameras are fixed in a network. Most approaches have neglected the dynamic and open world nature of the re- identification problem, where one or multiple new cameras may be temporarily on-boarded into an ex- isting system to get additional information or added to expand an existing network. To address such a very practical problem, we propose a novel approach for adapting existing multi-camera re-identification frameworks with limited supervision. First, we formulate a domain perceptive re-identification method based on geodesic flow kernel that can effectively find the best source camera (already installed) to adapt with newly introduced target camera(s), without requiring a very expensive training phase. Second, we introduce a transitive inference algorithm for re-identification that can exploit the information from best source camera to improve the accuracy across other camera pairs in a network of multiple cameras. Third, we develop a target-aware sparse prototype selection strategy for finding an informative subset of source camera data for data-efficient learning in resource constrained environments. Our approach can greatly increase the flexibility and reduce the deployment cost of new cameras in many real-world dy- namic camera networks. Extensive experiments demonstrate that our approach significantly outperforms state-of-the-art unsupervised alternatives whilst being extremely efficient to compute
Learning based forensic techniques for source camera identification
In recent years, multimedia forensics has received rapidly growing attention. One challenging problem of multimedia forensics is source camera identification, the goal of which is to identify the source of a multimedia object, such as digital image and video. Sensor pattern noises, produced by imaging sensors, have been proved to be an effective way for source camera identification. Precisely speaking, the conventional SPN-based source camera identification.has two application models: verification and identification. In the past decade, significant progress has been achieved in the tasks of SPN-based source camera verification and identification. However, there are still many cases requiring solutions beyond the capabilities of the current methods. In this thesis, we considered and addressed two commonly seen but less studied problems.
The first problem is the source camera verification with reference SPNs corrupted by scene details. The most significant limitation of using SPN for source camera identification.is that SPN can be seriously contaminated by scene details. Most existing methods consider the contaminations from scene details only occur in query images but not in reference images. To address this issue, we propose a measurement based on the combination of local image entropy and brightness so as to evaluate the quality of SPN contained by different image blocks. Based on this measurement, a context adaptive reference SPN estimator is proposed to address the problem that reference images are contaminated by scene details.
The second problem that we considered relates to the high computational complexity of using SPN in source camera identification., which is caused by the high dimensionality of SPN. In order to improve identification.efficiency without degrading accuracy, we propose an effective feature extraction algorithm based on the concept of PCA denoising to extract a small set of components from the original noise residual, which tends to carry most of the information of the true SPN signal. To further improve the performance of this framework, two enhancement methods are introduced. The first enhancement method is proposed to take the advantage of the label information of the reference images so as to better separate different classes and further reduce the dimensionality. Secondly, we propose an extension based on Candid Covariance-free Incremental PCA to incrementally update the feature extractor according to the received images so that there is no need to re-conduct training every time when a new image is added to the database. Moreover, an ensemble method based on the random subspace method and majority voting is proposed in the context of source camera identification.to tackle the performance degradation of PCA-based feature extraction method due to the corruption by unwanted interferences in the training set.
The proposed algorithms are evaluated on the challenging Dresden image database and experimental results confirmed their effectiveness
Interpretable and Generalizable Person Re-Identification with Query-Adaptive Convolution and Temporal Lifting
For person re-identification, existing deep networks often focus on
representation learning. However, without transfer learning, the learned model
is fixed as is, which is not adaptable for handling various unseen scenarios.
In this paper, beyond representation learning, we consider how to formulate
person image matching directly in deep feature maps. We treat image matching as
finding local correspondences in feature maps, and construct query-adaptive
convolution kernels on the fly to achieve local matching. In this way, the
matching process and results are interpretable, and this explicit matching is
more generalizable than representation features to unseen scenarios, such as
unknown misalignments, pose or viewpoint changes. To facilitate end-to-end
training of this architecture, we further build a class memory module to cache
feature maps of the most recent samples of each class, so as to compute image
matching losses for metric learning. Through direct cross-dataset evaluation,
the proposed Query-Adaptive Convolution (QAConv) method gains large
improvements over popular learning methods (about 10%+ mAP), and achieves
comparable results to many transfer learning methods. Besides, a model-free
temporal cooccurrence based score weighting method called TLift is proposed,
which improves the performance to a further extent, achieving state-of-the-art
results in cross-dataset person re-identification. Code is available at
https://github.com/ShengcaiLiao/QAConv.Comment: This is the ECCV 2020 version, including the appendi
Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-identification
This paper considers the domain adaptive person re-identification (re-ID)
problem: learning a re-ID model from a labeled source domain and an unlabeled
target domain. Conventional methods are mainly to reduce feature distribution
gap between the source and target domains. However, these studies largely
neglect the intra-domain variations in the target domain, which contain
critical factors influencing the testing performance on the target domain. In
this work, we comprehensively investigate into the intra-domain variations of
the target domain and propose to generalize the re-ID model w.r.t three types
of the underlying invariance, i.e., exemplar-invariance, camera-invariance and
neighborhood-invariance. To achieve this goal, an exemplar memory is introduced
to store features of the target domain and accommodate the three invariance
properties. The memory allows us to enforce the invariance constraints over
global training batch without significantly increasing computation cost.
Experiment demonstrates that the three invariance properties and the proposed
memory are indispensable towards an effective domain adaptation system. Results
on three re-ID domains show that our domain adaptation accuracy outperforms the
state of the art by a large margin. Code is available at:
https://github.com/zhunzhong07/ECNComment: To appear in CVPR 201
CANU-ReID: A Conditional Adversarial Network for Unsupervised person Re-IDentification
Unsupervised person re-ID is the task of identifying people on a target data
set for which the ID labels are unavailable during training. In this paper, we
propose to unify two trends in unsupervised person re-ID: clustering &
fine-tuning and adversarial learning. On one side, clustering groups training
images into pseudo-ID labels, and uses them to fine-tune the feature extractor.
On the other side, adversarial learning is used, inspired by domain adaptation,
to match distributions from different domains. Since target data is distributed
across different camera viewpoints, we propose to model each camera as an
independent domain, and aim to learn domain-independent features.
Straightforward adversarial learning yields negative transfer, we thus
introduce a conditioning vector to mitigate this undesirable effect. In our
framework, the centroid of the cluster to which the visual sample belongs is
used as conditioning vector of our conditional adversarial network, where the
vector is permutation invariant (clusters ordering does not matter) and its
size is independent of the number of clusters. To our knowledge, we are the
first to propose the use of conditional adversarial networks for unsupervised
person re-ID. We evaluate the proposed architecture on top of two
state-of-the-art clustering-based unsupervised person re-identification (re-ID)
methods on four different experimental settings with three different data sets
and set the new state-of-the-art performance on all four of them. Our code and
model will be made publicly available at
https://team.inria.fr/perception/canu-reid/
Source identification for mobile devices, based on wavelet transforms combined with sensor imperfections
One of the most relevant applications of digital image forensics is to accurately identify the device used for taking a given set of images, a problem called source identification. This paper studies recent developments in the field and proposes the mixture of two techniques (Sensor Imperfections and Wavelet Transforms) to get better source identification of images generated with mobile devices. Our results show that Sensor Imperfections and Wavelet Transforms can jointly serve as good forensic features to help trace the source camera of images produced by mobile phones. Furthermore, the model proposed here can also determine with high precision both the brand and model of the device
- …