2,579 research outputs found
Highly Efficient Regression for Scalable Person Re-Identification
Existing person re-identification models are poor for scaling up to large
data required in real-world applications due to: (1) Complexity: They employ
complex models for optimal performance resulting in high computational cost for
training at a large scale; (2) Inadaptability: Once trained, they are
unsuitable for incremental update to incorporate any new data available. This
work proposes a truly scalable solution to re-id by addressing both problems.
Specifically, a Highly Efficient Regression (HER) model is formulated by
embedding the Fisher's criterion to a ridge regression model for very fast
re-id model learning with scalable memory/storage usage. Importantly, this new
HER model supports faster than real-time incremental model updates therefore
making real-time active learning feasible in re-id with human-in-the-loop.
Extensive experiments show that such a simple and fast model not only
outperforms notably the state-of-the-art re-id methods, but also is more
scalable to large data with additional benefits to active learning for reducing
human labelling effort in re-id deployment
Incremental Training of a Detector Using Online Sparse Eigen-decomposition
The ability to efficiently and accurately detect objects plays a very crucial
role for many computer vision tasks. Recently, offline object detectors have
shown a tremendous success. However, one major drawback of offline techniques
is that a complete set of training data has to be collected beforehand. In
addition, once learned, an offline detector can not make use of newly arriving
data. To alleviate these drawbacks, online learning has been adopted with the
following objectives: (1) the technique should be computationally and storage
efficient; (2) the updated classifier must maintain its high classification
accuracy. In this paper, we propose an effective and efficient framework for
learning an adaptive online greedy sparse linear discriminant analysis (GSLDA)
model. Unlike many existing online boosting detectors, which usually apply
exponential or logistic loss, our online algorithm makes use of LDA's learning
criterion that not only aims to maximize the class-separation criterion but
also incorporates the asymmetrical property of training data distributions. We
provide a better alternative for online boosting algorithms in the context of
training a visual object detector. We demonstrate the robustness and efficiency
of our methods on handwriting digit and face data sets. Our results confirm
that object detection tasks benefit significantly when trained in an online
manner.Comment: 14 page
Robust object tracking based on weighted subspace reconstruction error with forward: Backward tracking criterion
© 2015 SPIE and IS & T. It is a challenging task to develop an effective and robust object tracking method due to factors such as severe occlusion, background clutters, abrupt motion, illumination variation, and so on. A tracking algorithm based on weighted subspace reconstruction error is proposed. The discriminative weights are defined based on minimizing reconstruction error with a positive dictionary while maximizing reconstruction error with a negative dictionary. Then a confidence map for candidates is computed through the subspace reconstruction error. Finally, the location of the target object is estimated by maximizing the decision map which combines the discriminative weights and subspace reconstruction error. Furthermore, the new evaluation method based on a forward-backward tracking criterion is used to verify the proposed method and demonstrates its robustness in the updating stage and its effectiveness in the reduction of accumulated errors. Experimental results on 12 challenging video sequences show that the proposed algorithm performs favorably against 12 state-of-the-art methods in terms of accuracy and robustness
State of the Art in Face Recognition
Notwithstanding the tremendous effort to solve the face recognition problem, it is not possible yet to design a face recognition system with a potential close to human performance. New computer vision and pattern recognition approaches need to be investigated. Even new knowledge and perspectives from different fields like, psychology and neuroscience must be incorporated into the current field of face recognition to design a robust face recognition system. Indeed, many more efforts are required to end up with a human like face recognition system. This book tries to make an effort to reduce the gap between the previous face recognition research state and the future state
Semi-supervised tensor-based graph embedding learning and its application to visual discriminant tracking
An appearance model adaptable to changes in object appearance is critical in visual object tracking. In
this paper, we treat an image patch as a 2-order tensor which preserves the original image structure. We design
two graphs for characterizing the intrinsic local geometrical structure of the tensor samples of the object and the
background. Graph embedding is used to reduce the dimensions of the tensors while preserving the structure of
the graphs. Then, a discriminant embedding space is constructed. We prove two propositions for finding the
transformation matrices which are used to map the original tensor samples to the tensor-based graph embedding
space. In order to encode more discriminant information in the embedding space, we propose a transfer-learningbased
semi-supervised strategy to iteratively adjust the embedding space into which discriminative information
obtained from earlier times is transferred. We apply the proposed semi-supervised tensor-based graph
embedding learning algorithm to visual tracking. The new tracking algorithm captures an object’s appearance
characteristics during tracking and uses a particle filter to estimate the optimal object state. Experimental results
on the CVPR 2013 benchmark dataset demonstrate the effectiveness of the proposed tracking algorithm
Temporal Model Adaptation for Person Re-Identification
Person re-identification is an open and challenging problem in computer
vision. Majority of the efforts have been spent either to design the best
feature representation or to learn the optimal matching metric. Most approaches
have neglected the problem of adapting the selected features or the learned
model over time. To address such a problem, we propose a temporal model
adaptation scheme with human in the loop. We first introduce a
similarity-dissimilarity learning method which can be trained in an incremental
fashion by means of a stochastic alternating directions methods of multipliers
optimization procedure. Then, to achieve temporal adaptation with limited human
effort, we exploit a graph-based approach to present the user only the most
informative probe-gallery matches that should be used to update the model.
Results on three datasets have shown that our approach performs on par or even
better than state-of-the-art approaches while reducing the manual pairwise
labeling effort by about 80%
- …