128,786 research outputs found

    Hyp-UML: Hyperbolic Image Retrieval with Uncertainty-aware Metric Learning

    Full text link
    Metric learning plays a critical role in training image retrieval and classification. It is also a key algorithm in representation learning, e.g., for feature learning and its alignment in metric space. Hyperbolic embedding has been recently developed. Compared to the conventional Euclidean embedding in most of the previously developed models, Hyperbolic embedding can be more effective in representing the hierarchical data structure. Second, uncertainty estimation/measurement is a long-lasting challenge in artificial intelligence. Successful uncertainty estimation can improve a machine learning model's performance, robustness, and security. In Hyperbolic space, uncertainty measurement is at least with equivalent, if not more, critical importance. In this paper, we develop a Hyperbolic image embedding with uncertainty-aware metric learning for image retrieval. We call our method Hyp-UML: Hyperbolic Uncertainty-aware Metric Learning. Our contribution are threefold: we propose an image embedding algorithm based on Hyperbolic space, with their corresponding uncertainty value; we propose two types of uncertainty-aware metric learning, for the popular Contrastive learning and conventional margin-based metric learning, respectively. We perform extensive experimental validations to prove that the proposed algorithm can achieve state-of-the-art results among related methods. The comprehensive ablation study validates the effectiveness of each component of the proposed algorithm

    A Geo-Location Context-Aware Mobile Learning System with Adaptive Correlation Computing Methods

    Get PDF
    AbstractThis paper proposes a context-aware mobile learning system with adaptive correlation computing methods. This system enables users to enhance their knowledge by correlating it with daily experiences. The proposed system contains a hybrid metric vector space to define the correlation between heterogeneous metadata vectors of the user context and learning material. The system integrates heterogeneous metric vector spaces with definitions of the semantic relations between the vector spaces. The significant feature of this system is a hybrid adaptation mechanism for the calculation of correlation. The adaptation mechanism has multidirectional adaptation functions for various learning materials, situations, and learners. We propose a revise-localize-personalize (RLP) adaptation model. In the adaptation mechanism, users only have to improve the metadata or the relations just in their relevant field. The advantage of the system is that the system reduces the time-intensive efforts required for describing direct relations between user contexts and learning materials. This paper presents the feasibility of the context-aware heterogeneous information provision with the hybrid metric vector space, by implementing an actual mobile application system and examining real-world experiments on data provision

    Attention-Set based Metric Learning for Video Face Recognition

    Full text link
    Face recognition has made great progress with the development of deep learning. However, video face recognition (VFR) is still an ongoing task due to various illumination, low-resolution, pose variations and motion blur. Most existing CNN-based VFR methods only obtain a feature vector from a single image and simply aggregate the features in a video, which less consider the correlations of face images in one video. In this paper, we propose a novel Attention-Set based Metric Learning (ASML) method to measure the statistical characteristics of image sets. It is a promising and generalized extension of Maximum Mean Discrepancy with memory attention weighting. First, we define an effective distance metric on image sets, which explicitly minimizes the intra-set distance and maximizes the inter-set distance simultaneously. Second, inspired by Neural Turing Machine, a Memory Attention Weighting is proposed to adapt set-aware global contents. Then ASML is naturally integrated into CNNs, resulting in an end-to-end learning scheme. Our method achieves state-of-the-art performance for the task of video face recognition on the three widely used benchmarks including YouTubeFace, YouTube Celebrities and Celebrity-1000.Comment: modify for ACP
    • …
    corecore