66,785 research outputs found

    On Semantic Search Algorithm Optimization

    Get PDF
    In the article we consider, on the example of development of a relational database (RDB) information system for Tatneft oil and gas company, an approach to organization of effective search in large arrays of heterogeneous data, satisfying the following essential requirements. On the one hand, the data is integrated at the semantic level, i.e. the system supports the presentation of data, describing its semantic properties within an unified subject domain ontology. Accordingly, end user's request are formulated exclusively in the subject domain terminology. On the other hand, the system generates unregulated SQL-queries, i.e. the full text of possible SQL-queries, not just values of particular parameters, predefined by the system developers. Considered approach includes both the possibilities of increasing the reactivity of the universal SQL queries generation scheme as well as more specific optimization possibilities, arising from the particular system usage context.472-48

    SADIH: Semantic-Aware DIscrete Hashing

    Full text link
    Due to its low storage cost and fast query speed, hashing has been recognized to accomplish similarity search in large-scale multimedia retrieval applications. Particularly supervised hashing has recently received considerable research attention by leveraging the label information to preserve the pairwise similarities of data points in the Hamming space. However, there still remain two crucial bottlenecks: 1) the learning process of the full pairwise similarity preservation is computationally unaffordable and unscalable to deal with big data; 2) the available category information of data are not well-explored to learn discriminative hash functions. To overcome these challenges, we propose a unified Semantic-Aware DIscrete Hashing (SADIH) framework, which aims to directly embed the transformed semantic information into the asymmetric similarity approximation and discriminative hashing function learning. Specifically, a semantic-aware latent embedding is introduced to asymmetrically preserve the full pairwise similarities while skillfully handle the cumbersome n times n pairwise similarity matrix. Meanwhile, a semantic-aware autoencoder is developed to jointly preserve the data structures in the discriminative latent semantic space and perform data reconstruction. Moreover, an efficient alternating optimization algorithm is proposed to solve the resulting discrete optimization problem. Extensive experimental results on multiple large-scale datasets demonstrate that our SADIH can clearly outperform the state-of-the-art baselines with the additional benefit of lower computational costs.Comment: Accepted by The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19

    Addressing Model Vulnerability to Distributional Shifts over Image Transformation Sets

    Full text link
    We are concerned with the vulnerability of computer vision models to distributional shifts. We formulate a combinatorial optimization problem that allows evaluating the regions in the image space where a given model is more vulnerable, in terms of image transformations applied to the input, and face it with standard search algorithms. We further embed this idea in a training procedure, where we define new data augmentation rules according to the image transformations that the current model is most vulnerable to, over iterations. An empirical evaluation on classification and semantic segmentation problems suggests that the devised algorithm allows to train models that are more robust against content-preserving image manipulations and, in general, against distributional shifts.Comment: ICCV 2019 (camera ready
    corecore