66,785 research outputs found
On Semantic Search Algorithm Optimization
In the article we consider, on the example of development of a
relational database (RDB) information system for Tatneft oil and gas company, an approach to organization of effective search in large arrays of heterogeneous data, satisfying the following essential requirements.
On the one hand, the data is integrated at the semantic level, i.e. the system
supports the presentation of data, describing its semantic properties within an
unified subject domain ontology. Accordingly, end user's request are formulated
exclusively in the subject domain terminology.
On the other hand, the system generates unregulated SQL-queries, i.e. the full
text of possible SQL-queries, not just values of particular parameters, predefined
by the system developers.
Considered approach includes both the possibilities of increasing the reactivity
of the universal SQL queries generation scheme as well as more specific
optimization possibilities, arising from the particular system usage context.472-48
SADIH: Semantic-Aware DIscrete Hashing
Due to its low storage cost and fast query speed, hashing has been recognized
to accomplish similarity search in large-scale multimedia retrieval
applications. Particularly supervised hashing has recently received
considerable research attention by leveraging the label information to preserve
the pairwise similarities of data points in the Hamming space. However, there
still remain two crucial bottlenecks: 1) the learning process of the full
pairwise similarity preservation is computationally unaffordable and unscalable
to deal with big data; 2) the available category information of data are not
well-explored to learn discriminative hash functions. To overcome these
challenges, we propose a unified Semantic-Aware DIscrete Hashing (SADIH)
framework, which aims to directly embed the transformed semantic information
into the asymmetric similarity approximation and discriminative hashing
function learning. Specifically, a semantic-aware latent embedding is
introduced to asymmetrically preserve the full pairwise similarities while
skillfully handle the cumbersome n times n pairwise similarity matrix.
Meanwhile, a semantic-aware autoencoder is developed to jointly preserve the
data structures in the discriminative latent semantic space and perform data
reconstruction. Moreover, an efficient alternating optimization algorithm is
proposed to solve the resulting discrete optimization problem. Extensive
experimental results on multiple large-scale datasets demonstrate that our
SADIH can clearly outperform the state-of-the-art baselines with the additional
benefit of lower computational costs.Comment: Accepted by The Thirty-Third AAAI Conference on Artificial
Intelligence (AAAI-19
Addressing Model Vulnerability to Distributional Shifts over Image Transformation Sets
We are concerned with the vulnerability of computer vision models to
distributional shifts. We formulate a combinatorial optimization problem that
allows evaluating the regions in the image space where a given model is more
vulnerable, in terms of image transformations applied to the input, and face it
with standard search algorithms. We further embed this idea in a training
procedure, where we define new data augmentation rules according to the image
transformations that the current model is most vulnerable to, over iterations.
An empirical evaluation on classification and semantic segmentation problems
suggests that the devised algorithm allows to train models that are more robust
against content-preserving image manipulations and, in general, against
distributional shifts.Comment: ICCV 2019 (camera ready
- …