Search CORE

1,448 research outputs found

Online Product Quantization

Author: Tsang Ivor W.
Xu Donna
Zhang Ying
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 24/03/2018
Field of study

Approximate nearest neighbor (ANN) search has achieved great success in many tasks. However, existing popular methods for ANN search, such as hashing and quantization methods, are designed for static databases only. They cannot handle well the database with data distribution evolving dynamically, due to the high computational effort for retraining the model based on the new database. In this paper, we address the problem by developing an online product quantization (online PQ) model and incrementally updating the quantization codebook that accommodates to the incoming streaming data. Moreover, to further alleviate the issue of large scale computation for the online PQ update, we design two budget constraints for the model to update partial PQ codebook instead of all. We derive a loss bound which guarantees the performance of our online PQ model. Furthermore, we develop an online PQ model over a sliding window with both data insertion and deletion supported, to reflect the real-time behaviour of the data. The experiments demonstrate that our online PQ model is both time-efficient and effective for ANN search in dynamic large scale databases compared with baseline methods and the idea of partial PQ codebook update further reduces the update cost.Comment: To appear in IEEE Transactions on Knowledge and Data Engineering (DOI: 10.1109/TKDE.2018.2817526

arXiv.org e-Print Archive

OPUS - University of Technology Sydney

Scalable Image Retrieval by Sparse Product Quantization

Author: Chen Chun
Hoi Steven C. H.
Ning Qingqun
Zhong Zhiyuan
Zhu Jianke
Publication venue
Publication date: 15/03/2016
Field of study

Fast Approximate Nearest Neighbor (ANN) search technique for high-dimensional feature indexing and retrieval is the crux of large-scale image retrieval. A recent promising technique is Product Quantization, which attempts to index high-dimensional image features by decomposing the feature space into a Cartesian product of low dimensional subspaces and quantizing each of them separately. Despite the promising results reported, their quantization approach follows the typical hard assignment of traditional quantization methods, which may result in large quantization errors and thus inferior search performance. Unlike the existing approaches, in this paper, we propose a novel approach called Sparse Product Quantization (SPQ) to encoding the high-dimensional feature vectors into sparse representation. We optimize the sparse representations of the feature vectors by minimizing their quantization errors, making the resulting representation is essentially close to the original data in practice. Experiments show that the proposed SPQ technique is not only able to compress data, but also an effective encoding technique. We obtain state-of-the-art results for ANN search on four public image datasets and the promising results of content-based image retrieval further validate the efficacy of our proposed method.Comment: 12 page

arXiv.org e-Print Archive

Institutional Knowledge at Singapore Management University

Cycle-Consistent Deep Generative Hashing for Cross-Modal Retrieval

Author: Shao Ling
Wang Yang
Wu Lin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/10/2018
Field of study

In this paper, we propose a novel deep generative approach to cross-modal retrieval to learn hash functions in the absence of paired training samples through the cycle consistency loss. Our proposed approach employs adversarial training scheme to lean a couple of hash functions enabling translation between modalities while assuming the underlying semantic relationship. To induce the hash codes with semantics to the input-output pair, cycle consistency loss is further proposed upon the adversarial training to strengthen the correlations between inputs and corresponding outputs. Our approach is generative to learn hash functions such that the learned hash codes can maximally correlate each input-output correspondence, meanwhile can also regenerate the inputs so as to minimize the information loss. The learning to hash embedding is thus performed to jointly optimize the parameters of the hash functions across modalities as well as the associated generative models. Extensive experiments on a variety of large-scale cross-modal data sets demonstrate that our proposed method achieves better retrieval results than the state-of-the-arts.Comment: To appeared on IEEE Trans. Image Processing. arXiv admin note: text overlap with arXiv:1703.10593 by other author

arXiv.org e-Print Archive

University of Queensland eSpace

Learning Binary Code for Fast Nearest Subspace Search

Author: Andoni
Andoni
Arya
Bai
Baraniuk
Basri
Basri
Basri
Bauml
Beveridge
Blank
Blanz
Broomhead
Broomhead
Chang
Datar
Dong
Dong
Edelman
Edwin R. Hancock
Fitzgibbon
Ghasedi Dizaji
Gionis
Goldstein
Golub
Gong
Gross
Hamm
Hotelling
Ji
Ji
Jun Zhou
Kim
Kushilevitz
Lei Zhou
Li
Lin
Lin
Liong
Liu
Liu
Liu
Luo
Marrinan
Muja
O’Hara
Pirsiavash
Shen
Song
Soomro
Sun
Vidal
Wang
Wang
Wang
Wang
Wang
Weiss
Wolf
Wright
Xianglong Liu
Xiao Bai
Xu
Zhang
Zhang
Zhang
Zhou
Publication venue: 'Elsevier BV'
Publication date: 01/02/2020
Field of study

Crossref

White Rose Research Online