Search CORE

1,244 research outputs found

Optimized Cartesian $K$ -Means

Author: Li Shipeng
Shen Heng Tao
Song Jingkuan
Wang Jianfeng
Wang Jingdong
Xu Xin-Shun
Publication venue
Publication date: 15/05/2014
Field of study

Product quantization-based approaches are effective to encode high-dimensional data points for approximate nearest neighbor search. The space is decomposed into a Cartesian product of low-dimensional subspaces, each of which generates a sub codebook. Data points are encoded as compact binary codes using these sub codebooks, and the distance between two data points can be approximated efficiently from their codes by the precomputed lookup tables. Traditionally, to encode a subvector of a data point in a subspace, only one sub codeword in the corresponding sub codebook is selected, which may impose strict restrictions on the search accuracy. In this paper, we propose a novel approach, named Optimized Cartesian

K

-Means (OCKM), to better encode the data points for more accurate approximate nearest neighbor search. In OCKM, multiple sub codewords are used to encode the subvector of a data point in a subspace. Each sub codeword stems from different sub codebooks in each subspace, which are optimally generated with regards to the minimization of the distortion errors. The high-dimensional data point is then encoded as the concatenation of the indices of multiple sub codewords from all the subspaces. This can provide more flexibility and lower distortion errors than traditional methods. Experimental results on the standard real-life datasets demonstrate the superiority over state-of-the-art approaches for approximate nearest neighbor search.Comment: to appear in IEEE TKDE, accepted in Apr. 201

arXiv.org e-Print Archive

University of Queensland eSpace

Bit Allocation Law for Multi-Antenna Channel Feedback Quantization: Single-User Case

Author: Khoshnevis Behrouz
Yu Wei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2010
Field of study

This paper studies the design and optimization of a limited feedback single-user system with multiple-antenna transmitter and single-antenna receiver. The design problem is cast in form of the minimizing the average transmission power at the base station subject to the user's outage probability constraint. The optimization is over the user's channel quantization codebook and the transmission power control function at the base station. Our approach is based on fixing the outage scenarios in advance and transforming the design problem into a robust system design problem. We start by showing that uniformly quantizing the channel magnitude in dB scale is asymptotically optimal, regardless of the magnitude distribution function. We derive the optimal uniform (in dB) channel magnitude codebook and combine it with a spatially uniform channel direction codebook to arrive at a product channel quantization codebook. We then optimize such a product structure in the asymptotic regime of

B\rightarrow \infty

, where

B

is the total number of quantization feedback bits. The paper shows that for channels in the real space, the asymptotically optimal number of direction quantization bits should be

{(M{-}1)}/{2}

times the number of magnitude quantization bits, where

M

is the number of base station antennas. We also show that the performance of the designed system approaches the performance of the perfect channel state information system as

2^{-\frac{2B}{M+1}}

. For complex channels, the number of magnitude and direction quantization bits are related by a factor of

(M{-}1)

and the system performance scales as

2^{-\frac{B}{M}}

B\rightarrow\infty

.Comment: Submitted to IEEE Transactions on Signal Processing, March 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

Weighted universal image compression

Author: Chou Philip A.
Effros Michelle
Gray Robert M.
Publication venue
Publication date: 01/10/1999
Field of study

We describe a general coding strategy leading to a family of universal image compression systems designed to give good performance in applications where the statistics of the source to be compressed are not available at design time or vary over time or space. The basic approach considered uses a two-stage structure in which the single source code of traditional image compression systems is replaced with a family of codes designed to cover a large class of possible sources. To illustrate this approach, we consider the optimal design and use of two-stage codes containing collections of vector quantizers (weighted universal vector quantization), bit allocations for JPEG-style coding (weighted universal bit allocation), and transform codes (weighted universal transform coding). Further, we demonstrate the benefits to be gained from the inclusion of perceptual distortion measures and optimal parsing. The strategy yields two-stage codes that significantly outperform their single-stage predecessors. On a sequence of medical images, weighted universal vector quantization outperforms entropy coded vector quantization by over 9 dB. On the same data sequence, weighted universal bit allocation outperforms a JPEG-style code by over 2.5 dB. On a collection of mixed test and image data, weighted universal transform coding outperforms a single, data-optimized transform code (which gives performance almost identical to that of JPEG) by over 6 dB

Caltech Authors

Packing and Padding: Coupled Multi-index for Accurate Image Retrieval

Author: Liu Ziqiong
Tian Qi
Wang Shengjin
Zheng Liang
Publication venue
Publication date: 01/01/2014
Field of study

In Bag-of-Words (BoW) based image retrieval, the SIFT visual word has a low discriminative power, so false positive matches occur prevalently. Apart from the information loss during quantization, another cause is that the SIFT feature only describes the local gradient distribution. To address this problem, this paper proposes a coupled Multi-Index (c-MI) framework to perform feature fusion at indexing level. Basically, complementary features are coupled into a multi-dimensional inverted index. Each dimension of c-MI corresponds to one kind of feature, and the retrieval process votes for images similar in both SIFT and other feature spaces. Specifically, we exploit the fusion of local color feature into c-MI. While the precision of visual match is greatly enhanced, we adopt Multiple Assignment to improve recall. The joint cooperation of SIFT and color features significantly reduces the impact of false positive matches. Extensive experiments on several benchmark datasets demonstrate that c-MI improves the retrieval accuracy significantly, while consuming only half of the query time compared to the baseline. Importantly, we show that c-MI is well complementary to many prior techniques. Assembling these methods, we have obtained an mAP of 85.8% and N-S score of 3.85 on Holidays and Ukbench datasets, respectively, which compare favorably with the state-of-the-arts.Comment: 8 pages, 7 figures, 6 tables. Accepted to CVPR 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

OPUS - University of Technology Sydney

Using Content Analysis for Video Compression

Author: A. BIANCHETTI
R. LEONARDI
S. BENINI
Publication venue: Isernhagen Kenzler Conference Management. Editor Yun He
Publication date: 01/01/2006
Field of study

This paper suggests the idea to model video information as a concatenation of different recurring sources. For each source a different tailored compressed representation can be optimally designed so as to best match the intrinsic characteristics of the viewed scene. Since in a video, a shot or scene with similar visual content recurs more than once, even at distant intervals in time, this enables to build a more compact representation of information. In a specific implementation of this idea, we suggest a content-based approach to structure video sequences into hierarchical summaries, and have each such summary represented by a tailored set of dictionaries of codewords. Vector quantization techniques, formerly employed for compression purposes only, have been here used ﬁrst to represent the visual content of video shots and then to exploit visual-content redundancy inside the video. The depth in the hierarchy determines the precision in the representation both from a structural point of view and from a quality level in reproducing the video sequence. The eﬀectiveness of the proposed method is demonstrated by preliminary tests performed on a limited collection of video-data excerpted from a feature movie. Some additional functionalities such as video skimming may remarkably benefit from this type of representation

Archivio istituzionale della ricerca - Università di Brescia