Search CORE

697 research outputs found

A Review of Codebook Models in Patch-Based Visual Object Recognition

Author: Niranjan Mahesan
Ramanan Amirthalingam
Publication venue
Publication date: 22/09/2011
Field of study

The codebook model-based approach, while ignoring any structural aspect in vision, nonetheless provides state-of-the-art performances on current datasets. The key role of a visual codebook is to provide a way to map the low-level features into a fixed-length vector in histogram space to which standard classifiers can be directly applied. The discriminative power of such a visual codebook determines the quality of the codebook model, whereas the size of the codebook controls the complexity of the model. Thus, the construction of a codebook is an important step which is usually done by cluster analysis. However, clustering is a process that retains regions of high density in a distribution and it follows that the resulting codebook need not have discriminant properties. This is also recognised as a computational bottleneck of such systems. In our recent work, we proposed a resource-allocating codebook, to constructing a discriminant codebook in a one-pass design procedure that slightly outperforms more traditional approaches at drastically reduced computing times. In this review we survey several approaches that have been proposed over the last decade with their use of feature detectors, descriptors, codebook construction schemes, choice of classifiers in recognising objects, and datasets that were used in evaluating the proposed methods

Southampton (e-Prints Soton)

A Review of Codebook Models in Patch-Based Visual Object Recognition

Author: A Agarwal
AK Jain
Amirthalingam Ramanan
B Leibe
D Comaniciu
D Lowe
D Nister
DM Blei
EB Sudderth
F Perronnin
G Peterson
H Bay
J Canny
J Zhang
JC Platt
JC Platt
JDR Farquhar
JR Quinlan
K Mikolajczyk
K Mikolajczyk
L Breiman
L Juan
Mahesan Niranjan
N Larios
P Quelhas
S Agarwal
S Lazebnik
T Kadir
X Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Enhanced spatial pyramid matching using log-polar-based image subdivision and representation

Author: Mayo Michael
Zhang Edmond Yiwen
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2010
Field of study

This paper presents a new model for capturing spatial information for object categorization with bag-of-words (BOW). BOW models have recently become popular for the task of object recognition, owing to their good performance and simplicity. Much work has been proposed over the years to improve the BOW model, where the Spatial Pyramid Matching (SPM) technique is the most notable. We propose a new method to exploit spatial relationships between image features, based on binned log-polar grids. Our model works by partitioning the image into grids of different scales and orientations and computing histogram of local features within each grid. Experimental results show that our approach improves the results on three diverse datasets over the SPM technique

Research Commons@Waikato