Search CORE

69,291 research outputs found

Supervised learning with hybrid global optimisation methods

Author: Georgieva A.
Jordanov Ivan
Publication venue: I-Tech Education and Publishing
Publication date: 01/01/2009
Field of study

Portsmouth University Research Portal (Pure)

Compositional Model based Fisher Vector Coding for Image Classification

Author: Hengel Anton van den
Liu Lingqiao
Shen Chunhua
Shen Heng Tao
Wang Chao
Wang Lei
Wang Peng
Publication venue
Publication date: 01/01/2017
Field of study

Deriving from the gradient vector of a generative model of local features, Fisher vector coding (FVC) has been identified as an effective coding method for image classification. Most, if not all, FVC implementations employ the Gaussian mixture model (GMM) to depict the generation process of local features. However, the representative power of the GMM could be limited because it essentially assumes that local features can be characterized by a fixed number of feature prototypes and the number of prototypes is usually small in FVC. To handle this limitation, in this paper we break the convention which assumes that a local feature is drawn from one of few Gaussian distributions. Instead, we adopt a compositional mechanism which assumes that a local feature is drawn from a Gaussian distribution whose mean vector is composed as the linear combination of multiple key components and the combination weight is a latent random variable. In this way, we can greatly enhance the representative power of the generative model of FVC. To implement our idea, we designed two particular generative models with such a compositional mechanism.Comment: Fixed typos. 16 pages. Appearing in IEEE T. Pattern Analysis and Machine Intelligence (TPAMI

arXiv.org e-Print Archive

Adelaide Research & Scholarship

Research Online

A Discriminative Representation of Convolutional Features for Indoor Scene Recognition

Author: Bennamoun Mohammed
Hayat Munawar
Khan Salman H.
Sohel Ferdous
Togneri Roberto
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/06/2015
Field of study

Indoor scene recognition is a multi-faceted and challenging problem due to the diverse intra-class variations and the confusing inter-class similarities. This paper presents a novel approach which exploits rich mid-level convolutional features to categorize indoor scenes. Traditionally used convolutional features preserve the global spatial structure, which is a desirable property for general object recognition. However, we argue that this structuredness is not much helpful when we have large variations in scene layouts, e.g., in indoor scenes. We propose to transform the structured convolutional activations to another highly discriminative feature space. The representation in the transformed space not only incorporates the discriminative aspects of the target dataset, but it also encodes the features in terms of the general object categories that are present in indoor scenes. To this end, we introduce a new large-scale dataset of 1300 object categories which are commonly present in indoor scenes. Our proposed approach achieves a significant performance boost over previous state of the art approaches on five major scene classification datasets

arXiv.org e-Print Archive

University of Canberra Research Repository

Research Repository

Harvesting Discriminative Meta Objects with Deep CNN Features for Scene Classification

Author: Wang Baoyuan
Wang Wenping
Wu Ruobing
Yu Yizhou
Publication venue
Publication date: 01/01/2015
Field of study

Recent work on scene classification still makes use of generic CNN features in a rudimentary manner. In this ICCV 2015 paper, we present a novel pipeline built upon deep CNN features to harvest discriminative visual objects and parts for scene classification. We first use a region proposal technique to generate a set of high-quality patches potentially containing objects, and apply a pre-trained CNN to extract generic deep features from these patches. Then we perform both unsupervised and weakly supervised learning to screen these patches and discover discriminative ones representing category-specific objects and parts. We further apply discriminative clustering enhanced with local CNN fine-tuning to aggregate similar objects and parts into groups, called meta objects. A scene image representation is constructed by pooling the feature response maps of all the learned meta objects at multiple spatial scales. We have confirmed that the scene image representation obtained using this new pipeline is capable of delivering state-of-the-art performance on two popular scene benchmark datasets, MIT Indoor 67~\cite{MITIndoor67} and Sun397~\cite{Sun397}Comment: To Appear in ICCV 201

arXiv.org e-Print Archive

HKU Scholars Hub