Search CORE

3,113 research outputs found

Action detection in office scene based on deep convolutional neural networks

Author: An Yu-Di
Smith Jeremy S
Yan Shi-Yang
Zhang Bai-Ling
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2016
Field of study

What Can I Do Around Here? Deep Functional Scene Understanding for Cognitive Robots

Author: Aloimonos Yiannis
Fermuller Cornelia
Yang Yezhou
Ye Chengxi
Publication venue
Publication date: 02/02/2016
Field of study

For robots that have the capability to interact with the physical environment through their end effectors, understanding the surrounding scenes is not merely a task of image classification or object recognition. To perform actual tasks, it is critical for the robot to have a functional understanding of the visual scene. Here, we address the problem of localizing and recognition of functional areas from an arbitrary indoor scene, formulated as a two-stage deep learning based detection pipeline. A new scene functionality testing-bed, which is complied from two publicly available indoor scene datasets, is used for evaluation. Our method is evaluated quantitatively on the new dataset, demonstrating the ability to perform efficient recognition of functional areas from arbitrary indoor scenes. We also demonstrate that our detection model can be generalized onto novel indoor scenes by cross validating it with the images from two different datasets

arXiv.org e-Print Archive

Crossref

A Discriminative Representation of Convolutional Features for Indoor Scene Recognition

Author: Bennamoun Mohammed
Hayat Munawar
Khan Salman H.
Sohel Ferdous
Togneri Roberto
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/06/2015
Field of study

Indoor scene recognition is a multi-faceted and challenging problem due to the diverse intra-class variations and the confusing inter-class similarities. This paper presents a novel approach which exploits rich mid-level convolutional features to categorize indoor scenes. Traditionally used convolutional features preserve the global spatial structure, which is a desirable property for general object recognition. However, we argue that this structuredness is not much helpful when we have large variations in scene layouts, e.g., in indoor scenes. We propose to transform the structured convolutional activations to another highly discriminative feature space. The representation in the transformed space not only incorporates the discriminative aspects of the target dataset, but it also encodes the features in terms of the general object categories that are present in indoor scenes. To this end, we introduce a new large-scale dataset of 1300 object categories which are commonly present in indoor scenes. Our proposed approach achieves a significant performance boost over previous state of the art approaches on five major scene classification datasets

arXiv.org e-Print Archive

University of Canberra Research Repository

Research Repository

No Spare Parts: Sharing Part Detectors for Image Categorization

Author: Mettes Pascal
Snoek Cees G. M.
van Gemert Jan C.
Publication venue
Publication date: 01/01/2016
Field of study

This work aims for image categorization using a representation of distinctive parts. Different from existing part-based work, we argue that parts are naturally shared between image categories and should be modeled as such. We motivate our approach with a quantitative and qualitative analysis by backtracking where selected parts come from. Our analysis shows that in addition to the category parts defining the class, the parts coming from the background context and parts from other image categories improve categorization performance. Part selection should not be done separately for each category, but instead be shared and optimized over all categories. To incorporate part sharing between categories, we present an algorithm based on AdaBoost to jointly optimize part sharing and selection, as well as fusion with the global image representation. We achieve results competitive to the state-of-the-art on object, scene, and action categories, further improving over deep convolutional neural networks

arXiv.org e-Print Archive

International Migration, Integration and Social Cohesion online publications

UvA-DARE