Search CORE

12 research outputs found

How Can Selection of Biologically Inspired Features Improve the Performance of a Robust Object Recognition Model?

Author: A Opelt
AC Berg
BW Mel
D Gabor
D Shi
Daniel Durstewitz
DH Hubel
DH Hubel
E Krupka
E Meyers
ET Rolls
F Wilcoxon
FJ Massey Jr
G Rudolph
G Schneider
G Wallis
H Zhang
J Holland
J Mutch
J Ponce
J Yang
JE Rowe
JP Jones
K Fukushima
K Fukushima
K Tanaka
Karim Rajaei
L Fei-Fei
M Riesenhuber
M Varma
M Weber
M Weber
Masoud Ghodrati
ML Raymer
Mohammad Pooyan
MY Teo
N Logothetis
N Pinto
N Pinto
R Fergus
Reza Ebrahimpour
S Grossberg
S Grossberg
S Lazebnik
S Salcedo-Sanz
S Sivanandam
Seyed-Mahdi Khaligh-Razavi
SM Stringer
T Serre
T Serre
T Serre
T Serre
TA Polk
W Siedlecki
X He
Y Huang
Z Pan
Publication venue: Public Library of Science
Publication date: 27/02/2012
Field of study

Humans can effectively and swiftly recognize objects in complex natural scenes. This outstanding ability has motivated many computational object recognition models. Most of these models try to emulate the behavior of this remarkable system. The human visual system hierarchically recognizes objects in several processing stages. Along these stages a set of features with increasing complexity is extracted by different parts of visual system. Elementary features like bars and edges are processed in earlier levels of visual pathway and as far as one goes upper in this pathway more complex features will be spotted. It is an important interrogation in the field of visual processing to see which features of an object are selected and represented by the visual cortex. To address this issue, we extended a hierarchical model, which is motivated by biology, for different object recognition tasks. In this model, a set of object parts, named patches, extracted in the intermediate stages. These object parts are used for training procedure in the model and have an important role in object recognition. These patches are selected indiscriminately from different positions of an image and this can lead to the extraction of non-discriminating patches which eventually may reduce the performance. In the proposed model we used an evolutionary algorithm approach to select a set of informative patches. Our reported results indicate that these patches are more informative than usual random patches. We demonstrate the strength of the proposed model on a range of object recognition tasks. The proposed model outperforms the original model in diverse object recognition tasks. It can be seen from the experiments that selected features are generally particular parts of target images. Our results suggest that selected features which are parts of target objects provide an efficient set for robust object recognition

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

A Stable Biologically Motivated Learning Mechanism for Visual Feature Extraction to Handle Facial Categorization

The brain mechanism of extracting visual features for recognizing various objects has consistently been a controversial issue in computational models of object recognition. To extract visual features, we introduce a new, biologically motivated model for facial categorization, which is an extension of the Hubel and Wiesel simple-to-complex cell hierarchy. To address the synaptic stability versus plasticity dilemma, we apply the Adaptive Resonance Theory (ART) for extracting informative intermediate level visual features during the learning process, which also makes this model stable against the destruction of previously learned information while learning new information. Such a mechanism has been suggested to be embedded within known laminar microcircuits of the cerebral cortex. To reveal the strength of the proposed visual feature learning mechanism, we show that when we use this mechanism in the training process of a well-known biologically motivated object recognition model (the HMAX model), it performs better than the HMAX model in face/non-face classification tasks. Furthermore, we demonstrate that our proposed mechanism is capable of following similar trends in performance as humans in a psychophysical experiment using a face versus non-face rapid categorization task

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Semantic Model Vectors for Complex Video Event Recognition

Author: Apostol Natsev
Bert Huang
Gang Hua
Lexing Xie
Michele Merler
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Recommended from our members

Automatic Multilevel Feature Abstraction in Adaptable Machine Vision Systems

Author: Rose Valerie
Publication venue
Publication date: 01/01/2010
Field of study

Vision is a complex task which can be accomplished with apparent ease by biological systems, but for which the design of artificial systems is difficult. Although machine vision systems can be successfully designed for a specific task, under certain conditions, they are likely to fail if circumstances change. This was the motivation for the research into ways in which systems can be self-designing and adaptable to new visual tasks. The research was conducted in three vital areas of concern for machine vision systems. The first area is finding a suitable architecture for forming an appropriate representation for the current task. The research investigated the application of Hypernetworks theory to building a multilevel, generally-applicable representation, through repeated application of a fundamental 'self-similarity' principle, that parts of objects assembled under a particular relation at one level, form whole objects at the next. Results show that this is potentially a powerful approach for autonomously generating an adaptable system-architecture suitable for multiple visual tasks. The second area is the autonomous extraction of suitable low-level features, which the research investigated through random generation of minimally-constrained pixel-configurations and algorithmic generation of homogeneous and heterogeneous polygons. The results suggest that, despite the simplicity of the features making them vulnerable to image transformations, these are promising approaches worth developing further. The third area is automatic feature selection. The research explored management of 'dimensionality' and of 'combinatorial explosion', as well as how to locate relevant features at multiple representation levels, in the context of 'emergence' of structure. Results indicate that this approach can find useful 'intermediate-level' constructs through analysis of the connectivity of the simplices representing objects at higher levels. The research concludes that the proposed novel approaches to tackling the above issues, in particular the application of hypernetworks to the formation of multilevel representations and the resulting emergence of higher-level structure, is fruitful

Open Research Online (The Open University)

Enhanced biologically inspired model

Author: Huang K.
Huang Y.
Li Xuelong
Tan T.
Tao D.
Wang L.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

It has been demonstrated by Serre et al. that the biologically inspired model (BIM) is effective for object recognition. It outperforms many state-of-the-art methods in challenging databases. However, BIM has the following three problems: a very heavy computational cost due to dense input, a disputable pooling operation in modeling relations of the visual cortex, and blind feature selection in a feed-forward framework. To solve these problems, we develop an enhanced BIM (EBIM), which removes uninformative input by imposing sparsity constraints, utilizes a novel local weighted pooling operation with stronger physiological motivations, and applies a feedback procedure that selects effective features for combination. Empirical studies on the CalTech5 database and CalTech101 database show that EBIM is more effective and efficient than BIM. We also apply EBIM to the MIT-CBCL street scene database to show it achieves comparable performance in comparison with the current best performance. Moreover, the new system can process images with resolution 128 times 128 at a rate of 50 frames per second and enhances the speed 20 times at least in comparison with BIM in common applications

CiteSeerX

The Hong Kong Polytechnic University Pao Yue-kong Library

PolyU Institutional Repository

OPUS - University of Technology Sydney

Birkbeck Institutional Research Online