Search CORE

42,860 research outputs found

Analysis of a biologically-inspired system for real-time object recognition

Author: Aboutalib Sarah
Murphy-Chutorian Erik
Triesch Jochen
Publication venue
Publication date: 12/05/2006
Field of study

We present a biologically-inspired system for real-time, feed-forward object recognition in cluttered scenes. Our system utilizes a vocabulary of very sparse features that are shared between and within different object models. To detect objects in a novel scene, these features are located in the image, and each detected feature votes for all objects that are consistent with its presence. Due to the sharing of features between object models our approach is more scalable to large object databases than traditional methods. To demonstrate the utility of this approach, we train our system to recognize any of 50 objects in everyday cluttered scenes with substantial occlusion. Without further optimization we also demonstrate near-perfect recognition on a standard 3-D recognition problem. Our system has an interpretation as a sparsely connected feed-forward neural network, making it a viable model for fast, feed-forward object recognition in the primate visual system

Hochschulschriftenserver - Universität Frankfurt am Main

Describing Textures in the Wild

Author: Cimpoi Mircea
Kokkinos Iasonas
Maji Subhransu
Mohamed Sammy
Vedaldi Andrea
Publication venue
Publication date: 01/01/2013
Field of study

Patterns and textures are defining characteristics of many natural objects: a shirt can be striped, the wings of a butterfly can be veined, and the skin of an animal can be scaly. Aiming at supporting this analytical dimension in image understanding, we address the challenging problem of describing textures with semantic attributes. We identify a rich vocabulary of forty-seven texture terms and use them to describe a large dataset of patterns collected in the wild.The resulting Describable Textures Dataset (DTD) is the basis to seek for the best texture representation for recognizing describable texture attributes in images. We port from object recognition to texture recognition the Improved Fisher Vector (IFV) and show that, surprisingly, it outperforms specialized texture descriptors not only on our problem, but also in established material recognition datasets. We also show that the describable attributes are excellent texture descriptors, transferring between datasets and tasks; in particular, combined with IFV, they significantly outperform the state-of-the-art by more than 8 percent on both FMD and KTHTIPS-2b benchmarks. We also demonstrate that they produce intuitive descriptions of materials and Internet images.Comment: 13 pages; 12 figures Fixed misplaced affiliatio

arXiv.org e-Print Archive

HAL-CentraleSupelec

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

Grounding semantics in robots for Visual Question Answering

Author: Wahle Björn
Publication venue: Universitat Politècnica de Catalunya
Publication date: 01/01/2019
Field of study

In this thesis I describe an operational implementation of an object detection and description system that incorporates in an end-to-end Visual Question Answering system and evaluated it on two visual question answering datasets for compositional language and elementary visual reasoning

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

An integrated cognitive architecture for autonomous agents

Author: Allen John A.
Gennari John H.
Iba Wayne
Langley Pat
Thompson Kevin
Publication venue: eScholarship, University of California
Publication date: 15/09/1989
Field of study

Crossref

eScholarship - University of California

Vision, Action, and Make-Perceive

Author: Briscoe Professor Robert
Publication venue: Blackwell
Publication date: 01/01/2008
Field of study

In this paper, I critically assess the enactive account of visual perception recently defended by Alva Noë (2004). I argue inter alia that the enactive account falsely identifies an object’s apparent shape with its 2D perspectival shape; that it mistakenly assimilates visual shape perception and volumetric object recognition; and that it seriously misrepresents the constitutive role of bodily action in visual awareness. I argue further that noticing an object’s perspectival shape involves a hybrid experience combining both perceptual and imaginative elements – an act of what I call ‘make-perceive.

PhilPapers

CiteSeerX

CogPrints Cognitive Sciences Eprint Archive

Time course and robustness of ERP object and face differences

Author: Bennett P.J.
Husk J.S.
Rousselet G.A.
Sekuler A.B.
Publication venue: 'Association for Research in Vision and Ophthalmology (ARVO)'
Publication date: 01/09/2008
Field of study

Conflicting results have been reported about the earliest “true” ERP differences related to face processing, with the bulk of the literature focusing on the signal in the first 200 ms after stimulus onset. Part of the discrepancy might be explained by uncontrolled low-level differences between images used to assess the timing of face processing. In the present experiment, we used a set of faces, houses, and noise textures with identical amplitude spectra to equate energy in each spatial frequency band. The timing of face processing was evaluated using face–house and face–noise contrasts, as well as upright-inverted stimulus contrasts. ERP differences were evaluated systematically at all electrodes, across subjects, and in each subject individually, using trimmed means and bootstrap tests. Different strategies were employed to assess the robustness of ERP differential activities in individual subjects and group comparisons. We report results showing that the most conspicuous and reliable effects were systematically observed in the N170 latency range, starting at about 130–150 ms after stimulus onset

Crossref

Enlighten

Texture Segregation By Visual Cortex: Perceptual Grouping, Attention, and Learning

Author: Ahissar
Arivazhagan
Beck
Beck
Ben-Shahar
Bergen
Bergen
Biederman
Biederman
Blaser
Bovik
Bradski
Brodatz
Bullier
Caelli
Caelli
Callaway
Cao
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Cavanagh
Cavanagh
Chellappa
Cohen
Colby
Connor
Connor
Corbetta
Cross
Desimone
Deubel
Duncan
Elder
Fazl
Felleman
Ferster
Field
Fogel
Gail A. Carpenter
Gove
Graham
Greenspan
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Guillery
Gurnsey
Hirsch
Hochstein
Hodgkin
Hubel
Hubel
Hubel
Hupé
Jain
Johnson
Julesz
Kapadia
Kellman
Kellman
Kelly
Knierim
Krumm
Lamme
Lamme
Lee
Malik
Malik
Manjunath
Mao
McGuire
Mirmehdi
Mitchell
Munoz
Murphy
Nothdurft
Nothdurft
Nothdurft
Nothdurft
Nothdurft
Olson
O’Craven
Paragios
Posner
Przybyszewski
Pylyshyn
Pylyshyn
Raizada
Raizada
Randen
Rao
Renninger
Reynolds
Reynolds
Reynolds
Roelfsema
Roska
Ross
Rushi Bhatt
Sagi
Salin
Shaw
Sigman
Sillito
Sillito
Sillito
Stephen Grossberg
Sutter
Thielscher
Treisman
Tse
Tyler
von der Heydt
von der Heydt
Watanabe
Wilkinson
Williamson
Wiser
Wolfe
Wolfson
Wolfson
Yeshurun
Zhu
Zipser
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/01/2006
Field of study

A neural model is proposed of how laminar interactions in the visual cortex may learn and recognize object texture and form boundaries. The model brings together five interacting processes: region-based texture classification, contour-based boundary grouping, surface filling-in, spatial attention, and object attention. The model shows how form boundaries can determine regions in which surface filling-in occurs; how surface filling-in interacts with spatial attention to generate a form-fitting distribution of spatial attention, or attentional shroud; how the strongest shroud can inhibit weaker shrouds; and how the winning shroud regulates learning of texture categories, and thus the allocation of object attention. The model can discriminate abutted textures with blurred boundaries and is sensitive to texture boundary attributes like discontinuities in orientation and texture flow curvature as well as to relative orientations of texture elements. The model quantitatively fits a large set of human psychophysical data on orientation-based textures. Object boundar output of the model is compared to computer vision algorithms using a set of human segmented photographic images. The model classifies textures and suppresses noise using a multiple scale oriented filterbank and a distributed Adaptive Resonance Theory (dART) classifier. The matched signal between the bottom-up texture inputs and top-down learned texture categories is utilized by oriented competitive and cooperative grouping processes to generate texture boundaries that control surface filling-in and spatial attention. Topdown modulatory attentional feedback from boundary and surface representations to early filtering stages results in enhanced texture boundaries and more efficient learning of texture within attended surface regions. Surface-based attention also provides a self-supervising training signal for learning new textures. Importance of the surface-based attentional feedback in texture learning and classification is tested using a set of textured images from the Brodatz micro-texture album. Benchmark studies vary from 95.1% to 98.6% with attention, and from 90.6% to 93.2% without attention.Air Force Office of Scientific Research (F49620-01-1-0397, F49620-01-1-0423); National Science Foundation (SBE-0354378); Office of Naval Research (N00014-01-1-0624

CiteSeerX

Elsevier - Publisher Connector

Crossref

Boston University Institutional Repository (OpenBU)

Vision systems with the human in the loop

Author: Bauckhage Christian
Hanheide Marc
Kaster Thomas
Pfeiffer Michael
Sagerer Gerhard
Wrede Sebastian
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2005
Field of study

The emerging cognitive vision paradigm deals with vision systems that apply machine learning and automatic reasoning in order to learn from what they perceive. Cognitive vision systems can rate the relevance and consistency of newly acquired knowledge, they can adapt to their environment and thus will exhibit high robustness. This contribution presents vision systems that aim at flexibility and robustness. One is tailored for content-based image retrieval, the others are cognitive vision systems that constitute prototypes of visual active memories which evaluate, gather, and integrate contextual knowledge for visual analysis. All three systems are designed to interact with human users. After we will have discussed adaptive content-based image retrieval and object and action recognition in an office environment, the issue of assessing cognitive systems will be raised. Experiences from psychologically evaluated human-machine interactions will be reported and the promising potential of psychologically-based usability experiments will be stressed

University of Lincoln Institutional Repository

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Publications at Bielefeld University