Search CORE

33,597 research outputs found

Removing the texture feature response to object boundaries

Author: Corcoran Padraig
Winstanley Adam C.
Publication venue: 'Scitepress'
Publication date: 01/01/2007
Field of study

Texture is a spatial property and thus any features used to describe it must be calculated within a neighbourhood. This process of integrating information over a neighbourhood leads to what we will refer to as the texture boundary response problem, where an unwanted response is observed at object boundaries. This response is due to features being extracted from a mixture of textures and/or an intensity edge between objects. If segmentation is performed using these raw features this will lead to the generation of unwanted classes along object boundaries. To overcome this, post processing of feature images must be performed to remove this response before a classification algorithm can be applied. To date this problem has received little attention with no evaluation of the alternative solutions available in the literature of which we are aware. In this work we perform an evaluation of known solutions to the boundary response problem and discover separable median filtering to be the curre nt best choice. An in depth evaluation of the separable median filtering approach shows that it fails to remove certain parts or types of object boundary response. To overcome this failing we propose two alternative techniques which involve either post processing of the separable median filtered result or an alternative filtering technique

MURAL - Maynooth University Research Archive Library

On Using Physical Analogies for Feature and Shape Extraction in Computer Vision

Author: Direkoglu Cem
Hurley David
Liu Xin
Nixon Mark
Publication venue
Publication date: 01/09/2008
Field of study

There is a rich literature of approaches to image feature extraction in computer vision. Many sophisticated approaches exist for low- and high-level feature extraction but can be complex to implement with parameter choice guided by experimentation, but impeded by speed of computation. We have developed new ways to extract features based on notional use of physical paradigms, with parameterisation that is more familiar to a scientifically-trained user, aiming to make best use of computational resource. We describe how analogies based on gravitational force can be used for low-level analysis, whilst analogies of water flow and heat can be deployed to achieve high-level smooth shape detection. These new approaches to arbitrary shape extraction are compared with standard state-of-art approaches by curve evolution. There is no comparator operator to our use of gravitational force. We also aim to show that the implementation is consistent with the original motivations for these techniques and so contend that the exploration of physical paradigms offers a promising new avenue for new approaches to feature extraction in computer vision

Southampton (e-Prints Soton)

Crossref

A Self-Organizing Neural System for Learning to Recognize Textured Scenes

Author: Grossberg Stephen
Williamson James
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/01/1997
Field of study

A self-organizing ARTEX model is developed to categorize and classify textured image regions. ARTEX specializes the FACADE model of how the visual cortex sees, and the ART model of how temporal and prefrontal cortices interact with the hippocampal system to learn visual recognition categories and their names. FACADE processing generates a vector of boundary and surface properties, notably texture and brightness properties, by utilizing multi-scale filtering, competition, and diffusive filling-in. Its context-sensitive local measures of textured scenes can be used to recognize scenic properties that gradually change across space, as well a.s abrupt texture boundaries. ART incrementally learns recognition categories that classify FACADE output vectors, class names of these categories, and their probabilities. Top-down expectations within ART encode learned prototypes that pay attention to expected visual features. When novel visual information creates a poor match with the best existing category prototype, a memory search selects a new category with which classify the novel data. ARTEX is compared with psychophysical data, and is benchmarked on classification of natural textures and synthetic aperture radar images. It outperforms state-of-the-art systems that use rule-based, backpropagation, and K-nearest neighbor classifiers.Defense Advanced Research Projects Agency; Office of Naval Research (N00014-95-1-0409, N00014-95-1-0657

Boston University Institutional Repository (OpenBU)

Texture Segregation By Visual Cortex: Perceptual Grouping, Attention, and Learning

Author: Ahissar
Arivazhagan
Beck
Beck
Ben-Shahar
Bergen
Bergen
Biederman
Biederman
Blaser
Bovik
Bradski
Brodatz
Bullier
Caelli
Caelli
Callaway
Cao
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Cavanagh
Cavanagh
Chellappa
Cohen
Colby
Connor
Connor
Corbetta
Cross
Desimone
Deubel
Duncan
Elder
Fazl
Felleman
Ferster
Field
Fogel
Gail A. Carpenter
Gove
Graham
Greenspan
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Guillery
Gurnsey
Hirsch
Hochstein
Hodgkin
Hubel
Hubel
Hubel
Hupé
Jain
Johnson
Julesz
Kapadia
Kellman
Kellman
Kelly
Knierim
Krumm
Lamme
Lamme
Lee
Malik
Malik
Manjunath
Mao
McGuire
Mirmehdi
Mitchell
Munoz
Murphy
Nothdurft
Nothdurft
Nothdurft
Nothdurft
Nothdurft
Olson
O’Craven
Paragios
Posner
Przybyszewski
Pylyshyn
Pylyshyn
Raizada
Raizada
Randen
Rao
Renninger
Reynolds
Reynolds
Reynolds
Roelfsema
Roska
Ross
Rushi Bhatt
Sagi
Salin
Shaw
Sigman
Sillito
Sillito
Sillito
Stephen Grossberg
Sutter
Thielscher
Treisman
Tse
Tyler
von der Heydt
von der Heydt
Watanabe
Wilkinson
Williamson
Wiser
Wolfe
Wolfson
Wolfson
Yeshurun
Zhu
Zipser
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/01/2006
Field of study

A neural model is proposed of how laminar interactions in the visual cortex may learn and recognize object texture and form boundaries. The model brings together five interacting processes: region-based texture classification, contour-based boundary grouping, surface filling-in, spatial attention, and object attention. The model shows how form boundaries can determine regions in which surface filling-in occurs; how surface filling-in interacts with spatial attention to generate a form-fitting distribution of spatial attention, or attentional shroud; how the strongest shroud can inhibit weaker shrouds; and how the winning shroud regulates learning of texture categories, and thus the allocation of object attention. The model can discriminate abutted textures with blurred boundaries and is sensitive to texture boundary attributes like discontinuities in orientation and texture flow curvature as well as to relative orientations of texture elements. The model quantitatively fits a large set of human psychophysical data on orientation-based textures. Object boundar output of the model is compared to computer vision algorithms using a set of human segmented photographic images. The model classifies textures and suppresses noise using a multiple scale oriented filterbank and a distributed Adaptive Resonance Theory (dART) classifier. The matched signal between the bottom-up texture inputs and top-down learned texture categories is utilized by oriented competitive and cooperative grouping processes to generate texture boundaries that control surface filling-in and spatial attention. Topdown modulatory attentional feedback from boundary and surface representations to early filtering stages results in enhanced texture boundaries and more efficient learning of texture within attended surface regions. Surface-based attention also provides a self-supervising training signal for learning new textures. Importance of the surface-based attentional feedback in texture learning and classification is tested using a set of textured images from the Brodatz micro-texture album. Benchmark studies vary from 95.1% to 98.6% with attention, and from 90.6% to 93.2% without attention.Air Force Office of Scientific Research (F49620-01-1-0397, F49620-01-1-0423); National Science Foundation (SBE-0354378); Office of Naval Research (N00014-01-1-0624

CiteSeerX

Elsevier - Publisher Connector

Crossref

Boston University Institutional Repository (OpenBU)

Dense semantic labeling of sub-decimeter resolution images with convolutional neural networks

Author: Tuia Devis
Volpi Michele
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 10/10/2016
Field of study

Semantic labeling (or pixel-level land-cover classification) in ultra-high resolution imagery (< 10cm) requires statistical models able to learn high level concepts from spatial data, with large appearance variations. Convolutional Neural Networks (CNNs) achieve this goal by learning discriminatively a hierarchy of representations of increasing abstraction. In this paper we present a CNN-based system relying on an downsample-then-upsample architecture. Specifically, it first learns a rough spatial map of high-level representations by means of convolutions and then learns to upsample them back to the original resolution by deconvolutions. By doing so, the CNN learns to densely label every pixel at the original resolution of the image. This results in many advantages, including i) state-of-the-art numerical accuracy, ii) improved geometric accuracy of predictions and iii) high efficiency at inference time. We test the proposed system on the Vaihingen and Potsdam sub-decimeter resolution datasets, involving semantic labeling of aerial images of 9cm and 5cm resolution, respectively. These datasets are composed by many large and fully annotated tiles allowing an unbiased evaluation of models making use of spatial information. We do so by comparing two standard CNN architectures to the proposed one: standard patch classification, prediction of local label patches by employing only convolutions and full patch labeling by employing deconvolutions. All the systems compare favorably or outperform a state-of-the-art baseline relying on superpixels and powerful appearance descriptors. The proposed full patch labeling CNN outperforms these models by a large margin, also showing a very appealing inference time.Comment: Accepted in IEEE Transactions on Geoscience and Remote Sensing, 201

arXiv.org e-Print Archive

Wageningen University & Research Publications

ZORA