Search CORE

18,606 research outputs found

Pushing the Boundaries of Boundary Detection using Deep Learning

Author: Kokkinos Iasonas
Publication venue
Publication date: 22/01/2016
Field of study

In this work we show that adapting Deep Convolutional Neural Network training to the task of boundary detection can result in substantial improvements over the current state-of-the-art in boundary detection. Our contributions consist firstly in combining a careful design of the loss for boundary detection training, a multi-resolution architecture and training with external data to improve the detection accuracy of the current state of the art. When measured on the standard Berkeley Segmentation Dataset, we improve theoptimal dataset scale F-measure from 0.780 to 0.808 - while human performance is at 0.803. We further improve performance to 0.813 by combining deep learning with grouping, integrating the Normalized Cuts technique within a deep network. We also examine the potential of our boundary detector in conjunction with the task of semantic segmentation and demonstrate clear improvements over state-of-the-art systems. Our detector is fully integrated in the popular Caffe framework and processes a 320x420 image in less than a second.Comment: The previous version reported large improvements w.r.t. the LPO region proposal baseline, which turned out to be due to a wrong computation for the baseline. The improvements are currently less important, and are omitted. We are sorry if the reported results caused any confusion. We have also integrated reviewer feedback regarding human performance on the BSD benchmar

arXiv.org e-Print Archive

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

UCL Discovery

HAL-Rennes 1

Texture Segregation By Visual Cortex: Perceptual Grouping, Attention, and Learning

Author: Ahissar
Arivazhagan
Beck
Beck
Ben-Shahar
Bergen
Bergen
Biederman
Biederman
Blaser
Bovik
Bradski
Brodatz
Bullier
Caelli
Caelli
Callaway
Cao
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Carpenter
Cavanagh
Cavanagh
Chellappa
Cohen
Colby
Connor
Connor
Corbetta
Cross
Desimone
Deubel
Duncan
Elder
Fazl
Felleman
Ferster
Field
Fogel
Gail A. Carpenter
Gove
Graham
Greenspan
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Grossberg
Guillery
Gurnsey
Hirsch
Hochstein
Hodgkin
Hubel
Hubel
Hubel
Hupé
Jain
Johnson
Julesz
Kapadia
Kellman
Kellman
Kelly
Knierim
Krumm
Lamme
Lamme
Lee
Malik
Malik
Manjunath
Mao
McGuire
Mirmehdi
Mitchell
Munoz
Murphy
Nothdurft
Nothdurft
Nothdurft
Nothdurft
Nothdurft
Olson
O’Craven
Paragios
Posner
Przybyszewski
Pylyshyn
Pylyshyn
Raizada
Raizada
Randen
Rao
Renninger
Reynolds
Reynolds
Reynolds
Roelfsema
Roska
Ross
Rushi Bhatt
Sagi
Salin
Shaw
Sigman
Sillito
Sillito
Sillito
Stephen Grossberg
Sutter
Thielscher
Treisman
Tse
Tyler
von der Heydt
von der Heydt
Watanabe
Wilkinson
Williamson
Wiser
Wolfe
Wolfson
Wolfson
Yeshurun
Zhu
Zipser
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/01/2006
Field of study

A neural model is proposed of how laminar interactions in the visual cortex may learn and recognize object texture and form boundaries. The model brings together five interacting processes: region-based texture classification, contour-based boundary grouping, surface filling-in, spatial attention, and object attention. The model shows how form boundaries can determine regions in which surface filling-in occurs; how surface filling-in interacts with spatial attention to generate a form-fitting distribution of spatial attention, or attentional shroud; how the strongest shroud can inhibit weaker shrouds; and how the winning shroud regulates learning of texture categories, and thus the allocation of object attention. The model can discriminate abutted textures with blurred boundaries and is sensitive to texture boundary attributes like discontinuities in orientation and texture flow curvature as well as to relative orientations of texture elements. The model quantitatively fits a large set of human psychophysical data on orientation-based textures. Object boundar output of the model is compared to computer vision algorithms using a set of human segmented photographic images. The model classifies textures and suppresses noise using a multiple scale oriented filterbank and a distributed Adaptive Resonance Theory (dART) classifier. The matched signal between the bottom-up texture inputs and top-down learned texture categories is utilized by oriented competitive and cooperative grouping processes to generate texture boundaries that control surface filling-in and spatial attention. Topdown modulatory attentional feedback from boundary and surface representations to early filtering stages results in enhanced texture boundaries and more efficient learning of texture within attended surface regions. Surface-based attention also provides a self-supervising training signal for learning new textures. Importance of the surface-based attentional feedback in texture learning and classification is tested using a set of textured images from the Brodatz micro-texture album. Benchmark studies vary from 95.1% to 98.6% with attention, and from 90.6% to 93.2% without attention.Air Force Office of Scientific Research (F49620-01-1-0397, F49620-01-1-0423); National Science Foundation (SBE-0354378); Office of Naval Research (N00014-01-1-0624

CiteSeerX

Elsevier - Publisher Connector

Crossref

Boston University Institutional Repository (OpenBU)

Learning long-range spatial dependencies with horizontal gated-recurrent units

Author: Kim Junkyung
Linsley Drew
Serre Thomas
Veerabadran Vijay
Publication venue
Publication date: 01/01/2018
Field of study

Progress in deep learning has spawned great successes in many engineering applications. As a prime example, convolutional neural networks, a type of feedforward neural networks, are now approaching -- and sometimes even surpassing -- human accuracy on a variety of visual recognition tasks. Here, however, we show that these neural networks and their recent extensions struggle in recognition tasks where co-dependent visual features must be detected over long spatial ranges. We introduce the horizontal gated-recurrent unit (hGRU) to learn intrinsic horizontal connections -- both within and across feature columns. We demonstrate that a single hGRU layer matches or outperforms all tested feedforward hierarchical baselines including state-of-the-art architectures which have orders of magnitude more free parameters. We further discuss the biological plausibility of the hGRU in comparison to anatomical data from the visual cortex as well as human behavioral data on a classic contour detection task.Comment: Published at NeurIPS 2018 https://papers.nips.cc/paper/7300-learning-long-range-spatial-dependencies-with-horizontal-gated-recurrent-unit

arXiv.org e-Print Archive

Crossref

A Framework for Symmetric Part Detection in Cluttered Scenes

Author: Dickinson Sven
Fidler Sanja
Lee Tom
Levinshtein Alex
Sminchisescu Cristian
Publication venue
Publication date: 05/02/2015
Field of study

The role of symmetry in computer vision has waxed and waned in importance during the evolution of the field from its earliest days. At first figuring prominently in support of bottom-up indexing, it fell out of favor as shape gave way to appearance and recognition gave way to detection. With a strong prior in the form of a target object, the role of the weaker priors offered by perceptual grouping was greatly diminished. However, as the field returns to the problem of recognition from a large database, the bottom-up recovery of the parts that make up the objects in a cluttered scene is critical for their recognition. The medial axis community has long exploited the ubiquitous regularity of symmetry as a basis for the decomposition of a closed contour into medial parts. However, today's recognition systems are faced with cluttered scenes, and the assumption that a closed contour exists, i.e. that figure-ground segmentation has been solved, renders much of the medial axis community's work inapplicable. In this article, we review a computational framework, previously reported in Lee et al. (2013), Levinshtein et al. (2009, 2013), that bridges the representation power of the medial axis and the need to recover and group an object's parts in a cluttered scene. Our framework is rooted in the idea that a maximally inscribed disc, the building block of a medial axis, can be modeled as a compact superpixel in the image. We evaluate the method on images of cluttered scenes.Comment: 10 pages, 8 figure

arXiv.org e-Print Archive

Multidisciplinary Digital Publishing Institute

CiteSeerX

Directory of Open Access Journals

Improving Spatial Codification in Semantic Segmentation

Author: Giró-i-Nieto Xavier
Marqués Ferran
McGuinness Kevin
O'Connor Noel E.
Ventura Carles
Vilaplana Verónica
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/05/2015
Field of study

This paper explores novel approaches for improving the spatial codification for the pooling of local descriptors to solve the semantic segmentation problem. We propose to partition the image into three regions for each object to be described: Figure, Border and Ground. This partition aims at minimizing the influence of the image context on the object description and vice versa by introducing an intermediate zone around the object contour. Furthermore, we also propose a richer visual descriptor of the object by applying a Spatial Pyramid over the Figure region. Two novel Spatial Pyramid configurations are explored: Cartesian-based and crown-based Spatial Pyramids. We test these approaches with state-of-the-art techniques and show that they improve the Figure-Ground based pooling in the Pascal VOC 2011 and 2012 semantic segmentation challenges.Comment: Paper accepted at the IEEE International Conference on Image Processing, ICIP 2015. Quebec City, 27-30 September. Project page: https://imatge.upc.edu/web/publications/improving-spatial-codification-semantic-segmentatio

arXiv.org e-Print Archive

Crossref

Irish Universities

DCU Online Research Access Service

Instance-Level Salient Object Segmentation

Author: Li Guanbin
Lin Liang
Xie Yuan
Yu Yizhou
Publication venue
Publication date: 01/01/2017
Field of study

Image saliency detection has recently witnessed rapid progress due to deep convolutional neural networks. However, none of the existing methods is able to identify object instances in the detected salient regions. In this paper, we present a salient instance segmentation method that produces a saliency mask with distinct object instance labels for an input image. Our method consists of three steps, estimating saliency map, detecting salient object contours and identifying salient object instances. For the first two steps, we propose a multiscale saliency refinement network, which generates high-quality salient region masks and salient object contours. Once integrated with multiscale combinatorial grouping and a MAP-based subset optimization framework, our method can generate very promising salient object instance segmentation results. To promote further research and evaluation of salient instance segmentation, we also construct a new database of 1000 images and their pixelwise salient instance annotations. Experimental results demonstrate that our proposed method is capable of achieving state-of-the-art performance on all public benchmarks for salient region detection as well as on our new dataset for salient instance segmentation.Comment: To appear in CVPR201

arXiv.org e-Print Archive

Crossref

HKU Scholars Hub

Object Contour and Edge Detection with RefineContourNet

Author: E Shelhamer
J Canny
K Maninis
P Arbelaez
R Deng
T-Y Lin
Y Ganin
Y Wang
Yun Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/05/2019
Field of study

A ResNet-based multi-path refinement CNN is used for object contour detection. For this task, we prioritise the effective utilization of the high-level abstraction capability of a ResNet, which leads to state-of-the-art results for edge detection. Keeping our focus in mind, we fuse the high, mid and low-level features in that specific order, which differs from many other approaches. It uses the tensor with the highest-levelled features as the starting point to combine it layer-by-layer with features of a lower abstraction level until it reaches the lowest level. We train this network on a modified PASCAL VOC 2012 dataset for object contour detection and evaluate on a refined PASCAL-val dataset reaching an excellent performance and an Optimal Dataset Scale (ODS) of 0.752. Furthermore, by fine-training on the BSDS500 dataset we reach state-of-the-art results for edge-detection with an ODS of 0.824.Comment: Keywords: Object Contour Detection, Edge Detection, Multi-Path Refinement CN

arXiv.org e-Print Archive

Crossref

Species identification of family Lutjanidae and separation of populations of John’s snapper (Lutjanus johnii) in Persian Gulf and Oman Sea based on otolith shape analysis

Author: Sadighzadeha Zahra
Publication venue
Publication date: 01/01/2012
Field of study

The anatomical and morphometric (shape indices, contour descriptors and otolith weight) characterizations of sagittal otoliths were investigated in 13 species of Lutjanus spp. inhabiting the Persian Gulf. This is the first study that compares the efficiency of three different image analysis techniques for discriminating species based on the shape of the outer otolith contour, including elliptical Fourier descriptors (EFD), fast Fourier transform (FFT) and wavelet transform (WT). Sagittal otoliths of snappers are morphologically similar with some small specific variations. The use of otolith contour based on wavelets (WT) provided the best results in comparison with the two other methods based on Fourier descriptors, but only the combination of the all three methods (EFD, FFT and WT) was useful to obtain a robust classification of species. The species prediction improved when otolith weight was included. In relation to the shape indices, only the aspect ratio provided a clear grouping of species. Also, another study was carried on to test the possibility of application of shape analysis and comparing otolith contour of otoliths of Lutjanus johnii from Persian Gulf and Oman Sea to identify potential stocks. The results showed the otoliths have differences in contour shape and can be contribute to two different stocks

Aquatic Commons