Search CORE

3,623 research outputs found

Graph-based analysis of textured images for hierarchical segmentation

Author: Gaetano Raffaele
Scarpa Giuseppe
Szirányi Tamás
Publication venue: BMVA Pr.
Publication date: 01/01/2010
Field of study

International audienceThe Texture Fragmentation and Reconstruction (TFR) algorithm has been recently introduced to address the problem of image segmentation by textural properties, based on a suitable image description tool known as the Hierarchical Multiple Markov Chain (H-MMC) model. TFR provides a hierarchical set of nested segmentation maps by first identifying the elementary image patterns, and then merging them sequentially to identify complete textures at different scales of observation. In this work, we propose a major modification to the TFR by resorting to a graph based description of the image content and a graph clustering technique for the enhancement and extraction of image patterns. A procedure based on mathematical morphology will be introduced that allows for the construction of a color-wise image representation by means of multiple graph structures, along with a simple clustering technique aimed at cutting the graphs and correspondingly segment groups of connected components with a similar spatial context. The performance assessment, realized both on synthetic compositions of real-world textures and images from the remote sensing domain, confirm the effectiveness and potential of the proposed method

Archivio della ricerca - Università degli studi di Napoli "Parthenope"

Archivio della ricerca - Università degli studi di Napoli Federico II

Crossref

SZTAKI Publication Repository

HAL-UNICE

INRIA a CCSD electronic archive server

ImageSpirit: Verbal Guided Image Parsing

Author: Cheng Ming-Ming
Crook Nigel
Lin Wen-Yan
Mitra Niloy
Sturgess Paul
Torr Philip
Vineet Vibhav
Warrell Jonathan
Zheng Shuai
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2014
Field of study

Humans describe images in terms of nouns and adjectives while algorithms operate on images represented as sets of pixels. Bridging this gap between how humans would like to access images versus their typical representation is the goal of image parsing, which involves assigning object and attribute labels to pixel. In this paper we propose treating nouns as object labels and adjectives as visual attribute labels. This allows us to formulate the image parsing problem as one of jointly estimating per-pixel object and attribute labels from a set of training images. We propose an efficient (interactive time) solution. Using the extracted labels as handles, our system empowers a user to verbally refine the results. This enables hands-free parsing of an image into pixel-wise object/attribute labels that correspond to human semantics. Verbally selecting objects of interests enables a novel and natural interaction modality that can possibly be used to interact with new generation devices (e.g. smart phones, Google Glass, living room devices). We demonstrate our system on a large number of real-world images with varying complexity. To help understand the tradeoffs compared to traditional mouse based interactions, results are reported for both a large scale quantitative evaluation and a user study.Comment: http://mmcheng.net/imagespirit

arXiv.org e-Print Archive

CiteSeerX

Crossref

Institutional Knowledge at Singapore Management University

UCL Discovery

Oxford University Research Archive

Oxford Brookes University: RADAR

Image segmentation with adaptive region growing based on a polynomial surface model

Author: Deboeverie Francis
Philips Wilfried
Veelaert Peter
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/01/2013
Field of study

A new method for segmenting intensity images into smooth surface segments is presented. The main idea is to divide the image into flat, planar, convex, concave, and saddle patches that coincide as well as possible with meaningful object features in the image. Therefore, we propose an adaptive region growing algorithm based on low-degree polynomial fitting. The algorithm uses a new adaptive thresholding technique with the L∞ fitting cost as a segmentation criterion. The polynomial degree and the fitting error are automatically adapted during the region growing process. The main contribution is that the algorithm detects outliers and edges, distinguishes between strong and smooth intensity transitions and finds surface segments that are bent in a certain way. As a result, the surface segments corresponding to meaningful object features and the contours separating the surface segments coincide with real-image object edges. Moreover, the curvature-based surface shape information facilitates many tasks in image analysis, such as object recognition performed on the polynomial representation. The polynomial representation provides good image approximation while preserving all the necessary details of the objects in the reconstructed images. The method outperforms existing techniques when segmenting images of objects with diffuse reflecting surfaces

Ghent University Academic Bibliography

DepthCut: Improved Depth Edge Estimation Using Multiple Unreliable Channels

Author: Guerrero Paul
Li Wilmot
Mitra Niloy J.
Winnemöller Holger
Publication venue
Publication date: 26/05/2017
Field of study

In the context of scene understanding, a variety of methods exists to estimate different information channels from mono or stereo images, including disparity, depth, and normals. Although several advances have been reported in the recent years for these tasks, the estimated information is often imprecise particularly near depth discontinuities or creases. Studies have however shown that precisely such depth edges carry critical cues for the perception of shape, and play important roles in tasks like depth-based segmentation or foreground selection. Unfortunately, the currently extracted channels often carry conflicting signals, making it difficult for subsequent applications to effectively use them. In this paper, we focus on the problem of obtaining high-precision depth edges (i.e., depth contours and creases) by jointly analyzing such unreliable information channels. We propose DepthCut, a data-driven fusion of the channels using a convolutional neural network trained on a large dataset with known depth. The resulting depth edges can be used for segmentation, decomposing a scene into depth layers with relatively flat depth, or improving the accuracy of the depth estimate near depth edges by constraining its gradients to agree with these edges. Quantitatively, we compare against 15 variants of baselines and demonstrate that our depth edges result in an improved segmentation performance and an improved depth estimate near depth edges compared to data-agnostic channel fusion. Qualitatively, we demonstrate that the depth edges result in superior segmentation and depth orderings.Comment: 12 page

arXiv.org e-Print Archive

UCL Discovery