Search CORE

2,881 research outputs found

Interactive Cosegmentation Using Global and Local Energy Optimization

Author: Jianbing Shen
Shao Ling
Xingping Dong
Yang Ming-Hsuan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/07/2015
Field of study

We propose a novel interactive cosegmentation method using global and local energy optimization. The global energy includes two terms: 1) the global scribbled energy and 2) the interimage energy. The first one utilizes the user scribbles to build the Gaussian mixture model and improve the cosegmentation performance. The second one is a global constraint, which attempts to match the histograms of common objects. To minimize the local energy, we apply the spline regression to learn the smoothness in a local neighborhood. This energy optimization can be converted into a constrained quadratic programming problem. To reduce the computational complexity, we propose an iterative optimization algorithm to decompose this optimization problem into several subproblems. The experimental results show that our method outperforms the state-of-the-art unsupervised cosegmentation and interactive cosegmentation methods on the iCoseg and MSRC benchmark data sets

Northumbria Research Link

University of East Anglia digital repository

Deep Interactive Region Segmentation and Captioning

Author: Boroujerdi Ali Sharifi
Breuss Michael
Khanian Maryam
Publication venue
Publication date: 26/07/2017
Field of study

With recent innovations in dense image captioning, it is now possible to describe every object of the scene with a caption while objects are determined by bounding boxes. However, interpretation of such an output is not trivial due to the existence of many overlapping bounding boxes. Furthermore, in current captioning frameworks, the user is not able to involve personal preferences to exclude out of interest areas. In this paper, we propose a novel hybrid deep learning architecture for interactive region segmentation and captioning where the user is able to specify an arbitrary region of the image that should be processed. To this end, a dedicated Fully Convolutional Network (FCN) named Lyncean FCN (LFCN) is trained using our special training data to isolate the User Intention Region (UIR) as the output of an efficient segmentation. In parallel, a dense image captioning model is utilized to provide a wide variety of captions for that region. Then, the UIR will be explained with the caption of the best match bounding box. To the best of our knowledge, this is the first work that provides such a comprehensive output. Our experiments show the superiority of the proposed approach over state-of-the-art interactive segmentation methods on several well-known datasets. In addition, replacement of the bounding boxes with the result of the interactive segmentation leads to a better understanding of the dense image captioning output as well as accuracy enhancement for the object detection in terms of Intersection over Union (IoU).Comment: 17, pages, 9 figure

arXiv.org e-Print Archive

Crossref

Cochlea Segmentation using Iterated Random Walks with Shape Prior

Author: Ceresa Mario
González Ballester Miguel Angel
Kjer Hans Martin
Ruiz Pujadas Esmeralda
Vera Sergio
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/01/2016
Field of study

Online Research Database In Technology