Search CORE

1,369 research outputs found

SFNet: Learning Object-aware Semantic Correspondence

Author: Ham Bumsub
Kim Dohyung
Lee Junghyup
Ponce Jean
Publication venue
Publication date: 04/04/2019
Field of study

We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category. We propose to use images annotated with binary foreground masks and subjected to synthetic geometric deformations to train a convolutional neural network (CNN) for this task. Using these masks as part of the supervisory signal offers a good compromise between semantic flow methods, where the amount of training data is limited by the cost of manually selecting point correspondences, and semantic alignment ones, where the regression of a single global geometric transformation between images may be sensitive to image-specific details such as background clutter. We propose a new CNN architecture, dubbed SFNet, which implements this idea. It leverages a new and differentiable version of the argmax function for end-to-end training, with a loss that combines mask and flow consistency with smoothness terms. Experimental results demonstrate the effectiveness of our approach, which significantly outperforms the state of the art on standard benchmarks.Comment: cvpr 2019 oral pape

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

SCNet: Learning Semantic Correspondence

Author: Cho Minsu
Ham Bumsub
Han Kai
Ponce Jean
Rezende Rafael S.
Schmid Cordelia
Wong Kwan-Yee K.
Publication venue
Publication date: 01/01/2017
Field of study

This paper addresses the problem of establishing semantic correspondences between images depicting different instances of the same object or scene category. Previous approaches focus on either combining a spatial regularizer with hand-crafted features, or learning a correspondence model for appearance only. We propose instead a convolutional neural network architecture, called SCNet, for learning a geometrically plausible model for semantic correspondence. SCNet uses region proposals as matching primitives, and explicitly incorporates geometric consistency in its loss function. It is trained on image pairs obtained from the PASCAL VOC 2007 keypoint dataset, and a comparative evaluation on several standard benchmarks demonstrates that the proposed approach substantially outperforms both recent deep learning architectures and previous methods based on hand-crafted features.Comment: ICCV 201

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

포항공과대학교

HKU Scholars Hub

2D Reconstruction of Small Intestine's Interior Wall

Author: Attar Rahman
Wang Zhihua
Xie Xiang
Yue Shigang
Publication venue
Publication date: 15/03/2018
Field of study

Examining and interpreting of a large number of wireless endoscopic images from the gastrointestinal tract is a tiresome task for physicians. A practical solution is to automatically construct a two dimensional representation of the gastrointestinal tract for easy inspection. However, little has been done on wireless endoscopic image stitching, let alone systematic investigation. The proposed new wireless endoscopic image stitching method consists of two main steps to improve the accuracy and efficiency of image registration. First, the keypoints are extracted by Principle Component Analysis and Scale Invariant Feature Transform (PCA-SIFT) algorithm and refined with Maximum Likelihood Estimation SAmple Consensus (MLESAC) outlier removal to find the most reliable keypoints. Second, the optimal transformation parameters obtained from first step are fed to the Normalised Mutual Information (NMI) algorithm as an initial solution. With modified Marquardt-Levenberg search strategy in a multiscale framework, the NMI can find the optimal transformation parameters in the shortest time. The proposed methodology has been tested on two different datasets - one with real wireless endoscopic images and another with images obtained from Micro-Ball (a new wireless cubic endoscopy system with six image sensors). The results have demonstrated the accuracy and robustness of the proposed methodology both visually and quantitatively.Comment: Journal draf

arXiv.org e-Print Archive

Southampton (e-Prints Soton)

Keypoint Transfer for Fast Whole-Body Segmentation

Author: Golland Polina
Langs Georg
Toews Matthew
Wachinger Christian
Wells William
Publication venue
Publication date: 22/06/2018
Field of study

We introduce an approach for image segmentation based on sparse correspondences between keypoints in testing and training images. Keypoints represent automatically identified distinctive image locations, where each keypoint correspondence suggests a transformation between images. We use these correspondences to transfer label maps of entire organs from the training images to the test image. The keypoint transfer algorithm includes three steps: (i) keypoint matching, (ii) voting-based keypoint labeling, and (iii) keypoint-based probabilistic transfer of organ segmentations. We report segmentation results for abdominal organs in whole-body CT and MRI, as well as in contrast-enhanced CT and MRI. Our method offers a speed-up of about three orders of magnitude in comparison to common multi-atlas segmentation, while achieving an accuracy that compares favorably. Moreover, keypoint transfer does not require the registration to an atlas or a training phase. Finally, the method allows for the segmentation of scans with highly variable field-of-view.Comment: Accepted for publication at IEEE Transactions on Medical Imagin

arXiv.org e-Print Archive

DSpace@MIT

Assessment of surface topography modifications through feature-based registration of areal topography data

Author: Gambucci G.
Leach R.K.
Moretti M.
Senin N.
Publication venue: 'IOP Publishing'
Publication date: 01/06/2019
Field of study

Surface topography modifications due to wear or other factors are usually investigated by visual and microscopic inspection, and – when quantitative assessment is required – through the computation of surface texture parameters. However, the current state-of-the-art areal topography measuring instruments produce detailed, areal reconstructions of surface topography which, in principle, may allow accurate comparison of the individual topographic formations before and after the modification event. The main obstacle to such an approach is registration, i.e. being able to accurately relocate the two topography datasets (measured before and after modification) in the same coordinate system. The challenge is related to the measurements being performed in independent coordinate systems, and on a surface which, having undergone modifications, may not feature easily-identifiable landmarks suitable for alignment. In this work, an algorithmic registration solution is proposed, based on the automated identification and alignment of matching topographic features. A shape descriptor (adapted from the scale invariant feature transform) is used to identify landmarks. Pairs of matching landmarks are identified by similarity of shape descriptor values. Registration is implemented by resolving the absolute orientation problem to align matched landmarks. The registration method is validated and discussed through application to simulated and real topographies selected as test cases

Repository@Nottingham

Learning Semantic Correspondence Exploiting an Object-level Prior

Author: Ham Bumsub
Kim DoHyung
Lee Junghyup
Lee Wonkyung
Ponce Jean
Publication venue: HAL CCSD
Publication date: 21/07/2020
Field of study

We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category. We propose to use images annotated with binary foreground masks and subjected to synthetic geometric deformations to train a convolutional neural network (CNN) for this task. Using these masks as part of the supervisory signal provides an object-level prior for the semantic correspondence task and offers a good compromise between semantic flow methods, where the amount of training data is limited by the cost of manually selecting point correspondences, and semantic alignment ones, where the regression of a single global geometric transformation between images may be sensitive to image-specific details such as background clutter. We propose a new CNN architecture, dubbed SFNet, which implements this idea. It leverages a new and differentiable version of the argmax function for end-to-end training, with a loss that combines mask and flow consistency with smoothness terms. Experimental results demonstrate the effectiveness of our approach, which significantly outperforms the state of the art on standard benchmarks

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server