Search CORE

66 research outputs found

Feedforward semantic segmentation with zoom-out features

Author: Mostajabi Mohammadreza
Shakhnarovich Gregory
Yadollahpour Payman
Publication venue
Publication date: 01/12/2014
Field of study

We introduce a purely feed-forward architecture for semantic segmentation. We map small image elements (superpixels) to rich feature representations extracted from a sequence of nested regions of increasing extent. These regions are obtained by "zooming out" from the superpixel all the way to scene-level resolution. This approach exploits statistical structure in the image and in the label space without setting up explicit structured prediction mechanisms, and thus avoids complex and expensive inference. Instead superpixels are classified by a feedforward multilayer network. Our architecture achieves new state of the art performance in semantic segmentation, obtaining 64.4% average accuracy on the PASCAL VOC 2012 test set

arXiv.org e-Print Archive

Crossref

DeepOrgan: Multi-level Deep Convolutional Networks for Automated Pancreas Segmentation

Author: Farag Amal
Liu Jiamin
Lu Le
Roth Holger R.
Shin Hoo-Chang
Summers Ronald M.
Turkbey Evrim
Publication venue
Publication date: 21/06/2015
Field of study

Automatic organ segmentation is an important yet challenging problem for medical image analysis. The pancreas is an abdominal organ with very high anatomical variability. This inhibits previous segmentation methods from achieving high accuracies, especially compared to other organs such as the liver, heart or kidneys. In this paper, we present a probabilistic bottom-up approach for pancreas segmentation in abdominal computed tomography (CT) scans, using multi-level deep convolutional networks (ConvNets). We propose and evaluate several variations of deep ConvNets in the context of hierarchical, coarse-to-fine classification on image patches and regions, i.e. superpixels. We first present a dense labeling of local image patches via

P{-}\mathrm{ConvNet}

and nearest neighbor fusion. Then we describe a regional ConvNet (

R_1{-}\mathrm{ConvNet}

) that samples a set of bounding boxes around each image superpixel at different scales of contexts in a "zoom-out" fashion. Our ConvNets learn to assign class probabilities for each superpixel region of being pancreas. Last, we study a stacked

R_2{-}\mathrm{ConvNet}

leveraging the joint space of CT intensities and the

P{-}\mathrm{ConvNet}

dense probability maps. Both 3D Gaussian smoothing and 2D conditional random fields are exploited as structured predictions for post-processing. We evaluate on CT images of 82 patients in 4-fold cross-validation. We achieve a Dice Similarity Coefficient of 83.6

\pm

6.3% in training and 71.8

\pm

10.7% in testing.Comment: To be presented at MICCAI 2015 - 18th International Conference on Medical Computing and Computer Assisted Interventions, Munich, German

arXiv.org e-Print Archive

CiteSeerX

Regularizing Deep Networks by Modeling and Predicting Label Structure

Author: Maire Michael
Mostajabi Mohammadreza
Shakhnarovich Gregory
Publication venue
Publication date: 05/04/2018
Field of study

We construct custom regularization functions for use in supervised training of deep neural networks. Our technique is applicable when the ground-truth labels themselves exhibit internal structure; we derive a regularizer by learning an autoencoder over the set of annotations. Training thereby becomes a two-phase procedure. The first phase models labels with an autoencoder. The second phase trains the actual network of interest by attaching an auxiliary branch that must predict output via a hidden layer of the autoencoder. After training, we discard this auxiliary branch. We experiment in the context of semantic segmentation, demonstrating this regularization strategy leads to consistent accuracy boosts over baselines, both when training from scratch, or in combination with ImageNet pretraining. Gains are also consistent over different choices of convolutional network architecture. As our regularizer is discarded after training, our method has zero cost at test time; the performance improvements are essentially free. We are simply able to learn better network weights by building an abstract model of the label space, and then training the network to understand this abstraction alongside the original task.Comment: to appear at CVPR 201

arXiv.org e-Print Archive

Crossref