3,921 research outputs found
Double Refinement Network for Efficient Indoor Monocular Depth Estimation
Monocular depth estimation is the task of obtaining a measure of distance for
each pixel using a single image. It is an important problem in computer vision
and is usually solved using neural networks. Though recent works in this area
have shown significant improvement in accuracy, the state-of-the-art methods
tend to require massive amounts of memory and time to process an image. The
main purpose of this work is to improve the performance of the latest solutions
with no decrease in accuracy. To this end, we introduce the Double Refinement
Network architecture. The proposed method achieves state-of-the-art results on
the standard benchmark RGB-D dataset NYU Depth v2, while its frames per second
rate is significantly higher (up to 18 times speedup per image at batch size 1)
and the RAM usage per image is lower
Fully Automatic Segmentation of Lumbar Vertebrae from CT Images using Cascaded 3D Fully Convolutional Networks
We present a method to address the challenging problem of segmentation of
lumbar vertebrae from CT images acquired with varying fields of view. Our
method is based on cascaded 3D Fully Convolutional Networks (FCNs) consisting
of a localization FCN and a segmentation FCN. More specifically, in the first
step we train a regression 3D FCN (we call it "LocalizationNet") to find the
bounding box of the lumbar region. After that, a 3D U-net like FCN (we call it
"SegmentationNet") is then developed, which after training, can perform a
pixel-wise multi-class segmentation to map a cropped lumber region volumetric
data to its volume-wise labels. Evaluated on publicly available datasets, our
method achieved an average Dice coefficient of 95.77 0.81% and an average
symmetric surface distance of 0.37 0.06 mm.Comment: 5 pages and 5 figure
- …