381 research outputs found
A Novel Weakly-supervised approach for RGB-D-based Nuclear Waste Object Detection and Categorization
This paper addresses the problem of RGBD-based detection and categorization of waste objects for nuclear decommissioning. To enable autonomous robotic manipulation for nuclear decommissioning, nuclear waste objects must be detected and categorized. However, as a novel industrial application, large amounts of annotated waste object data are currently unavailable. To overcome this problem, we propose a weakly-supervised learning approach which is able to learn a deep convolutional neural network (DCNN) from unlabelled RGBD videos while requiring very few annotations. The proposed method also has the potential to be applied to other household or industrial applications. We evaluate our approach on the Washington RGBD object recognition benchmark, achieving the state-of-the-art performance among semi-supervised methods. More importantly, we introduce a novel dataset, i.e. Birmingham nuclear waste simulants dataset, and evaluate our proposed approach on this novel industrial object recognition challenge. We further propose a complete real-time pipeline for RGBD-based detection and categorization of nuclear waste simulants. Our weakly-supervised approach has demonstrated to be highly effective in solving a novel RGB-D object detection and recognition application with limited human annotations
Unsupervised domain adaptation semantic segmentation of high-resolution remote sensing imagery with invariant domain-level prototype memory
Semantic segmentation is a key technique involved in automatic interpretation
of high-resolution remote sensing (HRS) imagery and has drawn much attention in
the remote sensing community. Deep convolutional neural networks (DCNNs) have
been successfully applied to the HRS imagery semantic segmentation task due to
their hierarchical representation ability. However, the heavy dependency on a
large number of training data with dense annotation and the sensitiveness to
the variation of data distribution severely restrict the potential application
of DCNNs for the semantic segmentation of HRS imagery. This study proposes a
novel unsupervised domain adaptation semantic segmentation network
(MemoryAdaptNet) for the semantic segmentation of HRS imagery. MemoryAdaptNet
constructs an output space adversarial learning scheme to bridge the domain
distribution discrepancy between source domain and target domain and to narrow
the influence of domain shift. Specifically, we embed an invariant feature
memory module to store invariant domain-level context information because the
features obtained from adversarial learning only tend to represent the variant
feature of current limited inputs. This module is integrated by a category
attention-driven invariant domain-level context aggregation module to current
pseudo invariant feature for further augmenting the pixel representations. An
entropy-based pseudo label filtering strategy is used to update the memory
module with high-confident pseudo invariant feature of current target images.
Extensive experiments under three cross-domain tasks indicate that our proposed
MemoryAdaptNet is remarkably superior to the state-of-the-art methods.Comment: 17 pages, 12 figures and 8 table
A Comprehensive Literature Review on Convolutional Neural Networks
The fields of computer vision and image processing from their initial days have been dealing with the problems of visual recognition. Convolutional Neural Networks (CNNs) in machine learning are deep architectures built as feed-forward neural networks or perceptrons, which are inspired by the research done in the fields of visual analysis by the visual cortex of mammals like cats. This work gives a detailed analysis of CNNs for the computer vision tasks, natural language processing, fundamental sciences and engineering problems along with other miscellaneous tasks. The general CNN structure along with its mathematical intuition and working, a brief critical commentary on the advantages and disadvantages, which leads researchers to search for alternatives to CNN’s are also mentioned. The paper also serves as an appreciation of the brain-child of past researchers for the existence of such a fecund architecture for handling multidimensional data and approaches to improve their performance further
- …