18 research outputs found
Introducing Geometry in Active Learning for Image Segmentation
We propose an Active Learning approach to training a segmentation classifier
that exploits geometric priors to streamline the annotation process in 3D image
volumes. To this end, we use these priors not only to select voxels most in
need of annotation but to guarantee that they lie on 2D planar patch, which
makes it much easier to annotate than if they were randomly distributed in the
volume. A simplified version of this approach is effective in natural 2D
images. We evaluated our approach on Electron Microscopy and Magnetic Resonance
image volumes, as well as on natural images. Comparing our approach against
several accepted baselines demonstrates a marked performance increase
Expected exponential loss for gaze-based video and volume ground truth annotation
Many recent machine learning approaches used in medical imaging are highly
reliant on large amounts of image and ground truth data. In the context of
object segmentation, pixel-wise annotations are extremely expensive to collect,
especially in video and 3D volumes. To reduce this annotation burden, we
propose a novel framework to allow annotators to simply observe the object to
segment and record where they have looked at with a \$200 eye gaze tracker. Our
method then estimates pixel-wise probabilities for the presence of the object
throughout the sequence from which we train a classifier in semi-supervised
setting using a novel Expected Exponential loss function. We show that our
framework provides superior performances on a wide range of medical image
settings compared to existing strategies and that our method can be combined
with current crowd-sourcing paradigms as well.Comment: 9 pages, 5 figues, MICCAI 2017 - LABELS Worksho
Bag-Level Aggregation for Multiple Instance Active Learning in Instance Classification Problems
A growing number of applications, e.g. video surveillance and medical image
analysis, require training recognition systems from large amounts of weakly
annotated data while some targeted interactions with a domain expert are
allowed to improve the training process. In such cases, active learning (AL)
can reduce labeling costs for training a classifier by querying the expert to
provide the labels of most informative instances. This paper focuses on AL
methods for instance classification problems in multiple instance learning
(MIL), where data is arranged into sets, called bags, that are weakly labeled.
Most AL methods focus on single instance learning problems. These methods are
not suitable for MIL problems because they cannot account for the bag structure
of data. In this paper, new methods for bag-level aggregation of instance
informativeness are proposed for multiple instance active learning (MIAL). The
\textit{aggregated informativeness} method identifies the most informative
instances based on classifier uncertainty, and queries bags incorporating the
most information. The other proposed method, called \textit{cluster-based
aggregative sampling}, clusters data hierarchically in the instance space. The
informativeness of instances is assessed by considering bag labels, inferred
instance labels, and the proportion of labels that remain to be discovered in
clusters. Both proposed methods significantly outperform reference methods in
extensive experiments using benchmark data from several application domains.
Results indicate that using an appropriate strategy to address MIAL problems
yields a significant reduction in the number of queries needed to achieve the
same level of performance as single instance AL methods
Minimum Cost Active Labeling
Labeling a data set completely is important for groundtruth generation. In
this paper, we consider the problem of minimum-cost labeling: classifying all
images in a large data set with a target accuracy bound at minimum dollar cost.
Human labeling can be prohibitive, so we train a classifier to accurately label
part of the data set. However, training the classifier can be expensive too,
particularly with active learning. Our min-cost labeling uses a variant of
active learning to learn a model to predict the optimal training set size for
the classifier that minimizes overall cost, then uses active learning to train
the classifier to maximize the number of samples the classifier can correctly
label. We validate our approach on well-known public data sets such as Fashion,
CIFAR-10, and CIFAR-100. In some cases, our approach has 6X lower overall cost
relative to human labeling, and is always cheaper than the cheapest active
learning strategy
Semantic Segmentation for Fully Automated Macrofouling Analysis on Coatings after Field Exposure
Biofouling is a major challenge for sustainable shipping, filter membranes,
heat exchangers, and medical devices. The development of fouling-resistant
coatings requires the evaluation of their effectiveness. Such an evaluation is
usually based on the assessment of fouling progression after different exposure
times to the target medium (e.g., salt water). The manual assessment of
macrofouling requires expert knowledge about local fouling communities due to
high variances in phenotypical appearance, has single-image sampling
inaccuracies for certain species, and lacks spatial information. Here we
present an approach for automatic image-based macrofouling analysis. We created
a dataset with dense labels prepared from field panel images and propose a
convolutional network (adapted U-Net) for the semantic segmentation of
different macrofouling classes. The establishment of macrofouling localization
allows for the generation of a successional model which enables the
determination of direct surface attachment and in-depth epibiotic studies.Comment: 33 pages, 10 figure
Crowdsourcing in Computer Vision
Computer vision systems require large amounts of manually annotated data to
properly learn challenging visual concepts. Crowdsourcing platforms offer an
inexpensive method to capture human knowledge and understanding, for a vast
number of visual perception tasks. In this survey, we describe the types of
annotations computer vision researchers have collected using crowdsourcing, and
how they have ensured that this data is of high quality while annotation effort
is minimized. We begin by discussing data collection on both classic (e.g.,
object recognition) and recent (e.g., visual story-telling) vision tasks. We
then summarize key design decisions for creating effective data collection
interfaces and workflows, and present strategies for intelligently selecting
the most important data instances to annotate. Finally, we conclude with some
thoughts on the future of crowdsourcing in computer vision.Comment: A 69-page meta review of the field, Foundations and Trends in
Computer Graphics and Vision, 201