28,483 research outputs found
Overlooked Video Classification in Weakly Supervised Video Anomaly Detection
Current weakly supervised video anomaly detection algorithms mostly use
multiple instance learning (MIL) or their varieties. Almost all recent
approaches focus on how to select the correct snippets for training to improve
the performance. They overlook or do not realize the power of video
classification in boosting the performance of anomaly detection. In this paper,
we study explicitly the power of video classification supervision using a BERT
or LSTM. With this BERT or LSTM, CNN features of all snippets of a video can be
aggregated into a single feature which can be used for video classification.
This simple yet powerful video classification supervision, combined into the
MIL framework, brings extraordinary performance improvement on all three major
video anomaly detection datasets. Particularly it improves the mean average
precision (mAP) on the XD-Violence from SOTA 78.84\% to new 82.10\%. The source
code is available at
https://github.com/wjtan99/BERT_Anomaly_Video_Classification.Comment: arXiv admin note: text overlap with arXiv:2101.10030 by other author
Multiple Instance Curriculum Learning for Weakly Supervised Object Detection
When supervising an object detector with weakly labeled data, most existing
approaches are prone to trapping in the discriminative object parts, e.g.,
finding the face of a cat instead of the full body, due to lacking the
supervision on the extent of full objects. To address this challenge, we
incorporate object segmentation into the detector training, which guides the
model to correctly localize the full objects. We propose the multiple instance
curriculum learning (MICL) method, which injects curriculum learning (CL) into
the multiple instance learning (MIL) framework. The MICL method starts by
automatically picking the easy training examples, where the extent of the
segmentation masks agree with detection bounding boxes. The training set is
gradually expanded to include harder examples to train strong detectors that
handle complex images. The proposed MICL method with segmentation in the loop
outperforms the state-of-the-art weakly supervised object detectors by a
substantial margin on the PASCAL VOC datasets.Comment: Published in BMVC 201
Bag-Level Aggregation for Multiple Instance Active Learning in Instance Classification Problems
A growing number of applications, e.g. video surveillance and medical image
analysis, require training recognition systems from large amounts of weakly
annotated data while some targeted interactions with a domain expert are
allowed to improve the training process. In such cases, active learning (AL)
can reduce labeling costs for training a classifier by querying the expert to
provide the labels of most informative instances. This paper focuses on AL
methods for instance classification problems in multiple instance learning
(MIL), where data is arranged into sets, called bags, that are weakly labeled.
Most AL methods focus on single instance learning problems. These methods are
not suitable for MIL problems because they cannot account for the bag structure
of data. In this paper, new methods for bag-level aggregation of instance
informativeness are proposed for multiple instance active learning (MIAL). The
\textit{aggregated informativeness} method identifies the most informative
instances based on classifier uncertainty, and queries bags incorporating the
most information. The other proposed method, called \textit{cluster-based
aggregative sampling}, clusters data hierarchically in the instance space. The
informativeness of instances is assessed by considering bag labels, inferred
instance labels, and the proportion of labels that remain to be discovered in
clusters. Both proposed methods significantly outperform reference methods in
extensive experiments using benchmark data from several application domains.
Results indicate that using an appropriate strategy to address MIAL problems
yields a significant reduction in the number of queries needed to achieve the
same level of performance as single instance AL methods
- …