463 research outputs found
GenDet: Meta Learning to Generate Detectors From Few Shots
Object detection has made enormous progress and has been widely used in many applications. However, it performs poorly when only limited training data is available for novel classes that the model has never seen before. Most existing approaches solve few-shot detection tasks implicitly without directly modeling the detectors for novel classes. In this article, we propose GenDet, a new meta-learning-based framework that can effectively generate object detectors for novel classes from few shots and, thus, conducts few-shot detection tasks explicitly. The detector generator is trained by numerous few-shot detection tasks sampled from base classes each with sufficient samples, and thus, it is expected to generalize well on novel classes. An adaptive pooling module is further introduced to suppress distracting samples and aggregate the detectors generated from multiple shots. Moreover, we propose to train a reference detector for each base class in the conventional way, with which to guide the training of the detector generator. The reference detectors and the detector generator can be trained simultaneously. Finally, the generated detectors of different classes are encouraged to be orthogonal to each other for better generalization. The proposed approach is extensively evaluated on the ImageNet, VOC, and COCO data sets under various few-shot detection settings, and it achieves new state-of-the-art results
A Survey of Imbalanced Learning on Graphs: Problems, Techniques, and Future Directions
Graphs represent interconnected structures prevalent in a myriad of
real-world scenarios. Effective graph analytics, such as graph learning
methods, enables users to gain profound insights from graph data, underpinning
various tasks including node classification and link prediction. However, these
methods often suffer from data imbalance, a common issue in graph data where
certain segments possess abundant data while others are scarce, thereby leading
to biased learning outcomes. This necessitates the emerging field of imbalanced
learning on graphs, which aims to correct these data distribution skews for
more accurate and representative learning outcomes. In this survey, we embark
on a comprehensive review of the literature on imbalanced learning on graphs.
We begin by providing a definitive understanding of the concept and related
terminologies, establishing a strong foundational understanding for readers.
Following this, we propose two comprehensive taxonomies: (1) the problem
taxonomy, which describes the forms of imbalance we consider, the associated
tasks, and potential solutions; (2) the technique taxonomy, which details key
strategies for addressing these imbalances, and aids readers in their method
selection process. Finally, we suggest prospective future directions for both
problems and techniques within the sphere of imbalanced learning on graphs,
fostering further innovation in this critical area.Comment: The collection of awesome literature on imbalanced learning on
graphs: https://github.com/Xtra-Computing/Awesome-Literature-ILoG
Few-shot Object Detection on Remote Sensing Images
In this paper, we deal with the problem of object detection on remote sensing
images. Previous methods have developed numerous deep CNN-based methods for
object detection on remote sensing images and the report remarkable
achievements in detection performance and efficiency. However, current
CNN-based methods mostly require a large number of annotated samples to train
deep neural networks and tend to have limited generalization abilities for
unseen object categories. In this paper, we introduce a few-shot learning-based
method for object detection on remote sensing images where only a few annotated
samples are provided for the unseen object categories. More specifically, our
model contains three main components: a meta feature extractor that learns to
extract feature representations from input images, a reweighting module that
learn to adaptively assign different weights for each feature representation
from the support images, and a bounding box prediction module that carries out
object detection on the reweighted feature maps. We build our few-shot object
detection model upon YOLOv3 architecture and develop a multi-scale object
detection framework. Experiments on two benchmark datasets demonstrate that
with only a few annotated samples our model can still achieve a satisfying
detection performance on remote sensing images and the performance of our model
is significantly better than the well-established baseline models.Comment: 12pages, 7 figure
A Survey on Negative Transfer
Transfer learning (TL) tries to utilize data or knowledge from one or more
source domains to facilitate the learning in a target domain. It is
particularly useful when the target domain has few or no labeled data, due to
annotation expense, privacy concerns, etc. Unfortunately, the effectiveness of
TL is not always guaranteed. Negative transfer (NT), i.e., the source domain
data/knowledge cause reduced learning performance in the target domain, has
been a long-standing and challenging problem in TL. Various approaches to
handle NT have been proposed in the literature. However, this filed lacks a
systematic survey on the formalization of NT, their factors and the algorithms
that handle NT. This paper proposes to fill this gap. First, the definition of
negative transfer is considered and a taxonomy of the factors are discussed.
Then, near fifty representative approaches for handling NT are categorized and
reviewed, from four perspectives: secure transfer, domain similarity
estimation, distant transfer and negative transfer mitigation. NT in related
fields, e.g., multi-task learning, lifelong learning, and adversarial attacks
are also discussed
Fine-grained Few-shot Recognition by Deep Object Parsing
In our framework, an object is made up of K distinct parts or units, and we
parse a test instance by inferring the K parts, where each part occupies a
distinct location in the feature space, and the instance features at this
location, manifest as an active subset of part templates shared across all
instances. We recognize test instances by comparing its active templates and
the relative geometry of its part locations against those of the presented
few-shot instances. We propose an end-to-end training method to learn part
templates on-top of a convolutional backbone. To combat visual distortions such
as orientation, pose and size, we learn multi-scale templates, and at test-time
parse and match instances across these scales. We show that our method is
competitive with the state-of-the-art, and by virtue of parsing enjoys
interpretability as well
- …