167,161 research outputs found
A Hybrid Instance-based Transfer Learning Method
In recent years, supervised machine learning models have demonstrated
tremendous success in a variety of application domains. Despite the promising
results, these successful models are data hungry and their performance relies
heavily on the size of training data. However, in many healthcare applications
it is difficult to collect sufficiently large training datasets. Transfer
learning can help overcome this issue by transferring the knowledge from
readily available datasets (source) to a new dataset (target). In this work, we
propose a hybrid instance-based transfer learning method that outperforms a set
of baselines including state-of-the-art instance-based transfer learning
approaches. Our method uses a probabilistic weighting strategy to fuse
information from the source domain to the model learned in the target domain.
Our method is generic, applicable to multiple source domains, and robust with
respect to negative transfer. We demonstrate the effectiveness of our approach
through extensive experiments for two different applications.Comment: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018
arXiv:cs/010120
Improvements to context based self-supervised learning
We develop a set of methods to improve on the results of self-supervised
learning using context. We start with a baseline of patch based arrangement
context learning and go from there. Our methods address some overt problems
such as chromatic aberration as well as other potential problems such as
spatial skew and mid-level feature neglect. We prevent problems with testing
generalization on common self-supervised benchmark tests by using different
datasets during our development. The results of our methods combined yield top
scores on all standard self-supervised benchmarks, including classification and
detection on PASCAL VOC 2007, segmentation on PASCAL VOC 2012, and "linear
tests" on the ImageNet and CSAIL Places datasets. We obtain an improvement over
our baseline method of between 4.0 to 7.1 percentage points on transfer
learning classification tests. We also show results on different standard
network architectures to demonstrate generalization as well as portability. All
data, models and programs are available at:
https://gdo-datasci.llnl.gov/selfsupervised/.Comment: Accepted paper at CVPR 201
Transfer Learning for Speech and Language Processing
Transfer learning is a vital technique that generalizes models trained for
one setting or task to other settings or tasks. For example in speech
recognition, an acoustic model trained for one language can be used to
recognize speech in another language, with little or no re-training data.
Transfer learning is closely related to multi-task learning (cross-lingual vs.
multilingual), and is traditionally studied in the name of `model adaptation'.
Recent advance in deep learning shows that transfer learning becomes much
easier and more effective with high-level abstract features learned by deep
models, and the `transfer' can be conducted not only between data distributions
and data types, but also between model structures (e.g., shallow nets and deep
nets) or even model types (e.g., Bayesian models and neural models). This
review paper summarizes some recent prominent research towards this direction,
particularly for speech and language processing. We also report some results
from our group and highlight the potential of this very interesting research
field.Comment: 13 pages, APSIPA 201
Instance-based Deep Transfer Learning
Deep transfer learning recently has acquired significant research interest.
It makes use of pre-trained models that are learned from a source domain, and
utilizes these models for the tasks in a target domain. Model-based deep
transfer learning is probably the most frequently used method. However, very
little research work has been devoted to enhancing deep transfer learning by
focusing on the influence of data. In this paper, we propose an instance-based
approach to improve deep transfer learning in a target domain. Specifically, we
choose a pre-trained model from a source domain and apply this model to
estimate the influence of training samples in a target domain. Then we optimize
the training data of the target domain by removing the training samples that
will lower the performance of the pre-trained model. We later either fine-tune
the pre-trained model with the optimized training data in the target domain, or
build a new model which is initialized partially based on the pre-trained
model, and fine-tune it with the optimized training data in the target domain.
Using this approach, transfer learning can help deep learning models to capture
more useful features. Extensive experiments demonstrate the effectiveness of
our approach on boosting the quality of deep learning models for some common
computer vision tasks, such as image classification.Comment: Accepted to WACV 2019. This is a preprint versio
- …