1,344 research outputs found
Deep Divergence-Based Approach to Clustering
A promising direction in deep learning research consists in learning
representations and simultaneously discovering cluster structure in unlabeled
data by optimizing a discriminative loss function. As opposed to supervised
deep learning, this line of research is in its infancy, and how to design and
optimize suitable loss functions to train deep neural networks for clustering
is still an open question. Our contribution to this emerging field is a new
deep clustering network that leverages the discriminative power of
information-theoretic divergence measures, which have been shown to be
effective in traditional clustering. We propose a novel loss function that
incorporates geometric regularization constraints, thus avoiding degenerate
structures of the resulting clustering partition. Experiments on synthetic
benchmarks and real datasets show that the proposed network achieves
competitive performance with respect to other state-of-the-art methods, scales
well to large datasets, and does not require pre-training steps
Algorithm-Dependent Bounds for Representation Learning of Multi-Source Domain Adaptation
We use information-theoretic tools to derive a novel analysis of Multi-source
Domain Adaptation (MDA) from the representation learning perspective.
Concretely, we study joint distribution alignment for supervised MDA with few
target labels and unsupervised MDA with pseudo labels, where the latter is
relatively hard and less commonly studied. We further provide
algorithm-dependent generalization bounds for these two settings, where the
generalization is characterized by the mutual information between the
parameters and the data. Then we propose a novel deep MDA algorithm, implicitly
addressing the target shift through joint alignment. Finally, the mutual
information bounds are extended to this algorithm providing a non-vacuous
gradient-norm estimation. The proposed algorithm has comparable performance to
the state-of-the-art on target-shifted MDA benchmark with improved memory
efficiency
- …