Generalised Mutual Information: a Framework for Discriminative
  Clustering

Bouveyron, Charles; Droit, Arnaud; Harchaoui, Warith; Leclercq, Mickaël; Mattei, Pierre-Alexandre; Ohl, Louis; Precioso, Frédéric

Generalised Mutual Information: a Framework for Discriminative Clustering

Authors: Charles Bouveyron
Arnaud Droit
Warith Harchaoui
Mickaël Leclercq
Pierre-Alexandre Mattei
Louis Ohl
Frédéric Precioso
Publication date: 6 September 2023
Publisher

Abstract

In the last decade, recent successes in deep clustering majorly involved the Mutual Information (MI) as an unsupervised objective for training neural networks with increasing regularisations. While the quality of the regularisations have been largely discussed for improvements, little attention has been dedicated to the relevance of MI as a clustering objective. In this paper, we first highlight how the maximisation of MI does not lead to satisfying clusters. We identified the Kullback-Leibler divergence as the main reason of this behaviour. Hence, we generalise the mutual information by changing its core distance, introducing the Generalised Mutual Information (GEMINI): a set of metrics for unsupervised neural network training. Unlike MI, some GEMINIs do not require regularisations when training as they are geometry-aware thanks to distances or kernels in the data space. Finally, we highlight that GEMINIs can automatically select a relevant number of clusters, a property that has been little studied in deep discriminative clustering context where the number of clusters is a priori unknown.Comment: Submitted for review at the IEEE Transactions on Pattern Analysis and Machine Intelligence. This article is an extension of an original NeurIPS 2022 article [arXiv:2210.06300

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2309.02858

Last time updated on 12/09/2023