2 research outputs found
A Framework for Deep Constrained Clustering -- Algorithms and Advances
The area of constrained clustering has been extensively explored by
researchers and used by practitioners. Constrained clustering formulations
exist for popular algorithms such as k-means, mixture models, and spectral
clustering but have several limitations. A fundamental strength of deep
learning is its flexibility, and here we explore a deep learning framework for
constrained clustering and in particular explore how it can extend the field of
constrained clustering. We show that our framework can not only handle standard
together/apart constraints (without the well documented negative effects
reported earlier) generated from labeled side information but more complex
constraints generated from new types of side information such as continuous
values and high-level domain knowledge.Comment: Updated for ECML/PKDD 201
A Framework for Deep Constrained Clustering
The area of constrained clustering has been extensively explored by
researchers and used by practitioners. Constrained clustering formulations
exist for popular algorithms such as k-means, mixture models, and spectral
clustering but have several limitations. A fundamental strength of deep
learning is its flexibility, and here we explore a deep learning framework for
constrained clustering and in particular explore how it can extend the field of
constrained clustering. We show that our framework can not only handle standard
together/apart constraints (without the well documented negative effects
reported earlier) generated from labeled side information but more complex
constraints generated from new types of side information such as continuous
values and high-level domain knowledge. Furthermore, we propose an efficient
training paradigm that is generally applicable to these four types of
constraints. We validate the effectiveness of our approach by empirical results
on both image and text datasets. We also study the robustness of our framework
when learning with noisy constraints and show how different components of our
framework contribute to the final performance. Our source code is available at
.Comment: Data Mining and Knowledge Discovery, 2021. arXiv admin note:
substantial text overlap with arXiv:1901.1006