1,661 research outputs found
A survey of kernel and spectral methods for clustering
Clustering algorithms are a useful tool to explore data structures and have been employed in many disciplines. The focus of this paper is the partitioning clustering problem with a special interest in two recent approaches: kernel and spectral methods. The aim of this paper is to present a survey of kernel and spectral clustering methods, two approaches able to produce nonlinear separating hypersurfaces between clusters. The presented kernel clustering methods are the kernel version of many classical clustering algorithms, e.g., K-means, SOM and neural gas. Spectral clustering arise from concepts in spectral graph theory and the clustering problem is configured as a graph cut problem where an appropriate objective function has to be optimized. An explicit proof of the fact that these two paradigms have the same objective is reported since it has been proven that these two seemingly different approaches have the same mathematical foundation. Besides, fuzzy kernel clustering methods are presented as extensions of kernel K-means clustering algorithm. (C) 2007 Pattem Recognition Society. Published by Elsevier Ltd. All rights reserved
Conceptual Spaces in Object-Oriented Framework
The aim of this paper is to show that the middle level of
mental representations in a conceptual spaces framework is consistent
with the OOP paradigm. We argue that conceptual spaces framework
together with vague prototype theory of categorization appears to be
the most suitable solution for modeling the cognitive apparatus of
humans, and that the OOP paradigm can be easily and intuitively
reconciled with this framework. First, we show that the prototypebased
OOP approach is consistent with GƤrdenforsā model in terms
of structural coherence. Second, we argue that the product of cloning
process in a prototype-based model is in line with the structure of
categories in GƤrdenforsā proposal. Finally, in order to make the fuzzy
object-oriented model consistent with conceptual space, we
demonstrate how to define membership function in a more cognitive
manner, i.e. in terms of similarity to prototype
Clustering Via Nonparametric Density Estimation: the R Package pdfCluster
The R package pdfCluster performs cluster analysis based on a nonparametric
estimate of the density of the observed variables. After summarizing the main
aspects of the methodology, we describe the features and the usage of the
package, and finally illustrate its working with the aid of two datasets
-MLE: A fast algorithm for learning statistical mixture models
We describe -MLE, a fast and efficient local search algorithm for learning
finite statistical mixtures of exponential families such as Gaussian mixture
models. Mixture models are traditionally learned using the
expectation-maximization (EM) soft clustering technique that monotonically
increases the incomplete (expected complete) likelihood. Given prescribed
mixture weights, the hard clustering -MLE algorithm iteratively assigns data
to the most likely weighted component and update the component models using
Maximum Likelihood Estimators (MLEs). Using the duality between exponential
families and Bregman divergences, we prove that the local convergence of the
complete likelihood of -MLE follows directly from the convergence of a dual
additively weighted Bregman hard clustering. The inner loop of -MLE can be
implemented using any -means heuristic like the celebrated Lloyd's batched
or Hartigan's greedy swap updates. We then show how to update the mixture
weights by minimizing a cross-entropy criterion that implies to update weights
by taking the relative proportion of cluster points, and reiterate the mixture
parameter update and mixture weight update processes until convergence. Hard EM
is interpreted as a special case of -MLE when both the component update and
the weight update are performed successively in the inner loop. To initialize
-MLE, we propose -MLE++, a careful initialization of -MLE guaranteeing
probabilistically a global bound on the best possible complete likelihood.Comment: 31 pages, Extend preliminary paper presented at IEEE ICASSP 201
- ā¦