Skip to main content
Article thumbnail
Location of Repository

An optimising approach to alternative clustering schemes.\ud

By A. Jessop

Abstract

Clustering objects into groups is usually done using a statistical heuristic or an optimisation. The method depends on the size of the problem and its purpose. There may exist a number of partitions which do not differ significantly but some of which may be preferable (or equally good) when aspects of the problem not formally contained in the model are considered in the interpretation of the result. To decide between a number of good partitions they must first be enumerated and this may be done by using a number of different heuristics. In this paper an alternative method is described which uses an integer linear programming model having the number and size distribution of groups as objectives and the criteria for group membership as constraints. The model is applied to three problems each having a different measure of dissimilarity between objects and so different membership criteria. In each case a number of optimal solutions are found and expressed in two parts: a core of groups, the membership of which does not change, and the remaining objects which augment the core. The core is found to contain over three quarters of the objects and so provides a stable base for cluster definition. \u

Topics: ILP, Multicriteria, Statistics, Cluster.
Publisher: Springer
Year: 2010
DOI identifier: 10.1007/s10100-009-0111-5
OAI identifier: oai:dro.dur.ac.uk.OAI2:8180
Journal:

Suggested articles

Citations

  1. (1990). Strategic groups: untested assertions and research proposals.
  2. (1998). Classification and Regression Trees.
  3. (1966). Community and Privacy: Towards a New Architecture of Humanism.
  4. (2001). Genetic clustering algorithms.
  5. (1999). A genetic algorithm approach to cluster analysis.
  6. (2004). Airport Statistics 2003/2004.
  7. (1993). Cluster Analysis (Third edn.).
  8. (2004). Product modularity: measures and design methods.
  9. (2003). Managing Airports; An International Perspective, 2nd edn.
  10. (2006). Multivariate Data Analysis (Sixth edn.). Pearson Prentice Hall, Upper Saddle River NJ,
  11. (1996). Statistics and partitioning of species diversity, and similarity among multiple communities.
  12. (2003). Stochastic Population Dynamics in Ecology and Conservation.
  13. (2006). A new approach to cluster analysis: the clustering-function-based method.
  14. (1988). Ecological Diversity and its Measurement.
  15. (2000). Genetic algorithm-based clustering technique.
  16. (1998). Renyi’s entropy as an index of diversity in simple-stage cluster sampling.
  17. (1996). In search of optimal clusters using genetic algorithms.
  18. (1982). Diversity as a concept and its measurement.
  19. (2005). Mapping the structure of MBA programmes: a comparative study of the structure of accredited AMBA programmes in the United Kingdom.
  20. (1974). The measurement of species diversity.
  21. (1977). Mathematical Ecology.
  22. (2007). ILP Approaches to the blockmodel problem.
  23. (1982). Diversity and dissimilarity coefficients: a unified approach.
  24. (1971). Cluster analysis and mathematical programming.
  25. (1961). On the measures of entropy and information.
  26. (1948). A mathematical theory of communication.
  27. (1949). Measurement of diversity.
  28. (1967). Economics and Information Theory.
  29. (2004). Improving dynamic programming strategies for partitioning.
  30. (1969). Integer programming and the theory of grouping.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.