Search CORE

2,308 research outputs found

S-TREE: Self-Organizing Trees for Data Clustering and Online Vector Quantization

Author: Campos Marcos
Carpenter Gail
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/09/2000
Field of study

This paper introduces S-TREE (Self-Organizing Tree), a family of models that use unsupervised learning to construct hierarchical representations of data and online tree-structured vector quantizers. The S-TREE1 model, which features a new tree-building algorithm, can be implemented with various cost functions. An alternative implementation, S-TREE2, which uses a new double-path search procedure, is also developed. S-TREE2 implements an online procedure that approximates an optimal (unstructured) clustering solution while imposing a tree-structure constraint. The performance of the S-TREE algorithms is illustrated with data clustering and vector quantization examples, including a Gauss-Markov source benchmark and an image compression application. S-TREE performance on these tasks is compared with the standard tree-structured vector quantizer (TSVQ) and the generalized Lloyd algorithm (GLA). The image reconstruction quality with S-TREE2 approaches that of GLA while taking less than 10% of computer time. S-TREE1 and S-TREE2 also compare favorably with the standard TSVQ in both the time needed to create the codebook and the quality of image reconstruction.Office of Naval Research (N00014-95-10409, N00014-95-0G57

Boston University Institutional Repository (OpenBU)

Magnification Control in Self-Organizing Maps and Neural Gas

Author: Brause R.
Jens Christian Claussen
Thomas Villmann
Villmann Th.
Villmann Th.
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2006
Field of study

We consider different ways to control the magnification in self-organizing maps (SOM) and neural gas (NG). Starting from early approaches of magnification control in vector quantization, we then concentrate on different approaches for SOM and NG. We show that three structurally similar approaches can be applied to both algorithms: localized learning, concave-convex learning, and winner relaxing learning. Thereby, the approach of concave-convex learning in SOM is extended to a more general description, whereas the concave-convex learning for NG is new. In general, the control mechanisms generate only slightly different behavior comparing both neural algorithms. However, we emphasize that the NG results are valid for any data dimension, whereas in the SOM case the results hold only for the one-dimensional case.Comment: 24 pages, 4 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref

Aston Publications Explorer

Fast Color Quantization Using Weighted Sort-Means Clustering

Author: Balasubramanian
Bing
Chang
Cheng
Dekker
Deng
Deng
Drineas
Equitz
Forgy
Gentile
Heckbert
Hu
Hu
Huang
Joy
Kanjanawanishkul
Kanungo
Kasuga
Kolen
Kuo
Linde
Lloyd
M. Emre Celebi
Orchard
Ozdemir
Papamarkos
Schaefer
Scheunders
Sirisathitkul
Wan
Xiang
Xiang
Yang
Yang
Publication venue: 'The Optical Society'
Publication date: 01/01/2009
Field of study

Color quantization is an important operation with numerous applications in graphics and image processing. Most quantization methods are essentially based on data clustering algorithms. However, despite its popularity as a general purpose clustering algorithm, k-means has not received much respect in the color quantization literature because of its high computational requirements and sensitivity to initialization. In this paper, a fast color quantization method based on k-means is presented. The method involves several modifications to the conventional (batch) k-means algorithm including data reduction, sample weighting, and the use of triangle inequality to speed up the nearest neighbor search. Experiments on a diverse set of images demonstrate that, with the proposed modifications, k-means becomes very competitive with state-of-the-art color quantization methods in terms of both effectiveness and efficiency.Comment: 30 pages, 2 figures, 4 table

arXiv.org e-Print Archive

CiteSeerX

Crossref

Medical imaging analysis with artificial neural networks

Author: Jiang J.
Ren Jinchang
Trundle P.
Publication venue: 'Elsevier BV'
Publication date: 01/12/2010
Field of study

Given that neural networks have been widely reported in the research community of medical imaging, we provide a focused literature survey on recent neural network developments in computer-aided diagnosis, medical image segmentation and edge detection towards visual content analysis, and medical image registration for its pre-processing and post-processing, with the aims of increasing awareness of how neural networks can be applied to these areas and to provide a foundation for further research and practical development. Representative techniques and algorithms are explained in detail to provide inspiring examples illustrating: (i) how a known neural network with fixed structure and training procedure could be applied to resolve a medical imaging problem; (ii) how medical images could be analysed, processed, and characterised by neural networks; and (iii) how neural networks could be expanded further to resolve problems relevant to medical imaging. In the concluding section, a highlight of comparisons among many neural network applications is included to provide a global view on computational intelligence with neural networks in medical imaging

University of Strathclyde Institutional Repository

Surrey Research Insight

Pointwise convergence of the Lloyd algorithm in higher dimension

Author: Pagès Gilles
Yu Jun
Publication venue
Publication date: 31/12/2013
Field of study

We establish the pointwise convergence of the iterative Lloyd algorithm, also known as

k

-means algorithm, when the quadratic quantization error of the starting grid (with size

N\ge 2

) is lower than the minimal quantization error with respect to the input distribution is lower at level

N-1

. Such a protocol is known as the splitting method and allows for convergence even when the input distribution has an unbounded support. We also show under very light assumption that the resulting limiting grid still has full size

N

. These results are obtained without continuity assumption on the input distribution. A variant of the procedure taking advantage of the asymptotic of the optimal quantizer radius is proposed which always guarantees the boundedness of the iterated grids

arXiv.org e-Print Archive

Hal-Diderot

Weighted Mahalanobis Distance for Hyper-Ellipsoidal Clustering

Author: Younis Khaled S.
Publication venue: AFIT Scholar
Publication date: 01/12/1996
Field of study

Cluster analysis is widely used in many applications, ranging from image and speech coding to pattern recognition. A new method that uses the weighted Mahalanobis distance (WMD) via the covariance matrix of the individual clusters as the basis for grouping is presented in this thesis. In this algorithm, the Mahalanobis distance is used as a measure of similarity between the samples in each cluster. This thesis discusses some difficulties associated with using the Mahalanobis distance in clustering. The proposed method provides solutions to these problems. The new algorithm is an approximation to the well-known expectation maximization (EM) procedure used to find the maximum likelihood estimates in a Gaussian mixture model. Unlike the EM procedure, WMD eliminates the requirement of having initial parameters such as the cluster means and variances as it starts from the raw data set. Properties of the new clustering method are presented by examining the clustering quality for codebooks designed with the proposed method and competing methods on a variety of data sets. The competing methods are the Linde-Buzo-Gray (LBG) algorithm and the Fuzzy c-means (FCM) algorithm, both of them use the Euclidean distance. The neural network for hyperellipsoidal clustering (HEC) that uses the Mahalnobis distance is also studied and compared to the WMD method and the other techniques as well. The new method provides better results than the competing methods. Thus, this method becomes another useful tool for use in clustering

AFTI Scholar (Air Force Institute of Technology)

Improving the Performance of K-Means for Color Quantization

Author: Al Hasan
Al-Daoud
Al-Daoud
Aloise
Arthur
Babu
Babu
Balasubramanian
Balasubramanian
Bezdek
Bing
Bottou
Braudaway
Brun
Brun
Celebi
Celebi
Celebi
Chang
Chen
Cheng
Dekker
Deng
Deng
Elkan
Equitz
Feldman
Fletcher
Forgy
Fränti
Gan
Gentile
Gervautz
Goldberg
Gonzalez
Har-Peled
Heckbert
Hochbaum
Hsieh
Hu
Hu
Hu
Huang
Joy
Kanjanawanishkul
Kanungo
Kasuga
Katsavounidis
Kaufman
Khan
Kolen
Kuo
Lai
Likas
Linde
Lloyd
Lo
M. Emre Celebi
Milligan
Milvang
Mojsilovic
Orchard
Ozdemir
Papamarkos
Pei
Perim
Phillips
Redmond
Schaefer
Scheunders
Selim
Sertel
Sherkat
Sirisathitkul
Smith
Turnbull
Uchiyama
Velho
Verevka
Wan
Wang
Wu
Wu
Xiang
Xiang
Xiang
Yang
Yang
Publication venue: 'Elsevier BV'
Publication date: 02/01/2011
Field of study

Color quantization is an important operation with many applications in graphics and image processing. Most quantization methods are essentially based on data clustering algorithms. However, despite its popularity as a general purpose clustering algorithm, k-means has not received much respect in the color quantization literature because of its high computational requirements and sensitivity to initialization. In this paper, we investigate the performance of k-means as a color quantizer. We implement fast and exact variants of k-means with several initialization schemes and then compare the resulting quantizers to some of the most popular quantizers in the literature. Experiments on a diverse set of images demonstrate that an efficient implementation of k-means with an appropriate initialization strategy can in fact serve as a very effective color quantizer.Comment: 26 pages, 4 figures, 13 table

arXiv.org e-Print Archive

Crossref

A Comparative Study of Efficient Initialization Methods for the K-Means Clustering Algorithm

Author: Al Hasan
Al-Daoud
Aloise
Aloise
Anderberg
Babu
Babu
Ball
Bei
Bergmann
Bottou
Breunig
Cao
Celebi
Chen
Chen
Daniel
Forgy
Friedman
Garcia
Garcia
Gonzalez
Hartigan
Hassan A. Kingravi
Hotelling
Huang
Huang
Hubert
Hyvärinen
Iman
Jain
Jain
Jancey
Kanungo
Katsavounidis
Kaufman
Lance
Likas
Linde
Lloyd
Lu
Luengo
M. Emre Celebi
Maitra
Mao
Matsumoto
Meilă
Milligan
Milligan
Norušis
Onoda
Ordonez
Pal
Patricio A. Vela
Pena
Redmond
Selim
Späth
Su
Tarsitano
Tou
Wu
Zhang
Publication venue: 'Elsevier BV'
Publication date: 10/09/2012
Field of study

K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately, due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of the cluster centers. Numerous initialization methods have been proposed to address this problem. In this paper, we first present an overview of these methods with an emphasis on their computational efficiency. We then compare eight commonly used linear time complexity initialization methods on a large and diverse collection of data sets using various performance criteria. Finally, we analyze the experimental results using non-parametric statistical tests and provide recommendations for practitioners. We demonstrate that popular initialization methods often perform poorly and that there are in fact strong alternatives to these methods.Comment: 17 pages, 1 figure, 7 table

arXiv.org e-Print Archive

Crossref