Data Reduction Method for Categorical Data Clustering

Abundez, Itzel; García, Rene A.; Gasca, Eduardo; Gutiérrez, Citlalih; Rendón, Eréndira; Sánchez Garreta, Josep Salvador

research

Data Reduction Method for Categorical Data Clustering

Authors: Itzel Abundez
Rene A. García
Eduardo Gasca
Citlalih Gutiérrez
Eréndira Rendón
Josep Salvador Sánchez Garreta
Publication date: 1 January 2008
Publisher: Springer Verlag

Abstract

Categorical data clustering constitutes an important part of data mining; its relevance has recently drawn attention from several researchers. As a step in data mining, however, clustering encounters the problem of large amount of data to be processed. This article offers a solution for categorical clustering algorithms when working with high volumes of data by means of a method that summarizes the database. This is done using a structure called CM-tree. In order to test our method, the KModes and Click clustering algorithms were used with several databases. Experiments demonstrate that the proposed summarization method improves execution time, without losing clustering quality

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Repositori Institucional de la Universitat Jaume I

oai:repositori.uji.es:10234/18...

Last time updated on 17/11/2016

Repositori UJI

oai:repositori.uji.es:10234/18...

Last time updated on 05/04/2020