Search CORE

1 research outputs found

Approximate Data Mining in Very Large Relational Data

Author: Christopher Leckie
James C. Bezdek
Ramamohanarao Kotagiri
Richard J. Hathaway
Publication venue
Publication date
Field of study

In this paper we discuss eNERF, an extended version of non-Euclidean relational fuzzy c-means (NERFCM) for approximate clustering in very large (unloadable) relational data. The eNERF procedure consists of four parts: (i) selection of distinguished features by algorithm DF to be monitored during progressive sampling; (ii) progressively sampling a square N × N relation matrix RN by algorithm PS until an n × n sample relation Rn passes a goodness of fit test; (iii) Clustering Rn using algorithm LNERF; and (iv), extension of the LNERF results to RN-Rn by algorithm xNERF, which uses an iterative procedure based on LNERF to compute fuzzy membership values for all of the objects remaining after LNERF clustering of the accepted sample. Three of the four algorithms are new- only LNERF (called NERFCM in the original literature) precedes this article

CiteSeerX