1 research outputs found
ROCK algorithm parallelization with TOREADOR primitives
We present the benefits of applying the code once deploy everywhere approach to clustering of categorical data over large datasets. The paper brings two main contributions: an step-by step application of the code based approach and an enhancement for the ROCK algorithm for clustering categorical data