A Swarm Based Approach to Improve Traditional Document Clustering Approach

Abstract

Clustering, an extremely important technique in Data Mining is an automatic learning technique aimed at grouping a set of objects into subsets or clusters. The goal is to create clusters that are coherent internally, but substantially different from each other. Text Document Clustering refers to the clustering of related text documents into groups based upon their content. Document clustering is a fundamental operation used in unsupervised document organization, text data mining, automatic topic extraction, and information retrieval. Fast and high - quality document clustering algorithms play an important role in effectively navigating, summarizing, and organizing information. The documents to be clustered can be web news articles, abstracts of research papers etc. The aim of this paper is to provide efficient document clustering technique involving the application of soft computing approach and the use of swarm intelligence based algorithm

    Similar works