Feature selection, optimization and clustering strategies of text documents

Nikhath, A. Kousar; Subrahmanyam, K.

Feature selection, optimization and clustering strategies of text documents

Authors: A. Kousar Nikhath
K. Subrahmanyam
Publication date: 1 April 2019
Publisher: 'Institute of Advanced Engineering and Science'
Doi

Abstract

Clustering is one of the most researched areas of data mining applications in the contemporary literature. The need for efficient clustering is observed across wide sectors including consumer segmentation, categorization, shared filtering, document management, and indexing. The research of clustering task is to be performed prior to its adaptation in the text environment. Conventional approaches typically emphasized on the quantitative information where the selected features are numbers. Efforts also have been put forward for achieving efficient clustering in the context of categorical information where the selected features can assume nominal values. This manuscript presents an in-depth analysis of challenges of clustering in the text environment. Further, this paper also details prominent models proposed for clustering along with the pros and cons of each model. In addition, it also focuses on various latest developments in the clustering task in the social network and associated environments

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Crossref

Last time updated on 31/10/2020

ZENODO

oai:zenodo.org:4066002

Last time updated on 08/08/2023

Institute of Advanced Engineering and Science

oai:ojs.www.iaescore.com:artic...

Last time updated on 20/10/2020