Search CORE

1,087 research outputs found

Text Classification Aided by Clustering: a Literature Review

Author: Kyriakopoulou Antonia
Publication venue: 'IntechOpen'
Publication date: 01/08/2008
Field of study

IntechOpen

Crossref

Personalized Document Clustering: A Collaborative-Filtering-Based Approach

Author: Hsiao Han-Wie
Wei Chih-Ping
Yang Chin-Sheng
Publication venue: AIS Electronic Library (AISeL)
Publication date: 31/12/2004
Field of study

AIS Electronic Library (AISeL)

Text Classification using Unsupervised Learning techniques

Author: Ricardo Henrique Teixeira Duarte
Publication venue
Publication date: 16/07/2018
Field of study

Repositório Aberto da Universidade do Porto

A survey of data mining techniques for social media analysis

Author: Adedoyin-Olowe Mariam
Gaber Mohamed Medhat
Stahl Frederic
Publication venue: Episciences
Publication date: 16/04/2014
Field of study

Social network has gained remarkable attention in the last decade. Accessing social network sites such as Twitter, Facebook LinkedIn and Google+ through the internet and the web 2.0 technologies has become more affordable. People are becoming more interested in and relying on social network for information, news and opinion of other users on diverse subject matters. The heavy reliance on social network sites causes them to generate massive data characterised by three computational issues namely; size, noise and dynamism. These issues often make social network data very complex to analyse manually, resulting in the pertinent use of computational means of analysing them. Data mining provides a wide range of techniques for detecting useful knowledge from massive datasets like trends, patterns and rules [44]. Data mining techniques are used for information retrieval, statistical modelling and machine learning. These techniques employ data pre-processing, data analysis, and data interpretation processes in the course of data analysis. This survey discusses different data mining techniques used in mining diverse aspects of the social network over decades going from the historical techniques to the up-to-date models, including our novel technique named TRCM. All the techniques covered in this survey are listed in the Table.1 including the tools employed as well as names of their authors

arXiv.org e-Print Archive

Central Archive at the University of Reading

Crossref

Episciences.org

Directory of Open Access Journals

Feature selection, optimization and clustering strategies of text documents

Author: Nikhath A. Kousar
Subrahmanyam K.
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/04/2019
Field of study

Clustering is one of the most researched areas of data mining applications in the contemporary literature. The need for efficient clustering is observed across wide sectors including consumer segmentation, categorization, shared filtering, document management, and indexing. The research of clustering task is to be performed prior to its adaptation in the text environment. Conventional approaches typically emphasized on the quantitative information where the selected features are numbers. Efforts also have been put forward for achieving efficient clustering in the context of categorical information where the selected features can assume nominal values. This manuscript presents an in-depth analysis of challenges of clustering in the text environment. Further, this paper also details prominent models proposed for clustering along with the pros and cons of each model. In addition, it also focuses on various latest developments in the clustering task in the social network and associated environments

Crossref

ZENODO

Institute of Advanced Engineering and Science

Automated subject classification of textual web documents

Author: Koraljka Golub
Publication venue: 'Emerald'
Publication date
Field of study

Crossref