Fast k-NN classifier for documents based on a graph structure

D.L. Lewis; H. Ferhatosmanoglu; J. Yu; J.P. Myles; K. Figueroa; S. Hernández-Rodríguez

research

Fast k-NN classifier for documents based on a graph structure

Authors: D.L. Lewis
H. Ferhatosmanoglu
J. Yu
J.P. Myles
K. Figueroa
S. Hernández-Rodríguez
Publication date: 1 January 2010
Publisher: 'Springer Science and Business Media LLC'
Doi

Abstract

In this paper, a fast k nearest neighbors (k-NN) classifier for documents is presented. Documents are usually represented in a high-dimensional feature space, where terms appeared on it are treated as features and the weight of each term reflects its importance in the document. There are many approaches to find the vicinity of an object, but their performance drastically decreases as the number of dimensions grows. This problem prevents its application for documents. The proposed method is based on a graph index structure with a fast search algorithm. It’s high selectivity permits to obtain a similar classification quality than exhaustive classifier, with a few number of computed distances. Our experimental results show that it is feasible the use of the proposed method in problems of very high dimensionality, such as Text Mining

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Repositori UJI

oai:repositori.uji.es:10234/31...

Last time updated on 05/04/2020

Repositori Institucional de la Universitat Jaume I

oai:repositori.uji.es:10234/31...

Last time updated on 17/11/2016

Crossref

Last time updated on 01/04/2019