Concept learning of text documents

An, Jiyuan; Chen, Yi-Ping Phoebe

research

Concept learning of text documents

Authors: Jiyuan An
Yi-Ping Phoebe Chen
Publication date: 1 January 2004
Publisher: IEEE Xplore

Abstract

Concept learning of text documents can be viewed as the problem of acquiring the definition of a general category of documents. To definite the category of a text document, the Conjunctive of keywords is usually be used. These keywords should be fewer and comprehensible. A naïve method is enumerating all combinations of keywords to extract suitable ones. However, because of the enormous number of keyword combinations, it is impossible to extract the most relevant keywords to describe the categories of documents by enumerating all possible combinations of keywords. Many heuristic methods are proposed, such as GA-base, immune based algorithm. In this work, we introduce pruning power technique and propose a robust enumeration-based concept learning algorithm. Experimental results show that the rules produce by our approach has more comprehensible and simplicity than by other methods. <br /

Similar works

Full text

Available Versions

Deakin Research Online

oai:dro.deakin.edu.au:DU:30005...

Last time updated on 22/08/2013