Search CORE

5 research outputs found

Evolving rules for document classification

Author: A. Bergström
C. Apté
C.M. Tan
D. Montana
D.R. Tauritz
F. Sebastiani
G. Salton
H. Lodhi
J.R. Koza
K. Bennet
M. Damashek
T. Joachims
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

We describe a novel method for using Genetic Programming to create compact classification rules based on combinations of N-Grams (character strings). Genetic programs acquire fitness by producing rules that are effective classifiers in terms of precision and recall when evaluated against a set of training documents. We describe a set of functions and terminals and provide results from a classification task using the Reuters 21578 dataset. We also suggest that because the induced rules are meaningful to a human analyst they may have a number of other uses beyond classification and provide a basis for text mining applications

CiteSeerX

Crossref

Sheffield Hallam University Research Archive

UCL Discovery

SPAM detection: Naïve bayesian classification and RPN expression-based LGP approaches compared

Author: A Guven
A Khorsi
AH Gandomi
AW Burks
C Sangeetha
Carlton Downey
CL Hamblin
E Stamatatos
GV Cormack
I Kononenko
J Pearl
L Hirsch
Lorrie Faith Cranor
M Basavaraju
M Brameier
M Matsumoto
M Zhang
PE Bennett
S Mukkamala
VA Yatsko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/07/2016
Field of study

An investigation is performed of a machine learning algorithm and the Bayesian classifier in the spam-filtering context. The paper shows the advantage of the use of Reverse Polish Notation (RPN) expressions with feature extraction compared to the traditional Naïve Bayesian classifier used for spam detection assuming the same features. The performance of the two is investigated using a public corpus and a recent private spam collection, concluding that the system based on RPN LGP (Linear Genetic Programming) gave better results compared to two popularly used open source Bayesian spam filters. © Springer International Publishing Switzerland 2016

Crossref

Institutional repository of Tomas Bata University Library

Application of Context Aware Systems to Support Knowledge Work in the Aerospace

Author: Xie Yifan
Publication venue
Publication date: 09/12/2013
Field of study

OPUS