Relaxing Feature Selection in Spam Filtering by Using Case-Based Reasoning Systems

Abstract

This paper presents a comparison between two alternative strategies for addressing feature selection on a well known case-based reasoning spam filtering system called SpamHunting. We present the usage of the k more predictive features and a percentage-based strategy for the exploitation of our amount of information measure. Finally, we confirm the idea that the percentage feature selection method is more adequate for spam filtering domain

    Similar works