A Clustering based Discretization for Supervised Learning

Gupta, Ankit; Mehrotra, Kishan; Mohan, Chilukuri K.

A Clustering based Discretization for Supervised Learning

Authors: Ankit Gupta
Kishan Mehrotra
Chilukuri K. Mohan
Publication date: 2 November 2009
Publisher: SURFACE at Syracuse University

Abstract

We address the problem of discretization of continuous variables for machine learning classification algorithms. Existing procedures do not use interdependence between the variables towards this goal. Our proposed method uses clustering to exploit such interdependence. Numerical results show that this improves the classification performance in almost all cases. Even if an existing algorithm can successfully operate with continuous variables, better performance is obtained if variables are first discretized. An additional advantage of discretization is that it reduces the overall time-complexity

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Syracuse University Research Facility and Collaborative Environment

oai:surface.syr.edu:eecs-1002

Last time updated on 09/07/2019