Ranking instances by maximizing the area under ROC curve

Guvenir, H. A.; Kurtcephe, M.

research

Ranking instances by maximizing the area under ROC curve

Authors: H. A. Guvenir
M. Kurtcephe
Publication date: 1 January 2013
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'
Doi

Abstract

Cataloged from PDF version of article.In recent years, the problem of learning a real-valued function that induces a ranking over an instance space has gained importance in machine learning literature. Here, we propose a supervised algorithm that learns a ranking function, called ranking instances by maximizing the area under the ROC curve (RIMARC). Since the area under the ROC curve (AUC) is a widely accepted performance measure for evaluating the quality of ranking, the algorithm aims to maximize the AUC value directly. For a single categorical feature, we show the necessary and sufficient condition that any ranking function must satisfy to achieve the maximum AUC. We also sketch a method to discretize a continuous feature in a way to reach the maximum AUC as well. RIMARC uses a heuristic to extend this maximization to all features of a data set. The ranking function learned by the RIMARC algorithm is in a humanreadable form; therefore, it provides valuable information to domain experts for decision making. Performance of RIMARC is evaluated on many real-life data sets by using different state-of-the-art algorithms. Evaluations of the AUC metric show that RIMARC achieves significantly better performance compared to other similar methods

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Crossref

info:doi/10.1109%2Ftkde.2012.2...

Last time updated on 01/04/2019

Bilkent University Institutional Repository

oai:repository.bilkent.edu.tr:...

Last time updated on 12/11/2016