Search CORE

4 research outputs found

On the selection of the globally optimal prototype subset for nearest-neighbor classification

Author: Carrizosa E.
Martín-Barragán B.
Morales D.R.
Plastria F.
Publication venue: 'Institute for Operations Research and the Management Sciences (INFORMS)'
Publication date: 01/01/2007
Field of study

The nearest-neighbor classifier has been shown to be a powerful tool for multiclass classification. We explore both theoretical properties and empirical behavior of a variant method, in which the nearest-neighbor rule is applied to a reduced set of prototypes. This set is selected a priori by fixing its cardinality and minimizing the empirical misclassification cost. In this way we alleviate the two serious drawbacks of the nearest-neighbor method: high storage requirements and time-consuming queries. Finding this reduced set is shown to be NP-hard. We provide mixed integer programming (MIP) formulations, which are theoretically compared and solved by a standard MIP solver for small problem instances. We show that the classifiers derived from these formulations are comparable to benchmark procedures. We solve large problem instances by a metaheuristic that yields good classification rules in reasonable time. Additional experiments indicate that prototype-based nearest-neighbor classifiers remain quite stable in the presence of missing values

Edinburgh Research Explorer

Oxford University Research Archive

Supervised Classification and Mathematical Optimization

Author: Carrizosa Emilio
Romero-Morales Dolores
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

Data Mining techniques often ask for the resolution of optimization problems. Supervised Classification, and, in particular, Support Vector Machines, can be seen as a paradigmatic instance. In this paper, some links between Mathematical Optimization methods and Supervised Classification are emphasized. It is shown that many different areas of Mathematical Optimization play a central role in off-the-shelf Supervised Classification methods. Moreover, Mathematical Optimization turns out to be extremely useful to address important issues in Classification, such as identifying relevant variables, improving the interpretability of classifiers or dealing with vagueness/noise in the data

On the selection of the globally optimal prototype subset for Nearest-Neighbor classification

Author: Carrizosa E
Martin-Barragan B
Plastria F
Romero-Morales D
Publication venue
Publication date: 01/01/2007
Field of study

Oxford University Research Archive

On the Selection of the Globally Optimal Prototype Subset for Nearest-Neighbor Classification

Author: Carrizosa Priego Emilio José
Martín Barragán Belén
Plastria Frank
Romero Morales María Dolores
Publication venue: 'Institute for Operations Research and the Management Sciences (INFORMS)'
Publication date: 20/07/2007
Field of study

idUS. Depósito de Investigación Universidad de Sevilla