Search CORE

69 research outputs found

CSNL: A cost-sensitive non-linear decision tree algorithm

Author: Allwein E. L.
Bennett K. P.
Bradford J.
Breslow L.
Brown G.
Elkan C.
Fan W.
Kanani P.
Knoll U.
Martin A.
Masnadi-Shirazi H.
Pazzani M.
Provost F. J.
Sunil Vadera
Ting K.
Ting K.
Turney P.
Vadera S.
Vadera S.
Zadrozny B.
Zhu X.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

This article presents a new decision tree learning algorithm called CSNL that induces Cost-Sensitive Non-Linear decision trees. The algorithm is based on the hypothesis that nonlinear decision nodes provide a better basis than axis-parallel decision nodes and utilizes discriminant analysis to construct nonlinear decision trees that take account of costs of misclassification. The performance of the algorithm is evaluated by applying it to seventeen datasets and the results are compared with those obtained by two well known cost-sensitive algorithms, ICET and MetaCost, which generate multiple trees to obtain some of the best results to date. The results show that CSNL performs at least as well, if not better than these algorithms, in more than twelve of the datasets and is considerably faster. The use of bagging with CSNL further enhances its performance showing the significant benefits of using nonlinear decision nodes. The performance of the algorithm is evaluated by applying it to seventeen data sets and the results are compared with those obtained by two well known cost-sensitive algorithms, ICET and MetaCost, which generate multiple trees to obtain some of the best results to date. The results show that CSNL performs at least as well, if not better than these algorithms, in more than twelve of the data sets and is considerably faster. The use of bagging with CSNL further enhances its performance showing the significant benefits of using non-linear decision nodes

CiteSeerX

University of Salford Institutional Repository

Crossref

Max-Margin Dictionary Learning for Multiclass Image Categorization

Author: B. Fulkerson
E. Allwein
F. Moosmann
F. Perronnin
J. Platt
K. Huang
L. Fei-Fei
N. Shor
S. Lazebnik
Y.G. Jiang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Abstract. Visual dictionary learning and base (binary) classifier train-ing are two basic problems for the recently most popular image cate-gorization framework, which is based on the bag-of-visual-terms (BOV) models and multiclass SVM classifiers. In this paper, we study new algo-rithms to improve performance of this framework from these two aspects. Typically SVM classifiers are trained with dictionaries fixed, and as a re-sult the traditional loss function can only be minimized with respect to hyperplane parameters (w and b). We propose a novel loss function for a binary classifier, which links the hinge-loss term with dictionary learning. By doing so, we can further optimize the loss function with respect to the dictionary parameters. Thus, this framework is able to further increase margins of binary classifiers, and consequently decrease the error bound of the aggregated classifier. On two benchmark dataset

CiteSeerX

Crossref

Building multiclass classifiers for remote homology detection and fold recognition

Author: A Heger
A Krogh
A Sun
AG Murzin
B Taskar
C Leslie
C Leslie
CA Orengo
CD Huang
CH Ding
D Mittelman
E le
E Lindahl
EL Allwein
F Aiolli
F Rosenblatt
George Karypis
H Rangwala
H Saigo
Huzefa Rangwala
I Tsochantaridis
J Rousu
J Shi
J Weston
K Crammer
K Crammer
L Holm
L Liao
M Collins
M Collins
M Marti-Renom
P Baldi
R Kuang
R Rifkin
S Altschul
SB Needleman
SE Brenner
T Jaakkola
T Jaakkola
T Joachims
TF Smith
TG Dietterich
V Vapnik
W Pearson
Y Guermeur
Y Guermeur
Y Hou
Y Hou
Z Barutcuoglu
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Protein remote homology detection and fold recognition are central problems in computational biology. Supervised learning algorithms based on support vector machines are currently one of the most effective methods for solving these problems. These methods are primarily used to solve binary classification problems and they have not been extensively used to solve the more general multiclass remote homology prediction and fold recognition problems. RESULTS: We present a comprehensive evaluation of a number of methods for building SVM-based multiclass classification schemes in the context of the SCOP protein classification. These methods include schemes that directly build an SVM-based multiclass model, schemes that employ a second-level learning approach to combine the predictions generated by a set of binary SVM-based classifiers, and schemes that build and combine binary classifiers for various levels of the SCOP hierarchy beyond those defining the target classes. CONCLUSION: Analyzing the performance achieved by the different approaches on four different datasets we show that most of the proposed multiclass SVM-based classification approaches are quite effective in solving the remote homology prediction and fold recognition problems and that the schemes that use predictions from binary models constructed for ancestral categories within the SCOP hierarchy tend to not only lead to lower error rates but also reduce the number of errors in which a superfamily is assigned to an entirely different fold and a fold is predicted as being from a different SCOP class. Our results also show that the limited size of the training data makes it hard to learn complex second-level models, and that models of moderate complexity lead to consistently better results

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Minnesota Digital Conservancy

Multiclass classification of microarray data samples with a reduced number of genes

Author: A Alizadeh
A Berger
A Dupuy
A Statnikov
A Statnikov
AI Su
C Ambroise
C Furlanello
CE Shannon
CF Aliferis
DJC Mackay
DK Slonim
E Tapia
EL Allwein
Elizabeth Tapia
F Azuaje
F Masulli
FR Kschischang
G James
G Salton
I Guyon
I Shmulevich
I Tsamardinos
I Witten
J Fan
J Hadar
J Khan
J Zhu
JE Staunton
K Yeung
KH Liu
L Breiman
Laura Angelone
Leonardo Ornella
M Dettling
M Hollander
MA Delgado
N Cristianini
Pilar Bulacio
R Rifkin
R Rifkin
RM Fano
S Dudoit
S Huang
S Lee
S Pomeroy
T Abeel
T Furey
T Li
TG Dietterich
TM Cover
V Guruswami
V Vapnik
X Qiu
Y Lin
Y Saeys
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Multiclass classification of microarray data samples with a reduced number of genes is a rich and challenging problem in Bioinformatics research. The problem gets harder as the number of classes is increased. In addition, the performance of most classifiers is tightly linked to the effectiveness of mandatory gene selection methods. Critical to gene selection is the availability of estimates about the maximum number of genes that can be handled by any classification algorithm. Lack of such estimates may lead to either computationally demanding explorations of a search space with thousands of dimensions or classification models based on gene sets of unrestricted size. In the former case, unbiased but possibly overfitted classification models may arise. In the latter case, biased classification models unable to support statistically significant findings may be obtained. Results A novel bound on the maximum number of genes that can be handled by binary classifiers in binary mediated multiclass classification algorithms of microarray data samples is presented. The bound suggests that high-dimensional binary output domains might favor the existence of accurate and sparse binary mediated multiclass classifiers for microarray data samples. Conclusions A comprehensive experimental work shows that the bound is indeed useful to induce accurate and sparse multiclass classifiers for microarray data samples.</p

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

CONICET Digital

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Maximizing upgrading and downgrading margins for ordinal regression

Author: A Shashua
AP Bradley
Belen Martin-Barragan
C Cortes
DJ Hand
E Bredensteiner
E Carrizosa
E Carrizosa
E Carrizosa
E Carrizosa
E Grigoroudis
EL Allwein
Emilio Carrizosa
F Plastria
G Ballarino
H Nakayama
J Mercer
J Shawe-Taylor
JC Platt
JP Pedroso
JS Cardoso
L Li
MA Kupinski
N Cristianini
NM Adams
OL Mangasarian
R Herbrich
R Lall
RM Everson
T Hastie
T Jiao
V Vapnik
V Vapnik
W Chu
W Waegeman
Y Guermeur
Y Jin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2011
Field of study

In ordinal regression, a score function and threshold values are sought to classify a set of objects into a set of ranked classes. Classifying an individual in a class with higher (respectively lower) rank than its actual rank is called an upgrading (respectively downgrading) error. Since upgrading and downgrading errors may not have the same importance, they should be considered as two different criteria to be taken into account when measuring the quality of a classifier. In Support Vector Machines, margin maximization is used as an effective and computationally tractable surrogate of the minimization of misclassification errors. As an extension, we consider in this paper the maximization of upgrading and downgrading margins as a surrogate of the minimization of upgrading and downgrading errors, and we address the biobjective problem of finding a classifier maximizing simultaneously the two margins. The whole set of Pareto-optimal solutions of such biobjective problem is described as translations of the optimal solutions of a scalar optimization problem. For the most popular case in which the Euclidean norm is considered, the scalar problem has a unique solution, yielding that all the Pareto-optimal solutions of the biobjective problem are translations of each other. Hence, the Pareto-optimal solutions can easily be provided to the analyst, who, after inspection of the misclassification errors caused, should choose in a later stage the most convenient classifier. The consequence of this analysis is that it provides a theoretical foundation for a popular strategy among practitioners, based on the so-called ROC curve, which is shown here to equal the set of Pareto-optimal solutions of maximizing simultaneously the downgrading and upgrading margins

Crossref

Edinburgh Research Explorer

idUS. Depósito de Investigación Universidad de Sevilla

Boosting algorithms: a review of methods, theory, and applications

Author: A Blumer
A Demiriz
A Torralba
A Vezhnevets
A. Dempster
B Efron
B Ripley
C Blake
C Zhang
C Zhang
E Allwein
E Bauer
F Fleuret
G Eibl
G Jun
H Grabner
H Schwenk
J Friedman
J Friedman
J Rodriguez
J Webb
J Zhu
L Breiman
L Breiman
L Kuncheva
L Mason
L Valiant
N Christiani
N Duffy
O Chapelle
P Bühlmann
P Mallapragada
P Viola
R Avnimelech
R Schapire
R Schapire
S Li
S Li
T Dietterich
T Pham
V Gómez-Verdejo
V Vapnik
V Vapnik
W.-C. Chang
X Zhu
Y Freund
Y Freund
Y Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/01/2012
Field of study

Repositório Científico do Instituto Politécnico de Lisboa

Crossref

Ternary Bradley-Terry model-based decoding for multi-class classification and its extensions

Author: A. Bhattacharjee
A. W. Vaart Van der
B. Zadrozny
C. Angulo
C. Angulo
E. L. Allwein
F. Cutzu
J. Weston
K. Crammer
M. Moreira
N. Murata
N. Yukinawa
O. Dekel
P. D. Allison
R Development Core Team
R. A. Bradley
R. E. Schapire
R. E. Schapire
R. Tibshirani
Shin Ishii
T. G. Dietterich
T. Hastie
T. Hastie
T. Takenouchi
T. Takenouchi
T. Windeatt
Takashi Takenouchi
V. Vapnik
Y. Freund
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Statistical topic models for multi-label document classification

Author: A. C. P. L. F. Carvalho de
A. K. McCallum
America Chambers
D. Blei
D. D. Lewis
D. M. Blei
D. M. Blei
D. M. Blei
D. Mimno
D. Mimno
D. Ramage
E. L. Allwein
E. Loza Mencía
E. Loza Mencía
F. Sebastiani
G. Druck
G. Forman
G. Tsoumakas
G. Tsoumakas
J. Davis
J. Fürnkranz
J. Read
J. Zhu
K. Crammer
K.-M. Schneider
L. Cao
M. Ioannou
M. Rosen-Zvi
M.-L. Zhang
M.-L. Zhang
Mark Steyvers
N. Ghamrawi
N. Japkowicz
N. Ueda
O. Dekel
Padhraic Smyth
R. Rak
R. Rifkin
R.-E. Fan
S. Ji
S. Lacoste-Julien
T. L. Griffiths
T.-Y. Liu
Timothy N. Rubin
W. Hersh
Y. W. Teh
Y. Wang
Y. Yang
Y. Yang
Y. Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers

Author: Erin L. Allwein
Robert E. Schapire
Yoram Singer
Publication venue: Morgan Kaufmann
Publication date: 01/01/2000
Field of study

We present a unifying framework for studying the solution of multiclass categorization problems by reducing them to multiple binary problems that are then solved using a margin-based binary learning algorithm. The proposed framework unifies some of the most popular approaches in which each class is compared against all others, or in which all pairs of classes are compared to each other, or in which output codes with error-correcting properties are used. We propose a general method for combining the classifiers generated on the binary problems, and we prove a general empirical multiclass loss bound given the empirical loss of the individual binary learning algorithms. The scheme and the corresponding bounds apply to many popular classification learning algorithms including support-vector machines, AdaBoost, regression, logistic regression and decision-tree algorithms. We also give a multiclass generalization error analysis for general output codes with AdaBoost as the binary learner. Experimental results with SVM and AdaBoost show that our scheme provides a viable alternative to the most commonly used multiclass algorithms

CiteSeerX

Polychotomous Classification with Pairwise Classifiers: A New Voting Principle

Author: E. L. Allwein
J. Schürmann
V. Roth
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref