Search CORE

39,572 research outputs found

A Simple Iterative Algorithm for Parsimonious Binary Kernel Fisher Discrimination

Author: B Chien
B Efron
B Krishnapuram
B Schölkopf
CM Bishop
D Hunter
D Masip
E Andelić
G Baudat
G Rätsch
J Lu
J Yang
J Zhu
K Fukunaga
K Lange
Kitsuchart Pasupa
M Figueiredo
M Last
M Osborne
M. Figueiredo
N Hsieh
R Duda
R Dutter
R Harrison
Robert F. Harrison
S Abe
S Billings
S Keerthi
S Mika
T Hastie
V Roth
Y Park
Y Sun
Y Washizawa
Y Xu
Y Xu
Y Xu
Z Liang
Z Liang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2010
Field of study

By applying recent results in optimization theory variously known as optimization transfer or majorize/minimize algorithms, an algorithm for binary, kernel, Fisher discriminant analysis is introduced that makes use of a non-smooth penalty on the coefficients to provide a parsimonious solution. The problem is converted into a smooth optimization that can be solved iteratively with no greater overhead than iteratively re-weighted least-squares. The result is simple, easily programmed and is shown to perform, in terms of both accuracy and parsimony, as well as or better than a number of leading machine learning algorithms on two well-studied and substantial benchmarks

Southampton (e-Prints Soton)

Crossref

White Rose Research Online

Over-optimism in bioinformatics: an illustration

Author: Anne-Laure Boulesteix
Arthur Tenenhaus
Korbinian Strimmer
Monika Jelizarow
Vincent Guillemot
Publication venue
Publication date: 03/05/2010
Field of study

In statistical bioinformatics research, different optimization mechanisms potentially lead to "over-optimism" in published papers. The present empirical study illustrates these mechanisms through a concrete example from an active research field. The investigated sources of over-optimism include the optimization of the data sets, of the settings, of the competing methods and, most importantly, of the method’s characteristics. We consider a "promising" new classification algorithm that turns out to yield disappointing results in terms of error rate, namely linear discriminant analysis incorporating prior knowledge on gene functional groups through an appropriate shrinkage of the within-group covariance matrix. We quantitatively demonstrate that this disappointing method can artificially seem superior to existing approaches if we "fish for significance”. We conclude that, if the improvement of a quantitative criterion such as the error rate is the main contribution of a paper, the superiority of new algorithms should be validated using "fresh" validation data sets

HAL-CentraleSupelec

Open Access LMU

HAL Descartes

The University of Manchester - Institutional Repository

HAL-CEA

HAL-Rennes 1

Parsimonious Kernel Fisher Discrimination

Author: A. Leach
B. Chen
B. Krishnapuram
B. Schölkopf
D.R. Hunter
G. Harper
G. Rätsch
K. Lange
K.C. Kiwiel
R. Dutter
R.O. Duda
S.A. Billings
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

By applying recent results in optimization transfer, a new algorithm for kernel Fisher Discriminant Analysis is provided that makes use of a non-smooth penalty on the coefficients to provide a parsimonious solution. The algorithm is simple, easily programmed and is shown to perform as well as or better than a number of leading machine learning algorithms on a substantial benchmark. It is then applied to a set of extreme small-sample-size problems in virtual screening where it is found to be less accurate than a currently leading approach but is still comparable in a number of cases

Southampton (e-Prints Soton)

Crossref

Detecting single-trial EEG evoked potential using a wavelet domain linear mixed model: application to error potentials classification

Author: Burle B
Roubaud M-C
Spinnato J
Torrésani B
Publication venue: 'IOP Publishing'
Publication date: 01/06/2015
Field of study

Objective. The main goal of this work is to develop a model for multi-sensor signals such as MEG or EEG signals, that accounts for the inter-trial variability, suitable for corresponding binary classification problems. An important constraint is that the model be simple enough to handle small size and unbalanced datasets, as often encountered in BCI type experiments. Approach. The method involves linear mixed effects statistical model, wavelet transform and spatial filtering, and aims at the characterization of localized discriminant features in multi-sensor signals. After discrete wavelet transform and spatial filtering, a projection onto the relevant wavelet and spatial channels subspaces is used for dimension reduction. The projected signals are then decomposed as the sum of a signal of interest (i.e. discriminant) and background noise, using a very simple Gaussian linear mixed model. Main results. Thanks to the simplicity of the model, the corresponding parameter estimation problem is simplified. Robust estimates of class-covariance matrices are obtained from small sample sizes and an effective Bayes plug-in classifier is derived. The approach is applied to the detection of error potentials in multichannel EEG data, in a very unbalanced situation (detection of rare events). Classification results prove the relevance of the proposed approach in such a context. Significance. The combination of linear mixed model, wavelet transform and spatial filtering for EEG classification is, to the best of our knowledge, an original approach, which is proven to be effective. This paper improves on earlier results on similar problems, and the three main ingredients all play an important role

arXiv.org e-Print Archive

HAL AMU

Supervised Classification Using Sparse Fisher's LDA

Author: Booth James G.
Gaynanova Irina
Wells Martin T.
Publication venue
Publication date: 16/09/2014
Field of study

It is well known that in a supervised classification setting when the number of features is smaller than the number of observations, Fisher's linear discriminant rule is asymptotically Bayes. However, there are numerous modern applications where classification is needed in the high-dimensional setting. Naive implementation of Fisher's rule in this case fails to provide good results because the sample covariance matrix is singular. Moreover, by constructing a classifier that relies on all features the interpretation of the results is challenging. Our goal is to provide robust classification that relies only on a small subset of important features and accounts for the underlying correlation structure. We apply a lasso-type penalty to the discriminant vector to ensure sparsity of the solution and use a shrinkage type estimator for the covariance matrix. The resulting optimization problem is solved using an iterative coordinate ascent algorithm. Furthermore, we analyze the effect of nonconvexity on the sparsity level of the solution and highlight the difference between the penalized and the constrained versions of the problem. The simulation results show that the proposed method performs favorably in comparison to alternatives. The method is used to classify leukemia patients based on DNA methylation features

arXiv.org e-Print Archive

CiteSeerX