Search CORE

585 research outputs found

Fast Cross-Validation via Sequential Testing

Author: Braun Mikio
Krueger Tammo
Panknin Danny
Publication venue
Publication date: 01/01/2015
Field of study

With the increasing size of today's data sets, finding the right parameter configuration in model selection via cross-validation can be an extremely time-consuming task. In this paper we propose an improved cross-validation procedure which uses nonparametric testing coupled with sequential analysis to determine the best parameter set on linearly increasing subsets of the data. By eliminating underperforming candidates quickly and keeping promising candidates as long as possible, the method speeds up the computation while preserving the capability of the full cross-validation. Theoretical considerations underline the statistical power of our procedure. The experimental evaluation shows that our method reduces the computation time by a factor of up to 120 compared to a full cross-validation with a negligible impact on the accuracy

arXiv.org e-Print Archive

CiteSeerX

Aspects of Credit Risk Modeling

Author: Swamy Murali
Swamy Murali
Publication venue
Publication date: 01/01/2009
Field of study

Imperial Users onl

Spiral - Imperial College Digital Repository

Block-Sparse Recovery via Convex Optimization

Author: Ehsan Elhamifar
Rene ́ Vidal
Senior Member
Student Member
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 13/04/2012
Field of study

Given a dictionary that consists of multiple blocks and a signal that lives in the range space of only a few blocks, we study the problem of finding a block-sparse representation of the signal, i.e., a representation that uses the minimum number of blocks. Motivated by signal/image processing and computer vision applications, such as face recognition, we consider the block-sparse recovery problem in the case where the number of atoms in each block is arbitrary, possibly much larger than the dimension of the underlying subspace. To find a block-sparse representation of a signal, we propose two classes of non-convex optimization programs, which aim to minimize the number of nonzero coefficient blocks and the number of nonzero reconstructed vectors from the blocks, respectively. Since both classes of problems are NP-hard, we propose convex relaxations and derive conditions under which each class of the convex programs is equivalent to the original non-convex formulation. Our conditions depend on the notions of mutual and cumulative subspace coherence of a dictionary, which are natural generalizations of existing notions of mutual and cumulative coherence. We evaluate the performance of the proposed convex programs through simulations as well as real experiments on face recognition. We show that treating the face recognition problem as a block-sparse recovery problem improves the state-of-the-art results by 10% with only 25% of the training data.Comment: IEEE Transactions on Signal Processin

arXiv.org e-Print Archive

CiteSeerX

Crossref

Inhibition in multiclass classification

Author: Bottou L.
Chang Y.-W.
Charles Elkan
Dempster A. P.
José M. Amigó
Kivinen J.
LeCun Y.
Lugosi G.
Platt J. C.
Platt J. C.
Ramón Huerta
Rifkin R.
Shankar Vembu
Smith B. H.
Tewari A.
Thomas Nowotny
Tsochantaridis I.
Weston J.
Publication venue: 'MIT Press - Journals'
Publication date: 01/09/2012
Field of study

The role of inhibition is investigated in a multiclass support vector machine formalism inspired by the brain structure of insects. The so-called mushroom bodies have a set of output neurons, or classification functions, that compete with each other to encode a particular input. Strongly active output neurons depress or inhibit the remaining outputs without knowing which is correct or incorrect. Accordingly, we propose to use a classification function that embodies unselective inhibition and train it in the large margin classifier framework. Inhibition leads to more robust classifiers in the sense that they perform better on larger areas of appropriate hyperparameters when assessed with leave-one-out strategies. We also show that the classifier with inhibition is a tight bound to probabilistic exponential models and is Bayes consistent for 3-class problems. These properties make this approach useful for data sets with a limited number of labeled examples. For larger data sets, there is no significant comparative advantage to other multiclass SVM approaches

Crossref

PubMed Central

Sussex Research Online