Search CORE

7,753 research outputs found

Worst-Case Linear Discriminant Analysis as Scalable Semidefinite Feasibility Problems

Author: Hengel Anton van den
Li Hui
Shen Chunhua
Shi Qinfeng
Publication venue
Publication date: 26/11/2014
Field of study

In this paper, we propose an efficient semidefinite programming (SDP) approach to worst-case linear discriminant analysis (WLDA). Compared with the traditional LDA, WLDA considers the dimensionality reduction problem from the worst-case viewpoint, which is in general more robust for classification. However, the original problem of WLDA is non-convex and difficult to optimize. In this paper, we reformulate the optimization problem of WLDA into a sequence of semidefinite feasibility problems. To efficiently solve the semidefinite feasibility problems, we design a new scalable optimization method with quasi-Newton methods and eigen-decomposition being the core components. The proposed method is orders of magnitude faster than standard interior-point based SDP solvers. Experiments on a variety of classification problems demonstrate that our approach achieves better performance than standard LDA. Our method is also much faster and more scalable than standard interior-point SDP solvers based WLDA. The computational complexity for an SDP with

m

constraints and matrices of size

d

d

is roughly reduced from

\mathcal{O}(m^3+md^3+m^2d^2)

\mathcal{O}(d^3)

(

m>d

in our case).Comment: 14 page

arXiv.org e-Print Archive

Sparse multinomial kernel discriminant analysis (sMKDA)

Author: Abe
Baudat
Billings
Bo
Cawley
Cawley
Centeno
Chen
Chen
Chen
Chen
Crownover
Duda
Gonzalez-Abril
Hastie
Hastie
Hastie
Hong
Hsu
Kitsuchart Pasupa
Krishnapuram
Li
Liang
Liang
Lu
Mayoraz
Mika
Rifkin
Robert F. Harrison
Schölkopf
Shawe-Taylor
Similä
Suykens
Tipping
Van Gestel
Weston
Xu
Xu
Xu
Yu
Zheng
Publication venue: 'Elsevier BV'
Publication date: 01/09/2009
Field of study

Dimensionality reduction via canonical variate analysis (CVA) is important for pattern recognition and has been extended variously to permit more flexibility, e.g. by "kernelizing" the formulation. This can lead to over-fitting, usually ameliorated by regularization. Here, a method for sparse, multinomial kernel discriminant analysis (sMKDA) is proposed, using a sparse basis to control complexity. It is based on the connection between CVA and least-squares, and uses forward selection via orthogonal least-squares to approximate a basis, generalizing a similar approach for binomial problems. Classification can be performed directly via minimum Mahalanobis distance in the canonical variates. sMKDA achieves state-of-the-art performance in terms of accuracy and sparseness on 11 benchmark datasets

Southampton (e-Prints Soton)

Crossref

White Rose Research Online

Asymptotic Generalization Bound of Fisher's Linear Discriminant Analysis

Author: Bian Wei
Tao Dacheng
Publication venue
Publication date: 22/04/2013
Field of study

Fisher's linear discriminant analysis (FLDA) is an important dimension reduction method in statistical pattern recognition. It has been shown that FLDA is asymptotically Bayes optimal under the homoscedastic Gaussian assumption. However, this classical result has the following two major limitations: 1) it holds only for a fixed dimensionality

D

, and thus does not apply when

D

and the training sample size

N

are proportionally large; 2) it does not provide a quantitative description on how the generalization ability of FLDA is affected by

D

and

N

. In this paper, we present an asymptotic generalization analysis of FLDA based on random matrix theory, in a setting where both

D

and

N

increase and

D/N\longrightarrow\gamma\in[0,1)

. The obtained lower bound of the generalization discrimination power overcomes both limitations of the classical result, i.e., it is applicable when

D

and

N

are proportionally large and provides a quantitative description of the generalization ability of FLDA in terms of the ratio

\gamma=D/N

and the population discrimination power. Besides, the discrimination power bound also leads to an upper bound on the generalization error of binary-classification with FLDA

arXiv.org e-Print Archive

OPUS - University of Technology Sydney

How to Solve Classification and Regression Problems on High-Dimensional Data with a Supervised Extension of Slow Feature Analysis

Author: Escalante-B. Alberto-N.
Wiskott Prof. Dr. Laurenz
Publication venue
Publication date: 01/01/2013
Field of study

Supervised learning from high-dimensional data, e.g., multimedia data, is a challenging task. We propose an extension of slow feature analysis (SFA) for supervised dimensionality reduction called graph-based SFA (GSFA). The algorithm extracts a label-predictive low-dimensional set of features that can be post-processed by typical supervised algorithms to generate the ﬁnal label or class estimation. GSFA is trained with a so-called training graph, in which the vertices are the samples and the edges represent similarities of the corresponding labels. A new weighted SFA optimization problem is introduced, generalizing the notion of slowness from sequences of samples to such training graphs. We show that GSFA computes an optimal solution to this problem in the considered function space, and propose several types of training graphs. For classiﬁcation, the most straightforward graph yields features equivalent to those of (nonlinear) Fisher discriminant analysis. Emphasis is on regression, where four different graphs were evaluated experimentally with a subproblem of face detection on photographs. The method proposed is promising particularly when linear models are insufficient, as well as when feature selection is difficult

CiteSeerX

CogPrints Cognitive Sciences Eprint Archive

Metric learning with convex optimization

Author: Shen Chunhua
Publication venue
Publication date: 10/09/2018
Field of study

The Australian National University

Novel Deep Learning Techniques For Computer Vision and Structure Health Monitoring

Author: Pathirage Chathurdara Sri Nadith
Publication venue: Curtin University
Publication date: 01/01/2018
Field of study

This thesis proposes novel techniques in building a generic framework for both the regression and classification tasks in vastly different applications domains such as computer vision and civil engineering. Many frameworks have been proposed and combined into a complex deep network design to provide a complete solution to a wide variety of problems. The experiment results demonstrate significant improvements of all the proposed techniques towards accuracy and efficiency

espace@Curtin

Positive Semidefinite Metric Learning Using Boosting-like Algorithms

Author: Hengel Anton van den
Kim Junae
Shen Chunhua
Wang Lei
Publication venue
Publication date: 01/01/2012
Field of study

The success of many machine learning and pattern recognition methods relies heavily upon the identification of an appropriate distance metric on the input data. It is often beneficial to learn such a metric from the input training data, instead of using a default one such as the Euclidean distance. In this work, we propose a boosting-based technique, termed BoostMetric, for learning a quadratic Mahalanobis distance metric. Learning a valid Mahalanobis distance metric requires enforcing the constraint that the matrix parameter to the metric remains positive definite. Semidefinite programming is often used to enforce this constraint, but does not scale well and easy to implement. BoostMetric is instead based on the observation that any positive semidefinite matrix can be decomposed into a linear combination of trace-one rank-one matrices. BoostMetric thus uses rank-one positive semidefinite matrices as weak learners within an efficient and scalable boosting-based learning process. The resulting methods are easy to implement, efficient, and can accommodate various types of constraints. We extend traditional boosting algorithms in that its weak learner is a positive semidefinite matrix with trace and rank being one rather than a classifier or regressor. Experiments on various datasets demonstrate that the proposed algorithms compare favorably to those state-of-the-art methods in terms of classification accuracy and running time.Comment: 30 pages, appearing in Journal of Machine Learning Researc

arXiv.org e-Print Archive

CiteSeerX

Adelaide Research & Scholarship

Research Online