Search CORE

9,454 research outputs found

Semi-Supervised Discriminant Analysis Using Robust Path-Based Similarity

Author: Dit-yan Yeung
Yu Zhang
Publication venue
Publication date: 01/01/2008
Field of study

Linear Discriminant Analysis (LDA), which works by maximizing the within-class similarity and minimizing the between-class similarity simultaneously, is a popular dimensionality reduction technique in pattern recognition and machine learning. In real-world applications when labeled data are limited, LDA does not work well. Under many situations, however, it is easy to obtain unlabeled data in large quantities. In this paper, we propose a novel dimensionality reduction method, called Semi-Supervised Discriminant Analysis (SSDA), which can utilize both labeled and unlabeled data to perform dimensionality reduction in the semisupervised setting. Our method uses a robust path-based similarity measure to capture the manifold structure of the data and then uses the obtained similarity to maximize the separability between different classes. A kernel extension of the proposed method for nonlinear dimensionality reduction in the semi-supervised setting is also presented. Experiments on face recognition demonstrate the effectiveness of the proposed method. 1

CiteSeerX

Crossref

Non-Redundant Spectral Dimensionality Reduction

Author: A Brun
A Hyvärinen
A Hyvärinen
A Singer
B Schölkopf
C Jutten
CC Chang
DL Donoho
EA Nadaraya
G Guo
GS Watson
JB Tenenbaum
L Maaten Van Der
M Belkin
M Belkin
M Rubinstein
MS Bartlett
N Halko
P Isola
RR Coifman
ST Roweis
X Geng
X He
Y Goldberg
Y LeCun
Z Zhang
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/04/2017
Field of study

Spectral dimensionality reduction algorithms are widely used in numerous domains, including for recognition, segmentation, tracking and visualization. However, despite their popularity, these algorithms suffer from a major limitation known as the "repeated Eigen-directions" phenomenon. That is, many of the embedding coordinates they produce typically capture the same direction along the data manifold. This leads to redundant and inefficient representations that do not reveal the true intrinsic dimensionality of the data. In this paper, we propose a general method for avoiding redundancy in spectral algorithms. Our approach relies on replacing the orthogonality constraints underlying those methods by unpredictability constraints. Specifically, we require that each embedding coordinate be unpredictable (in the statistical sense) from all previous ones. We prove that these constraints necessarily prevent redundancy, and provide a simple technique to incorporate them into existing methods. As we illustrate on challenging high-dimensional scenarios, our approach produces significantly more informative and compact representations, which improve visualization and classification tasks

arXiv.org e-Print Archive

Crossref

A scale-based approach to finding effective dimensionality in manifold learning

Author: Marron J. S.
Wang Xiaohui
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2007
Field of study

The discovering of low-dimensional manifolds in high-dimensional data is one of the main goals in manifold learning. We propose a new approach to identify the effective dimension (intrinsic dimension) of low-dimensional manifolds. The scale space viewpoint is the key to our approach enabling us to meet the challenge of noisy data. Our approach finds the effective dimensionality of the data over all scale without any prior knowledge. It has better performance compared with other methods especially in the presence of relatively large noise and is computationally efficient.Comment: Published in at http://dx.doi.org/10.1214/07-EJS137 the Electronic Journal of Statistics (http://www.i-journals.org/ejs/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

Carolina Digital Repository

Joint & Progressive Learning from High-Dimensional Data for Multi-Label Classification

Author: Hong Danfeng
Xu Jian
Yokoya Naoto
Zhu Xiaoxiang
Publication venue
Publication date: 01/01/2018
Field of study

Despite the fact that nonlinear subspace learning techniques (e.g. manifold learning) have successfully applied to data representation, there is still room for improvement in explainability (explicit mapping), generalization (out-of-samples), and cost-effectiveness (linearization). To this end, a novel linearized subspace learning technique is developed in a joint and progressive way, called \textbf{j}oint and \textbf{p}rogressive \textbf{l}earning str\textbf{a}teg\textbf{y} (J-Play), with its application to multi-label classification. The J-Play learns high-level and semantically meaningful feature representation from high-dimensional data by 1) jointly performing multiple subspace learning and classification to find a latent subspace where samples are expected to be better classified; 2) progressively learning multi-coupled projections to linearly approach the optimal mapping bridging the original space with the most discriminative subspace; 3) locally embedding manifold structure in each learnable latent subspace. Extensive experiments are performed to demonstrate the superiority and effectiveness of the proposed method in comparison with previous state-of-the-art methods.Comment: accepted in ECCV 201

arXiv.org e-Print Archive

Institute of Transport Research:Publications