Search CORE

1,753 research outputs found

Representing complex data using localized principal components with application to astronomical data

Author: A Gersho
A Gorban
AH Monaghan
AR Webb
B Chalmond
B Kégl
C Allende Prieto
CAL Bailer-Jones
CAL Bailer-Jones
DJ Marchette
E Diday
E Oja
EC Malthouse
EM Braverman
FL Hall
H Hotelling
H Späth
H Wold
IT Jolliffe
J Einbeck
J Einbeck
JH Friedman
JH Friedman
JH Friedman
JJ Verbeek
JM Chambers
K Fukunaga
K Hornik
L Breiman
MAC Perryman
MG Kendall
N Kambhatla
P Delicado
P Delicado
PG Willemsen
R Tibshirani
RJ Bolton
S de Jong
T Aluja-Banet
T Duchamps
T Hastie
T Hastie
WS Cleveland
Z-Y Liu
Publication venue
Publication date: 01/01/2007
Field of study

Often the relation between the variables constituting a multivariate data space might be characterized by one or more of the terms: ``nonlinear'', ``branched'', ``disconnected'', ``bended'', ``curved'', ``heterogeneous'', or, more general, ``complex''. In these cases, simple principal component analysis (PCA) as a tool for dimension reduction can fail badly. Of the many alternative approaches proposed so far, local approximations of PCA are among the most promising. This paper will give a short review of localized versions of PCA, focusing on local principal curves and local partitioning algorithms. Furthermore we discuss projections other than the local principal components. When performing local dimension reduction for regression or classification problems it is important to focus not only on the manifold structure of the covariates, but also on the response variable(s). Local principal components only achieve the former, whereas localized regression approaches concentrate on the latter. Local projection directions derived from the partial least squares (PLS) algorithm offer an interesting trade-off between these two objectives. We apply these methods to several real data sets. In particular, we consider simulated astrophysical data from the future Galactic survey mission Gaia.Comment: 25 pages. In "Principal Manifolds for Data Visualization and Dimension Reduction", A. Gorban, B. Kegl, D. Wunsch, and A. Zinovyev (eds), Lecture Notes in Computational Science and Engineering, Springer, 2007, pp. 180--204, http://www.springer.com/dal/home/generic/search/results?SGWID=1-40109-22-173750210-

arXiv.org e-Print Archive

Durham Research Online

Crossref

Enlighten

Explore Bristol Research

Data compression and regression based on local principal curves.

Author: Einbeck J
Evers L
Fink A
Hinchliff K
Lausen B
Seidel W
Ultsch A
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Frequently the predictor space of a multivariate regression problem of the type y = m(x_1, …, x_p ) + ε is intrinsically one-dimensional, or at least of far lower dimension than p. Usual modeling attempts such as the additive model y = m_1(x_1) + … + m_p (x_p ) + ε, which try to reduce the complexity of the regression problem by making additional structural assumptions, are then inefficient as they ignore the inherent structure of the predictor space and involve complicated model and variable selection stages. In a fundamentally different approach, one may consider first approximating the predictor space by a (usually nonlinear) curve passing through it, and then regressing the response only against the one-dimensional projections onto this curve. This entails the reduction from a p- to a one-dimensional regression problem. As a tool for the compression of the predictor space we apply local principal curves. Taking things on from the results presented in Einbeck et al. (Classification – The Ubiquitous Challenge. Springer, Heidelberg, 2005, pp. 256–263), we show how local principal curves can be parametrized and how the projections are obtained. The regression step can then be carried out using any nonparametric smoother. We illustrate the technique using data from the physical sciences

Durham Research Online

Enlighten

Elastic principal manifolds and their practical applications

Author: A. Gorban
A. Gorban
A. Gusev
A. N. Gorban
A. N. Gorban
A. N. Gorban
A. N. Gorban
A. N. Gorban
A. Yu. Zinovyev
A. Yu. Zinovyev
A. Zinovyev
B. Kégl
B. Kégl
C. Mandal
D. Stanford
E. Erwin
F. Mulier
J. B. Tenenbaum
J. D. Banfield
M. Born
M. LeBlanc
S. Roweis
T. Hastie
T. Kohonen
V. A. Dergachev
W. Cai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/04/2011
Field of study

Principal manifolds serve as useful tool for many practical applications. These manifolds are defined as lines or surfaces passing through "the middle" of data distribution. We propose an algorithm for fast construction of grid approximations of principal manifolds with given topology. It is based on analogy of principal manifold and elastic membrane. The first advantage of this method is a form of the functional to be minimized which becomes quadratic at the step of the vertices position refinement. This makes the algorithm very effective, especially for parallel implementations. Another advantage is that the same algorithmic kernel is applied to construct principal manifolds of different dimensions and topologies. We demonstrate how flexibility of the approach allows numerous adaptive strategies like principal graph constructing, etc. The algorithm is implemented as a C++ package elmap and as a part of stand-alone data visualization tool VidaExpert, available on the web. We describe the approach and provide several examples of its application with speed performance characteristics.Comment: 26 pages, 10 figures, edited final versio

arXiv.org e-Print Archive

Crossref

Data compression and regression based on local principal curves

Author: Einbeck J
Evers L
Fink A
Hinchliff K
Lausen B
Seidel W
Ultsch A
Publication venue
Publication date: 01/01/2010
Field of study

Durham Research Online

Elastic Maps and Nets for Approximating Principal Manifolds and Their Application to Microarray Data Visualization

Author: A Gorban
A Gorban
A Gusev
A Zinovyev
A. N. Gorban
AJ Smola
AJ Smola
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AY Zinovyev
AY Zinovyev
B Kégl
B Kégl
B Mirkin
B Schölkopf
CM Bishop
CM Perou
D Stanford
DG Kendall
E Erwin
F Mulier
H Ritter
H Yin
H Yin
H Zou
JB Tenenbaum
JD Banfield
K Pearson
Kégl
L Aizenberg
L Dyrskjot
M Born
M Frećhet
M LeBlanc
M Oja
R Durbin
R Sayle
R Shyamsundar
S Kaski
S Roweis
T Hastie
T Hastie
T Kohonen
VA Dergachev
W Cai
Y Wang
YF Leung
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/12/2007
Field of study

Principal manifolds are defined as lines or surfaces passing through ``the middle'' of data distribution. Linear principal manifolds (Principal Components Analysis) are routinely used for dimension reduction, noise filtering and data visualization. Recently, methods for constructing non-linear principal manifolds were proposed, including our elastic maps approach which is based on a physical analogy with elastic membranes. We have developed a general geometric framework for constructing ``principal objects'' of various dimensions and topologies with the simplest quadratic form of the smoothness penalty which allows very effective parallel implementations. Our approach is implemented in three programming languages (C++, Java and Delphi) with two graphical user interfaces (VidaExpert http://bioinfo.curie.fr/projects/vidaexpert and ViMiDa http://bioinfo-out.curie.fr/projects/vimida applications). In this paper we overview the method of elastic maps and present in detail one of its major applications: the visualization of microarray data in bioinformatics. We show that the method of elastic maps outperforms linear PCA in terms of data approximation, representation of between-point distance structure, preservation of local point neighborhood and representing point classes in low-dimensional spaces.Comment: 35 pages 10 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref

Statistical model based 3D shape prediction of postoperative trunks for non-invasive scoliosis surgery planning

Author: Assi Kondo C.
Cheriet Farida
Labelle Hubert
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

One of the major concerns of scoliosis patients undergoing surgical treatment is the aesthetic aspect of the surgery outcome. It would be useful to predict the postoperative appearance of the patient trunk in the course of a surgery planning process in order to take into account the expectations of the patient. In this paper, we propose to use least squares support vector regression for the prediction of the postoperative trunk 3D shape after spine surgery for adolescent idiopathic scoliosis. Five dimensionality reduction techniques used in conjunction with the support vector machine are compared. The methods are evaluated in terms of their accuracy, based on the leave-one-out cross-validation performed on a database of 141 cases. The results indicate that the 3D shape predictions using a dimensionality reduction obtained by simultaneous decomposition of the predictors and response variables have the best accuracy.CIHR / IRS

PolyPublie

Dépôt Institutionnel Numérique

ニューラルネットワークを用いた非線形主成分分析に関する研究

Author: Saegusa Ryo
Publication venue: Waseda University
Publication date: 01/03/2005
Field of study

制度:新 ; 文部省報告番号:甲2050号 ; 学位の種類:博士(工学) ; 授与年月日:2005/3/3 ; 早大学位記番号:新400

Waseda University Repository