Search CORE

30,648 research outputs found

On landmark selection and sampling in high-dimensional data analysis

Author: Blackburn J.
Deshpande A.
Drineas P.
Elgammal A.
Fowlkes C.
Lee K.-C.
Lee K.-C.
Liu R.
Ouimet M.
Platt J. C.
Smola A. J.
Talwalkar A.
Williams C. K. I.
Publication venue: 'The Royal Society'
Publication date: 24/06/2009
Field of study

In recent years, the spectral analysis of appropriately defined kernel matrices has emerged as a principled way to extract the low-dimensional structure often prevalent in high-dimensional data. Here we provide an introduction to spectral methods for linear and nonlinear dimension reduction, emphasizing ways to overcome the computational limitations currently faced by practitioners with massive datasets. In particular, a data subsampling or landmark selection process is often employed to construct a kernel based on partial information, followed by an approximate spectral analysis termed the Nystrom extension. We provide a quantitative framework to analyse this procedure, and use it to demonstrate algorithmic performance bounds on a range of practical approaches designed to optimize the landmark selection process. We compare the practical implications of these bounds by way of real-world examples drawn from the field of computer vision, whereby low-dimensional manifold structure is shown to emerge from high-dimensional video data streams.Comment: 18 pages, 6 figures, submitted for publicatio

arXiv.org e-Print Archive

Crossref

PubMed Central

UCL Discovery

DIMAL: Deep Isometric Manifold Learning Using Sparse Geodesic Sampling

Author: Bronstein Alex
Kimmel Ron
Pai Gautam
Talmon Ronen
Publication venue
Publication date: 13/11/2018
Field of study

This paper explores a fully unsupervised deep learning approach for computing distance-preserving maps that generate low-dimensional embeddings for a certain class of manifolds. We use the Siamese configuration to train a neural network to solve the problem of least squares multidimensional scaling for generating maps that approximately preserve geodesic distances. By training with only a few landmarks, we show a significantly improved local and nonlocal generalization of the isometric mapping as compared to analogous non-parametric counterparts. Importantly, the combination of a deep-learning framework with a multidimensional scaling objective enables a numerical analysis of network architectures to aid in understanding their representation power. This provides a geometric perspective to the generalizability of deep learning.Comment: 10 pages, 11 Figure

arXiv.org e-Print Archive

Crossref