Search CORE

54,140 research outputs found

Non-Redundant Spectral Dimensionality Reduction

Author: A Brun
A Hyvärinen
A Hyvärinen
A Singer
B Schölkopf
C Jutten
CC Chang
DL Donoho
EA Nadaraya
G Guo
GS Watson
JB Tenenbaum
L Maaten Van Der
M Belkin
M Belkin
M Rubinstein
MS Bartlett
N Halko
P Isola
RR Coifman
ST Roweis
X Geng
X He
Y Goldberg
Y LeCun
Z Zhang
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/04/2017
Field of study

Spectral dimensionality reduction algorithms are widely used in numerous domains, including for recognition, segmentation, tracking and visualization. However, despite their popularity, these algorithms suffer from a major limitation known as the "repeated Eigen-directions" phenomenon. That is, many of the embedding coordinates they produce typically capture the same direction along the data manifold. This leads to redundant and inefficient representations that do not reveal the true intrinsic dimensionality of the data. In this paper, we propose a general method for avoiding redundancy in spectral algorithms. Our approach relies on replacing the orthogonality constraints underlying those methods by unpredictability constraints. Specifically, we require that each embedding coordinate be unpredictable (in the statistical sense) from all previous ones. We prove that these constraints necessarily prevent redundancy, and provide a simple technique to incorporate them into existing methods. As we illustrate on challenging high-dimensional scenarios, our approach produces significantly more informative and compact representations, which improve visualization and classification tasks

arXiv.org e-Print Archive

Crossref

Capture of manufacturing uncertainty in turbine blades through probabilistic techniques

Author: Keane Andy
Nair Prasanth B.
Thakur Nikita
Publication venue
Publication date: 08/07/2008
Field of study

Efficient designing of the turbine blades is critical to the performance of an aircraft engine. An area of significant research interest is the capture of manufacturing uncertainty in the shapes of these turbine blades. The available data used for estimation of this manufacturing uncertainty inevitably contains the effects of measurement error/noise. In the present work, we propose the application of Principal Component Analysis (PCA) for de-noising the measurement data and quantifying the underlying manufacturing uncertainty. Once the PCA is performed, a method for dimensionality reduction has been proposed which utilizes prior information available on the variance of measurement error for different measurement types. Numerical studies indicate that approximately 82% of the variation in the measurements from their design values is accounted for by the manufacturing uncertainty, while the remaining 18% variation is filtered out as measurement error

Southampton (e-Prints Soton)

SCANN: Synthesis of Compact and Accurate Neural Networks

Author: Hassantabar Shayan
Jha Niraj K.
Wang Zeyu
Publication venue
Publication date: 28/03/2021
Field of study

Deep neural networks (DNNs) have become the driving force behind recent artificial intelligence (AI) research. An important problem with implementing a neural network is the design of its architecture. Typically, such an architecture is obtained manually by exploring its hyperparameter space and kept fixed during training. This approach is time-consuming and inefficient. Another issue is that modern neural networks often contain millions of parameters, whereas many applications and devices require small inference models. However, efforts to migrate DNNs to such devices typically entail a significant loss of classification accuracy. To address these challenges, we propose a two-step neural network synthesis methodology, called DR+SCANN, that combines two complementary approaches to design compact and accurate DNNs. At the core of our framework is the SCANN methodology that uses three basic architecture-changing operations, namely connection growth, neuron growth, and connection pruning, to synthesize feed-forward architectures with arbitrary structure. SCANN encapsulates three synthesis methodologies that apply a repeated grow-and-prune paradigm to three architectural starting points. DR+SCANN combines the SCANN methodology with dataset dimensionality reduction to alleviate the curse of dimensionality. We demonstrate the efficacy of SCANN and DR+SCANN on various image and non-image datasets. We evaluate SCANN on MNIST and ImageNet benchmarks. In addition, we also evaluate the efficacy of using dimensionality reduction alongside SCANN (DR+SCANN) on nine small to medium-size datasets. We also show that our synthesis methodology yields neural networks that are much better at navigating the accuracy vs. energy efficiency space. This would enable neural network-based inference even on Internet-of-Things sensors.Comment: 13 pages, 8 figure

arXiv.org e-Print Archive

Princeton University Open Access Repository

A Comparison of Hand-Geometry Recognition Methods Based on Low- and High-Level Features

Author: Bazen A.M.
Booij Wim
Hendrikse A.J.
Hendrikse Anne
Veldhuis Raymond N.J.
Publication venue: STW/NWO/Dutch Ministry of Economic Affairs
Publication date: 01/01/2004
Field of study

This paper compares the performance of hand-geometry recognition based on high-level features and on low-level features. The difference between high- and low-level features is that the former are based on interpreting the biometric data, e.g. by locating a finger and measuring its dimensions, whereas the latter are not. The low-level features used here are landmarks on the contour of the hand. The high-level features are a standard set of geometrical features such as widths and lengths of fingers and angles, measured at preselected locations

University of Twente Research Information

Time Series Cluster Kernel for Learning Similarities between Multivariate Time Series with Missing Data

Author: Bianchi Filippo Maria
Jenssen Robert
Mikalsen Karl Øyvind
Soguero-Ruiz Cristina
Publication venue
Publication date: 01/01/2017
Field of study

Similarity-based approaches represent a promising direction for time series analysis. However, many such methods rely on parameter tuning, and some have shortcomings if the time series are multivariate (MTS), due to dependencies between attributes, or the time series contain missing data. In this paper, we address these challenges within the powerful context of kernel methods by proposing the robust \emph{time series cluster kernel} (TCK). The approach taken leverages the missing data handling properties of Gaussian mixture models (GMM) augmented with informative prior distributions. An ensemble learning approach is exploited to ensure robustness to parameters by combining the clustering results of many GMM to form the final kernel. We evaluate the TCK on synthetic and real data and compare to other state-of-the-art techniques. The experimental results demonstrate that the TCK is robust to parameter choices, provides competitive results for MTS without missing data and outstanding results for missing data.Comment: 23 pages, 6 figure

arXiv.org e-Print Archive

Munin - Open Research Archive

NORA - Norwegian Open Research Archives