Search CORE

15,052 research outputs found

Estimates on compressed neural networks regression

Author: Aad
Achlioptas
Barron
Bishop
Blum
Brauer
Cao
Chen
Cucker
Cybenko
Davenport
Funahashi
Hamers
Hastie
Haykin
Hritonenko
Jiabing Ji
Jianyong Sun
Kohler
Leonardis
Musavi
Pontil
Tibshirani
Tikhonov
Tsaig
Williamson
Xu
Yongquan Zhang
Youmei Li
Yuan
Publication venue: 'Elsevier BV'
Publication date: 01/03/2015
Field of study

When the neural element number nn of neural networks is larger than the sample size mm, the overfitting problem arises since there are more parameters than actual data (more variable than constraints). In order to overcome the overfitting problem, we propose to reduce the number of neural elements by using compressed projection AA which does not need to satisfy the condition of Restricted Isometric Property (RIP). By applying probability inequalities and approximation properties of the feedforward neural networks (FNNs), we prove that solving the FNNs regression learning algorithm in the compressed domain instead of the original domain reduces the sample error at the price of an increased (but controlled) approximation error, where the covering number theory is used to estimate the excess error, and an upper bound of the excess error is given

Crossref

Greenwich Academic Literature Archive

FastDeepIoT: Towards Understanding and Optimizing Neural Network Execution Time on Mobile and Embedded Devices

Author: Abdelzaher Tarek
Liu Dongxin
Liu Shengzhong
Shao Huajie
Su Lu
Yao Shuochao
Zhao Yiran
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 18/09/2018
Field of study

Deep neural networks show great potential as solutions to many sensing application problems, but their excessive resource demand slows down execution time, pausing a serious impediment to deployment on low-end devices. To address this challenge, recent literature focused on compressing neural network size to improve performance. We show that changing neural network size does not proportionally affect performance attributes of interest, such as execution time. Rather, extreme run-time nonlinearities exist over the network configuration space. Hence, we propose a novel framework, called FastDeepIoT, that uncovers the non-linear relation between neural network structure and execution time, then exploits that understanding to find network configurations that significantly improve the trade-off between execution time and accuracy on mobile and embedded devices. FastDeepIoT makes two key contributions. First, FastDeepIoT automatically learns an accurate and highly interpretable execution time model for deep neural networks on the target device. This is done without prior knowledge of either the hardware specifications or the detailed implementation of the used deep learning library. Second, FastDeepIoT informs a compression algorithm how to minimize execution time on the profiled device without impacting accuracy. We evaluate FastDeepIoT using three different sensing-related tasks on two mobile devices: Nexus 5 and Galaxy Nexus. FastDeepIoT further reduces the neural network execution time by

48\%

78\%

and energy consumption by

37\%

69\%

compared with the state-of-the-art compression algorithms.Comment: Accepted by SenSys '1

arXiv.org e-Print Archive

Crossref

Deep learning cardiac motion analysis for human survival prediction

Author: Bello Ghalib A.
Biffi Carlo
Cook Stuart A.
Dawes Timothy J. W.
de Marvao Antonio
Duan Jinming
Gibbs J. Simon R.
Howard Luke S. G. E.
O'Regan Declan P.
Rueckert Daniel
Wilkins Martin R.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 08/10/2018
Field of study

Motion analysis is used in computer vision to understand the behaviour of moving objects in sequences of images. Optimising the interpretation of dynamic biological systems requires accurate and precise motion tracking as well as efficient representations of high-dimensional motion trajectories so that these can be used for prediction tasks. Here we use image sequences of the heart, acquired using cardiac magnetic resonance imaging, to create time-resolved three-dimensional segmentations using a fully convolutional network trained on anatomical shape priors. This dense motion model formed the input to a supervised denoising autoencoder (4Dsurvival), which is a hybrid network consisting of an autoencoder that learns a task-specific latent code representation trained on observed outcome data, yielding a latent representation optimised for survival prediction. To handle right-censored survival outcomes, our network used a Cox partial likelihood loss function. In a study of 302 patients the predictive accuracy (quantified by Harrell's C-index) was significantly higher (p < .0001) for our model C=0.73 (95

\%

CI: 0.68 - 0.78) than the human benchmark of C=0.59 (95

\%

CI: 0.53 - 0.65). This work demonstrates how a complex computer vision task using high-dimensional medical image data can efficiently predict human survival

arXiv.org e-Print Archive

University of Birmingham Research Portal

Overlearning in marginal distribution-based ICA: analysis and solutions

Author: Särelä Mr Jaakko
Publication venue: MIT press
Publication date: 01/12/2003
Field of study

The present paper is written as a word of caution, with users of independent component analysis (ICA) in mind, to overlearning phenomena that are often observed.\\ We consider two types of overlearning, typical to high-order statistics based ICA. These algorithms can be seen to maximise the negentropy of the source estimates. The first kind of overlearning results in the generation of spike-like signals, if there are not enough samples in the data or there is a considerable amount of noise present. It is argued that, if the data has power spectrum characterised by

1/f

curve, we face a more severe problem, which cannot be solved inside the strict ICA model. This overlearning is better characterised by bumps instead of spikes. Both overlearning types are demonstrated in the case of artificial signals as well as magnetoencephalograms (MEG). Several methods are suggested to circumvent both types, either by making the estimation of the ICA model more robust or by including further modelling of the data