Search CORE

9 research outputs found

Learning activation functions from data using cubic spline interpolation

Author: CT Chen
E Trentin
J Schmidhuber
L Ma
L Vecci
M Scarpiniti
M Scarpiniti
M Zhang
P Chandra
S Goh
S Guarnieri
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/05/2017
Field of study

Neural networks require a careful design in order to perform properly on a given task. In particular, selecting a good activation function (possibly in a data-dependent fashion) is a crucial step, which remains an open problem in the research community. Despite a large amount of investigations, most current implementations simply select one fixed function from a small set of candidates, which is not adapted during training, and is shared among all neurons throughout the different layers. However, neither two of these assumptions can be supposed optimal in practice. In this paper, we present a principled way to have data-dependent adaptation of the activation functions, which is performed independently for each neuron. This is achieved by leveraging over past and present advances on cubic spline interpolation, allowing for local adaptation of the functions around their regions of use. The resulting algorithm is relatively cheap to implement, and overfitting is counterbalanced by the inclusion of a novel damping criterion, which penalizes unwanted oscillations from a predefined shape. Experimental results validate the proposal over two well-known benchmarks.Comment: Submitted to the 27th Italian Workshop on Neural Networks (WIRN 2017

arXiv.org e-Print Archive

Crossref

Archivio della ricerca- Università di Roma La Sapienza

A method to improve the performance of multilayer perceptron by utilizing various activation functions in the last hidden layer and the least squares method

Author: Krzysztof Halawa
Publication venue: Springer Nature
Publication date: 01/01/2011
Field of study

Springer - Publisher Connector

Deep Neural Networks with Trainable Activations and Controlled Lipschitz Constant

Author: Aziznejad Shayan
Campos Joaquim
Gupta Harshit
Unser Michael
Publication venue
Publication date: 07/08/2020
Field of study

We introduce a variational framework to learn the activation functions of deep neural networks. Our aim is to increase the capacity of the network while controlling an upper-bound of the actual Lipschitz constant of the input-output relation. To that end, we first establish a global bound for the Lipschitz constant of neural networks. Based on the obtained bound, we then formulate a variational problem for learning activation functions. Our variational problem is infinite-dimensional and is not computationally tractable. However, we prove that there always exists a solution that has continuous and piecewise-linear (linear-spline) activations. This reduces the original problem to a finite-dimensional minimization where an l1 penalty on the parameters of the activations favors the learning of sparse nonlinearities. We numerically compare our scheme with standard ReLU network and its variations, PReLU and LeakyReLU and we empirically demonstrate the practical aspects of our framework

arXiv.org e-Print Archive

Rational neural networks

Author: Boullé Nicolas
Nakatsukasa Yuji
Townsend Alex
Publication venue
Publication date: 01/01/2020
Field of study

We consider neural networks with rational activation functions. The choice of the nonlinear activation function in deep learning architectures is crucial and heavily impacts the performance of a neural network. We establish optimal bounds in terms of network complexity and prove that rational neural networks approximate smooth functions more efficiently than ReLU networks with exponentially smaller depth. The flexibility and smoothness of rational activation functions make them an attractive alternative to ReLU, as we demonstrate with numerical experiments.Comment: 21 pages, 7 figure

arXiv.org e-Print Archive

Oxford University Research Archive

Evolved neural network approximation of discontinuous vector fields in unit quaternion space (S3) for anatomical joint constraint

Author: Jenkins Glenn
Publication venue
Publication date: 01/08/2007
Field of study

University of South Wales Research Explorer

Multilayer feedforward networks with adaptive spline activation function

Author: F. Piazza
S. Guarnieri
UNCINI Aurelio
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/1999
Field of study

in this paper, a new adaptive spline activation function neural network (ASNN) is presented. Due to the ASNN's high representation capabilities, networks with a small number of interconnections can be trained to solve both pattern recognition and data processing real-time problems. The main idea is to use a Catmull-Rom cubic spline as the neuron's activation function, which ensures a simple structure suitable for both software and hardware implementation. Experimental results demonstrate improvements in terms of generalization capability and of learning speed in both pattern recognition and data processing tasks

CiteSeerX

Archivio della ricerca- Università di Roma La Sapienza

Multilayer Feedforward Networks with Adaptive Spline Activation Function

Author: F. PIAZZA
GUARNIERI S.
UNCINI A.
Publication venue: IEEE / Institute of Electrical and Electronics Engineers Incorporated:445 Hoes Lane:Piscataway, NJ 08854:(800)701-4333, (732)981-0060, EMAIL: [email protected], INTERNET: http://www.ieee.org, Fax: (732)981-9667
Publication date
Field of study

IRIS UniversitÃ Politecnica delle Marche