Search CORE

3,822 research outputs found

New acceleration technique for the backpropagation algorithm

Author: Sadhu Prasad Durga
Publication venue: Digital Scholarship@UNLV
Publication date: 01/01/1997
Field of study

Artificial neural networks have been studied for many years in the hope of achieving human like performance in the area of pattern recognition, speech synthesis and higher level of cognitive process. In the connectionist model there are several interconnected processing elements called the neurons that have limited processing capability. Even though the rate of information transmitted between these elements is limited, the complex interconnection and the cooperative interaction between these elements results in a vastly increased computing power; The neural network models are specified by an organized network topology of interconnected neurons. These networks have to be trained in order them to be used for a specific purpose. Backpropagation is one of the popular methods of training the neural networks. There has been a lot of improvement over the speed of convergence of standard backpropagation algorithm in the recent past. Herein we have presented a new technique for accelerating the existing backpropagation without modifying it. We have used the fourth order interpolation method for the dominant eigen values, by using these we change the slope of the activation function. And by doing so we increase the speed of convergence of the backpropagation algorithm; Our experiments have shown significant improvement in the convergence time for problems widely used in benchmarKing Three to ten fold decrease in convergence time is achieved. Convergence time decreases as the complexity of the problem increases. The technique adjusts the energy state of the system so as to escape from local minima

University of Nevada, Las Vegas Repository

Development of self-adaptive back propagation and derivative free training algorithms in artificial neural networks

Author: Ahmed Shamsuddin
Publication venue: Edith Cowan University, Research Online, Perth, Western Australia
Publication date: 01/01/2000
Field of study

Three new iterative, dynamically self-adaptive, derivative-free and training parameter free artificial neural network (ANN) training algorithms are developed. They are defined as self-adaptive back propagation, multi-directional and restart ANN training algorithms. The descent direction in self-adaptive back propagation training is determined implicitly by a central difference approximation scheme, which chooses its step size according to the convergence behavior of the error function. This approach trains an ANN when the gradient information of the corresponding error function is not readily available. The self- adaptive variable learning rates per epoch are determined dynamically using a constrained interpolation search. As a result, appropriate descent to the error function is achieved. The multi-directional training algorithm is self-adaptive and derivative free. It orients an initial search vector in a descent location at the early stage of training. Individual learning rates and momentum term for all the ANN weights are determined optimally. The search directions are derived from rectilinear and Euclidean paths, which explore stiff ridges and valleys of the error surface to improve training. The restart training algorithm is derivative free. It redefines a de-generated simplex at a re-scale phase. This multi-parameter training algorithm updates ANN weights simultaneously instead of individually. The descent directions are derived from the centroid of a simplex along a reflection point opposite to the worst vertex. The algorithm is robust and has the ability to improve local search. These ANN training methods are appropriate when there is discontinuity in corresponding ANN error function or the Hessian matrix is ill conditioned or singular. The convergence properties of the algorithms are proved where possible. All the training algorithms successfully train exclusive OR (XOR), parity, character-recognition and forecasting problems. The simulation results with XOR, parity and character recognition problems suggest that all the training algorithms improve significantly over the standard back propagation algorithm in average number of epoch, function evaluations and terminal function values. The multivariate ANN calibration problem as a regression model with small data set is relatively difficult to train. In forecasting problems, an ANN is trained to extrapolate the data in validation period. The extrapolation results are compared with the actual data. The trained ANN performs better than the statistical regression method in mean absolute deviations; mean squared errors and relative percentage error. The restart training algorithm succeeds in training a problem, where other training algorithms face difficulty. It is shown that a seasonal time series problem possesses a Hessian matrix that has a high condition number. Convergence difficulties as well as slow training are therefore not atypical. The research exploits the geometry of the error surface to identify self-adaptive optimized learning rates and momentum terms. Consequently, the algorithms converge with high success rate. These attributes brand the training algorithms as self-adaptive, automatic, parameter free, efficient and easy to use

Research Online @ ECU

Brain Tumor Segmentation with Deep Neural Networks

Author: Bengio Yoshua
Biard Antoine
Courville Aaron
Davy Axel
Havaei Mohammad
Jodoin Pierre-Marc
Larochelle Hugo
Pal Chris
Warde-Farley David
Publication venue: 'Elsevier BV'
Publication date: 20/05/2016
Field of study

In this paper, we present a fully automatic brain tumor segmentation method based on Deep Neural Networks (DNNs). The proposed networks are tailored to glioblastomas (both low and high grade) pictured in MR images. By their very nature, these tumors can appear anywhere in the brain and have almost any kind of shape, size, and contrast. These reasons motivate our exploration of a machine learning solution that exploits a flexible, high capacity DNN while being extremely efficient. Here, we give a description of different model choices that we've found to be necessary for obtaining competitive performance. We explore in particular different architectures based on Convolutional Neural Networks (CNN), i.e. DNNs specifically adapted to image data. We present a novel CNN architecture which differs from those traditionally used in computer vision. Our CNN exploits both local features as well as more global contextual features simultaneously. Also, different from most traditional uses of CNNs, our networks use a final layer that is a convolutional implementation of a fully connected layer which allows a 40 fold speed up. We also describe a 2-phase training procedure that allows us to tackle difficulties related to the imbalance of tumor labels. Finally, we explore a cascade architecture in which the output of a basic CNN is treated as an additional source of information for a subsequent CNN. Results reported on the 2013 BRATS test dataset reveal that our architecture improves over the currently published state-of-the-art while being over 30 times faster

arXiv.org e-Print Archive

CiteSeerX

PolyPublie

Coronal rain in magnetic arcades: Rebound shocks, Limit cycles, and Shear flows

Author: Fang X.
Keppens R.
Van Doorsselaere T.
Xia C.
Publication venue: 'IOP Publishing'
Publication date: 01/01/2015
Field of study

We extend our earlier multidimensional, magnetohydrodynamic simulations of coronal rain occurring in magnetic arcades with higher resolution, grid-adaptive computations covering a much longer (

>6

hour) timespan. We quantify how in-situ forming blob-like condensations grow along and across field lines and show that rain showers can occur in limit cycles, here demonstrated for the first time in 2.5D setups. We discuss dynamical, multi-dimensional aspects of the rebound shocks generated by the siphon inflows and quantify the thermodynamics of a prominence-corona-transition-region like structure surrounding the blobs. We point out the correlation between condensation rates and the cross-sectional size of loop systems where catastrophic cooling takes place. We also study the variations of the typical number density, kinetic energy and temperature while blobs descend, impact and sink into the transition region. In addition, we explain the mechanisms leading to concurrent upflows while the blobs descend. As a result, there are plenty of shear flows generated with relative velocity difference around 80 km s

^{-1}

in our simulations. These shear flows are siphon flows set up by multiple blob dynamics and they in turn affect the deformation of the falling blobs. In particular, we show how shear flows can break apart blobs into smaller fragments, within minutes

arXiv.org e-Print Archive

Lirias