Search CORE

58 research outputs found

Distributed computing methodology for training neural networks in an image-guided diagnostic application

Author: Magoulas George D.
Plagianakos V.P.
Vrahatis M.N.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2006
Field of study

Distributed computing is a process through which a set of computers connected by a network is used collectively to solve a single problem. In this paper, we propose a distributed computing methodology for training neural networks for the detection of lesions in colonoscopy. Our approach is based on partitioning the training set across multiple processors using a parallel virtual machine. In this way, interconnected computers of varied architectures can be used for the distributed evaluation of the error function and gradient values, and, thus, training neural networks utilizing various learning methods. The proposed methodology has large granularity and low synchronization, and has been implemented and tested. Our results indicate that the parallel virtual machine implementation of the training algorithms developed leads to considerable speedup, especially when large network architectures and training sets are used

Birkbeck Institutional Research Online

Improved sign-based learning algorithm derived by the composite nonlinear Jacobi process

Author: Anastasiadis A.D.
Magoulas George D.
Vrahatis M.N.
Publication venue: Elsevier
Publication date: 01/01/2006
Field of study

In this paper a globally convergent first-order training algorithm is proposed that uses sign-based information of the batch error measure in the framework of the nonlinear Jacobi process. This approach allows us to equip the recently proposed Jacobi–Rprop method with the global convergence property, i.e. convergence to a local minimizer from any initial starting point. We also propose a strategy that ensures the search direction of the globally convergent Jacobi–Rprop is a descent one. The behaviour of the algorithm is empirically investigated in eight benchmark problems. Simulation results verify that there are indeed improvements on the convergence success of the algorithm

Elsevier - Publisher Connector

Birkbeck Institutional Research Online

Bounding the search space for global optimization of neural networks learning error: an interval analysis approach

Author: Adam S.P.
Karras D.A.
Magoulas George D.
Vrahatis M.N.
Publication venue: JMLR
Publication date: 01/09/2016
Field of study

Training a multilayer perceptron (MLP) with algorithms employing global search strategies has been an important research direction in the field of neural networks. Despite a number of significant results, an important matter concerning the bounds of the search region---typically defined as a box---where a global optimization method has to search for a potential global minimizer seems to be unresolved. The approach presented in this paper builds on interval analysis and attempts to define guaranteed bounds in the search space prior to applying a global search algorithm for training an MLP. These bounds depend on the machine precision and the term guaranteed denotes that the region defined surely encloses weight sets that are global minimizers of the neural network's error function. Although the solution set to the bounding problem for an MLP is in general non-convex, the paper presents the theoretical results that help deriving a box which is a convex set. This box is an outer approximation of the algebraic solutions to the interval equations resulting from the function implemented by the network nodes. An experimental study using well known benchmarks is presented in accordance with the theoretical results

Birkbeck Institutional Research Online

Solving the linear interval tolerance problem for weight initialization of neural networks

Author: Adam S.P.
Karras D.A.
Magoulas George D.
Vrahatis M.N.
Publication venue: 'Elsevier BV'
Publication date: 01/06/2014
Field of study

Determining good initial conditions for an algorithm used to train a neural network is considered a parameter estimation problem dealing with uncertainty about the initial weights. Interval Analysis approaches model uncertainty in parameter estimation problems using intervals and formulating tolerance problems. Solving a tolerance problem is defining lower and upper bounds of the intervals so that the system functionality is guaranteed within predefined limits. The aim of this paper is to show how the problem of determining the initial weight intervals of a neural network can be defined in terms of solving a linear interval tolerance problem. The proposed Linear Interval Tolerance Approach copes with uncertainty about the initial weights without any previous knowledge or specific assumptions on the input data as required by approaches such as fuzzy sets or rough sets. The proposed method is tested on a number of well known benchmarks for neural networks trained with the back-propagation family of algorithms. Its efficiency is evaluated with regards to standard performance measures and the results obtained are compared against results of a number of well known and established initialization methods. These results provide credible evidence that the proposed method outperforms classical weight initialization methods

Crossref

Birkbeck Institutional Research Online

Reliable estimation of a neural network’s domain of validity through interval analysis based inversion

Author: Adam S.P.
Karras D.A.
Magoulas George D.
Vrahatis M.N.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 12/07/2015
Field of study

Reliable estimation of a neural network’s domain of validity is important for a number of reasons such as assessing its ability to cope with a given problem, evaluating the consistency of its generalization etc. In this paper we introduce a new approach to estimate the domain of validity of a neural network based on Set Inversion Via Interval Analysis (SIVIA), the methodology established by Jaulin andWalter [1]. This approach was originally introduced in order to solve nonlinear parameter estimation problems in a bounded error context and proved to be effective in tackling several types of problems dealing with nonlinear systems analysis. The dependence of a neural network output on the pattern data is a nonlinear function and hence derivation of the impact of the input data to the neural network function can be addressed as a nonlinear parameter estimation problem that can be tackled by SIVIA. We present concrete application examples and show how the proposed method allows to delimit the domain of validity of a trained neural network. We discuss advantages, pitfalls and potential improvements offered to neural networks

Crossref

Birkbeck Institutional Research Online

A new ghost cell/level set method for moving boundary problems:application to tumor growth

Author: A.H. Hallett
A.R.A. Anderson
B. Engquist
C. Chatterjee
C.S. Hogea
D. Adalsteinsson
D. Ambrosi
D. Galaris
D. Peng
F. Gibou
F. Gibou
F. Gibou
G. Cowles
G.L.G. Sleijpen
G.S. Jiang
G.S. Jiang
H. Byrne
H.B. Frieboes
H.K. Zhao
H.M. Byrne
J. Adam
J. Glimm
J.A. Sethian
J.A. Sethian
John S. Lowengrub
M. Sussman
M.A.J. Chaplain
M.N. Vrahatis
M.S. McMullen
N. Bellomo
P. Macklin
P. Macklin
P. Macklin
P.T. Lin
Paul Macklin
R. Malladi
R.A. Kloner
R.G. Abbott
R.J. LeVeque
R.P. Araujo
R.P. Fedkiw
S. Gottlieb
S. Gottlieb
S. Osher
S. Osher
S. Osher
S. Sanga
V. Cristini
V. Quaranta
X. Zheng
X.D. Liu
Y.C. Zhou
Y.C. Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

In this paper, we present a ghost cell/level set method for the evolution of interfaces whose normal velocity depend upon the solutions of linear and nonlinear quasi-steady reaction-diffusion equations with curvature-dependent boundary conditions. Our technique includes a ghost cell method that accurately discretizes normal derivative jump boundary conditions without smearing jumps in the tangential derivative; a new iterative method for solving linear and nonlinear quasi-steady reaction-diffusion equations; an adaptive discretization to compute the curvature and normal vectors; and a new discrete approximation to the Heaviside function. We present numerical examples that demonstrate better than 1.5-order convergence for problems where traditional ghost cell methods either fail to converge or attain at best sub-linear accuracy. We apply our techniques to a model of tumor growth in complex, heterogeneous tissues that consists of a nonlinear nutrient equation and a pressure equation with geometry-dependent jump boundary conditions. We simulate the growth of glioblastoma (an aggressive brain tumor) into a large, 1 cm square of brain tissue that includes heterogeneous nutrient delivery and varied biomechanical characteristics (white matter, gray matter, cerebrospinal fluid, and bone), and we observe growth morphologies that are highly dependent upon the variations of the tissue characteristics—an effect observed in real tumor growth

Crossref

PubMed Central

University of Dundee Online Publications

A class of adaptive learning rate algorithms derived by one-dimensional subminimization methods

Author: Magoulas George
Vrahatis M.N.
Publication venue: Dynamic Publishers Inc.
Publication date: 01/01/2000
Field of study

Birkbeck Institutional Research Online

Feature Selection in Single-Cell RNA-seq Data via a Genetic Algorithm

Author: Chatzilygeroudis K.I. Vrahatis A.G., Tasoulis S.K., Vrahatis M.N.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Big data methods prevail in the biomedical domain leading to effective and scalable data-driven approaches. Biomedical data are known for their ultra-high dimensionality, especially the ones coming from molecular biology experiments. This property is also included in the emerging technique of single-cell RNA-sequencing (scRNA-seq), where we obtain sequence information from individual cells. A reliable way to uncover their complexity is by using Machine Learning approaches, including dimensional reduction and feature selection methods. Although the first choice has had remarkable progress in scRNA-seq data, only the latter can offer deeper interpretability at the gene level since it highlights the dominant gene features in the given data. Towards tackling this challenge, we propose a feature selection framework that utilizes genetic optimization principles and identifies low-dimensional combinations of gene lists in order to enhance classification performance of any off-the-shelf classifier (e.g., LDA or SVM). Our intuition is that by identifying an optimal genes subset, we can enhance the prediction power of scRNA-seq data even if these genes are unrelated to each other. We showcase our proposed framework’s effectiveness in two real scRNA-seq experiments with gene dimensions up to 36708. Our framework can identify very low-dimensional subsets of genes (less than 200) while boosting the classifiers’ performance. Finally, we provide a biological interpretation of the selected genes, thus providing evidence of our method’s utility towards explainable artificial intelligence. © 2021, Springer Nature Switzerland AG

University of Thessaly Institutional Repository

Adaptive algorithms for neural network supervised learning: a deterministic optimization approach

Author: Magoulas George
Vrahatis M.N.
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 01/01/2006
Field of study

Networks of neurons can perform computations that even modern computers find very difficult to simulate. Most of the existing artificial neurons and artificial neural networks are considered biologically unrealistic, nevertheless the practical success of the backpropagation algorithm and the powerful capabilities of feedforward neural networks have made neural computing very popular in several application areas. A challenging issue in this context is learning internal representations by adjusting the weights of the network connections. To this end, several first-order and second-order algorithms have been proposed in the literature. This paper provides an overview of approaches to backpropagation training, emphazing on first-order adaptive learning algorithms that build on the theory of nonlinear optimization, and proposes a framework for their analysis in the context of deterministic optimization

Birkbeck Institutional Research Online