Search CORE

7 research outputs found

Large Scale Evolution of Convolutional Neural Networks Using Volunteer Computing

Author: Krizhevsky A.
Krizhevsky A.
Publication venue
Publication date: 15/03/2017
Field of study

This work presents a new algorithm called evolutionary exploration of augmenting convolutional topologies (EXACT), which is capable of evolving the structure of convolutional neural networks (CNNs). EXACT is in part modeled after the neuroevolution of augmenting topologies (NEAT) algorithm, with notable exceptions to allow it to scale to large scale distributed computing environments and evolve networks with convolutional filters. In addition to multithreaded and MPI versions, EXACT has been implemented as part of a BOINC volunteer computing project, allowing large scale evolution. During a period of two months, over 4,500 volunteered computers on the Citizen Science Grid trained over 120,000 CNNs and evolved networks reaching 98.32% test data accuracy on the MNIST handwritten digits dataset. These results are even stronger as the backpropagation strategy used to train the CNNs was fairly rudimentary (ReLU units, L2 regularization and Nesterov momentum) and these were initial test runs done without refinement of the backpropagation hyperparameters. Further, the EXACT evolutionary strategy is independent of the method used to train the CNNs, so they could be further improved by advanced techniques like elastic distortions, pretraining and dropout. The evolved networks are also quite interesting, showing "organic" structures and significant differences from standard human designed architectures.Comment: 17 pages, 13 figures. Submitted to the 2017 Genetic and Evolutionary Computation Conference (GECCO 2017

arXiv.org e-Print Archive

Crossref

Distributed evolutionary algorithms and their models: A survey of the state-of-the-art

Author: Alba
Alba
Alba
Alba
Alba
Anglano
Apolloni
Bai
Bollini
Bouvry
Branke
Burczynski
Burczyński
Cahon
Cahon
Cantu-Paz
Cantu-Paz
Cantú-Paz
Chatzimilioudis
Chen
Creput
Danoy
Davis
de Toro Negro
Dean
Deb
Decraene
Desell
Dorronsoro
Du
Dubreuil
Durillo
Durillo
Durillo
Durillo
Durillo
Epitropakis
Escuela
Ewald
Fok
Folino
Folino
Gagné
Garcia-Arenas
García-Arenas
Giacobini
Giacobini
Giacobini
Giacobini
Giacobini
Giacobini
Goh
Goldberg
Gong
Gonzalez
Herrera
Herrera
Hidalgo
Hidalgo
Hosseini
Iimura
Ishimizu
Ismail
Jin
Jing-Jing Li
Johar
Jun Zhang
Kattan
Kattan
Kirley
Kirley
Kwok
Laredo
Li
Liang
Lienig
Lim
Lim
Liu
Llora
Lorion
Manfrin
McNabb
Melab
Melab
Mendiburu
Merelo
Merelo-Guervos
Merelo-Guervós
Merelo-Guervós
Michel
Mostaghim
Mussi
Nebro
Nebro
Nesmachnow
Nojima
Ordeshook
Pedemonte
Pendharkar
Pierreval
Piriyakumar
Potter
Qingfu Zhang
Ray
Robilliard
Roy
Ruiz-Andino
Said
Said
Sarma
Schutte
Schönfisch
Scriven
Sefrioui
Seredynski
Seredynski
Seredynski
Sherry
Soca
Starzynski
Stützle
Su
Subbu
Subbu
Suganthan
Tagawa
Tan
Tan
Tan
Tan
Tan
Tasoulis
Tomassini
Tomassini
Umbarkar
Van Veldhuizen
Verma
Veronese
Vlachogiannis
Weber
Weber
Wei-Neng Chen
Whitley
Wickramasinghe
Wu
Xiong
Xu
Yang
Yu
Yu
Yue-Jiao Gong
Yun Li
Zhang
Zhang
Zhang
Zhao
Zhao
Zhi-Hui Zhan
Zhou
Zhou
Zhu
Publication venue: 'Elsevier BV'
Publication date: 11/05/2015
Field of study

The increasing complexity of real-world optimization problems raises new challenges to evolutionary computation. Responding to these challenges, distributed evolutionary computation has received considerable attention over the past decade. This article provides a comprehensive survey of the state-of-the-art distributed evolutionary algorithms and models, which have been classified into two groups according to their task division mechanism. Population-distributed models are presented with master-slave, island, cellular, hierarchical, and pool architectures, which parallelize an evolution task at population, individual, or operation levels. Dimension-distributed models include coevolution and multi-agent models, which focus on dimension reduction. Insights into the models, such as synchronization, homogeneity, communication, topology, speedup, advantages and disadvantages are also presented and discussed. The study of these models helps guide future development of different and/or improved algorithms. Also highlighted are recent hotspots in this area, including the cloud and MapReduce-based implementations, GPU and CUDA-based implementations, distributed evolutionary multiobjective optimization, and real-world applications. Further, a number of future research directions have been discussed, with a conclusion that the development of distributed evolutionary computation will continue to flourish

University of Essex Research Repository

Crossref

Enlighten

Many-Threaded Differential Evolution on the GPU

Author: A. Engelbrecht
A.J. Page
D. Ashlock
D. Fernandez-Baca
H. Izakian
J.G. Proakis
K.V. Price
L. Bertacco
N. Fujimoto
P. Krömer
P. Krömer
R. Martí
R. Storn
S. Ali
S. Roy
T. Schiavinotto
T. Schiavinotto
T.D. Braun
V. Campos
V. Snášel
W. Zhu
W.B. Langdon
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Microwave Tomography Using Stochastic Optimization And High Performance Computing

Author: Holman Michael William
Publication venue: UND Scholarly Commons
Publication date: 01/01/2013
Field of study

This thesis discusses the application of parallel computing in microwave tomography for detection and imaging of dielectric objects. The main focus is on microwave tomography with the use of a parallelized Finite Difference Time Domain (FDTD) forward solver in conjunction with non-linear stochastic optimization based inverse solvers. Because such solvers require very heavy computation, their investigation has been limited in favour of deterministic inverse solvers that make use of assumptions and approximations of the imaging target. Without the use of linearization assumptions, a non-linear stochastic microwave tomography system is able to resolve targets of arbitrary permittivity contrast profiles while avoiding convergence to local minima of the microwave tomography optimization space. This work is focused on ameliorating this computational load with the use of heavy parallelization. The presented microwave tomography system is capable of modelling complex, heterogeneous, and dispersive media using the Debye model. A detailed explanation of the dispersive FDTD is presented herein. The system uses scattered field data due to multiple excitation angles, frequencies, and observation angles in order to improve target resolution, reduce the ill-posedness of the microwave tomography inverse problem, and improve the accuracy of the complex permittivity profile of the imaging target. The FDTD forward solver is parallelized with the use of the Common Unified Device Architecture (CUDA) programming model developed by NVIDIA corporation. In the forward solver, the time stepping of the fields are computed on a Graphics Processing Unit (GPU). In addition the inverse solver makes use of the Message Passing Interface (MPI) system to distribute computation across multiple work stations. The FDTD method was chosen due to its ease of parallelization using GPU computing, in addition to its ability to simulate wideband excitation signals during a single forward simulation. We investigated the use of distributed Particle Swarm Optimization (PSO) and Differential Evolution (DE) methods in the inverse solver for this microwave tomography system. In these optimization algorithms, candidate solutions are farmed out to separate workstations to be evaluated. As fitness evaluations are returned asynchronously, the optimization algorithm updates the population of candidate solutions and gives new candidate solutions to be evaluated to open workstations. In this manner, we used a total of eight graphics processing units during optimization with minimal downtime. Presented in this thesis is a microwave tomography algorithm that does not rely on linearization assumptions, capable of imaging a target in a reasonable amount of time for clinical applications. The proposed algorithm was tested using numerical phantoms that with material parameters similar to what one would find in normal or malignant human tissue

UND Scholarly Commons (University of North Dakota)

Blackbox Stencil Interpolation Method for model reduction

Author: Chen Han, S.M. Massachusetts Institute of Technology
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/2012
Field of study

Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Aeronautics and Astronautics, 2012.Cataloged from department-submitted PDF version of thesis. This electronic version was submitted and approved by the author's academic department as part of an electronic thesis pilot project. The certified thesis is available in the Institute Archives and Special Collections.Includes bibliographical references (p. 87-89).Model reduction often requires modifications to the simulation code. In many circumstances, developing and maintaining these modifications can be cumbersome. Non-intrusive methods that do not require modification to the source code are often preferred. This thesis proposed a new formulation of machine learning, Black-box Stencil Interpolation Method, for this purpose. It is a non-intrusive, data-oriented method to infer the underlying physics that governs a simulation, which can be combined with conventional intrusive model reduction techniques. This method is tested on several problems to investigate its accuracy, robustness, and applicabilities.by Han Chen.S.M

DSpace@MIT

Scalable parallel evolutionary optimisation based on high performance computing

Author: Jin C
Publication venue: RMIT University
Publication date
Field of study

Evolutionary algorithms (EAs) have been successfully applied to solve various challenging optimisation problems. Due to their stochastic nature, EAs typically require considerable time to find desirable solutions; especially for increasingly complex and large-scale problems. As a result, many works studied implementing EAs on parallel computing facilities to accelerate the time-consuming processes. Recently, the rapid development of modern parallel computing facilities such as the high performance computing (HPC) bring not only unprecedented computational capabilities but also challenges on designing parallel algorithms. This thesis mainly focuses on designing scalable parallel evolutionary optimisation (SPEO) frameworks which run efficiently on the HPC. Motivated by the interesting phenomenon that many EAs begin to employ increasingly large population sizes, this thesis firstly studies the effect of a large population size through comprehensive experiments. Numerical results indicate that a large population benefits to the solving of complex problems but requires a large number of maximal fitness evaluations (FEs). However, since sequential EAs usually requires a considerable computing time to achieve extensive FEs, we propose a scalable parallel evolutionary optimisation framework that can efficiently deploy parallel EAs over many CPU cores at CPU-only HPC. On the other hand, since EAs using a large number of FEs can produce massive useful information in the course of evolution, we design a surrogate-based approach to learn from this historical information and to better solve complex problems. Then this approach is implemented in parallel based on the proposed scalable parallel framework to achieve remarkable speedups. Since demanding a great computing power on CPU-only HPC is usually very expensive, we design a framework based on GPU-enabled HPC to improve the cost-effectiveness of parallel EAs. The proposed framework can efficiently accelerate parallel EAs using many GPUs and can achieve superior cost-effectiveness. However, since it is very challenging to correctly implement parallel EAs on the GPU, we propose a set of guidelines to verify the correctness of GPU-based EAs. In order to examine these guidelines, they are employed to verify a GPU-based brain storm optimisation that is also proposed in this thesis. In conclusion, the comprehensively experimental study is firstly conducted to investigate the impacts of a large population. After that, a SPEO framework based on CPU-only HPC is proposed and is employed to accelerate a time-consuming implementation of EA. Finally, the correctness verification of implementing EAs based on a single GPU is discussed and the SPEO framework is then extended to be deployed based on GPU-enabled HPC

RMIT Research Repository