Search CORE

1,563 research outputs found

A GPU-based Evolution Strategy for Optic Disk Detection in Retinal Images

Author: González-Calederón Guillermo
Sánchez-Torres Germán
Publication venue: 'Universidad de Medellin'
Publication date: 01/01/2016
Field of study

La ejecución paralela de aplicaciones usando unidades de procesamiento gráfico (gpu) ha ganado gran interés en la comunidad académica en los años recientes. La computación paralela puede ser aplicada a las estrategias evolutivas para procesar individuos dentro de una población, sin embargo, las estrategias evolutivas se caracterizan por un significativo consumo de recursos computacionales al resolver problemas de gran tamaño o aquellos que se modelan mediante funciones de aptitud complejas. Este artículo describe la implementación de una estrategia evolutiva para la detección del disco óptico en imágenes de retina usando Compute Unified Device Architecture (cuda). Los resultados experimentales muestran que el tiempo de ejecución para la detección del disco óptico logra una aceleración de 5 a 7 veces, comparado con la ejecución secuencial en una cpu convencional.Parallel processing using graphic processing units (GPUs) has attracted much research interest in recent years. Parallel computation can be applied to evolution strategy (ES) for processing individuals in a population, but evolutionary strategies are time consuming to solve large computational problems or complex fitness functions. In this paper we describe the implementation of an improved ES for optic disk detection in retinal images using the Compute Unified Device Architecture (CUDA) environment. In the experimental results we show that the computational time for optic disk detection task has a speedup factor of 5x and 7x compared to an implementation on a mainstream CPU

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Universidad de Medellín: Revistas Científicas

Repositorio Institucional Universidad de Medellín

DIALNET

High-speed detection of emergent market clustering via an unsupervised parallel genetic algorithm

Author: Gebbie Tim
Hendricks Dieter
Wilcox Diane
Publication venue: 'Academy of Science of South Africa'
Publication date: 02/08/2015
Field of study

We implement a master-slave parallel genetic algorithm (PGA) with a bespoke log-likelihood fitness function to identify emergent clusters within price evolutions. We use graphics processing units (GPUs) to implement a PGA and visualise the results using disjoint minimal spanning trees (MSTs). We demonstrate that our GPU PGA, implemented on a commercially available general purpose GPU, is able to recover stock clusters in sub-second speed, based on a subset of stocks in the South African market. This represents a pragmatic choice for low-cost, scalable parallel computing and is significantly faster than a prototype serial implementation in an optimised C-based fourth-generation programming language, although the results are not directly comparable due to compiler differences. Combined with fast online intraday correlation matrix estimation from high frequency data for cluster identification, the proposed implementation offers cost-effective, near-real-time risk assessment for financial practitioners.Comment: 10 pages, 5 figures, 4 tables, More thorough discussion of implementatio

arXiv.org e-Print Archive

Crossref

Academy of Science of South Africa (ASSAf): Open Journal Systems

Directory of Open Access Journals

Graphics Processing Unit–Enhanced Genetic Algorithms for Solving the Temporal Dynamics of Gene Regulatory Networks

Author: Córdoba Zurita Antonio
Díaz del Río Fernando
García Calvo Agustín
Guisado Lízar José Luís
Jiménez-Morales Francisco de Paula
Publication venue: 'SAGE Publications'
Publication date: 01/01/2018
Field of study

Understanding the regulation of gene expression is one of the key problems in current biology. A promising method for that purpose is the determination of the temporal dynamics between known initial and ending network states, by using simple acting rules. The huge amount of rule combinations and the nonlinear inherent nature of the problem make genetic algorithms an excellent candidate for finding optimal solutions. As this is a computationally intensive problem that needs long runtimes in conventional architectures for realistic network sizes, it is fundamental to accelerate this task. In this article, we study how to develop efficient parallel implementations of this method for the fine-grained parallel architecture of graphics processing units (GPUs) using the compute unified device architecture (CUDA) platform. An exhaustive and methodical study of various parallel genetic algorithm schemes—master-slave, island, cellular, and hybrid models, and various individual selection methods (roulette, elitist)—is carried out for this problem. Several procedures that optimize the use of the GPU’s resources are presented. We conclude that the implementation that produces better results (both from the performance and the genetic algorithm fitness perspectives) is simulating a few thousands of individuals grouped in a few islands using elitist selection. This model comprises 2 mighty factors for discovering the best solutions: finding good individuals in a short number of generations, and introducing genetic diversity via a relatively frequent and numerous migration. As a result, we have even found the optimal solution for the analyzed gene regulatory network (GRN). In addition, a comparative study of the performance obtained by the different parallel implementations on GPU versus a sequential application on CPU is carried out. In our tests, a multifold speedup was obtained for our optimized parallel implementation of the method on medium class GPU over an equivalent sequential single-core implementation running on a recent Intel i7 CPU. This work can provide useful guidance to researchers in biology, medicine, or bioinformatics in how to take advantage of the parallelization on massively parallel devices and GPUs to apply novel metaheuristic algorithms powered by nature for real-world applications (like the method to solve the temporal dynamics of GRNs)

idUS. Depósito de Investigación Universidad de Sevilla

Distributed evolutionary algorithms and their models: A survey of the state-of-the-art

Author: Alba
Alba
Alba
Alba
Alba
Anglano
Apolloni
Bai
Bollini
Bouvry
Branke
Burczynski
Burczyński
Cahon
Cahon
Cantu-Paz
Cantu-Paz
Cantú-Paz
Chatzimilioudis
Chen
Creput
Danoy
Davis
de Toro Negro
Dean
Deb
Decraene
Desell
Dorronsoro
Du
Dubreuil
Durillo
Durillo
Durillo
Durillo
Durillo
Epitropakis
Escuela
Ewald
Fok
Folino
Folino
Gagné
Garcia-Arenas
García-Arenas
Giacobini
Giacobini
Giacobini
Giacobini
Giacobini
Giacobini
Goh
Goldberg
Gong
Gonzalez
Herrera
Herrera
Hidalgo
Hidalgo
Hosseini
Iimura
Ishimizu
Ismail
Jin
Jing-Jing Li
Johar
Jun Zhang
Kattan
Kattan
Kirley
Kirley
Kwok
Laredo
Li
Liang
Lienig
Lim
Lim
Liu
Llora
Lorion
Manfrin
McNabb
Melab
Melab
Mendiburu
Merelo
Merelo-Guervos
Merelo-Guervós
Merelo-Guervós
Michel
Mostaghim
Mussi
Nebro
Nebro
Nesmachnow
Nojima
Ordeshook
Pedemonte
Pendharkar
Pierreval
Piriyakumar
Potter
Qingfu Zhang
Ray
Robilliard
Roy
Ruiz-Andino
Said
Said
Sarma
Schutte
Schönfisch
Scriven
Sefrioui
Seredynski
Seredynski
Seredynski
Sherry
Soca
Starzynski
Stützle
Su
Subbu
Subbu
Suganthan
Tagawa
Tan
Tan
Tan
Tan
Tan
Tasoulis
Tomassini
Tomassini
Umbarkar
Van Veldhuizen
Verma
Veronese
Vlachogiannis
Weber
Weber
Wei-Neng Chen
Whitley
Wickramasinghe
Wu
Xiong
Xu
Yang
Yu
Yu
Yue-Jiao Gong
Yun Li
Zhang
Zhang
Zhang
Zhao
Zhao
Zhi-Hui Zhan
Zhou
Zhou
Zhu
Publication venue: 'Elsevier BV'
Publication date: 11/05/2015
Field of study

The increasing complexity of real-world optimization problems raises new challenges to evolutionary computation. Responding to these challenges, distributed evolutionary computation has received considerable attention over the past decade. This article provides a comprehensive survey of the state-of-the-art distributed evolutionary algorithms and models, which have been classified into two groups according to their task division mechanism. Population-distributed models are presented with master-slave, island, cellular, hierarchical, and pool architectures, which parallelize an evolution task at population, individual, or operation levels. Dimension-distributed models include coevolution and multi-agent models, which focus on dimension reduction. Insights into the models, such as synchronization, homogeneity, communication, topology, speedup, advantages and disadvantages are also presented and discussed. The study of these models helps guide future development of different and/or improved algorithms. Also highlighted are recent hotspots in this area, including the cloud and MapReduce-based implementations, GPU and CUDA-based implementations, distributed evolutionary multiobjective optimization, and real-world applications. Further, a number of future research directions have been discussed, with a conclusion that the development of distributed evolutionary computation will continue to flourish

University of Essex Research Repository

Crossref

Enlighten

Parallel Genetic Algorithms with GPU Computing

Author: Cheng John Runwei
Gen Mitsuo
Publication venue: 'IntechOpen'
Publication date: 05/02/2020
Field of study

Genetic algorithms (GAs) are powerful solutions to optimization problems arising from manufacturing and logistic fields. It helps to find better solutions for complex and difficult cases, which are hard to be solved by using strict optimization methods. Accelerating parallel GAs with GPU computing have received significant attention from both practitioners and researchers, ever since the emergence of GPU-CPU heterogeneous architectures. Designing a parallel algorithm on GPU is different fundamentally from designing one on CPU. On CPU architecture, typically data or tasks are distributed across tens of threads or processes, while on GPU architecture, more than hundreds of thousands of threads run. In order to fully utilize the computing power of GPUs, the design approaches and implementation strategies of parallel GAs should be re-probed. In the chapter, a concise overview of parallel GAs on GPU is given from the perspective of GPU architecture. The concept of parallelism granularity is redefined, the aspect of data layout is discussed on how it will affect the kernel performance, and the hierarchy of threads is examined on how threads are organized in the grid and blocks to expose sufficient parallelism to GPU. Some future research is discussed. A hybrid parallel model, based on the feature of GPU architecture, is suggested to build up efficient parallel GAs for hyper-scale problems

IntechOpen

Solving the Uncapacitated Single Allocation p-Hub Median Problem on GPU

Author: A Ilic
AT Ernst
AT Ernst
D Bryan
EG Talbi
H Damgacioglu
H Topcuoglu
I Contreras
J Kratica
J Sohn
JF Campbell
JF Campbell
JF Campbell
JF Chen
M Labbe
M Maric
MR Silva
MW Horner
R Abyazi-Sani
RS Camargo de
S Abdinnour-Helm
T Meyer
TV Luong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/04/2017
Field of study

A parallel genetic algorithm (GA) implemented on GPU clusters is proposed to solve the Uncapacitated Single Allocation p-Hub Median problem. The GA uses binary and integer encoding and genetic operators adapted to this problem. Our GA is improved by generated initial solution with hubs located at middle nodes. The obtained experimental results are compared with the best known solutions on all benchmarks on instances up to 1000 nodes. Furthermore, we solve our own randomly generated instances up to 6000 nodes. Our approach outperforms most well-known heuristics in terms of solution quality and time execution and it allows hitherto unsolved problems to be solved

arXiv.org e-Print Archive

Crossref

Parallel Multi-Objective Evolutionary Algorithms: A Comprehensive Survey

Author: Castillo Tapia M.G.
Coello C.A.
Falcón-Cardona J.G.
Hernández Gómez R.
Publication venue: 'Elsevier BV'
Publication date: 01/12/2021
Field of study

Multi-Objective Evolutionary Algorithms (MOEAs) are powerful search techniques that have been extensively used to solve difficult problems in a wide variety of disciplines. However, they can be very demanding in terms of computational resources. Parallel implementations of MOEAs (pMOEAs) provide considerable gains regarding performance and scalability and, therefore, their relevance in tackling computationally expensive applications. This paper presents a survey of pMOEAs, describing a refined taxonomy, an up-to-date review of methods and the key contributions to the field. Furthermore, some of the open questions that require further research are also briefly discussed

BCAM's Institutional Repository Data