Search CORE

65 research outputs found

Predicting model training time to optimize distributed machine learning applications

Author: Alves Victor
Carneiro Davide
Guimarães Miguel
Novais Paulo
Oliveira Filipe
Oliveira Óscar
Palumbo Guilherme
Publication venue: Multidisciplinary Digital Publishing Institute
Publication date: 08/02/2023
Field of study

Despite major advances in recent years, the field of Machine Learning continues to face research and technical challenges. Mostly, these stem from big data and streaming data, which require models to be frequently updated or re-trained, at the expense of significant computational resources. One solution is the use of distributed learning algorithms, which can learn in a distributed manner, from distributed datasets. In this paper, we describe CEDEs—a distributed learning system in which models are heterogeneous distributed Ensembles, i.e., complex models constituted by different base models, trained with different and distributed subsets of data. Specifically, we address the issue of predicting the training time of a given model, given its characteristics and the characteristics of the data. Given that the creation of an Ensemble may imply the training of hundreds of base models, information about the predicted duration of each of these individual tasks is paramount for an efficient management of the cluster’s computational resources and for minimizing makespan, i.e., the time it takes to train the whole Ensemble. Results show that the proposed approach is able to predict the training time of Decision Trees with an average error of 0.103 s, and the training time of Neural Networks with an average error of 21.263 s. We also show how results depend significantly on the hyperparameters of the model and on the characteristics of the input data.This work has been supported by national funds through FCT – Fundação para a Ciência e Tecnologia through projects UIDB/04728/2020, EXPL/CCI-COM/0706/2021, and CPCA-IAC/AV/475278/2022

Universidade do Minho: RepositoriUM

Metaheuristic design of feedforward neural networks: a review of two decades of research

Author: Abbass
Abraham
Ackley
Ajith Abraham
Akhand
Alba
Ali Ahmadi
Almeida
Alvarez
Amari
Andersen
Angeline
Arifovic
Augusteijn
Azimi-Sadjadi
Bakker
Baranyi
Battiti
Bertsekas
Bishop
Bland
Bousquet
Boussaid
Breiman
Brownlee
Carvalho
Chandra
Charalambous
Chen
Chen
Chen
Chen
Cho
Chrisley
Coello
Cortes
Costa
Cruz-Ramírez
Cybenko
Da
da Silva
Dai
Das
Das
Davis
de Albuquerque Teixeira
Deneubourg
Dhahri
Diebold
Ding
Ditzler
Dominey
Donate
Dorigo
Dumont
Engel
Fahlman
Feo
FernandezCaballero
Fister
Fletcher
Fogel
Fogel
Fontanari
Formato
Frean
Fukumizu
Fullér
Furtuna
Garcia-Pedrajas
García-Pedrajas
García-Pedrajas
Gaspar-Cunha
Geem
Geman
Gershenfeld
Ghalambaz
Girosi
Giustolisi
Glover
Goh
Goldberg
Gori
Gorin
Green
Grossberg
Hagan
Hansen
Haykin
Haykin
Hernández
Hestenes
Hinton
Hinton
Hinton
Hirose
Ho
Holland
Hopfield
Hornik
Hornik
Huang
Huang
Huang
Huang
Huang
Igel
Ilonen
Irani
Irani
Islam
Jacobs
Jain
Jain
Jin
Juang
Kaelbling
Karaboga
Karpat
Kennedy
Khan
Khan
Kim
Kim
Kim
Kim
Kiranyaz
Kirkpatrick
Kitano
Kitano
Kohonen
Kolmogorov
Kordík
Kouda
Koza
Kulluk
Kŭrková
Lam
Larrañaga
LeCun
Lera
Leshno
Leung
Leung
Lewenstein
Li
Lin
Lin
Ling
Lippmann
Liu
Liu
Lowe
Ludermir
Mahdavi
Maniezzo
March
Marquardt
Martínez-Muñoz
Mazurowski
McCulloch
Menczer
Merrill
Metropolis
Minku
Minsky
Mirjalili
Mirjalili
Mitra
Mjolsness
Mladenović
Moriarty
Murray
Nakama
Nandy
Narayanan
Natschläger
Nedjah
Niranjan
Niu
Nolfi
Oh
Ojha
Osman
Pan
Passino
Pearce
Pencina
Peng
Pettersson
Pipino
Polikar
Prechelt
Prisecaru
Puig
Rashedi
Reed
Ritchie
Rosenblatt
Rumelhart
Rumelhart
Saad
Salajegheh
Sarkar
Schaffer
Schapire
Schmidhuber
Schwefel
Sejnowski
Selmic
Sexton
Sexton
Sexton
Shang
Sharma
Sietsma
Simovici
Sivagaminathan
Slowik
Socha
Socha
Sokolova
Sporea
Stanley
Storn
Sum
Sörensen
Tang
Tayefeh Mahmoudi
Toh
Tong
Trelea
Trentin
Tsai
Tsai
Tsoulos
Twomey
Ulagammai
Van den Bergh
van der Voet
Varun Kumar Ojha
Venkadesh
Ventura
Vieira
Václav Snášel
Wand
Wang
Wessels
Weyland
Whitley
Widrow
Wiegand
Wilson
Wolpert
Wolpert
Xi-Zhao
Yaghini
Yang
Yang
Yao
Yao
Yao
Yao
Yao
Yao
Yao
Ye
Yin
Yusiong
Zhang
Zhang
Zhang
Zhang
Zhang
Zhao
Zhou
Zhou
Zikopoulos
Zăvoianu
Černỳ
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Over the past two decades, the feedforward neural network (FNN) optimization has been a key interest among the researchers and practitioners of multiple disciplines. The FNN optimization is often viewed from the various perspectives: the optimization of weights, network architecture, activation nodes, learning parameters, learning environment, etc. Researchers adopted such different viewpoints mainly to improve the FNN's generalization ability. The gradient-descent algorithm such as backpropagation has been widely applied to optimize the FNNs. Its success is evident from the FNN's application to numerous real-world problems. However, due to the limitations of the gradient-based optimization methods, the metaheuristic algorithms including the evolutionary algorithms, swarm intelligence, etc., are still being widely explored by the researchers aiming to obtain generalized FNN for a given problem. This article attempts to summarize a broad spectrum of FNN optimization methodologies including conventional and metaheuristic approaches. This article also tries to connect various research directions emerged out of the FNN optimization practices, such as evolving neural network (NN), cooperative coevolution NN, complex-valued NN, deep learning, extreme learning machine, quantum NN, etc. Additionally, it provides interesting research challenges for future research to cope-up with the present information processing era

arXiv.org e-Print Archive

Central Archive at the University of Reading

Repository for Publications and Research Data

Crossref

DSpace at VSB Technical University of Ostrava

CM-CASL: Comparison-based Performance Modeling of Software Systems via Collaborative Active and Semisupervised Learning

Author: Bao Liang
Cao Rong
Li Yufei
Wu Chase
Zhang Zhe
Zhangsun Panpan
Publication venue
Publication date: 28/03/2023
Field of study

Configuration tuning for large software systems is generally challenging due to the complex configuration space and expensive performance evaluation. Most existing approaches follow a two-phase process, first learning a regression-based performance prediction model on available samples and then searching for the configurations with satisfactory performance using the learned model. Such regression-based models often suffer from the scarcity of samples due to the enormous time and resources required to run a large software system with a specific configuration. Moreover, previous studies have shown that even a highly accurate regression-based model may fail to discern the relative merit between two configurations, whereas performance comparison is actually one fundamental strategy for configuration tuning. To address these issues, this paper proposes CM-CASL, a Comparison-based performance Modeling approach for software systems via Collaborative Active and Semisupervised Learning. CM-CASL learns a classification model that compares the performance of two given configurations, and enhances the samples through a collaborative labeling process by both human experts and classifiers using an integration of active and semisupervised learning. Experimental results demonstrate that CM-CASL outperforms two state-of-the-art performance modeling approaches in terms of both classification accuracy and rank accuracy, and thus provides a better performance model for the subsequent work of configuration tuning

arXiv.org e-Print Archive

Recommended from our members

Optimizing Data-Intensive Computing with Efficient Configuration Tuning

Author: Fekry Ayat
Publication venue: University of Cambridge
Publication date: 30/07/2021
Field of study

As the complexity of distributed analytics systems evolves over time, more configuration parameters get exposed for tuning. While these numerous parameters allow users more control over how their workloads are executed, this flexibility comes at a cost, since finding the right configurations for such systems in a cost-effective way becomes challenging. In practice, several factors contribute to the complexity of tuning the configuration of those systems: the large configuration space, the diversity of the served workloads (each workload possibly requiring a different resource allocation strategy to run optimally), and the dynamic characteristics of these systems’ environment (e.g., increase in input data size, changes in the allocation of resources). Paradoxically, existing solutions for workload tuning either assume static tuning environment or workloads that are inexpensive to run (i.e. requiring hundreds of execution samples). Recently, Bayesian Optimisation (BO) strategies have been applied as a solution to enable efficient autotuning. They build a probabilistic model incrementally to predict the impact of the parameters on performance using a small number of execution samples. The incrementally constructed BO model is used to guide the tuning process and accelerate convergence to a near-optimal configuration. Unfortunately, for distributed analytics systems, the configuration space is too large to construct a good model using traditional BO, which fails to provide quick convergence in high dimensional configuration space. I argue that cost-effective tuning strategies can only be developed when taking into account: the frequent changes that can happen in the analytics workload/environment, the amortization of tuning costs and how this influences tuning profitability, the high dimensionality of configuration space and the need to cater for diverse workloads. To tackle these challenges, I propose Tuneful, an efficient configuration tuning framework for such expensive to tune systems. It works efficiently both initially (when little data is available) as well as later (as more tuning knowledge is acquired). It starts with learning workload-specific influential parameters incrementally and tunes those only, then when more tuning knowledge becomes available, it detects similarity across workloads and utilizes multitask BO to share the tuning knowledge across similar workloads. I show how augmenting the BO approach with parameters’ significance and workload similarity characteristics enables an efficient configuration tuning in high dimensional configuration space. Over diverse analytics workloads, this significantly accelerates both configuration tuning and cost amortization, saving search time by 2.7-3.7X at median compared to the-state-of-the-art approaches

Apollo (Cambridge)

Bio-inspired computation for big data fusion, storage, processing, learning and visualization: state of the art and future directions

Author: Camacho David
Del Ser Javier
Díaz-de-Arcaya Josu
Muhammad Khan
Osaba Eneko
Torre-Bastida Ana I.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/08/2021
Field of study

This overview gravitates on research achievements that have recently emerged from the confluence between Big Data technologies and bio-inspired computation. A manifold of reasons can be identified for the profitable synergy between these two paradigms, all rooted on the adaptability, intelligence and robustness that biologically inspired principles can provide to technologies aimed to manage, retrieve, fuse and process Big Data efficiently. We delve into this research field by first analyzing in depth the existing literature, with a focus on advances reported in the last few years. This prior literature analysis is complemented by an identification of the new trends and open challenges in Big Data that remain unsolved to date, and that can be effectively addressed by bio-inspired algorithms. As a second contribution, this work elaborates on how bio-inspired algorithms need to be adapted for their use in a Big Data context, in which data fusion becomes crucial as a previous step to allow processing and mining several and potentially heterogeneous data sources. This analysis allows exploring and comparing the scope and efficiency of existing approaches across different problems and domains, with the purpose of identifying new potential applications and research niches. Finally, this survey highlights open issues that remain unsolved to date in this research avenue, alongside a prescription of recommendations for future research.This work has received funding support from the Basque Government (Eusko Jaurlaritza) through the Consolidated Research Group MATHMODE (IT1294-19), EMAITEK and ELK ARTEK programs. D. Camacho also acknowledges support from the Spanish Ministry of Science and Education under PID2020-117263GB-100 grant (FightDIS), the Comunidad Autonoma de Madrid under S2018/TCS-4566 grant (CYNAMON), and the CHIST ERA 2017 BDSI PACMEL Project (PCI2019-103623, Spain)

TECNALIA Publications

Bio-inspired computation: where we stand and what's next

Author: Camacho D.
Camacho D.
Coello Coello C.
Coello Coello C.
Das S.
Das S.
Del Ser J.
Del Ser J.
Herrera F.
Herrera F.
Molina D.
Molina D.
Osaba E.
Osaba E.
Salcedo-Sanz S.
Salcedo-Sanz S.
Suganthan P.
Suganthan P.
Yang X.
Yang X.
Publication venue: Elsevier
Publication date: 01/01/2019
Field of study

In recent years, the research community has witnessed an explosion of literature dealing with the adaptation of behavioral patterns and social phenomena observed in nature towards efficiently solving complex computational tasks. This trend has been especially dramatic in what relates to optimization problems, mainly due to the unprecedented complexity of problem instances, arising from a diverse spectrum of domains such as transportation, logistics, energy, climate, social networks, health and industry 4.0, among many others. Notwithstanding this upsurge of activity, research in this vibrant topic should be steered towards certain areas that, despite their eventual value and impact on the field of bio-inspired computation, still remain insufficiently explored to date. The main purpose of this paper is to outline the state of the art and to identify open challenges concerning the most relevant areas within bio-inspired optimization. An analysis and discussion are also carried out over the general trajectory followed in recent years by the community working in this field, thereby highlighting the need for reaching a consensus and joining forces towards achieving valuable insights into the understanding of this family of optimization techniques

Middlesex University Research Repository

Bio-inspired computation: where we stand and what's next

Author: Abouhawwash
Abraham
Afifi
Ahmadi-Javid
Al Amro
Al-Faris
Alba
Alba
Alba
Alba
Alba
Amine Bouhlel
Andrade
Andres
Andrés-Pérez
Andrés-Pérez
Antonio
Antonio
Antonio
Arcuri
Arnold
Arnold
Atashpaz-Gargari
Atencia
Auger
Auger
Awad
Awais
Baringo
Barrera
Barták
Basak
Beale
Bechikh
Bello-Orgaz
Ben-Tal
Bermejo
Bertsimas
Bessaou
Beume
Beyer
Bhosekar
Biamonte
Binitha
Biswas
Biswas
Blackwell
Blanchard
Bokrantz
Bonabeau
Bouhlel
Branke
Brest
Bucking
Burke
Burke
Bäck
Camacho
Camacho-Villalón
Cao
Cao
Cao
Carrasco
Chen
Chen
Chen
Cheng
Cheng
Cheng
Chica
Chicano
Choraś
Christelis
Ciliberto
Clerc
Cobb
Coello
Coello Coello
Coello Coello
Collette
Cowling
Cruz
Cuadra
Cui
Dantzig
Das
Das
Das
Das
De Falco
de França
De Jong
Deb
Deb
Deb
Deb
Del Ser
Demertzis
Derrac
Diez-Olivan
Dilek
Dorigo
Drugan
Du
Duan
Duarte
Durillo
Easum
Eberhart
Eiben
Eiben
Eichfelder
Elsayed
Engelbrecht
Epitropakis
Eskandar
Falcón-Cardona
Farina
Fazenda
Fiore
Fister
Fister
Fogel
Forrester
Fu
Gal
Gamarra
Gao
Garcia
García-Martínez
Gen
Gen
Ghaheri
Goh
Goldberg
Gong
Gong
Gonzalez-Pardo
Gonzalez-Pardo
González-Pardo
González-Pardo
Gonçalves
Grandell
Greene
Grobler
Gutjahr
Gálvez
Gómez
Gómez
Hadka
Han
Hansen
Hansen
Hansen
Hansen
He
Hellwig
Holland
Hong
Hooper
Hu
Hu
Huband
Hussain
Hussain
Igel
Igel
Inuiguchi
Ishibuchi
Ishibuchi
Jabbarpour
Jaimes
Jana
Janson
Janson
Jena
Jiang
Jiang
Jin
Jin
Jin
Jones
Jordehi
Joyce
Kalyanmoy
Kamyab
Kar
Kar
Karaboga
Karafotias
Karnan
Kashan
Kim
Komatsu
Kononova
Koza
Koziel
Koziel
Krasnogor
Krasnogor
Kuhn
Kusyk
Lara
Lara-Cabrera
Laszczyk
LaTorre
LaTorre
Lee
Lehman
Lehre
Leskinen
Li
Li
Li
Li
Li
Li
Li
Li
Li
Liang
Liang
Liang
Liang
Liao
Lim
Lin
Liu
Liu
Liu
Liu
Logenthiran
Logeswari
Lones
Lozano
Lu
Lucas
Lynn
Lynn
Lynn
López-Ibáńez
Ma
Maashi
Mahdavi
Mahdavi
Mahdavi
Mahfoud
Malikopoulos
Mallipeddi
Mallipeddi
Mandal
Mane
Martinez
Martí
Mashwani
Maul
Maučec
Mavrovouniotis
Mazzara
McClymont
Melcer
Mendiburu
Meuth
Miikkulainen
Molina
Molina
Molina
Montana
Moscato
Moscato
Moser
Mostaghim
Müller
Müller
Naldi
Nannen
Nebro
Neri
Neumann
Nguyen
Nguyen
Ni
Nogueira Collazo
Novoa-Hernández
Oliveira
Omidvar
Omidvar
Omidvar
Ong
Ong
Orgaz
Parpinelli
Passino
Payne
Peng
Peng
Pescador-Rojas
Piotrowski
Piotrowski
Piotrowski
Piotrowski
Pitzer
Pizzuti
Polakova
Potter
Potter
Pošík
Praditwong
Prebeg
Pétrowski
Qian
Qin
Qu
Qu
Qu
Qu
Queipo
Rajasekhar
Rakshit
Ramírez-Gallego
Rashedi
Ray
Rechenberg
Recio
Remde
Ross
Ross
Rothlauf
Roy
Sahinidis
Saka
Salcedo-Sanz
Salcedo-Sanz
Salcedo-Sanz
Salcedo-Sanz
Salcedo-Sanz
Sareni
Schumacher
Schutze
Schwefel
Senapati
Seredynski
Sergeyev
Shaker
Shang
Shen
Simon
Sivakumar
Smit
Smit
Smith
Smith
Soleimani
Srinivas
Stanley
Starzynski
Storn
Subbu
Subbu
Such
Suganthan
Suganthan
Suganthan
Suganthi
Suganuma
Sun
Sun
Sutton
Swan
Sörensen
Talbi
Tanabe
Tanabe
Tang
Tang
Tang
Tassiulas
Ter Braak
Thangavel
Thomsen
Tintner
Tomassini
Tricoire
Tsai
Tsang
Ursem
Vafaee
Verma
Verma
Vitaliy
Vrugt
Vrugt
Vrugt
Vrugt
Walker
Wang
Wang
Wang
Wang
Wang
Wang
Wari
Weber
Wedyan
Wessing
Weyland
Whitley
Whitley
Woldesenbet
Wolpert
Wu
Wu
Xiao
Xiong
Xue
Xue
Yang
Yang
Yang
Yang
Yang
Yang
Yang
Yang
Yang
Yang
Yanıkoğlu
Yannakakis
Yazdani
Yazdi
Yu
Yu
Yu
Yue
Zabinsky
Zainud-Deen
Zhang
Zhang
Zhang
Zhang
Zhao
Zhao
Zhao
Zhou
Zhou
Zhu
Zhuang
Zille
Zille
Zitzler
Özcan
Črepinšek
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

Crossref

Middlesex University Research Repository

DR-NTU (Digital Repository of NTU)

Modelos predictivos basados en deep learning para datos temporales masivos

Author: Torres Maldonado José Francisco
Publication venue
Publication date: 01/01/2022
Field of study

Programa de Doctorado en Biotecnología, Ingeniería y Tecnología QuímicaLínea de Investigación: Ingeniería, Ciencia de Datos y BioinformáticaClave Programa: DBICódigo Línea: 111El avance en el mundo del hardware ha revolucionado el campo de la inteligencia artificial, abriendo nuevos frentes y áreas que hasta hoy estaban limitadas. El área del deep learning es quizás una de las mas afectadas por este avance, ya que estos modelos requieren de una gran capacidad de computación debido al número de operaciones y complejidad de las mismas, motivo por el cual habían caído en desuso hasta los últimos años. Esta Tesis Doctoral ha sido presentada mediante la modalidad de compendio de publicaciones, con un total de diez aportaciones científicas en Congresos Internacionales y revistas con alto índice de impacto en el Journal of Citation Reports (JCR). En ella se recoge una investigación orientada al estudio, análisis y desarrollo de las arquitecturas deep learning mas extendidas en la literatura para la predicción de series temporales, principalmente de tipo energético, como son la demanda eléctrica y la generación de energía solar. Además, se ha centrado gran parte de la investigación en la optimización de estos modelos, tarea primordial para la obtención de un modelo predictivo fiable. En una primera fase, la tesis se centra en el desarrollo de modelos predictivos basados en deep learning para la predicción de series temporales aplicadas a dos fuentes de datos reales. En primer lugar se diseñó una metodología que permitía realizar la predicción multipaso de un modelo Feed-Forward, cuyos resultados fueron publicados en el International Work-Conference on the Interplay Between Natural and Artificial Computation (IWINAC). Esta misma metodología se aplicó y comparó con otros modelos clásicos, implementados de manera distribuida, cuyos resultados fueron publicados en el 14th International Work-Conference on Artificial Neural Networks (IWANN). Fruto de la diferencia en tiempo de computación y escalabilidad del método de deep learning con los otros modelos comparados, se diseñó una versión distribuida, cuyos resultados fueron publicados en dos revistas indexadas con categoría Q1, como son Integrated Computer-Aided Engineering e Information Sciences. Todas estas aportaciones fueron probadas utilizando un conjunto de datos de demanda eléctrica en España. De forma paralela, y con el objetivo de comprobar la generalidad de la metodología, se aplicó el mismo enfoque sobre un conjunto de datos correspondiente a la generación de energía solar en Australia en dos versiones: univariante, cuyos resultados se publicaron en International on Soft Computing Models in Industrial and Environment Applications (SOCO), y la versión multivariante, que fué publicada en la revista Expert Systems, indexada con categoría Q2. A pesar de los buenos resultados obtenidos, la estrategia de optimización de los modelos no era óptima para entornos big data debido a su carácter exhaustivo y al coste computacional que conllevaba. Motivado por esto, la segunda fase de la Tesis Doctoral se basó en la optimización de los modelos deep learning. Se diseñó una estrategia de búsqueda aleatoria aplicada a la metodología propuesta en la primera fase, cuyos resultados fueron publicados en el IWANN. Posteriormente, se centró la atención en modelos de optimización basado en heurísticas, donde se desarrolló un algoritmo genético para optimizar el modelo feed-forward. Los resultados de esta investigación se presentaron en la revista Applied Sciences, indexada con categoría Q2. Además, e influenciado por la situación pandémica del 2020, se decidió diseñar e implementar una heurística basada en el modelo de propagación de la COVID-19. Esta estrategia de optimización se integró con una red Long-Short-Term-Memory, ofreciendo resultados altamente competitivos que fueron publicados en la revista Big Data, indexada en el JCR con categoría Q1. Para finalizar el trabajo de tesis, toda la información y conocimientos adquiridos fueron recopilados en un artículo a modo de survey, que fue publicado en la revista indexada con categoría Q1 Big Data.Universidad Pablo de Olavide de Sevilla. Departamento de Deporte e Informátic

Repositorio Institucional Olavide

Data distribution and task scheduling for distributed computing of all-to-all comparison problems

Author: Zhang Yi-Fan
Publication venue: 'Queensland University of Technology'
Publication date: 01/01/2016
Field of study

This research studied distributed computing of all-to-all comparison problems with big data sets. The thesis formalised the problem, and developed a high-performance and scalable computing framework with a programming model, data distribution strategies and task scheduling policies to solve the problem. The study considered storage usage, data locality and load balancing for performance improvement in solving the problem. The research outcomes can be applied in bioinformatics, biometrics and data mining and other domains in which all-to-all comparisons are a typical computing pattern

Queensland University of Technology ePrints Archive