Search CORE

6 research outputs found

Evolutionary Computation, Optimization and Learning Algorithms for Data Science

Author: A Agrawal
A Imteaj
C Blum
D Karaboga
D Wunsch
D Zhang
E Hancer
F Harfouchi
F Zabihi
F Zhuang
FG Mohammadi
FG Mohammadi
FG Mohammadi
H Shi
H Wang
H Yoshida
I Rahman
ILS Russo
J Kennedy
J Kennedy
J Pierezan
J Yang
JR Koza
K Ahmed
K Chen
K Socha
LJ Fogel
MA Abido
MH Amini
MH Amini
MH Amini
MH Amini
MH Amini
MH Amini
MH Amini
MJLF Cruyff
MM Kabir
N Altman
NP Patel
P Marrow
P Moscato
R Balamurugan
R Vanaja
S Jiang
S Mirjalili
SF Razavi
SL Gupta
SN Karpagam
T Bäck
U Khurana
V Rostami
W Shi
Wai Keen Vong
X Meng
X-B Meng
X-L Li
X-Y Liu
Y Cao
Y Chen
Y Xue
Y Zhang
Z-F Hao
Publication venue: FIU Digital Commons
Publication date: 01/08/2019
Field of study

A large number of engineering, science and computational problems have yet to be solved in a computationally efficient way. One of the emerging challenges is how evolving technologies grow towards autonomy and intelligent decision making. This leads to collection of large amounts of data from various sensing and measurement technologies, e.g., cameras, smart phones, health sensors, smart electricity meters, and environment sensors. Hence, it is imperative to develop efficient algorithms for generation, analysis, classification, and illustration of data. Meanwhile, data is structured purposefully through different representations, such as large-scale networks and graphs. We focus on data science as a crucial area, specifically focusing on a curse of dimensionality (CoD) which is due to the large amount of generated/sensed/collected data. This motivates researchers to think about optimization and to apply nature-inspired algorithms, such as evolutionary algorithms (EAs) to solve optimization problems. Although these algorithms look un-deterministic, they are robust enough to reach an optimal solution. Researchers do not adopt evolutionary algorithms unless they face a problem which is suffering from placement in local optimal solution, rather than global optimal solution. In this chapter, we first develop a clear and formal definition of the CoD problem, next we focus on feature extraction techniques and categories, then we provide a general overview of meta-heuristic algorithms, its terminology, and desirable properties of evolutionary algorithms

arXiv.org e-Print Archive

Crossref

DigitalCommons@Florida International University

Malware Detection using Artificial Bee Colony Algorithm

Author: Amini M. Hadi
Arabnia Hamid R.
Mohammadi Farid Ghareh
Shenavarmasouleh Farzan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/12/2020
Field of study

Malware detection has become a challenging task due to the increase in the number of malware families. Universal malware detection algorithms that can detect all the malware families are needed to make the whole process feasible. However, the more universal an algorithm is, the higher number of feature dimensions it needs to work with, and that inevitably causes the emerging problem of Curse of Dimensionality (CoD). Besides, it is also difficult to make this solution work due to the real-time behavior of malware analysis. In this paper, we address this problem and aim to propose a feature selection based malware detection algorithm using an evolutionary algorithm that is referred to as Artificial Bee Colony (ABC). The proposed algorithm enables researchers to decrease the feature dimension and as a result, boost the process of malware detection. The experimental results reveal that the proposed method outperforms the state-of-the-art

arXiv.org e-Print Archive

Crossref

Improvement on KNN using genetic algorithm and combined feature extraction to identify COVID-19 sufferers based on CT scan image

Author: Nugraha Arie Sapta
Nugroho Radityo Adi
Rahayu Fenny Winda
Rasyid Aylwin Al
Publication venue: 'Universitas Ahmad Dahlan'
Publication date: 01/10/2021
Field of study

Coronavirus disease 2019 (COVID-19) has spread throughout the world. The detection of this disease is usually carried out using the reverse transcriptase polymerase chain reaction (RT-PCR) swab test. However, limited resources became an obstacle to carrying out the massive test. To solve this problem, computerized tomography (CT) scan images are used as one of the solutions to detect the sufferer. This technique has been used by researchers but mostly using classifiers that required high resources, such as convolutional neural network (CNN). In this study, we proposed a way to classify the CT scan images by using the more efficient classifier, k-nearest neighbors (KNN), for images that are processed using a combination of these feature extraction methods, Haralick, histogram, and local binary pattern. Genetic algorithm is also used for feature selection. The results showed that the proposed method was able to improve KNN performance, with the best accuracy of 93.30% for the combination of Haralick and local binary pattern feature extraction, and the best area under the curve (AUC) for the combination of Haralick, histogram, and local binary pattern with a value of 0.948. The best accuracy of our models also outperforms CNN by a 4.3% margin

Journal of Education and Learning (EduLearn)

TELKOMNIKA (Telecommunication Computing Electronics and Control)

UAD Journal Management System

Exploring the Time-efficient Evolutionary-based Feature Selection Algorithms for Speech Data under Stressful Work Condition

Author: Adi Derry Pramono
Frismanda
Gumelar Agustinus Bimo
Junaedi Lukman
Kristanto Andreas Agung
Publication venue: 'EMITTER International Journal of Engineering Technology'
Publication date: 26/02/2021
Field of study

Initially, the goal of Machine Learning (ML) advancements is faster computation time and lower computation resources, while the curse of dimensionality burdens both computation time and resource. This paper describes the benefits of the Feature Selection Algorithms (FSA) for speech data under workload stress. FSA contributes to reducing both data dimension and computation time and simultaneously retains the speech information. We chose to use the robust Evolutionary Algorithm, Harmony Search, Principal Component Analysis, Genetic Algorithm, Particle Swarm Optimization, Ant Colony Optimization, and Bee Colony Optimization, which are then to be evaluated using the hierarchical machine learning models. These FSAs are explored with the conversational workload stress data of a Customer Service hotline, which has daily complaints that trigger stress in speaking. Furthermore, we employed precisely 223 acoustic-based features. Using Random Forest, our evaluation result showed computation time had improved 3.6 faster than the original 223 features employed. Evaluation using Support Vector Machine beat the record with 0.001 seconds of computation time

EMITTER - International Journal of Engineering Technology

Integrating supercomputing clusters into education: a case study in biotechnology

Author: Conde Miguel Á.
Fernández Camino
Fernández Álvaro
Miguel-Dávila José-Ángel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/06/2022
Field of study

The integration of a Supercomputer in the educational process improves student’s technological skills. The aim of the paper is to study the interaction between sci-ence, technology, engineering, and mathematics (STEM) and non-STEM subjects for developing a course of study related to Supercomputing training. We propose a flowchart of the process to improve the performance of students attending courses related to Supercomputing. As a final result, this study highlights the analysis of the information obtained by the use of HPC infrastructures in courses implemented in higher education through a questionnaire that provides useful information about their attitudes, beliefs and evaluations. The results help us to understand how the collaboration between institutions enhances outcomes in the education context. The conclusion provides a description of the resources needed for the improvement of Supercomputing Education (SE), proposing future research directions. 2018-1-ES01-KA201-05093SIComisión EuropeaMinisterio de Ciencia e InnovaciónMinisterio de Economía y CompetitividadFundación Centro de Supercomputación de Castilla y Leó

Leon University (Spain)

Análisis y evaluación del uso de la supercomputación en la mejora del desempeño formativo = Analysis and evaluation of supercomputing for training performance improvement

Author: Fernández González Álvaro
Publication venue: 'University of Leon'
Publication date: 08/10/2020
Field of study

205 p.Los recursos de supercomputación son en la actualidad el pilar fundamental para el desarrollo de la investigación en diversos campos. Su impacto se basa en la capacidad de cálculo, que permite realizar simulaciones computacionales que permiten mejorar la precisión de los experimentos. La presente Tesis Doctoral pretende, en primer lugar, realizar un estudio de la evolución de la supercomputación y su aplicación a diversos campos para, posteriormente, estudiar los factores determinantes que permitan analizar los aspectos más relevantes a la hora de estudiar la relación existente entre los estudios de supercomputación con los aspectos pedagógicos, de conocimiento y de contenido, basándose en el modelo TPACK. El estudio se realizó con información procedente de la base de datos de estudiantes del Centro de Supercomputación de Castilla y León (SCAYLE), de la que se obtuvieron 97 participantes. En el estudio se realizó un análisis factorial para comprobar que la estructura de datos obtenida era coherente con el modelo TPACK usado como referencia. Los resultados obtenidos del análisis relacionan las dimensiones tecnológicas con las de conocimiento, pedagógicas y de contenido

Crossref

Leon University (Spain)