Search CORE

104 research outputs found

Gene selection for cancer classification with the help of bees

Author: A Balmain
A Banharnsakun
A Bhattacharjee
A Brazma
A Choudhary
A Dussutour
A Farji-Brener
A Statnikov
A Statnikov
AG Karegowda
AI Su
AV Tinker
B Wu
BJ Norton
BK Verma
C Giallourakis
C Lazar
C Xu
CA Markowski
CC Chang
CJ Tu
CL Nutt
CM Bishop
D Chen
D Karaboga
D Karaboga
D Karaboga
D Karaboga
D Karaboga
D Karaboga
D Singh
D Teodorovic
DM Gordon
DM Gordon
DM Gordon
DV Nguyen
EL Lehmann
ER Dougherty
F Ahmade
F Emmert-Streib
F Kang
F Kang
F Roces
F Roces
F Wilcoxon
FJ Rodriguez
G George
G Li
G Stephanopoulos
G Xu
G Yan
G Zhu
GEP Box
H Drias
H Hu
H Liu
H Shah
H Sharma
H Torres-Contreras
H Yu
H Zhang
HF Wedde
I Eksin
I Guyon
I Guyon
I Inza
J Hamidi
J Ji
J Kennedy
J Khan
J Kiefer
J Wang
J Xu
J-Q Li
JC Bansal
JC Bansal
JC Chang
JD Gibbons
JE Staunton
JG Zhang
JH Cho
JJ Howard
JJ Liu
JL Deneubourg
Johra Muhammad Moosa
JW Lee
L Breiman
L Deng
L Lan
L Li
L Wang
LW Jacobs
LY Chuang
LY Chuang
LY Chuang
LY Chuang
M Bollazzi
M Dorigo
M Hollander
M Kefayat
M Mohamad
M Pirooznia
M Schena
MA Shipp
MA Tahir
MH Kashan
MJ Greene
Mohammad Kaykobad
Mohammad Sohel Rahman
MS Mohamad
MS Mohamad
MS Mohamad
N Todorovic
OK Erol
P Mukherjee
PA Devijver
PE Lønning
PW TSai
PY Kumbhar
Q Shen
Q Zhou
QK Pan
QK Pan
R Akbari
R Cai
R Debnath
R Díaz-Uriarte
R Hooke
R Kohavi
R Kohavi
R Mallika
R Murugan
R Ruiz
Rameen Shakur
RJ Schafer
RN Khushaba
S Bicciato
S Bitam
S Dudoit
S Guo
S Knudsen
S Kumar
S Kumar
S Li
S Omkar
S Pavlidis
S Ramaswamy
S Siegel
S Sundar
S Wang
S Yang
SA Armstrong
SL Pomeroy
SL Wang
SP Fodor
SS Jadon
SS Jeffrey
T Davidović
T Li
T Stützle
TK Sharma
TM Cover
TR Golub
TS Furey
V Saravanan
V Tereshko
V Tereshko
V Tereshko
VN Vapnik
W Li
W Li
W Szeto
W-F Gao
WH Au
WH Kruskal
WH Press
X Wang
X Yan
X Yu
X Zhou
Y Leung
Y Lu
Y Saeys
Y Tan
Y Wang
Y Wang
Y Xu
Y Zhang
Y Zhang
Z Liu
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

The importance of data classification using machine learning methods in microarray data

Author: Deris Safaai
Jaber Aws Naser
Machap Logenthiran
Moorthy Kohbalan
Publication venue: 'Universitas Ahmad Dahlan'
Publication date: 01/04/2021
Field of study

The detection of genetic mutations has attracted global attention. several methods have proposed to detect diseases such as cancers and tumours. One of them is microarrays, which is a type of representation for gene expression that is helpful in diagnosis. To unleash the full potential of microarrays, machine-learning algorithms and gene selection methods can be implemented to facilitate processing on microarrays and to overcome other potential challenges. One of these challenges involves high dimensional data that are redundant, irrelevant, and noisy. To alleviate this problem, this representation should be simplified. For example, the feature selection process can be implemented by reducing the number of features adopted in clustering and classification. A subset of genes can be selected from a pool of gene expression data recorded on DNA micro-arrays. This paper reviews existing classification techniques and gene selection methods. The effectiveness of emerging techniques, such as the swarm intelligence technique in feature selection and classification in microarrays, are reported as well. These emerging techniques can be used in detecting cancer. The swarm intelligence technique can be combined with other statistical methods for attaining better results

Journal of Education and Learning (EduLearn)

TELKOMNIKA (Telecommunication Computing Electronics and Control)

UAD Journal Management System

HYBRID FLOWER POLLINATION ALGORITHM AND SUPPORT VECTOR MACHINE FOR BREAST CANCER CLASSIFICATION

Author: Dankolo Muhammad Nasiru
Mustaffa Noorfa Haszlinna
Radzi Nor Haizan Mohamed
Salehuddin Roselina
Publication venue: 'Penerbit UTHM'
Publication date: 12/02/2018
Field of study

Microarray technology is a system that enable experts to examine gene profile at molecular level for early disease detection. Machine learning algorithms such as classification are used in detection of dieses from data generated by microarray. It increases the potentials of classification and diagnosis of many diseases such as cancer at gene expression level. Though, numerous difficulties may affect the performance of machine learning algorithms which includes vast number of genes features comprised in the original data. Many of these features may be unrelated to the intended analysis. Therefore, feature selection is necessary to be performed in the data preprocessing. Many feature selection algorithms are developed and applied on microarray which including the metaheuristic optimization algorithms. This paper proposed a new technique for feature selection and classification of breast cancer based on Flower Pollination algorithm (FPA) and Support Vector machine (SVM) using microarray data. The result for this research reveals that FPA-SVM is promising by outperforming the state of the earth Particle Swam Optimization algorithm with 80.11% accuracy. Â

Journals of Universiti Tun Hussein Onn Malaysia (UTHM)

Survey analysis for optimization algorithms applied to electroencephalogram

Author: Al-Shammary Dhiah
Hakem Ekram
Mahdi Ahmed M.
Publication venue: Institute of Advanced Engineering and Science
Publication date: 01/12/2023
Field of study

This paper presents a survey for optimization approaches that analyze and classify Electroencephalogram (EEG) signals. The automatic analysis of EEG presents a significant challenge due to the high-dimensional data volume. Optimization algorithms seek to achieve better accuracy by selecting practical features and reducing unwanted features. Forty-seven reputable research papers are provided in this work, emphasizing the developed and executed techniques divided into seven groups based on the applied optimization algorithm particle swarm optimization (PSO), ant colony optimization (ACO), artificial bee colony (ABC), grey wolf optimizer (GWO), Bat, Firefly, and other optimizer approaches). The main measures to analyze this paper are accuracy, precision, recall, and F1-score assessment. Several datasets have been utilized in the included papers like EEG Bonn University, CHB-MIT, electrocardiography (ECG) dataset, and other datasets. The results have proven that the PSO and GWO algorithms have achieved the highest accuracy rate of around 99% compared with other techniques

Institute of Advanced Engineering and Science

Mutable composite firefly algorithm for gene selection in microarray based cancer classification

Author: Fajila Mohamed Nisper Fathima
Publication venue
Publication date: 01/01/2022
Field of study

Cancer classification is critical due to the strenuous effort required in cancer treatment and the rising cancer mortality rate. Recent trends with high throughput technologies have led to discoveries in terms of biomarkers that successfully contributed to cancerrelated issues. A computational approach for gene selection based on microarray data analysis has been applied in many cancer classification problems. However, the existing hybrid approaches with metaheuristic optimization algorithms in feature selection (specifically in gene selection) are not generalized enough to efficiently classify most cancer microarray data while maintaining a small set of genes. This leads to the classification accuracy and genes subset size problem. Hence, this study proposed to modify the Firefly Algorithm (FA) along with the Correlation-based Feature Selection (CFS) filter for the gene selection task. An improved FA was proposed to overcome FA slow convergence by generating mutable size solutions for the firefly population. In addition, a composite position update strategy was designed for the mutable size solutions. The proposed strategy was to balance FA exploration and exploitation in order to address the local optima problem. The proposed hybrid algorithm known as CFS-Mutable Composite Firefly Algorithm (CFS-MCFA) was evaluated on cancer microarray data for biomarker selection along with the deployment of Support Vector Machine (SVM) as the classifier. Evaluation was performed based on two metrics: classification accuracy and size of feature set. The results showed that the CFS-MCFA-SVM algorithm outperforms benchmark methods in terms of classification accuracy and genes subset size. In particular, 100 percent accuracy was achieved on all four datasets and with only a few biomarkers (between one and four). This result indicates that the proposed algorithm is one of the competitive alternatives in feature selection, which later contributes to the analysis of microarray data

Universiti Utara Malaysia: UUM eTheses

Elephant Search with Deep Learning for Microarray Data Analysis

Author: Panda Mrutyunjaya
Publication venue
Publication date: 12/07/2017
Field of study

Even though there is a plethora of research in Microarray gene expression data analysis, still, it poses challenges for researchers to effectively and efficiently analyze the large yet complex expression of genes. The feature (gene) selection method is of paramount importance for understanding the differences in biological and non-biological variation between samples. In order to address this problem, a novel elephant search (ES) based optimization is proposed to select best gene expressions from the large volume of microarray data. Further, a promising machine learning method is envisioned to leverage such high dimensional and complex microarray dataset for extracting hidden patterns inside to make a meaningful prediction and most accurate classification. In particular, stochastic gradient descent based Deep learning (DL) with softmax activation function is then used on the reduced features (genes) for better classification of different samples according to their gene expression levels. The experiments are carried out on nine most popular Cancer microarray gene selection datasets, obtained from UCI machine learning repository. The empirical results obtained by the proposed elephant search based deep learning (ESDL) approach are compared with most recent published article for its suitability in future Bioinformatics research.Comment: 12 pages, 5 Tabl

arXiv.org e-Print Archive

Swarm Intelligence Based Feature Selection for High Dimensional Classification: A Literature Survey

Author: Hnin Myint Phyu
Saw Thinzar
Publication venue: 'International Journal of Computer Engineering and Applications'
Publication date: 16/05/2019
Field of study

Feature selection is an important and challenging task in machine learning and data mining techniques to avoid the curse of dimensionality and maximize the classification accuracy. Moreover, feature selection helps to reduce computational complexity of learning algorithm, improve prediction performance, better data understanding and reduce data storage space. Swarm intelligence based feature selection approach enables to find an optimal feature subset from an extremely large dimensionality of features for building the most accurate classifier model. There is still a type of researches that is not done yet in data mining. In this paper, the utilization of swarm intelligence algorithms for feature selection process in high dimensional data focusing on medical data classification is form the subject matter. The results shows that swarm intelligence algorithms reviewed based on state-of-the-art literature have a promising capability that can be applied in feature selections techniques. The significance of this work is to present the comparison and various alternatives of swarm algorithms to be applied in feature selections for high dimensional classification

International Journal of Computer (IJC - Global Society of Scientific Research and Researchers, GSSRR)

Hybrid Feature Selection Approach Based on GRASP for Cancer Microarray Data

Author: Arpita Nagpal
Deepti Gaur
Publication venue: 'Faculty of Electrical Engineering and Computing, Univ. of Zagreb'
Publication date: 01/01/2017
Field of study

Microarray data usually contain a large number of genes, but a small number of samples. Feature subset selection for microarray data aims at reducing the number of genes so that useful information can be extracted from the samples. Reducing the dimension of data sets further helps in improving the computational efficiency of the learning model. In this paper, we propose a modified algorithm based on the tabu search as local search procedures to a Greedy Randomized Adaptive Search Procedure (GRASP) for high dimensional microarray data sets. The proposed Tabu based Greedy Randomized Adaptive Search Procedure algorithm is named as TGRASP. In TGRASP, a new parameter has been introduced named as Tabu Tenure and the existing parameters, NumIter and size have been modified. We observed that different parameter settings affect the quality of the optimum. The second proposed algorithm known as FFGRASP (Firefly Greedy Randomized Adaptive Search Procedure) uses a firefly optimization algorithm in the local search optimzation phase of the greedy randomized adaptive search procedure (GRASP). Firefly algorithm is one of the powerful algorithms for optimization of multimodal applications. Experimental results show that the proposed TGRASP and FFGRASP algorithms are much better than existing algorithm with respect to three performance parameters viz. accuracy, run time, number of a selected subset of features. We have also compared both the approaches with a unified metric (Extended Adjusted Ratio of Ratios) which has shown that TGRASP approach outperforms existing approach for six out of nine cancer microarray datasets and FFGRASP performs better on seven out of nine datasets

Directory of Open Access Journals

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Hybrid feature selection of breast cancer gene expression microarray data based on metaheuristic methods: a comprehensive review

Author: Ab. Aziz Nor Azlina
Besar Rosli
Mohd Ali Nursabillilah
Publication venue: 'MDPI AG'
Publication date: 20/09/2022
Field of study

Breast cancer (BC) remains the most dominant cancer among women worldwide. Numerous BC gene expression microarray-based studies have been employed in cancer classification and prognosis. The availability of gene expression microarray data together with advanced classification methods has enabled accurate and precise classification. Nevertheless, the microarray datasets suffer from a large number of gene expression levels, limited sample size, and irrelevant features. Additionally, datasets are often asymmetrical, where the number of samples from different classes is not balanced. These limitations make it difficult to determine the actual features that contribute to the existence of cancer classification in the gene expression profiles. Various accurate feature selection methods exist, and they are being widely applied. The objective of feature selection is to search for a relevant, discriminant feature subset from the basic feature space. In this review, we aim to compile and review the latest hybrid feature selection methods based on bio-inspired metaheuristic methods and wrapper methods for the classification of BC and other types of cancer

Universiti Teknikal Malaysia Melaka (UTeM) Repository