Search CORE

34 research outputs found

Improved feature selection using a hybrid side-blotched lizard algorithm and genetic algorithm approach

Author: Abdel-aal Amr
El-Henawy Ibrahim
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/10/2023
Field of study

Feature selection entails choosing the significant features among a wide collection of original features that are essential for predicting test data using a classifier. Feature selection is commonly used in various applications, such as bioinformatics, data mining, and the analysis of written texts, where the dataset contains tens or hundreds of thousands of features, making it difficult to analyze such a large feature set. Removing irrelevant features improves the predictor performance, making it more accurate and cost-effective. In this research, a novel hybrid technique is presented for feature selection that aims to enhance classification accuracy. A hybrid binary version of side-blotched lizard algorithm (SBLA) with genetic algorithm (GA), namely SBLAGA, which combines the strengths of both algorithms is proposed. We use a sigmoid function to adapt the continuous variables values into a binary one, and evaluate our proposed algorithm on twenty-three standard benchmark datasets. Average classification accuracy, average number of selected features and average fitness value were the evaluation criteria. According to the experimental results, SBLAGA demonstrated superior performance compared to SBLA and GA with regards to these criteria. We further compare SBLAGA with four wrapper feature selection methods that are widely used in the literature, and find it to be more efficient

Institute of Advanced Engineering and Science

Exploiting heterogeneous features to improve in silico prediction of peptide status – amyloidogenic or non-amyloidogenic

Author: A Caflisch
AE Eiben
I Levner
J Cui
J Han
J Tian
KE Marshall
KK Frousios
KS Hareesha
L Goldschmidt
M López de la Paz
MJ Thompson
NV Subba Reddy
O Conchillo-Solé
OV Galzitskaya
P Han
P Moscato
S Bandyopadhyay
S Kawashima
Smitha Sunil Kumaran Nair
SO Garbuzynskiy
SSK Nair
SSK Nair
VS Mathura
WL Huang
Y Peng
Y Saeys
Z Zhang
Z Zhu
ZR Li
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

CLUSTERING ARTIKEL BERITA BERBAHASA INDONESIA MENGGUNAKAN UNSUPERVISED FEATURE SELECTION

Author: Baizal ZK Abdurahman
Firdaus A.W. Yanuar
Langgeni Diah Pudi
Publication venue: Jurusan Teknik Informatika, Universitas Palangka Raya
Publication date: 30/07/2015
Field of study

Meningkatnya penggunaan internet telah memicu pertumbuhan dan pertukaran informasi menjadi jauh lebih pesat dibandingkan era sebelumnya. Volume berita elektronik berbahasa Indonesia semakin bertambah besar dan menyimpan informasi yang berharga di dalamnya. Pengelompokkan berita berbahasa Indonesia merupakan salah satu solusi yang dapat digunakan untuk mempermudah mencerna informasi penting yang ada di dalamnya. Clustering dapat digunakan untuk membantu menganalisis berita dengan mengelompokkan secara otomatis berita yang memiliki kesamaan. Pada text clustering terdapat suatu permasalahan yaitu adanya fitur – fitur yang berdimensi tinggi. Diperlukan metode Feature selection untuk mengurangi dimensi fitur ini. Feature selection memiliki kemampuan mengurangi dimensionalitas suatu data sehingga dapat meningkatkan performansi clustering. Ada beberapa pendekatan sebagai teknik dari implementasi feature selection, salah satunya adalah filter based feature selection. Pada penelitian ini, dilakukan analisis perbandingan metode feature selection antara Term contribution dan Document Frequency. Metode-metode feature selection tersebut diterapkan secara filter feature selection. Pada akhir pengujian, dapat dibuktikan bahwa metode Term contribution lebih baik daripada Document Frequency karena memperhitungkan frekuensi kemunculan term pada suatu dokumen dan jumlah dokumen yang dimiliki term tersebut, sehingga term yang terpilih adalah term yang khas atau bersifat diskriminator. Hal ini dapat meningkatkan performansi clustering dokumen berdasarkan precision dan entropy

Seminar Nasional Informatika (SEMNASIF)

CLUSTERING ARTIKEL BERITA BERBAHASA INDONESIA MENGGUNAKAN UNSUPERVISED FEATURE SELECTION

Author: Baizal ZK Abdurahman
Firdaus A.W. Yanuar
Langgeni Diah Pudi
Publication venue: Jurusan Teknik Informatika, Universitas Palangka Raya
Publication date: 30/07/2015
Field of study

UPN (Universitas Pembangunan Nasional) Veteran Yogyakarta: Portal Journals

Seminar Nasional Informatika (SEMNASIF)

A modified memetic algorithm with an application to gene selection in a sheep body weight study

Author: Cai Fengjing
Miao Maoxuan
Wang You-Gan
Wu Jinran
Publication venue: 'MDPI AG'
Publication date: 01/01/2022
Field of study

Selecting the minimal best subset out of a huge number of factors for influencing the response is a fundamental and very challenging NP-hard problem because the presence of many redundant genes results in over-fitting easily while missing an important gene can more detrimental impact on predictions, and computation is prohibitive for exhaust search. We propose a modified memetic algorithm (MA) based on an improved splicing method to overcome the problems in the traditional genetic algorithm exploitation capability and dimension reduction in the predictor variables. The new algorithm accelerates the search in identifying the minimal best subset of genes by incorporating it into the new local search operator and hence improving the splicing method. The improvement is also due to another two novel aspects: (a) updating subsets of genes iteratively until the no more reduction in the loss function by splicing and increasing the probability of selecting the true subsets of genes; and (b) introducing add and del operators based on backward sacrifice into the splicing method to limit the size of gene subsets. Additionally, according to the experimental results, our proposed optimizer can obtain a better minimal subset of genes with a few iterations, compared with all considered algorithms. Moreover, the mutation operator is replaced by it to enhance exploitation capability and initial individuals are improved by it to enhance efficiency of search. A dataset of the body weight of Hu sheep was used to evaluate the superiority of the modified MA against the genetic algorithm. According to our experimental results, our proposed optimizer can obtain a better minimal subset of genes with a few iterations, compared with all considered algorithms including the most advanced adaptive best-subset selection algorithm

Multidisciplinary Digital Publishing Institute

ACU Research Bank

Directory of Open Access Journals

PubMed Central

Queensland University of Technology ePrints Archive

CLUSTERING ARTIKEL BERITA BERBAHASA INDONESIA MENGGUNAKAN UNSUPERVISED FEATURE SELECTION

Author: Diah Pudi Langgeni
Yanuar Firdaus A.W.
ZK. Abdurahman Baizal
Publication venue
Publication date
Field of study

UPN (Universitas Pembangunan Nasional) Veteran Yogyakarta: Institutional Repository