Search CORE

6 research outputs found

Aco-based feature selection algorithm for classification

Author: Al-mazini Hassan Fouad Abbas
Publication venue
Publication date: 01/01/2022
Field of study

Dataset with a small number of records but big number of attributes represents a phenomenon called “curse of dimensionality”. The classification of this type of dataset requires Feature Selection (FS) methods for the extraction of useful information. The modified graph clustering ant colony optimisation (MGCACO) algorithm is an effective FS method that was developed based on grouping the highly correlated features. However, the MGCACO algorithm has three main drawbacks in producing a features subset because of its clustering method, parameter sensitivity, and the final subset determination. An enhanced graph clustering ant colony optimisation (EGCACO) algorithm is proposed to solve the three (3) MGCACO algorithm problems. The proposed improvement includes: (i) an ACO feature clustering method to obtain clusters of highly correlated features; (ii) an adaptive selection technique for subset construction from the clusters of features; and (iii) a genetic-based method for producing the final subset of features. The ACO feature clustering method utilises the ability of various mechanisms such as intensification and diversification for local and global optimisation to provide highly correlated features. The adaptive technique for ant selection enables the parameter to adaptively change based on the feedback of the search space. The genetic method determines the final subset, automatically, based on the crossover and subset quality calculation. The performance of the proposed algorithm was evaluated on 18 benchmark datasets from the University California Irvine (UCI) repository and nine (9) deoxyribonucleic acid (DNA) microarray datasets against 15 benchmark metaheuristic algorithms. The experimental results of the EGCACO algorithm on the UCI dataset are superior to other benchmark optimisation algorithms in terms of the number of selected features for 16 out of the 18 UCI datasets (88.89%) and the best in eight (8) (44.47%) of the datasets for classification accuracy. Further, experiments on the nine (9) DNA microarray datasets showed that the EGCACO algorithm is superior than the benchmark algorithms in terms of classification accuracy (first rank) for seven (7) datasets (77.78%) and demonstrates the lowest number of selected features in six (6) datasets (66.67%). The proposed EGCACO algorithm can be utilised for FS in DNA microarray classification tasks that involve large dataset size in various application domains

Universiti Utara Malaysia: UUM eTheses

Systematic study of the Arabidopsis Circadian Clock

Author: Saithong Treenut
Publication venue: The University of Edinburgh
Publication date: 01/01/2009
Field of study

Edinburgh Research Archive

Machine Learning Methods with Noisy, Incomplete or Small Datasets

Author
Publication venue: 'MDPI AG'
Publication date: 11/01/2022
Field of study

In many machine learning applications, available datasets are sometimes incomplete, noisy or affected by artifacts. In supervised scenarios, it could happen that label information has low quality, which might include unbalanced training sets, noisy labels and other problems. Moreover, in practice, it is very common that available data samples are not enough to derive useful supervised or unsupervised classifiers. All these issues are commonly referred to as the low-quality data problem. This book collects novel contributions on machine learning methods for low-quality datasets, to contribute to the dissemination of new ideas to solve this challenging problem, and to provide clear examples of application in real scenarios

Directory of Open Access Books (DOAB)

Ανάπτυξη Ευφυούς Συστήματος Υποστήριξης Κλινικής Απόφασης για την Έγκαιρη Διάγνωση και την Εξατομικευμένη Διαχείριση Γυναικών που βρίσκονται σε κίνδυνο ανάπτυξης Καρκίνου του Μαστού

Author: Tarousi Marilena
Ταρούση Μαριλένα
Publication venue
Publication date: 23/11/2022
Field of study

DSpace at NTUA

Exploration and Reduction of the Feature Space by Hierarchical Clustering

Author: Ienco Dino
Meo Rosa
Publication venue: 'The Japan Society for Industrial and Applied Mathematics'
Publication date: 01/01/2008
Field of study

Institutional Research Information System University of Turin