Search CORE

3 research outputs found

INPUT SELECTION BY EPR-MOGA

Author: Aryal
Bannerman
Barraud
Berardi
Bowden
Bowden
D'Heygere
D. Savic
Dembélé
Dotto
E. Creaco
Galelli
Giustolisi
Giustolisi
Giustolisi
Gupta
Haykin
Koza
L. Berardi
Lacour
Laucelli
Ljung
Luke
Metadier
Mourad
O. Giustolisi
Savic
Savic
Siao Sun
Sun
Sun
Sun
Tirelli
Vaze
Wan Jaafar
Yang
Publication venue
Publication date: 01/01/2016
Field of study

The growing availability of field data, from information and communication technologies (ICTs) in "smart'' urban infrastructures, allows data modeling to understand complex phenomena and to support management decisions. Among the analyzed phenomena, those related to storm water quality modeling have recently been gaining interest in the scientific literature. Nonetheless, the large amount of available data poses the problem of selecting relevant variables to describe a phenomenon and enable robust data modeling. This paper presents a procedure for the selection of relevant input variables using the multi-objective evolutionary polynomial regression (EPR-MOGA) paradigm. The procedure is based on scrutinizing the explanatory variables that appear inside the set of EPR-MOGA symbolic model expressions of increasing complexity and goodness of fit to target output. The strategy also enables the selection to be validated by engineering judgement. In such context, the multiple case study extension of EPR-MOGA, called MCS-EPR-MOGA, is adopted. The application of the proposed procedure to modeling storm water quality parameters in two French catchments shows that it was able to significantly reduce the number of explanatory variables for successive analyses. Finally, the EPR-MOGA models obtained after the input selection are compared with those obtained by using the same technique without benefitting from input selection and with those obtained in previous works where other data-modeling techniques were used on the same data. The comparison highlights the effectiveness of both EPR-MOGA and the input selection procedure

Crossref

Archivio Istituzionale della Ricerca - Università degli Studi di Pavia

Institutional Repository of Institute of Geographic Sciences and Natural Resources Research, CAS

Open Research Exeter

Open Access Repository

Aco-based feature selection algorithm for classification

Author: Al-mazini Hassan Fouad Abbas
Publication venue
Publication date: 01/01/2022
Field of study

Dataset with a small number of records but big number of attributes represents a phenomenon called “curse of dimensionality”. The classification of this type of dataset requires Feature Selection (FS) methods for the extraction of useful information. The modified graph clustering ant colony optimisation (MGCACO) algorithm is an effective FS method that was developed based on grouping the highly correlated features. However, the MGCACO algorithm has three main drawbacks in producing a features subset because of its clustering method, parameter sensitivity, and the final subset determination. An enhanced graph clustering ant colony optimisation (EGCACO) algorithm is proposed to solve the three (3) MGCACO algorithm problems. The proposed improvement includes: (i) an ACO feature clustering method to obtain clusters of highly correlated features; (ii) an adaptive selection technique for subset construction from the clusters of features; and (iii) a genetic-based method for producing the final subset of features. The ACO feature clustering method utilises the ability of various mechanisms such as intensification and diversification for local and global optimisation to provide highly correlated features. The adaptive technique for ant selection enables the parameter to adaptively change based on the feedback of the search space. The genetic method determines the final subset, automatically, based on the crossover and subset quality calculation. The performance of the proposed algorithm was evaluated on 18 benchmark datasets from the University California Irvine (UCI) repository and nine (9) deoxyribonucleic acid (DNA) microarray datasets against 15 benchmark metaheuristic algorithms. The experimental results of the EGCACO algorithm on the UCI dataset are superior to other benchmark optimisation algorithms in terms of the number of selected features for 16 out of the 18 UCI datasets (88.89%) and the best in eight (8) (44.47%) of the datasets for classification accuracy. Further, experiments on the nine (9) DNA microarray datasets showed that the EGCACO algorithm is superior than the benchmark algorithms in terms of classification accuracy (first rank) for seven (7) datasets (77.78%) and demonstrates the lowest number of selected features in six (6) datasets (66.67%). The proposed EGCACO algorithm can be utilised for FS in DNA microarray classification tasks that involve large dataset size in various application domains

Universiti Utara Malaysia: UUM eTheses

Feature selection using probabilistic prediction of support vector regression

Author: Ong C.-J.
Yang J.-B.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2010
Field of study

10.1109/TNN.2011.2128342IEEE Transactions on Neural Networks226954-962ITNN

Crossref

ScholarBank@NUS