Search CORE

24 research outputs found

Identify error-sensitive patterns by decision tree

Author: E Alpaydin
IA Gheyas
IH Witten
J Han
JR Quinlan
L Breiman
L Breiman
L Breiman
LI Kuncheva
M Hall
P Yang
RE Schapire
S Tabakhi
W Wu
Y Saeys
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

© Springer International Publishing Switzerland 2015. When errors are inevitable during data classification, finding a particular part of the classification model which may be more susceptible to error than others, when compared to finding an Achilles’ heel of the model in a casual way, may help uncover specific error-sensitive value patterns and lead to additional error reduction measures. As an initial phase of the investigation, this study narrows the scope of problem by focusing on decision trees as a pilot model, develops a simple and effective tagging method to digitize individual nodes of a binary decision tree for node-level analysis, to link and track classification statistics for each node in a transparent way, to facilitate the identification and examination of the potentially “weakest” nodes and error-sensitive value patterns in decision trees, to assist cause analysis and enhancement development. This digitization method is not an attempt to re-develop or transform the existing decision tree model, but rather, a pragmatic node ID formulation that crafts numeric values to reflect the tree structure and decision making paths, to expand post-classification analysis to detailed node-level. Initial experiments have shown successful results in locating potentially high-risk attribute and value patterns; this is an encouraging sign to believe this study worth further exploration

Crossref

OPUS - University of Technology Sydney

Feature Selection using Tabu Search with Learning Memory: Learning Tabu Search

Author: B Xue
CJ Tan
D Corne
D Schindl
I Guyon
IA Gheyas
J Yang
LS Oliveira
M Dorigo
N Long
R Kohavi
TM Hamdani
Z Zhu
Publication venue: HAL CCSD
Publication date: 29/05/2016
Field of study

International audienceFeature selection in classification can be modeled as a com-binatorial optimization problem. One of the main particularities of this problem is the large amount of time that may be needed to evaluate the quality of a subset of features. In this paper, we propose to solve this problem with a tabu search algorithm integrating a learning mechanism. To do so, we adapt to the feature selection problem, a learning tabu search algorithm originally designed for a railway network problem in which the evaluation of a solution is time-consuming. Experiments are conducted and show the benefit of using a learning mechanism to solve hard instances of the literature

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Engine Misfire Detection with Pervasive Mobile Audio

Author: A Sujono
AD Carvalho Jr. de
E Galloni
H Liu
IA Gheyas
J Merkisz
S Vulli
SN Dandare
SS Merola
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2016
Field of study

We address the problem of detecting whether an engine is misfiring by using machine learning techniques on transformed audio data collected from a smartphone. We recorded audio samples in an uncontrolled environment and extracted Fourier, Wavelet and Mel-frequency Cepstrum features from normal and abnormal engines. We then implemented Fisher Score and Relief Score based variable ranking to obtain an informative reduced feature set for training and testing classification algorithms. Using this feature set, we were able to obtain a model accuracy of over 99 % using a linear SVM applied to outsample data. This application of machine learning to vehicle subsystem monitoring simplifies traditional engine diagnostics, aiding vehicle owners in the maintenance process and opening up new avenues for pervasive mobile sensing and automotive diagnostics. Keywords: Pervasive sensing, Mobile phones, Sound classification, Audio processing, Fault detection, Machine learnin

DSpace@MIT

Crossref

A filter-dominating hybrid sequential forward floating search method for feature subset selection in high-dimensional space

Author: A Jain
DL Tong
H Liu
H Peng
IA Gheyas
J Desmar
J Handl
J Huang
JC Bezdek
JL Davies
M Sebban
O Boehm
O Uncu
P Bermejo
P Pudil
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2014
Field of study

Sequential forward floating search (SFFS) has been well recognized as one of the best feature selection methods. This paper proposes a filter-dominating hybrid SFFS method, aiming at high efficiency and insignificant accuracy sacrifice for high-dimensional feature subset selection. Experiments with this new hybrid approach have been conducted on five feature data sets, with different combinations of classifier and separability index as alternative criteria for evaluating the performance of potential feature subsets. The classifiers under consideration include linear discriminate analysis classifier, support vector machine, and K-nearest neighbors classifier, and the separability indexes include the Davies-Bouldin index and a mutual information based index. Experimental results have demonstrated the advantages and usefulness of the proposed method in high-dimensional feature subset selection. © 2012 Springer-Verlag Berlin Heidelberg

University of Essex Research Repository

Crossref

Imputation of Missing Data in Electronic Health Records Based on Patients’ Similarities

Author: A Zeileis
AEW Johnson
BJ Wells
G Hripcsak
IA Gheyas
J Lee
JM Jerez
K Strike
MJ Azur
N Menachemi
PL Peissig
R Rahman
S Ajami
S Moritz
S van Buuren
Y Luo
Z Che
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Multi-objective hybrid filter-wrapper evolutionary approach for feature selection

Author: A Mukhopadhyay
B Xue
B Xue
B Xue
Chih-Cheng Hung
IA Gheyas
J Derrac
J Huang
K Deb
Lamjed Ben Said
Marwa Hammami
P Bermejo
R Kohavi
Slim Bechikh
T Abeel
T Cover
Z Zhu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

An optimized artificial neural network model for the prediction of rate of hazardous chemical and healthcare waste generation at the national level

Author: A Karpušenkaitė
A Mustapha
A Sözen
A Šiljić
AJ Granados
AL Shannon
Aleksandra A. Perić-Grujić
B Zakaria
CD Court
CJ Willmott
D Antanasijević
D Antanasijević
D Eleyan
D Komilis
D Mmereki
D Tomandl
Davor Z. Antanasijević
DF Millie
DF Specht
DF Specht
E Dogan
E Elimelech
E Insa
G Jacobs
H Liu
H Weisz
H-T Pao
I Rimaityte
IA Al-Khatib
IA Gheyas
IA Gheyas
J Gusca
K Harttgen
LC Hamilton
M Sartaj
M Schuhmacher
M Tripathy
ME Birpinar
Mirjana Đ. Ristić
MR Sabour
O Kisi
O Renaud
OECD
P Beigl
Q Zhou
R Noori
R Pahlavan
RA Rudel
S Jahandideh
S Palani
S Walczak
SS Sawant
T Chai
V Adamović
Viktor V. Pocajt
Vladimir M. Adamović
Y-Y Kang
ZM Yaseen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

This paper presents a development of general regression neural network (a form of artificial neural network) models for the prediction of annual quantities of hazardous chemical and healthcare waste at the national level. Hazardous waste is being generated from many different sources and therefore it is not possible to conduct accurate predictions of the total amount of hazardous waste using traditional methodologies. Since they represent about 40% of the total hazardous waste in the European Union, chemical and healthcare waste were specifically selected for this research. Broadly available social, economic, industrial and sustainability indicators were used as input variables and the optimal sets were selected using correlation analysis and sensitivity analysis. The obtained values of coefficients of determination for the final models were 0.999 for the prediction of chemical hazardous waste and 0.975 for the prediction of healthcare and biological hazardous waste. The predicting capabilities of the models for both types of waste are high, since there were no predictions with errors greater than 25%. Also, results of this research demonstrate that the human development index can replace gross domestic product and in this context even represent a better indicator of socio-economic conditions at the national level

Crossref

TechnoRep

Diagnostic classification of solitary pulmonary nodules using dual time 18F-FDG PET/CT image texture features in granuloma-endemic regions

Author: A Matthies
BJ Hillman
DW Kim
ER Delong
F Han
F Orlhac
FH Velden van
FJ Cloran
G Cheng
GC Cawley
H Yu
IA Gheyas
K Alkhawaldeh
K Miwa
K Suga
M Amadasun
M Soussan
MK Gould
PE Galavis
R Maani
R Xu
S Chen
SA Deppen
SJ Swensen
T Sun
W Chen
XP Zhang
YT Sim
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Feature construction as a bi-level optimization problem

Author: A Statnikov
AE Eiben
Ali Louati
B Colson
B Pes
B Tran
B Tran
B Xue
B Xue
C Shannon
CA Gallo
D Peralta
D Sahin
G Brock
IA Gheyas
J Derrac
J He
J Vergara
K Neshatian
L Kaufman
Lamjed Ben Said
M Cerrada
M Ghosh
M Hammami
M Muharram
Marwa Hammami
Mohamed Makhlouf
P Bermejo
Slim Bechikh
U Kamath
ZX Zhu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Neural network method for automatic data generation in adaptive information systems

Author: A Obukhov
AD Obukhov
C Deb
C Wang
D Laredo
D Shu
E Christodoulou
E Moen
ES Silva
F Kang
F Ye
G Aquino
G Cybenko
G Fan
G Hernández
H Alqahtani
H Li
H Liu
HS Chiang
IA Gheyas
J de Jesús Rubio
JA Meda-Campaña
M Boopathi
M Mei
O Popova
R Budjač
R Hecht-Nielsen
S Goswami
S Nagarajaiah
S Pouyanfar
SJ Choudhury
T Tian
Y Xie
YS Sysoev
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref