Search CORE

6 research outputs found

Efficient Discovery of Expressive Multi-label Rules using Relaxed Pruning

Author: EL Mencía
EL Mencía
F Charte
FA Thabtah
G Tsoumakas
J. Arunadevi
JL Ávila-Jiménez
K Dembczyński
M Allamanis
M Rapp
Publication venue
Publication date: 19/08/2019
Field of study

Being able to model correlations between labels is considered crucial in multi-label classification. Rule-based models enable to expose such dependencies, e.g., implications, subsumptions, or exclusions, in an interpretable and human-comprehensible manner. Albeit the number of possible label combinations increases exponentially with the number of available labels, it has been shown that rules with multiple labels in their heads, which are a natural form to model local label dependencies, can be induced efficiently by exploiting certain properties of rule evaluation measures and pruning the label search space accordingly. However, experiments have revealed that multi-label heads are unlikely to be learned by existing methods due to their restrictiveness. To overcome this limitation, we propose a plug-in approach that relaxes the search space pruning used by existing methods in order to introduce a bias towards larger multi-label heads resulting in more expressive rules. We further demonstrate the effectiveness of our approach empirically and show that it does not come with drawbacks in terms of training time or predictive performance.Comment: Preprint version. To appear in Proceedings of the 22nd International Conference on Discovery Science, 201

arXiv.org e-Print Archive

TUbiblio

Crossref

Effective elimination of redundant association rules

Author: A Ceglar
CC Aggarwal
FA Thabtah
GK Palshikar
J-F Boulicaut
James Cheng
MJ Zaki
N Kumar
N Pasquier
Q Yang
Wilfred Ng
Yiping Ke
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Fast rule-based bioactivity prediction using associative classification mining

Author: A Nayyar
A Schwaighofer
B Liu
C Becquet
C Borgelt
C Creighton
C Marinica
C Marinica
CY Liew
David J Wild
F Nigsch
F Tao
F Thabtah
F Thabtah
FA Thabtah
Fatih
H Wang
I Bouzouita
I Takigawa
J Dougherty
J Han
J Hastings
J Kazius
J van den Boogaard
JA Mohr
JH Xiaoxin Yin
JL Durant
K-S Leung
L Dehaspe
L Han
M Deshpande
M Tamura
M Vogt
MJ Zaki
P Gramatica
P Prathipati
Pulan Yu
Q Li
Q Li
R Agrawal
R Martinez
RL Bartzatt
S Park
S Sommer
S Soni
T Horváth
W Li
W Tong
XH Ma
Publication venue: BMC
Publication date: 01/01/2012
Field of study

<p>Abstract</p> <p>Relating chemical features to bioactivities is critical in molecular design and is used extensively in the lead discovery and optimization process. A variety of techniques from statistics, data mining and machine learning have been applied to this process. In this study, we utilize a collection of methods, called <it>associative classification mining</it> (<it>ACM</it>), which are popular in the data mining community, but so far have not been applied widely in cheminformatics. More specifically, classification based on predictive association rules (CPAR), classification based on multiple association rules (CMAR) and classification based on association rules (CBA) are employed on three datasets using various descriptor sets. Experimental evaluations on anti-tuberculosis (antiTB), mutagenicity and hERG (the human Ether-a-go-go-Related Gene) blocker datasets show that these three methods are computationally scalable and appropriate for high speed mining. Additionally, they provide comparable accuracy and efficiency to the commonly used Bayesian and support vector machines (SVM) methods, and produce highly interpretable models.</p

Crossref

Springer - Publisher Connector

IUScholarWorks (University of Indiana)

Directory of Open Access Journals

Evaluating associative classification algorithms for Big Data

Author: A Bechini
A Ben-David
A Gumbus
A Segatori
C Cortes
C Lam
D DeWitt
F Padillo
FA Thabtah
G Valdes
I Triguero
J Dean
J Han
J Han
KC Tan
L Oneto
L Venturini
M Zaharia
N Siddique
P Clark
R Agrawal
R Quinlan
RC Holte
SG Kim
X Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Top 10 algorithms in data mining

Crossref

A Multi-Label Learning Based Kernel Automatic Recommendation Method for Support Vector Machine

Crossref