Search CORE

11 research outputs found

Towards a semantic and statistical selection of association rules

Author: Bouker Slim
Nguifo Engelbert Mephu
Saidi Rabie
Yahia Sadok Ben
Publication venue
Publication date: 01/01/2013
Field of study

The increasing growth of databases raises an urgent need for more accurate methods to better understand the stored data. In this scope, association rules were extensively used for the analysis and the comprehension of huge amounts of data. However, the number of generated rules is too large to be efficiently analyzed and explored in any further process. Association rules selection is a classical topic to address this issue, yet, new innovated approaches are required in order to provide help to decision makers. Hence, many interesting- ness measures have been defined to statistically evaluate and filter the association rules. However, these measures present two major problems. On the one hand, they do not allow eliminating irrelevant rules, on the other hand, their abun- dance leads to the heterogeneity of the evaluation results which leads to confusion in decision making. In this paper, we propose a two-winged approach to select statistically in- teresting and semantically incomparable rules. Our statis- tical selection helps discovering interesting association rules without favoring or excluding any measure. The semantic comparability helps to decide if the considered association rules are semantically related i.e comparable. The outcomes of our experiments on real datasets show promising results in terms of reduction in the number of rules

arXiv.org e-Print Archive

HAL Clermont Université

Hal-Diderot

Prediction of Metabolic Pathways Involvement in Prokaryotic UniProtKB Data by Association Rule Mining

Author: Boudellioua Imane
Hoehndorf Robert
Martin Maria
Saidi Rabie
Solovyev Victor
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 19/09/2016
Field of study

The widening gap between known proteins and their functions has encouraged the development of methods to automatically infer annotations. Automatic functional annotation of proteins is expected to meet the conflicting requirements of maximizing annotation coverage, while minimizing erroneous functional assignments. This trade-off imposes a great challenge in designing intelligent systems to tackle the problem of automatic protein annotation. In this work, we present a system that utilizes rule mining techniques to predict metabolic pathways in prokaryotes. The resulting knowledge represents predictive models that assign pathway involvement to UniProtKB entries. We carried out an evaluation study of our system performance using cross-validation technique. We found that it achieved very promising results in pathway identification with an F1-measure of 0.982 and an AUC of 0.987. Our prediction models were then successfully applied to 6.2 million UniProtKB/TrEMBL reference proteome entries of prokaryotes. As a result, 663,724 entries were covered, where 436,510 of them lacked any previous pathway annotations

arXiv.org e-Print Archive

Directory of Open Access Journals

ARM-AMO: An Efficient Association Rule Mining Algorithm Based on Animal Migration Optimization

Author: Le Hoang Son
Chiclana Francisco
Kumar Raghavendra
Mittal Mamta
Khari Manju
Chatterjee Jyotir Moy
Baik Sung Wook
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI linkAssociation rule mining (ARM) aims to find out association rules that satisfy predefined minimum support and confidence from a given database. However, in many cases ARM generates extremely large number of association rules, which are impossible for end users to comprehend or validate, thereby limiting the usefulness of data mining results. In this paper, we propose a new mining algorithm based on Animal Migration Optimization (AMO), called ARM-AMO, to reduce the number of association rules. It is based on the idea that rules which are not of high support and unnecessary are deleted from the data. Firstly, Apriori algorithm is applied to generate frequent itemsets and association rules. Then, AMO is used to reduce the number of association rules with a new fitness function that incorporates frequent rules. It is observed from the experiments that, in comparison with the other relevant techniques, ARM-AMO greatly reduces the computational time for frequent item set generation, memory for association rule generation, and the number of rules generated

Repositorio Institucional Universidad César Vallejo: Página de inicio

Registro Nacional de Trabajos de Investigación y Proyectos

De Montfort University Open Research Archive

ARM-AMO: An Efficient Association Rule Mining Algorithm Based on Animal Migration Optimization

Author: Baik Sung Wook
Chatterjee Jyotir Moy
Chiclana Francisco
Khari Manju
Kumar Raghavendra
Le Hoang Son
Mittal Mamta
Publication venue: 'Elsevier BV'
Publication date: 29/04/2018
Field of study

De Montfort University Open Research Archive

STAIRS 2014:proceedings of the 7th European Starting AI Researcher Symposium

Author
Publication venue: 'IOS Press'
Publication date: 01/01/2014
Field of study

International Migration, Integration and Social Cohesion online publications

Eighth International Workshop "What can FCA do for Artificial Intelligence?" (FCA4AI at ECAI 2020)

Author: Kuznetsov Sergei O.
Napoli Amedeo
Rudolph Sebastian
Publication venue: HAL CCSD
Publication date: 06/11/2020
Field of study

International audienceProceedings of the 8th International Workshop "What can FCA do for Artificial Intelligence?" (FCA4AI 2020)co-located with 24th European Conference on Artificial Intelligence (ECAI 2020), Santiago de Compostela, Spain, August 29, 202

INRIA a CCSD electronic archive server

Mining Undominated Association Rules Through Interestingness Measures

Author: Bouker Slim
Mephu Nguifo Engelbert
Saidi Rabie
Yahia Sadok
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 01/01/2014
Field of study

The increasing growth of databases raises an urgent need for more accurate methods to better understand the stored data. In this scope, association rules were extensively used for the analysis and the comprehension of huge amounts of data. However, the number of generated rules is too large to be efficiently analyzed and explored in any further process. In order to bypass this hamper, an efficient selection of rules has to be performed. Since selection is necessarily based on evaluation, many interestingness measures have been proposed. However, the abundance of these measures gave rise to a new problem, namely the heterogeneity of the evaluation results and this created confusion to the decision. In this respect, we propose a novel approach to discover interesting association rules without favoring or excluding any measure by adopting the notion of dominance between association rules. Our approach bypasses the problem of measure heterogeneity and unveils a compromise between their evaluations. Interestingly enough, the proposed approach also avoids another non-trivial problem which is the threshold value specification. Extensive carried out experiments on benchmark datasets show the benefits of the introduced approach. </jats:p

Crossref

HAL Clermont Université

Hal-Diderot