Search CORE

17,156 research outputs found

A Constraint Programming Approach for Mining Sequential Patterns in a Sequence Database

Author: Charnois Thierry
Loudni Samir
Métivier Jean-Philippe
Publication venue
Publication date: 23/09/2013
Field of study

Constraint-based pattern discovery is at the core of numerous data mining tasks. Patterns are extracted with respect to a given set of constraints (frequency, closedness, size, etc). In the context of sequential pattern mining, a large number of devoted techniques have been developed for solving particular classes of constraints. The aim of this paper is to investigate the use of Constraint Programming (CP) to model and mine sequential patterns in a sequence database. Our CP approach offers a natural way to simultaneously combine in a same framework a large set of constraints coming from various origins. Experiments show the feasibility and the interest of our approach

arXiv.org e-Print Archive

HAL - Normandie Université

HAL-Paris 13

Web Usage Mining with Evolutionary Extraction of Temporal Fuzzy Association Rules

Author: Ahmadi Samad
Gongora Mario A.
Hopgood Adrian A.
Matthews Stephen G.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

In Web usage mining, fuzzy association rules that have a temporal property can provide useful knowledge about when associations occur. However, there is a problem with traditional temporal fuzzy association rule mining algorithms. Some rules occur at the intersection of fuzzy sets' boundaries where there is less support (lower membership), so the rules are lost. A genetic algorithm (GA)-based solution is described that uses the flexible nature of the 2-tuple linguistic representation to discover rules that occur at the intersection of fuzzy set boundaries. The GA-based approach is enhanced from previous work by including a graph representation and an improved fitness function. A comparison of the GA-based approach with a traditional approach on real-world Web log data discovered rules that were lost with the traditional approach. The GA-based approach is recommended as complementary to existing algorithms, because it discovers extra rules. (C) 2013 Elsevier B.V. All rights reserved

Open Repository and Bibliography - Liège

De Montfort University Open Research Archive

Explore Bristol Research

XML Schema Clustering with Semantic and Hierarchical Similarity Measures

Author: Iryadi Wina
Nayak Richi
Publication venue: 'Elsevier BV'
Publication date: 01/01/2007
Field of study

With the growing popularity of XML as the data representation language, collections of the XML data are exploded in numbers. The methods are required to manage and discover the useful information from them for the improved document handling. We present a schema clustering process by organising the heterogeneous XML schemas into various groups. The methodology considers not only the linguistic and the context of the elements but also the hierarchical structural similarity. We support our findings with experiments and analysis

Crossref

Queensland University of Technology ePrints Archive

Prefix-Projection Global Constraint for Sequential Pattern Mining

Author: B Negrevergne
G Pesant
G Yang
MJ Zaki
MN Garofalakis
N Beldiceanu
P Fournier-Viger
T Guns
Publication venue
Publication date: 23/06/2015
Field of study

Sequential pattern mining under constraints is a challenging data mining task. Many efficient ad hoc methods have been developed for mining sequential patterns, but they are all suffering from a lack of genericity. Recent works have investigated Constraint Programming (CP) methods, but they are not still effective because of their encoding. In this paper, we propose a global constraint based on the projected databases principle which remedies to this drawback. Experiments show that our approach clearly outperforms CP approaches and competes well with ad hoc methods on large datasets

arXiv.org e-Print Archive

HAL - Normandie Université

Crossref

Evolving temporal fuzzy association rules from quantitative data with a multi-objective evolutionary algorithm

Author: C. Carmona
C.A.C. Coello
E. Corchado
K. Deb
M. Kaya
M. Kaya
S.G. Matthews
T.-P. Hong
Y. Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

A novel method for mining association rules that are both quantitative and temporal using a multi-objective evolutionary algorithm is presented. This method successfully identifies numerous temporal association rules that occur more frequently in areas of a dataset with specific quantitative values represented with fuzzy sets. The novelty of this research lies in exploring the composition of quantitative and temporal fuzzy association rules and the approach of using a hybridisation of a multi-objective evolutionary algorithm with fuzzy sets. Results show the ability of a multi-objective evolutionary algorithm (NSGA-II) to evolve multiple target itemsets that have been augmented into synthetic datasets

CiteSeerX

Crossref

Sheffield Hallam University Research Archive

De Montfort University Open Research Archive

Open Repository and Bibliography - Liège

Explore Bristol Research