20,732 research outputs found
Using Answer Set Programming for pattern mining
Serial pattern mining consists in extracting the frequent sequential patterns
from a unique sequence of itemsets. This paper explores the ability of a
declarative language, such as Answer Set Programming (ASP), to solve this issue
efficiently. We propose several ASP implementations of the frequent sequential
pattern mining task: a non-incremental and an incremental resolution. The
results show that the incremental resolution is more efficient than the
non-incremental one, but both ASP programs are less efficient than dedicated
algorithms. Nonetheless, this approach can be seen as a first step toward a
generic framework for sequential pattern mining with constraints.Comment: Intelligence Artificielle Fondamentale (2014
Mining Weighted Frequent Closed Episodes over Multiple Sequences
Frequent episode discovery is introduced to mine useful and interesting temporal patterns from sequential data. The existing episode mining methods mainly focused on mining from a single long sequence consisting of events with time constraints. However, there can be multiple sequences of different importance as the persons or entities associated with each sequence can be of different importance. Aiming to mine episodes in multiple sequences of different importance, we first define a new kind of episodes, i.e., the weighted frequent closed episodes, to take sequence importance, episode distribution and occurrence frequency into account together. Secondly, to facilitate the mining of such new episodes, we present a new concept called maximal duration serial episodes to cut a whole sequence into multiple maximum episodes using duration constraints, and discuss its properties for episode shrinking processing. Finally, based on the theoretical properties, we propose a two-phase approach to efficiently mine these new episodes. In Phase I, we adopt a level-wise episode shrinking framework to discover the candidate frequent closed episodes with the same prefixes, and in Phase II, we match the candidates with different prefixes to find the frequent close episodes. Experiments on simulated and real datasets demonstrate that the proposed episode mining strategy has good mining effectiveness and efficiency
On mining complex sequential data by means of FCA and pattern structures
Nowadays data sets are available in very complex and heterogeneous ways.
Mining of such data collections is essential to support many real-world
applications ranging from healthcare to marketing. In this work, we focus on
the analysis of "complex" sequential data by means of interesting sequential
patterns. We approach the problem using the elegant mathematical framework of
Formal Concept Analysis (FCA) and its extension based on "pattern structures".
Pattern structures are used for mining complex data (such as sequences or
graphs) and are based on a subsumption operation, which in our case is defined
with respect to the partial order on sequences. We show how pattern structures
along with projections (i.e., a data reduction of sequential structures), are
able to enumerate more meaningful patterns and increase the computing
efficiency of the approach. Finally, we show the applicability of the presented
method for discovering and analyzing interesting patient patterns from a French
healthcare data set on cancer. The quantitative and qualitative results (with
annotations and analysis from a physician) are reported in this use case which
is the main motivation for this work.
Keywords: data mining; formal concept analysis; pattern structures;
projections; sequences; sequential data.Comment: An accepted publication in International Journal of General Systems.
The paper is created in the wake of the conference on Concept Lattice and
their Applications (CLA'2013). 27 pages, 9 figures, 3 table
Incremental Mining of Frequent Serial Episodes Considering Multiple Occurrences
The need to analyze information from streams arises in a variety of
applications. One of its fundamental research directions is to mine sequential
patterns over data streams. Current studies mine series of items based on the
presence of the pattern in transactions but pay no attention to the series of
itemsets and their multiple occurrences. The pattern over a window of itemsets
stream and their multiple occurrences, however, provides additional capability
to recognize the essential characteristics of the patterns and the
inter-relationships among them that are unidentifiable by the existing
presence-based studies. In this paper, we study such a new sequential pattern
mining problem and propose a corresponding sequential miner with novel
strategies to prune the search space efficiently. Experiments on both real and
synthetic data show the utility of our approach
Financial crises and financial reforms in Spain : what have we learned?.
Like the rest of the world, Spain has suffered frequent financial crises and undergone several changes in its regulatory framework. There have been crises that have been followed by reforms of the financial structure, and also troubled financial times with no modification of the regulatory and supervisory regime. In various instances, regulatory changes have predated financial crises, but in others banking crises have occurred without reference to changes in the regulatory regime. Regulation and supervision has been usually absent in the XIXth century, while in the XXth century policy makers have been more active and diligent. Moreover, all major financial crises have been followed by intense financial restructuring, although as elsewhere banking restructuring and interventions not always have been successful (in fact, the cases of failures and mixed results overcome the successful cases). The paper provides a short history of the major financial crises in Spain from 1856 to the present, and also reviews the main financial reforms and the distinctive regulatory regimes that have been in place in this last 150 years time span.Spanish banking; Financial crisis; Financial regulations; Banking reforms;
- …