2 research outputs found
Characterization and extraction of condensed representation of correlated patterns based on formal concept analysis
Correlated pattern mining has increasingly become an important task in data
mining since these patterns allow conveying knowledge about meaningful and
surprising relations among data. Frequent correlated patterns were thoroughly
studied in the literature. In this thesis, we propose to benefit from both
frequent correlated as well as rare correlated patterns according to the bond
correlation measure. We propose to extract a subset without information loss of
the sets of frequent correlated and of rare correlated patterns, this subset is
called ``Condensed Representation``. In this regard, we are based on the
notions derived from the Formal Concept Analysis FCA, specifically the
equivalence classes associated to a closure operator fbond dedicated to the
bond measure, to introduce new concise representations of both frequent
correlated and rare correlated patterns
Motifs corr\'el\'es rares : Caract\'erisation et nouvelles repr\'esentations concises
Recently, rare pattern mining proves to be of added-value in different data
mining applications since these patterns allow conveying knowledge on rare and
unexpected events. However, the extraction of rare patterns suffers from two
main limits, namely the large number of mined patterns in real-life
applications, as well as the low informativeness quality of several rare
patterns. In this situation, we propose to use the correlation measure, bond,
in the mining process in order to only retain those rare patterns having a
certain degree of correlation between their respective items. A
characterization of the resulting set, of rare correlated patterns, is then
carried out based on the study of constraints of distinct types induced by the
rarity and the correlation. In addition, based on the equivalence classes
associated to a closure operator dedicated to the bond measure, we propose
concise representations of rare correlated patterns. We then design a new
algorithm CRP_Miner dedicated to the extraction of the whole set of rare
correlated patterns. We also introduce the CRPR_Miner algorithm allowing an
efficient extraction of the proposed concise representations. In addition, we
design two other algorithms which allow to us the query and the regeneration of
the whole set of rare correlated patterns. The carried out experimental studies
show the effectiveness of the algorithm CRPR_Miner and prove the compactness
rate offered by the proposed concise representations.Comment: in French. Master's thesis 201