2 research outputs found

    Characterization and extraction of condensed representation of correlated patterns based on formal concept analysis

    Full text link
    Correlated pattern mining has increasingly become an important task in data mining since these patterns allow conveying knowledge about meaningful and surprising relations among data. Frequent correlated patterns were thoroughly studied in the literature. In this thesis, we propose to benefit from both frequent correlated as well as rare correlated patterns according to the bond correlation measure. We propose to extract a subset without information loss of the sets of frequent correlated and of rare correlated patterns, this subset is called ``Condensed Representation``. In this regard, we are based on the notions derived from the Formal Concept Analysis FCA, specifically the equivalence classes associated to a closure operator fbond dedicated to the bond measure, to introduce new concise representations of both frequent correlated and rare correlated patterns

    Motifs corr\'el\'es rares : Caract\'erisation et nouvelles repr\'esentations concises

    Full text link
    Recently, rare pattern mining proves to be of added-value in different data mining applications since these patterns allow conveying knowledge on rare and unexpected events. However, the extraction of rare patterns suffers from two main limits, namely the large number of mined patterns in real-life applications, as well as the low informativeness quality of several rare patterns. In this situation, we propose to use the correlation measure, bond, in the mining process in order to only retain those rare patterns having a certain degree of correlation between their respective items. A characterization of the resulting set, of rare correlated patterns, is then carried out based on the study of constraints of distinct types induced by the rarity and the correlation. In addition, based on the equivalence classes associated to a closure operator dedicated to the bond measure, we propose concise representations of rare correlated patterns. We then design a new algorithm CRP_Miner dedicated to the extraction of the whole set of rare correlated patterns. We also introduce the CRPR_Miner algorithm allowing an efficient extraction of the proposed concise representations. In addition, we design two other algorithms which allow to us the query and the regeneration of the whole set of rare correlated patterns. The carried out experimental studies show the effectiveness of the algorithm CRPR_Miner and prove the compactness rate offered by the proposed concise representations.Comment: in French. Master's thesis 201
    corecore