MINING HYBRID SEQUENTIAL PATTERNS BY HIERARCHICAL MINING TECHNIQUE

Abstract

Unlike sequential patterns, hybrid sequential patterns display not only the path but also the relationship among transaction items. The information provided by the collection of hybrid sequential patterns is useful in improving the analysis of marketing strategies, such as, browsing web pages, discovering customers' behaviors and so on. The process of mining hybrid sequential patterns in a database, however, becomes Complicated by the huge number of candidate patterns. In this paper, we propose a hierarchical mining technique to deal with this complexity. The unique features of this new technique include: counting hybrid sequential patterns by class, and examining database transactions in a top-down manner. This results in scanning a database, at most, twice. Using the technique, we develop an efficient mining algorithm, and conduct a simulation to study its performance. There are three major contributions in this paper. First, our proposed pattern-class concept provides a new way to count a group of patterns simultaneously. Second, we propose a novel decomposition model to lower the I/O cost in counting patterns from a large database. And third, we prove the correctness of counting patterns in the pattern decomposition model in this paper

    Similar works