Choosing the Right Patterns : An Experimental Comparison between Different Tree Inclusion Relations

Feelders, A.; Knijf, J. de

Choosing the Right Patterns : An Experimental Comparison between Different Tree Inclusion Relations

Authors: A. Feelders
J. de Knijf
Publication date: 1 January 2007
Publisher

Abstract

In recent years a variety of mining algorithms has been developed, to derive all frequent subtrees from a database of labeled ordered rooted trees. These algorithms share properties such as enumeration strategies and pruning techniques. They differ however in the tree inclusion relation used and how attribute values are dealt with. In this work we investigate the different approaches with respect to ‘usefulness’ of the derived patterns, in particular, the performance of classifiers that use the derived patterns as features. In order to find a good tradeoff between expressiveness and runtime performance of the different approaches, we also take the complexity of the different classifiers into account, as well as the run time and memory usage of the different approaches. The experiments are performed on two real datasets. The results show that significant improvement in both predictive performance and computational efficiency can be gained by choosing the right tree mining approach

Similar works

Full text

Available Versions

Utrecht University Repository

oai:dspace.library.uu.nl:1874/...

Last time updated on 14/06/2016

NARCIS

Last time updated on 14/10/2017