Learning Small Trees and Graphs that Generalize

Kääriäinen, Matti

thesis

Learning Small Trees and Graphs that Generalize

Authors: Matti Kääriäinen
Publication date: 1 October 2004
Publisher: Helsingfors universitet

Abstract

In this Thesis we study issues related to learning small tree and graph formed classifiers. First, we study reduced error pruning of decision trees and branching programs. We analyze the behavior of a reduced error pruning algorithm for decision trees under various probabilistic assumptions on the pruning data. As a result we get, e.g., new upper bounds for the probability of replacing a tree that fits random noise by a leaf. In the case of branching programs we show that the existence of an efficient approximation algorithm for reduced error pruning would imply P=NP. This indicates that reduced error pruning of branching programs is most likely impossible in practice, even though the corresponding problem for decision trees is easily solvable in linear time. The latter part of the Thesis is concerned with generalization error analysis, more particularly on Rademacher penalization applied to small or otherwise restricted decision trees. We develop a progressive sampling method based on Rademacher penalization that yields reasonable data dependent sample complexity estimate

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.96.90...

Last time updated on 23/10/2014

Helsingin yliopiston digitaalinen arkisto

oai:helda.helsinki.fi:10138/21...

Last time updated on 30/08/2013