Search CORE

5 research outputs found

Kernels and Distances for Structured Data

Author: John W. Lloyd
Peter A. Flach
Thomas Gärtner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Naive Bayesian Classification of Structured Data

Author: Nicolas Lachiche
Peter A. Flach
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

First order random forests: Learning relational classifiers with complex aggregates

Author: Anneleen Van Assche
Celine Vens
F. J. Provost
G. Plotkin
H. Blockeel
H. Blockeel
Hendrik Blockeel
J. Quinlan
L. Breiman
L. Breiman
L. Hansen
R. E. Schapire
R. Michalski
S. Džeroski
S. Muggleton
Sašo Džeroski
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Random Relational Rules

Author: Anderson Grant
Publication venue: The University of Waikato
Publication date: 01/01/2008
Field of study

In the field of machine learning, methods for learning from single-table data have received much more attention than those for learning from multi-table, or relational data, which are generally more computationally complex. However, a significant amount of the world's data is relational. This indicates a need for algorithms that can operate efficiently on relational data and exploit the larger body of work produced in the area of single-table techniques. This thesis presents algorithms for learning from relational data that mitigate, to some extent, the complexity normally associated with such learning. All algorithms in this thesis are based on the generation of random relational rules. The assumption is that random rules enable efficient and effective relational learning, and this thesis presents evidence that this is indeed the case. To this end, a system for generating random relational rules is described, and algorithms using these rules are evaluated. These algorithms include direct classification, classification by propositionalisation, clustering, semi-supervised learning and generating random forests. The experimental results show that these algorithms perform competitively with previously published results for the datasets used, while often exhibiting lower runtime than other tested systems. This demonstrates that sufficient information for classification and clustering is retained in the rule generation process and that learning with random rules is efficient. Further applications of random rules are investigated. Propositionalisation allows single-table algorithms for classification and clustering to be applied to the resulting data, reducing the amount of relational processing required. Further results show that techniques for utilising additional unlabeled training data improve accuracy of classification in the semi-supervised setting. The thesis also develops a novel algorithm for building random forests by makingefficient use of random rules to generate trees and leaves in parallel

Research Commons@Waikato

Fungal fermentation for food protein production in upcycled agro-industrial side-streams.

Author: Lübeck Mette
Stephensen Lübeck Peter
Publication venue
Publication date: 01/01/2023
Field of study

VBN