Search CORE

2 research outputs found

Autocorrelation and linkage cause bias in evaluation of relational learners

Author: D. Jensen
J. Kleinberg
J. R. Quinlan
R. Lipton
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2002
Field of study

Two common characteristics of relational data sets — concentrated linkage and relational auto-correlation — can cause traditional methods of evaluation to greatly overestimate the accuracy of induced models on test sets. We identify these characteristics, define quantitative measures of their severity, and explain how they produce this bias. We show how linkage and autocorrelation affect estimates of model accuracy by applying FOIL to synthetic data and to data drawn from the Internet Movie Database. We show how a modified sampling procedure can eliminate the bias

CiteSeerX

Crossref

ScholarWorks@UMass Amherst

Recommended from our members

SRL2003 IJCAI 2003 Workshop on Learning Statistical Models from Relational Data

Author: Getoor Lise
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2003
Field of study

ScholarWorks@UMass Amherst