Learning Tuple Probabilities

Dylla, Maximilian; Theobald, Martin

research

Learning Tuple Probabilities

Authors: Maximilian Dylla
Martin Theobald
Publication date: 1 January 2016
Publisher

Abstract

Learning the parameters of complex probabilistic-relational models from labeled training data is a standard technique in machine learning, which has been intensively studied in the subfield of Statistical Relational Learning (SRL), but---so far---this is still an under-investigated topic in the context of Probabilistic Databases (PDBs). In this paper, we focus on learning the probability values of base tuples in a PDB from labeled lineage formulas. The resulting learning problem can be viewed as the inverse problem to confidence computations in PDBs: given a set of labeled query answers, learn the probability values of the base tuples, such that the marginal probabilities of the query answers again yield in the assigned probability labels. We analyze the learning problem from a theoretical perspective, cast it into an optimization problem, and provide an algorithm based on stochastic gradient descent. Finally, we conclude by an experimental evaluation on three real-world and one synthetic dataset, thus comparing our approach to various techniques from SRL, reasoning in information extraction, and optimization

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Open Repository and Bibliography - Luxembourg

oai:orbilu.uni.lu:10993/34037

Last time updated on 08/02/2018