Search CORE

13,234 research outputs found

Multi-task Neural Network for Non-discrete Attribute Prediction in Knowledge Graphs

Author: Bollacker Kurt D.
Diederik
Guo Shu
Gupta Abhijeet
Hoffart Johannes
Jenatton Rodolphe
Ji Guoliang
Li Ya
Lin Yankai
Lin Yankai
Lin Yankai
Liu Pengfei
Luu Anh Tuan
Mikolov Tomas
Nickel Maximilian
Qiu Xipeng
Tay Yi
Wang Zhen
Xiao Han
Xie Ruobing
Zhong Huaping
Publication venue
Publication date: 16/08/2017
Field of study

Many popular knowledge graphs such as Freebase, YAGO or DBPedia maintain a list of non-discrete attributes for each entity. Intuitively, these attributes such as height, price or population count are able to richly characterize entities in knowledge graphs. This additional source of information may help to alleviate the inherent sparsity and incompleteness problem that are prevalent in knowledge graphs. Unfortunately, many state-of-the-art relational learning models ignore this information due to the challenging nature of dealing with non-discrete data types in the inherently binary-natured knowledge graphs. In this paper, we propose a novel multi-task neural network approach for both encoding and prediction of non-discrete attribute information in a relational setting. Specifically, we train a neural network for triplet prediction along with a separate network for attribute value regression. Via multi-task learning, we are able to learn representations of entities, relations and attributes that encode information about both tasks. Moreover, such attributes are not only central to many predictive tasks as an information source but also as a prediction target. Therefore, models that are able to encode, incorporate and predict such information in a relational learning context are highly attractive as well. We show that our approach outperforms many state-of-the-art methods for the tasks of relational triplet classification and attribute value prediction.Comment: Accepted at CIKM 201

arXiv.org e-Print Archive

Crossref

Zero-Truncated Poisson Tensor Factorization for Massive Binary Tensors

Author: Carin Lawrence
Hu Changwei
Rai Piyush
Publication venue
Publication date: 18/08/2015
Field of study

We present a scalable Bayesian model for low-rank factorization of massive tensors with binary observations. The proposed model has the following key properties: (1) in contrast to the models based on the logistic or probit likelihood, using a zero-truncated Poisson likelihood for binary data allows our model to scale up in the number of \emph{ones} in the tensor, which is especially appealing for massive but sparse binary tensors; (2) side-information in form of binary pairwise relationships (e.g., an adjacency network) between objects in any tensor mode can also be leveraged, which can be especially useful in "cold-start" settings; and (3) the model admits simple Bayesian inference via batch, as well as \emph{online} MCMC; the latter allows scaling up even for \emph{dense} binary data (i.e., when the number of ones in the tensor/network is also massive). In addition, non-negative factor matrices in our model provide easy interpretability, and the tensor rank can be inferred from the data. We evaluate our model on several large-scale real-world binary tensors, achieving excellent computational scalability, and also demonstrate its usefulness in leveraging side-information provided in form of mode-network(s).Comment: UAI (Uncertainty in Artificial Intelligence) 201

arXiv.org e-Print Archive

CiteSeerX

Link Prediction via Generalized Coupled Tensor Factorisation

Author: Acar Evrim
Cemgil A. Taylan
Ermiş Beyza
Publication venue
Publication date: 01/01/2012
Field of study

This study deals with the missing link prediction problem: the problem of predicting the existence of missing connections between entities of interest. We address link prediction using coupled analysis of relational datasets represented as heterogeneous data, i.e., datasets in the form of matrices and higher-order tensors. We propose to use an approach based on probabilistic interpretation of tensor factorisation models, i.e., Generalised Coupled Tensor Factorisation, which can simultaneously fit a large class of tensor models to higher-order tensors/matrices with com- mon latent factors using different loss functions. Numerical experiments demonstrate that joint analysis of data from multiple sources via coupled factorisation improves the link prediction performance and the selection of right loss function and tensor model is crucial for accurately predicting missing links

arXiv.org e-Print Archive

Copenhagen University Research Information System