Search CORE

7 research outputs found

Enriching Knowledge Bases with Counting Quantifiers

Author: F Darari
HT Dang
L Galárraga
Marc Denecker
S Auer
S Neumaier
S Riedel
XL Dong
Publication venue
Publication date: 01/01/2018
Field of study

Information extraction traditionally focuses on extracting relations between identifiable entities, such as . Yet, texts often also contain Counting information, stating that a subject is in a specific relation with a number of objects, without mentioning the objects themselves, for example, "California is divided into 58 counties". Such counting quantifiers can help in a variety of tasks such as query answering or knowledge base curation, but are neglected by prior work. This paper develops the first full-fledged system for extracting counting information from text, called CINEX. We employ distant supervision using fact counts from a knowledge base as training seeds, and develop novel techniques for dealing with several challenges: (i) non-maximal training seeds due to the incompleteness of knowledge bases, (ii) sparse and skewed observations in text sources, and (iii) high diversity of linguistic patterns. Experiments with five human-evaluated relations show that CINEX can achieve 60% average precision for extracting counting information. In a large-scale experiment, we demonstrate the potential for knowledge base enrichment by applying CINEX to 2,474 frequent relations in Wikidata. CINEX can assert the existence of 2.5M facts for 110 distinct relations, which is 28% more than the existing Wikidata facts for these relations.Comment: 16 pages, The 17th International Semantic Web Conference (ISWC 2018

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Extracting novel facts from tables for Knowledge Graph completion

Author: CS Bhagavatula
G Limaye
I Ermilov
J Pearl
J Wang
M Cafarella
M Nickel
M Pham
P Venetis
S Auer
S Neumaier
V Efthymiou
V Mulwad
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

We propose a new end-to-end method for extending a Knowledge Graph (KG) from tables. Existing techniques tend to interpret tables by focusing on information that is already in the KG, and therefore tend to extract many redundant facts. Our method aims to find more novel facts. We introduce a new technique for table interpretation based on a scalable graphical model using entity similarities. Our method further disambiguates cell values using KG embeddings as additional ranking method. Other distinctive features are the lack of assumptions about the underlying KG and the enabling of a fine-grained tuning of the precision/recall trade-off of extracted facts. Our experiments show that our approach has a higher recall during the interpretation process than the state-of-the-art, and is more resistant against the bias observed in extracting mostly redundant facts since it produces more novel extractions

VU Research Portal

Crossref

CWI's Institutional Repository

Extracting Novel Facts from Tables for Knowledge Graph Completion (Extended version)

Author: Boncz P.A. (Peter)
Kruit B.B. (Benno)
Urbani J. (Jacopo)
Publication venue
Publication date: 15/07/2019
Field of study

arXiv.org e-Print Archive

CWI's Institutional Repository

Making Sense of Entities and Quantities in {Web} Tables

Author: Ibrahim Y.
Riedewald M.
Weikum G.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2016
Field of study

CISPA – Helmholtz-Zentrum für Informationssicherheit

MPG.PuRe