Search CORE

1 research outputs found

Triplifying Wikipedia's tables

Author: Hogan Aidan
Mileo Alessandra
Muñoz Emir
Publication venue: CEUR-WS.org
Publication date: 13/09/2016
Field of study

We are currently investigating methods to triplify the content of Wikipedia's tables. We propose that existing knowledge-bases can be leveraged to semi-automatically extract high-quality facts (in the form of RDF triples) from tables embedded in Wikipedia articles (henceforth called \Wikitables"). We present a survey of Wikitables and their content in a recent dump of Wikipedia. We then discuss some ongoing work on using DBpedia to mine novel RDF triples from these tables: we present methods that automatically extract 24.4 million raw triples from the Wikitables at an estimated precision of 52.2%. We believe this precision can be (greatly) improved through machine learning methods and sketch ideas for features that should help classify (in)correct triples.This paper was funded in part by Science Foundation Ireland under Grant No. SFI/08/CE/I1380 (Lion-2).peer-reviewe

Access to Research at National University of Ireland, Galway