Search CORE

61 research outputs found

Using Crowdsourcing for Fine-Grained Entity Type Completion in Knowledge Bases

Author: A Gangemi
A Melo
A Palmero Aprosio
AP Dawid
B Mozafari
H Paulheim
H Paulheim
H Paulheim
J Lehmann
T Rebele
Z Dong
Publication venue: Springer International Publishing
Publication date: 01/01/2018
Field of study

Recent years have witnessed the proliferation of large-scale Knowledge Bases (KBs). However, many entities in KBs have incomplete type information, and some are totally untyped. Even worse, fine-grained types (e.g., BasketballPlayer) containing rich semantic meanings are more likely to be incomplete, as they are more difficult to be obtained. Existing machine-based algorithms use predicates (e.g., birthPlace) of entities to infer their missing types, and they have limitations that the predicates may be insufficient to infer fine-grained types. In this paper, we utilize crowdsourcing to solve the problem, and address the challenge of controlling crowdsourcing cost. To this end, we propose a hybrid machine-crowdsourcing approach for fine-grained entity type completion. It firstly determines the types of some “representative” entities via crowdsourcing and then infers the types for remaining entities based on the crowdsourcing results. To support this approach, we first propose an embedding-based influence for type inference which considers not only the distance between entity embeddings but also the distances between entity and type embeddings. Second, we propose a new difficulty model for entity selection which can better capture the uncertainty of the machine algorithm when identifying the entity types. We demonstrate the effectiveness of our approach through experiments on real crowdsourcing platforms. The results show that our method outperforms the state-of-the-art algorithms by improving the effectiveness of fine-grained type completion at affordable crowdsourcing cost.Peer reviewe

Crossref

Helsingin yliopiston digitaalinen arkisto

LD4IE - Linked data for information extraction

Author: D'Amato C.
Gentile A. L.
Paulheim H.
Zhang Z.
Publication venue: CEUR-WS
Publication date: 01/01/2013
Field of study

The World Wide Web provides access to tens of billions of pages, mostly containinginformation that is largely unstructured and only intended for human readability. Onthe other hand, the LOD provide billions of pieces of information linked together andmade available for automated processing. However, there is the lack of interconnectionbetween the information in the Web pages and that in LOD. A number of initiatives,like RDFa (supported by W3C) or Microformats (used by schema.org and supported bymajor search engines) are trying to enable machines to make sense of the informationcontained in human readable pages by providing the ability to annotate webpage contentwith links into LOD

Archivio istituzionale della ricerca - Università di Bari

Canonicalizing Knowledge Base Literals

Author: A Dimou
A Gangemi
A Zaveri
CN Silla
D Fleischhacker
D Krompaß
H Paulheim
H Paulheim
H Paulheim
I Dongo
J Debattista
J Pujara
J Raad
J Sleeman
K Gunaratna
M Färber
S Auer
S Auer
V Efthymiou
X Niu
Z Abedjan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Ontology-based knowledge bases (KBs) like DBpedia are very valuable resources, but their usefulness and usability is limited by various quality issues. One such issue is the use of string literals instead of semantically typed entities. In this paper we study the automated canonicalization of such literals, i.e., replacing the literal with an existing entity from the KB or with a new entity that is typed using classes from the KB. We propose a framework that combines both reasoning and machine learning in order to predict the relevant entities and types, and we evaluate this framework against state-of-the-art baselines for both semantic typing and entity matching

arXiv.org e-Print Archive

City Research Online

Crossref

NORA - Norwegian Open Research Archives

Recommended from our members

Results of the ontology alignment evaluation initiative 2019

Author: Algergawy A.
Faria D.
Ferrara A.
Fundulaki I.
Harrow I.
Hertling S.
Jimenez-Ruiz E.
Karam N.
Khiat A.
Lambrix P.
Li H.
Montanelli S.
Paulheim H.
Pesquita C.
Saveta T.
Shvaiko P.
Splendiani A.
Thiéblin E.
Trojahn C.
Vataščinová J.
Zamazal O.
Zhou L.
Publication venue
Publication date: 01/01/2019
Field of study

The Ontology Alignment Evaluation Initiative (OAEI) aims at comparing ontology matching systems on precisely defined test cases. These test cases can be based on ontologies of different levels of complexity (from simple thesauri to expressive OWL ontologies) and use different evaluation modalities (e.g., blind evaluation, open evaluation, or consensus). The OAEI 2019 campaign offered 11 tracks with 29 test cases, and was attended by 20 participants. This paper is an overall presentation of that campaign

City Research Online

MAnnheim DOCument Server (Univ. Mannheim)

Supporting the Linked Data Life Cycle Using an Integrated Tool Stack

Author: A Khalili
H Paulheim
M Samwald
V Janev
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Task-Oriented Uncertainty Evaluation for Linked Data Based on Graph Interlinks

Author: A Miles
AEA Djebri
H Paulheim
K Christodoulou
P Shvaiko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/09/2020
Field of study

International audienceFor data sources to ensure providing reliable linked data, they need to indicate information about the (un)certainty of their data based on the views of their consumers. In Addition, uncertainty information in terms of Semantic Web has also to be encoded into a readable, publishable, and exchangeable format to increase the interoperability of systems. This paper introduces a novel approach to evaluate the uncertainty of data in an RDF dataset based on its links with other datasets. We propose to evaluate uncertainty for sets of statements related to user-selected resources by exploiting their similarity interlinks with external resources. Our data-driven approach translates each interlink into a set of links referring to the position of a target dataset from a reference dataset, based on both object and predicate similarities. We show how our approach can be implemented and present an evaluation with real-world datasets. Finally, we discuss updating the publishable uncertainty values

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

QueDI: From Knowledge Graph Querying to Data Visualization

Author: A Harth
A Soylu
D Damljanovic
H Paulheim
JR Lewis
L Rietveld
M Alonen
MG Skjæveland
N Bikakis
S Ferré
S Ferré
Publication venue
Publication date: 01/01/2020
Field of study

Abstract While Open Data (OD) publishers are spur in providing data as Linked Open Data (LOD) to boost innovation and knowledge creation, the complexity of RDF querying languages, such as SPARQL, threatens their exploitation. We aim to help lay users (by focusing on experts in table manipulation, such as OD experts) in querying and exploiting LOD by taking advantage of our target users' expertise in table manipulation and chart creation. We propose QueDI (Query Data of Interest), a question-answering and visualization tool that implements a scaffold transitional approach to 1) query LOD without being aware of SPARQL and representing results by data tables; 2) once reached our target user comfort zone, users can manipulate and 3) visually represent data by exportable and dynamic visualizations. The main novelty of our approach is the split of the querying phase in SPARQL query building and data table manipulation. In this article, we present the QueDI operating mechanism, its interface supported by a guided use-case over DBpedia, and the evaluation of its accuracy and usability level

Crossref

Open Access Repository

A collection of benchmark datasets for systematic evaluations of machine learning on the Semantic Web

Author: A Rettinger
CN Silla Jr
GKD Vries
H Paulheim
J Demšar
M Schmachtenberg
P Ristoski
P Ristoski
S Bloehdorn
V Boer
V Tresp
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Crossref

MAnnheim DOCument Server (Univ. Mannheim)

Results of the ontology alignment evaluation initiative 2023

Author: Algergawy A
Buche P
Castro Lj
Chen J
Coulet A
Cufi J
Dong H
Fallatah O
Faria D
Fundulaki I
He Yuan
Hertling S
Horrocks Ian
Huschka M
Ibanescu L
Jain S
Jiménez-Ruiz E
Karam N
Lambrix P
Li H
Li Y
Monnin P
Nasr E
Paulheim H
Pesquita C
Pour Man
Saveta T
Shvaiko P
Sousa G
Trojahn C
Vatascinova J
Wu M
Yaman B
Zamazal O
Zhou L
Publication venue: CEUR Workshop Proceedings
Publication date: 14/12/2023
Field of study

The Ontology Alignment Evaluation Initiative (OAEI) aims at comparing ontology matching systems on precisely defined test cases. These test cases can be based on ontologies of different levels of complexity and use different evaluation modalities. The OAEI 2023 campaign offered 15 tracks and was attended by 16 participants. This paper is an overall presentation of that campaign

Oxford University Research Archive

Universal design of ICT for emergency management: A systematic literature review and research agenda

Author: A Engelman
A Malizia
A Malizia
B Wentz
Briony J. Gray
C Stary
C-M Huang
D Bennett
E Bromley
H Maryam
H Paulheim
J Cinnamon
J Doyle
JT Morris
M Lichter
MH McSweeney-Feld
MK Lindell
Paloma Díaz
S Jan
T Onorati
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

acceptedVersionNivå

Crossref

NORA - Norwegian Open Research Archives

Agder University Research Archive