Search CORE

10 research outputs found

Knowledge Graphs Evolution and Preservation -- A Technical Report from ISWS 2019

Author: Ahmad Sakor
Alba Catalina Morales Tirado
Alessandro Umbrico
Allard Oelen
Amine Dadoun
Aneta Koleva
Anna Nguyen
Ariam Rivas Mendez
Axel Polleres
Bilal Koteich
Chang Sun
Chuangtao Ma
Claudia d'Amato
Eleonora Marzi
Fabio Mariani
Federico Igne
Felix Bensmann
Frances Gillis-Webber
Francesca Alloatti
Francesca Giovannetti
Genet Asefa Gesese
Gianmarco Spinaci
Glenda Amaral
Harald Sack
Harm Delva
Heiko Paulheim
Irene Celino
Ismail Harrando
Ivan Heibi
Jaime Salas
Jan Portisch
John Domingue
Kabul Kurniawan
Kader Pustu-Iren
Kholoud Alghamdi
Laurine Huber
Lientje Maas
Ling Cai
Luigi Asprino
Maheshkumar Mistry
Marc Gallofré Ocaña
Margherita Porena
Marieke van Erp
Martin Beno
Martin Mansfield
Marìa Granados Buey
Meilin Shi
Mengya Liu
Michalis Georgiou
Michel Dumontier
Mohamad Yaser Jaradeh
Molka Tounsi Dhouib
Mortaza Alinam
Nacira Abbas
Neha Keshan
Omaima Fallatah
Paola Espinoza Arias
Riley Capshaw
Russa Biswas
Sebastian Rudolph
Sebastián Ferrada
Sepideh Mesbah
Soheil Roshankish
Stefano De Giorgis
Tabea Tietz
Thomas Schleider
Valentina Anita Carriero
Valentina Pasqual
Valentina Presutti
Viet Bach Nguyen
Vincent Emonet
Vitor Horta
Weiqin Xu
Wouter van den Berg
Publication venue
Publication date: 01/01/2020
Field of study

One of the grand challenges discussed during the Dagstuhl Seminar "Knowledge Graphs: New Directions for Knowledge Representation on the Semantic Web" and described in its report is that of a: "Public FAIR Knowledge Graph of Everything: We increasingly see the creation of knowledge graphs that capture information about the entirety of a class of entities. [...] This grand challenge extends this further by asking if we can create a knowledge graph of "everything" ranging from common sense concepts to location based entities. This knowledge graph should be "open to the public" in a FAIR manner democratizing this mass amount of knowledge." Although linked open data (LOD) is one knowledge graph, it is the closest realisation (and probably the only one) to a public FAIR Knowledge Graph (KG) of everything. Surely, LOD provides a unique testbed for experimenting and evaluating research hypotheses on open and FAIR KG. One of the most neglected FAIR issues about KGs is their ongoing evolution and long term preservation. We want to investigate this problem, that is to understand what preserving and supporting the evolution of KGs means and how these problems can be addressed. Clearly, the problem can be approached from different perspectives and may require the development of different approaches, including new theories, ontologies, metrics, strategies, procedures, etc. This document reports a collaborative effort performed by 9 teams of students, each guided by a senior researcher as their mentor, attending the International Semantic Web Research School (ISWS 2019). Each team provides a different perspective to the problem of knowledge graph evolution substantiated by a set of research questions as the main subject of their investigation. In addition, they provide their working definition for KG preservation and evolution

Archivio istituzionale della ricerca - Università di Bari

Relation Classification using Semantically-Enhanced Syntactic Dependency Paths : Combining Semantic and Syntactic Dependencies for Relation Classification using Long Short-Term Memory Networks

Author: Capshaw Riley
Publication venue: Linköpings universitet, Interaktiva och kognitiva system
Publication date: 01/01/2018
Field of study

Many approaches to solving tasks in the field of Natural Language Processing (NLP) use syntactic dependency trees (SDTs) as a feature to represent the latent nonlinear structure within sentences. Recently, work in parsing sentences to graph-based structures which encode semantic relationships between words—called semantic dependency graphs (SDGs)—has gained interest. This thesis seeks to explore the use of SDGs in place of and alongside SDTs within a relation classification system based on long short-term memory (LSTM) neural networks. Two methods for handling the information in these graphs are presented and compared between two SDG formalisms. Three new relation extraction system architectures have been created based on these methods and are compared to a recent state-of-the-art LSTM-based system, showing comparable results when semantic dependencies are used to enhance syntactic dependencies, but with significantly fewer training parameters

Relation Classification using Semantically-Enhanced Syntactic Dependency Paths : Combining Semantic and Syntactic Dependencies for Relation Classification using Long Short-Term Memory Networks

Author: Capshaw Riley
Publication venue: Linköpings universitet, Interaktiva och kognitiva system
Publication date: 01/01/2018
Field of study

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Relation Classification using Semantically-Enhanced Syntactic Dependency Paths : Combining Semantic and Syntactic Dependencies for Relation Classification using Long Short-Term Memory Networks

Author: Capshaw Riley
Publication venue: Linköpings universitet, Interaktiva och kognitiva system
Publication date: 01/01/2018
Field of study

Publikationer från Linköpings universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line

BERT is as Gentle as a Sledgehammer: Too Powerful or Too Blunt? It Depends on the Benchmark

Author: Alirezaie Marjan
Blomqvist Eva
Capshaw Riley
Santini Marina
Publication venue: Örebro University, Sweden
Publication date: 01/01/2021
Field of study

In this position statement, we wish to contribute to the discussion about how to assess quality and coverage of a model. We believe that BERT's prominence as a single-step pipeline for contextualization and classification highlights the need for benchmarks to evolve concurrently with models. Much recent work has touted BERT's raw power for solving natural language tasks, so we used a 12-layer uncased BERT pipeline with a linear classifier as a quick-and-dirty model to score well on the SemEval 2010 Task 8 dataset for relation classification between nominals. We initially expected there to be significant enough bias from BERT's training to influence downstream tasks, since it is well-known that biased training corpora can lead to biased language models (LMs). Gender bias is the most common example, where gender roles are codified within language models. To handle such training data bias, we took inspiration from work in the field of computer vision. Tang et al. (2020) mitigate human reporting bias over the labels of a scene graph generation task using a form of causal reasoning based on counterfactual analysis. They extract the total direct effect of the context image on the prediction task by "blanking out" detected objects, intuitively asking "What if these objects were not here?" If the system still predicts the same label, then the original prediction is likely caused by bias in some form. Our goal was to remove any effects from biases learned during BERT's pre-training, so we analyzed total effect (TE) instead. However, across several experimental configurations we found no noticeable effects from using TE analysis. One disappointing possibility was that BERT might be resistant to causal analysis due to its complexity. Another was that BERT is so powerful (or blunt?) that it can find unanticipated trends in its input, rendering any human-generated causal analysis of its predictions useless. We nearly concluded that what we expected to be delicate experimentation was more akin to trying to carve a masterpiece sculpture with a self-driven sledgehammer. We then found related work where BERT fooled humans by exploiting unexpected characteristics of a benchmark. When we used BERT to predict a relation for random words in the benchmark sentences, it guessed the same label as it would have for the corresponding marked entities roughly half of the time. Since the task had nineteen roughly-balanced labels, we expected much less consistency. This finding repeated across all pipeline configurations; BERT was treating the benchmark as a sequence classification task! Our final conclusion was that the benchmark is inadequate: all sentences appeared exactly once with exactly one pair of entities, so the task was equivalent to simply labeling each sentence. We passionately claim from our experience that the current trend of using larger and more complex LMs must include concurrent evolution of benchmarks. We as researchers need to be diligent in keeping our tools for measuring as sophisticated as the models being measured, as any scientific domain does

Digitala Vetenskapliga Arkivet - Academic Archive On-line