Search CORE

6 research outputs found

Tackling scalability issues in mining path patterns from knowledge graphs: a preliminary study

Author: Bresso Emmanuel
Couceiro Miguel
Coulet Adrien
Monnin Pierre
Napoli Amedeo
Smaïl-Tabbone Malika
Publication venue
Publication date: 07/08/2020
Field of study

Features mined from knowledge graphs are widely used within multiple knowledge discovery tasks such as classification or fact-checking. Here, we consider a given set of vertices, called seed vertices, and focus on mining their associated neighboring vertices, paths, and, more generally, path patterns that involve classes of ontologies linked with knowledge graphs. Due to the combinatorial nature and the increasing size of real-world knowledge graphs, the task of mining these patterns immediately entails scalability issues. In this paper, we address these issues by proposing a pattern mining approach that relies on a set of constraints (e.g., support or degree thresholds) and the monotonicity property. As our motivation comes from the mining of real-world knowledge graphs, we illustrate our approach with PGxLOD, a biomedical knowledge graph

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Semantic Web in data mining and knowledge discovery: A comprehensive survey

Author: Abel
Abel
Auer
Bellandi
Bernstein
Bhagavatula
Bicer
Bicharra~Garcia
Bizer
Bloehdorn
Bloem
Blum
Bollacker
Bontcheva
Brunetti
Brüggemann
Chen
Dadzie
Daiber
De~Clercq
de~Vries
Diamantini
Ding
Di~Noia
Di~Noia
Dou
Džeroski
d’Aquin
Eronen
Euzenat
Fanizzi
Fayyad
Finin
Fürber
Fürber
Fürber
Fürber
Fürnkranz
Gabriel
Gruber
Han
Hand
Hassanzadeh
Heckmann
Heiko Paulheim
Hienert
Hilario
Huang
Huang
Huynh
Jay
John
Kauppinen
Kauppinen
Kedad
Kietz
Kietz
Klsgen
Kramer
Langegger
Lavrač
Lavrač
Liaw
Limaye
Lösch
Marinica
Mendes
Milano
Miller
Moss
Mulwad
Mulwad
Mulwad
Muoz
Muñoz
Nigro
Pan
Pan
Pang-Ning
Panov
Panov
Panov
Panov
Passant
Paulheim
Paulheim
Paulheim
Paulheim
Pennacchiotti
Petar Ristoski
Phillips
Pinto
Podpecan
Podpečan
Prez-Rey
Qu
Quboa
Rettinger
Ristoski
Ristoski
Ristoski
Rizzo
Scerri
Schmachtenberg
Schuhmacher
Schulz
Serban
Shervashidze
Spanos
Srikant
Stumme
Suchanek
Suchanek
Suyama
Svátek
Tiddi
Tiddi
Tiddi
Trajkovski
Tresp
Tummarello
Unbehauen
van Hage
Vavpetič
Vavpetič
Vavpetič
Venetis
Wang
Wang
Wang
Wu
Zhang
Zhang
Zhou
Žáková
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

A Comparison of Propositionalization Strategies for Creating Features from Linked Open Data

Author: Paulheim Heiko
Ristoski Petar
Publication venue: RWTH
Publication date: 01/01/2014
Field of study

A Comparison of Propositionalization Strategies for Creating Features from Linked Open Data

Author: Paulheim Heiko
Ristoski Petar
Publication venue: RWTH
Publication date: 01/01/2014
Field of study

Abstract. Linked Open Data has been recognized as a valuable source for background information in data mining. However, most data min-ing tools require features in propositional form, i.e., binary, nominal or numerical features associated with an instance, while Linked Open Data sources are usually graphs by nature. In this paper, we compare different strategies for creating propositional features from Linked Open Data (a process called propositionalization), and present experiments on different tasks, i.e., classification, regression, and outlier detection. We show that the choice of the strategy can have a strong influence on the results

CiteSeerX

MAnnheim DOCument Server

Explainable methods for knowledge graph refinement and exploration via symbolic reasoning

Author: Gad-Elrab Mohamed Hassan Mohamed
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2021
Field of study

Knowledge Graphs (KGs) have applications in many domains such as Finance, Manufacturing, and Healthcare. While recent efforts have created large KGs, their content is far from complete and sometimes includes invalid statements. Therefore, it is crucial to refine the constructed KGs to enhance their coverage and accuracy via KG completion and KG validation. It is also vital to provide human-comprehensible explanations for such refinements, so that humans have trust in the KG quality. Enabling KG exploration, by search and browsing, is also essential for users to understand the KG value and limitations towards down-stream applications. However, the large size of KGs makes KG exploration very challenging. While the type taxonomy of KGs is a useful asset along these lines, it remains insufficient for deep exploration. In this dissertation we tackle the aforementioned challenges of KG refinement and KG exploration by combining logical reasoning over the KG with other techniques such as KG embedding models and text mining. Through such combination, we introduce methods that provide human-understandable output. Concretely, we introduce methods to tackle KG incompleteness by learning exception-aware rules over the existing KG. Learned rules are then used in inferring missing links in the KG accurately. Furthermore, we propose a framework for constructing human-comprehensible explanations for candidate facts from both KG and text. Extracted explanations are used to insure the validity of KG facts. Finally, to facilitate KG exploration, we introduce a method that combines KG embeddings with rule mining to compute informative entity clusters with explanations.Wissensgraphen haben viele Anwendungen in verschiedenen Bereichen, beispielsweise im Finanz- und Gesundheitswesen. Wissensgraphen sind jedoch unvollständig und enthalten auch ungültige Daten. Hohe Abdeckung und Korrektheit erfordern neue Methoden zur Wissensgraph-Erweiterung und Wissensgraph-Validierung. Beide Aufgaben zusammen werden als Wissensgraph-Verfeinerung bezeichnet. Ein wichtiger Aspekt dabei ist die Erklärbarkeit und Verständlichkeit von Wissensgraphinhalten für Nutzer. In Anwendungen ist darüber hinaus die nutzerseitige Exploration von Wissensgraphen von besonderer Bedeutung. Suchen und Navigieren im Graph hilft dem Anwender, die Wissensinhalte und ihre Limitationen besser zu verstehen. Aufgrund der riesigen Menge an vorhandenen Entitäten und Fakten ist die Wissensgraphen-Exploration eine Herausforderung. Taxonomische Typsystem helfen dabei, sind jedoch für tiefergehende Exploration nicht ausreichend. Diese Dissertation adressiert die Herausforderungen der Wissensgraph-Verfeinerung und der Wissensgraph-Exploration durch algorithmische Inferenz über dem Wissensgraph. Sie erweitert logisches Schlussfolgern und kombiniert es mit anderen Methoden, insbesondere mit neuronalen Wissensgraph-Einbettungen und mit Text-Mining. Diese neuen Methoden liefern Ausgaben mit Erklärungen für Nutzer. Die Dissertation umfasst folgende Beiträge: Insbesondere leistet die Dissertation folgende Beiträge: • Zur Wissensgraph-Erweiterung präsentieren wir ExRuL, eine Methode zur Revision von Horn-Regeln durch Hinzufügen von Ausnahmebedingungen zum Rumpf der Regeln. Die erweiterten Regeln können neue Fakten inferieren und somit Lücken im Wissensgraphen schließen. Experimente mit großen Wissensgraphen zeigen, dass diese Methode Fehler in abgeleiteten Fakten erheblich reduziert und nutzerfreundliche Erklärungen liefert. • Mit RuLES stellen wir eine Methode zum Lernen von Regeln vor, die auf probabilistischen Repräsentationen für fehlende Fakten basiert. Das Verfahren erweitert iterativ die aus einem Wissensgraphen induzierten Regeln, indem es neuronale Wissensgraph-Einbettungen mit Informationen aus Textkorpora kombiniert. Bei der Regelgenerierung werden neue Metriken für die Regelqualität verwendet. Experimente zeigen, dass RuLES die Qualität der gelernten Regeln und ihrer Vorhersagen erheblich verbessert. • Zur Unterstützung der Wissensgraph-Validierung wird ExFaKT vorgestellt, ein Framework zur Konstruktion von Erklärungen für Faktkandidaten. Die Methode transformiert Kandidaten mit Hilfe von Regeln in eine Menge von Aussagen, die leichter zu finden und zu validieren oder widerlegen sind. Die Ausgabe von ExFaKT ist eine Menge semantischer Evidenzen für Faktkandidaten, die aus Textkorpora und dem Wissensgraph extrahiert werden. Experimente zeigen, dass die Transformationen die Ausbeute und Qualität der entdeckten Erklärungen deutlich verbessert. Die generierten unterstützen Erklärungen unterstütze sowohl die manuelle Wissensgraph- Validierung durch Kuratoren als auch die automatische Validierung. • Zur Unterstützung der Wissensgraph-Exploration wird ExCut vorgestellt, eine Methode zur Erzeugung von informativen Entitäts-Clustern mit Erklärungen unter Verwendung von Wissensgraph-Einbettungen und automatisch induzierten Regeln. Eine Cluster-Erklärung besteht aus einer Kombination von Relationen zwischen den Entitäten, die den Cluster identifizieren. ExCut verbessert gleichzeitig die Cluster- Qualität und die Cluster-Erklärbarkeit durch iteratives Verschränken des Lernens von Einbettungen und Regeln. Experimente zeigen, dass ExCut Cluster von hoher Qualität berechnet und dass die Cluster-Erklärungen für Nutzer informativ sind

Universaar

Acronym

Proceedings of the 1st International Conference on Algebras, Graphs and Ordered Sets (ALGOS 2020)

Author: Couceiro Miguel
Monnin Pierre
Napoli Amedeo
Publication venue: HAL CCSD
Publication date: 01/08/2020
Field of study

International audienceOriginating in arithmetics and logic, the theory of ordered sets is now a field of combinatorics that is intimately linked to graph theory, universal algebra and multiple-valued logic, and that has a wide range of classical applications such as formal calculus, classification, decision aid and social choice.This international conference “Algebras, graphs and ordered set” (ALGOS) brings together specialists in the theory of graphs, relational structures and ordered sets, topics that are omnipresent in artificial intelligence and in knowledge discovery, and with concrete applications in biomedical sciences, security, social networks and e-learning systems. One of the goals of this event is to provide a common ground for mathematicians and computer scientists to meet, to present their latest results, and to discuss original applications in related scientific fields. On this basis, we hope for fruitful exchanges that can motivate multidisciplinary projects.The first edition of ALgebras, Graphs and Ordered Sets (ALGOS 2020) has a particular motivation, namely, an opportunity to honour Maurice Pouzet on his 75th birthday! For this reason, we have particularly welcomed submissions in areas related to Maurice’s many scientific interests:• Lattices and ordered sets• Combinatorics and graph theory• Set theory and theory of relations• Universal algebra and multiple valued logic• Applications: formal calculus, knowledge discovery, biomedical sciences, decision aid and social choice, security, social networks, web semantics..

INRIA a CCSD electronic archive server