881 research outputs found
Time-Aware Probabilistic Knowledge Graphs
The emergence of open information extraction as a tool for constructing and expanding knowledge graphs has aided the growth of temporal data, for instance, YAGO, NELL and Wikidata. While YAGO and Wikidata maintain the valid time of facts, NELL records the time point at which a fact is retrieved from some Web corpora. Collectively, these knowledge graphs (KG) store facts extracted from Wikipedia and other sources. Due to the imprecise nature of the extraction tools that are used to build and expand KG, such as NELL, the facts in the KG are weighted (a confidence value representing the correctness of a fact). Additionally, NELL can be considered as a transaction time KG because every fact is associated with extraction date. On the other hand, YAGO and Wikidata use the valid time model because they maintain facts together with their validity time (temporal scope). In this paper, we propose a bitemporal model (that combines transaction and valid time models) for maintaining and querying bitemporal probabilistic knowledge graphs. We study coalescing and scalability of marginal and MAP inference. Moreover, we show that complexity of reasoning tasks in atemporal probabilistic KG carry over to the bitemporal setting. Finally, we report our evaluation results of the proposed model
Political Text Scaling Meets Computational Semantics
During the last fifteen years, automatic text scaling has become one of the
key tools of the Text as Data community in political science. Prominent text
scaling algorithms, however, rely on the assumption that latent positions can
be captured just by leveraging the information about word frequencies in
documents under study. We challenge this traditional view and present a new,
semantically aware text scaling algorithm, SemScale, which combines recent
developments in the area of computational linguistics with unsupervised
graph-based clustering. We conduct an extensive quantitative analysis over a
collection of speeches from the European Parliament in five different languages
and from two different legislative terms, and show that a scaling approach
relying on semantic document representations is often better at capturing known
underlying political dimensions than the established frequency-based (i.e.,
symbolic) scaling method. We further validate our findings through a series of
experiments focused on text preprocessing and feature selection, document
representation, scaling of party manifestos, and a supervised extension of our
algorithm. To catalyze further research on this new branch of text scaling
methods, we release a Python implementation of SemScale with all included data
sets and evaluation procedures.Comment: Updated version - accepted for Transactions on Data Science (TDS
Reasoning and Change Management in Modular Ontologies
The benefits of modular representations are well known from many areas of computer science. In this paper, we concentrate on the benefits of modular ontologies with respect to local containment of terminological reasoning. We define an architecture for modular ontologies that supports local reasoning by compiling implied subsumption relations. We further address the problem of guaranteeing the integrity of a modular ontology in the presence of local changes. We propose a strategy for analyzing changes and guiding the process of updating compiled information
Towards Log-Linear Logics with Concrete Domains
We present (M denotes Markov logic networks) an
extension of the log-linear description logics -LL with
concrete domains, nominals, and instances. We use Markov logic networks (MLNs)
in order to find the most probable, classified and coherent
ontology from an knowledge base. In particular, we develop
a novel way to deal with concrete domains (also known as datatypes) by
extending MLN's cutting plane inference (CPI) algorithm.Comment: StarAI201
Representation of Semantic Mappings
The aim of this breakout session was to chart the landscape of existing approaches
for representing mappings between heterogeneous models, identify common
ideas and formulate research questions to be addressed in the future. In the
session, the discussion mainly concerned three aspects: The nature of mappings,
existing proposals for mappings and open research questions
Towards Mapping-Based Document Retrieval in Heterogeneous Digital Libraries
In many scientific domains, researchers depend on a timely and
efficient access to available publications in their particular
area. The increasing availability of publications in electronic
form via digital libraries is a reaction to this need. A remaining
problem is the fact that the pool of all available publications is
distributed between different libraries. In order to increase the
availability of information, these different libraries should be
linked in such a way, that all the information is available via
any one of them. Peer-to-peer technologies provide sophisticated
solutions for this kind of loose integration of information
sources. In our work, we consider digital libraries that organize
documents according to a dedicated classification hierarchy or
provide access to information on the basis of a thesaurus. These
kinds of access mechanisms have proven to increase the retrieval
result and are therefore widely used. On the other hand, this
causes new problems as different sources will use different
classifications and thesauri to organize information. This means,
that we have to be able to mediate between these different
structures. Integrating this mediation into the information
retrieval process is a problem that to the best of our knowledge
has not been addressed before
Ontology Alignment: An annotated Bibliography
Ontology mapping, alignment, and translation has been an active research component of the general research on semantic integration and interoperability. In our talk, we gave our own classification of different topics in this research. We talked about types of heterogeneity between ontologies, various mapping representations, classified methods for discovering methods both between ontology concepts and data, and talked about various tasks where mappings are used. In this extended abstract of our talk, we provide an annotated bibliography for this area of research, giving readers brief pointers on representative papers in each of the topics mentioned above. We did not attempt to compile a comprehensive bibliography and hence the list in this abstract is necessarily incomplete. Rather, we tried to sketch a map of the field, with some specific reference to help interested readers in their exploration of the work to-date
- …
