Search CORE

1,395 research outputs found

Hierarchical Losses and New Resources for Fine-grained Entity Typing and Linking

Author: McCallum Andrew
Murty* Shikhar
Radovanovic Irena
Verga* Patrick
Vilnis Luke
Publication venue
Publication date: 01/01/2018
Field of study

Extraction from raw text to a knowledge base of entities and fine-grained types is often cast as prediction into a flat set of entity and type labels, neglecting the rich hierarchies over types and entities contained in curated ontologies. Previous attempts to incorporate hierarchical structure have yielded little benefit and are restricted to shallow ontologies. This paper presents new methods using real and complex bilinear mappings for integrating hierarchical information, yielding substantial improvement over flat predictions in entity linking and fine-grained entity typing, and achieving new state-of-the-art results for end-to-end models on the benchmark FIGER dataset. We also present two new human-annotated datasets containing wide and deep hierarchies which we will release to the community to encourage further research in this direction: MedMentions, a collection of PubMed abstracts in which 246k mentions have been mapped to the massive UMLS ontology; and TypeNet, which aligns Freebase types with the WordNet hierarchy to obtain nearly 2k entity types. In experiments on all three datasets we show substantial gains from hierarchy-aware training.Comment: ACL 201

arXiv.org e-Print Archive

Crossref

Link prediction in very large directed graphs: Exploiting hierarchical properties in parallel

Author: Cortés García Claudio Ulises
Garcia Gasulla Dario
Publication venue: CEUR-WS.org
Publication date: 01/01/2014
Field of study

Link prediction is a link mining task that tries to find new edges within a given graph. Among the targets of link prediction there is large directed graphs, which are frequent structures nowadays. The typical sparsity of large graphs demands of high precision predictions in order to obtain usable results. However, the size of those graphs only permits the execution of scalable algorithms. As a trade-off between those two problems we recently proposed a link prediction algorithm for directed graphs that exploits hierarchical properties. The algorithm can be classified as a local score, which entails scalability. Unlike the rest of local scores, our proposal assumes the existence of an underlying model for the data which allows it to produce predictions with a higher precision. We test the validity of its hierarchical assumptions on two clearly hierarchical data sets, one of them based on RDF. Then we test it on a non-hierarchical data set based on Wikipedia to demonstrate its broad applicability. Given the computational complexity of link prediction in very large graphs we also introduce some general recommendations useful to make of link prediction an efficiently parallelized problem.Peer ReviewedPostprint (published version

UPCommons. Portal del coneixement obert de la UPC

Bottom-up construction of ontologies

Author: Mars N.J.I.
Vet P.E. van der
Publication venue: IEEE
Publication date: 01/01/1998
Field of study

Presents a particular way of building ontologies that proceeds in a bottom-up fashion. Concepts are defined in a way that mirrors the way their instances are composed out of smaller objects. The smaller objects themselves may also be modeled as being composed. Bottom-up ontologies are flexible through the use of implicit and, hence, parsimonious part-whole and subconcept-superconcept relations. The bottom-up method complements current practice, where, as a rule, ontologies are built top-down. The design method is illustrated by an example involving ontologies of pure substances at several levels of detail. It is not claimed that bottom-up construction is a generally valid recipe; indeed, such recipes are deemed uninformative or impossible. Rather, the approach is intended to enrich the ontology developer's toolki

University of Twente Research Information

Recommended from our members

Geometric Representation Learning

Author: Vilnis Luke
Publication venue: ScholarWorks@UMass Amherst
Publication date: 06/04/2021
Field of study

Vector embedding models are a cornerstone of modern machine learning methods for knowledge representation and reasoning. These methods aim to turn semantic questions into geometric questions by learning representations of concepts and other domain objects in a lower-dimensional vector space. In that spirit, this work advocates for density- and region-based representation learning. Embedding domain elements as geometric objects beyond a single point enables us to naturally represent breadth and polysemy, make asymmetric comparisons, answer complex queries, and provides a strong inductive bias when labeled data is scarce. We present a model for word representation using Gaussian densities, enabling asymmetric entailment judgments between concepts, and a probabilistic model for weighted transitive relations and multivariate discrete data based on a lattice of axis-aligned hyperrectangle representations (boxes). We explore the suitability of these embedding methods in different regimes of sparsity, edge weight, correlation, and independence structure, as well as extensions of the representation and different optimization strategies. We make a theoretical investigation of the representational power of the box lattice, and propose extensions to address shortcomings in modeling difficult distributions and graphs

ScholarWorks@UMass Amherst

The Dynamics of Interfirm Networks along the Industry Life Cycle: The Case of the Global Video Games Industry 1987-2007

Author: Mathijs de Vaan
Pierre-Alexandre Balland
Ron Boschma
Publication venue
Publication date
Field of study

In this paper, we study the formation of network ties between firms along the life cycle of a creative industry. We focus on three drivers of network formation: i) network endogeneity which stresses a path-dependent change originating from previous network structures, ii) five forms of proximity (e.g. geographical proximity) which ascribe tie formation to the similarity of actors' attributes; and (iii) individual characteristics which refer to the heterogeneity in actors capabilities to exploit external knowledge. The paper employs a stochastic actor-oriented model to estimate the - changing - effects of these drivers on inter-firm network formation in the global video game industry from 1987 to 2007. Our findings indicate that the effects of the drivers of network formation change with the degree of maturity of the industry. To an increasing extent, video game firms tend to partner over shorter distances and with more cognitively similar firms as the industry evolves.network dynamics, industry life cycle, proximity, creative industry, video game industry, stochastic actor-oriented model

Research Papers in Economics

Representation Learning for Words and Entities

Author: Rastogi Pushpendre
Publication venue
Publication date: 12/06/2019
Field of study

This thesis presents new methods for unsupervised learning of distributed representations of words and entities from text and knowledge bases. The first algorithm presented in the thesis is a multi-view algorithm for learning representations of words called Multiview Latent Semantic Analysis (MVLSA). By incorporating up to 46 different types of co-occurrence statistics for the same vocabulary of english words, I show that MVLSA outperforms other state-of-the-art word embedding models. Next, I focus on learning entity representations for search and recommendation and present the second method of this thesis, Neural Variational Set Expansion (NVSE). NVSE is also an unsupervised learning method, but it is based on the Variational Autoencoder framework. Evaluations with human annotators show that NVSE can facilitate better search and recommendation of information gathered from noisy, automatic annotation of unstructured natural language corpora. Finally, I move from unstructured data and focus on structured knowledge graphs. I present novel approaches for learning embeddings of vertices and edges in a knowledge graph that obey logical constraints.Comment: phd thesis, Machine Learning, Natural Language Processing, Representation Learning, Knowledge Graphs, Entities, Word Embeddings, Entity Embedding

arXiv.org e-Print Archive

JScholarship

Dual tensor model for detecting asymmetric lexico-semantic relations

Author: Glavaš Goran
Ponzetto Simone Paolo
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

Crossref

MAnnheim DOCument Server