Joint Entity Recognition and Linking in Technical Domains Using Undirected Probabilistic Graphical Models

Abstract

ter Horst H, Hartung M, Cimiano P. Joint Entity Recognition and Linking in Technical Domains Using Undirected Probabilistic Graphical Models. In: Gracia J, Bond F, McCrae JP, Buitelaar P, Chiarcos C, Hellmann S, eds. Language, Data, and Knowledge (Proceedings of the 1st International LDK Conference). Lecture Notes in Artificial Intelligence. Vol 10318. Springer; 2017: 166-180.The problems of recognizing mentions of entities in texts and linking them to unique knowledge base identifiers have received considerable attention in recent years. In this paper we present a probabilistic system based on undirected graphical models that jointly addresses both the entity recognition and the linking task. Our framework considers the span of mentions of entities as well as the corresponding knowledge base identifier as random variables and models the joint assignment using a factorized distribution. We show that our approach can be easily applied to different technical domains by merely exchanging the underlying ontology. On the task of recognizing and linking disease names, we show that our approach outperforms the state-of-the-art systems DNorm and TaggerOne, as well as two strong lexicon-based baselines. On the task of recognizing and linking chemical names, our system achieves comparable performance to the state-of-the-art

    Similar works