Analysing entity context in multilingual Wikipedia to support entity-centric retrieval applications

Cristea, Alexandra I.; Demidova, Elena; Zhou, Yiwei

research

Analysing entity context in multilingual Wikipedia to support entity-centric retrieval applications

Authors: Alexandra I. Cristea
Elena Demidova
Yiwei Zhou
Publication date: 7 January 2016
Publisher: Springer International Publishing

Abstract

Representation of influential entities, such as famous people and multinational corporations, on the Web can vary across languages, reflecting language-specific entity aspects as well as divergent views on these entities in different communities. A systematic analysis of language specific entity contexts can provide a better overview of the existing aspects and support entity-centric retrieval applications over multilingual Web data. An important source of cross-lingual information about influential entities is Wikipedia — an online community-created encyclopaedia — containing more than 280 language editions. In this paper we focus on the extraction and analysis of the language-specific entity contexts from different Wikipedia language editions over multilingual data. We discuss alternative ways such contexts can be built, including graph-based and article-based contexts. Furthermore, we analyse the similarities and the differences in these contexts in a case study including 80 entities and five Wikipedia language editions

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Warwick Research Archives Portal Repository

oai:wrap.warwick.ac.uk:78613

Last time updated on 02/08/2016