Search CORE

2,905 research outputs found

Recommended from our members

Encyclopedic knowledge in the mobile age

Author: Kukulska-Hulme Agnes
Publication venue: 'Facet Publishing'
Publication date: 15/09/2008
Field of study

The aim of this chapter is to shed some light on the changing nature of all-encompassing collections of represented knowledge, how knowledge may be socially constructed and shared, and whether perspectives may be shifting due to greater mobility and travel. New manifestations of encyclopedias on the web attest to the enduring appeal of gathering together and disseminating what is known about a broad range of topics. At the same time, the scale and nature of knowledge sharing on the web differs in many respects from traditional formats. The proliferation of portable and pervasive technologies is introducing further changes that we are only beginning to understand

Open Research Online (The Open University)

Extracting corpus specific knowledge bases from Wikipedia

Author: Milne David N.
Nichols David M.
Witten Ian H.
Publication venue: University of Waikato, Department of Computer Science
Publication date: 01/06/2007
Field of study

Thesauri are useful knowledge structures for assisting information retrieval. Yet their production is labor-intensive, and few domains have comprehensive thesauri that cover domain-specific concepts and contemporary usage. One approach, which has been attempted without much success for decades, is to seek statistical natural language processing algorithms that work on free text. Instead, we propose to replace costly professional indexers with thousands of dedicated amateur volunteers--namely, those that are producing Wikipedia. This vast, open encyclopedia represents a rich tapestry of topics and semantics and a huge investment of human effort and judgment. We show how this can be directly exploited to provide WikiSauri: manually-defined yet inexpensive thesaurus structures that are specifically tailored to expose the topics, terminology and semantics of individual document collections. We also offer concrete evidence of the effectiveness of WikiSauri for assisting information retrieval

Research Commons@Waikato

Entity linking: test collections revisited

Author: Deleu Johannes
Demeester Thomas
Develder Chris
Feys Matthias
Mertens Laurent
Publication venue
Publication date: 01/01/2014
Field of study

Ghent University Academic Bibliography

Developing web searching skills in translator training

Author: Enriquez-Raido Vanessa
Publication venue
Publication date: 01/01/2011
Field of study

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio Institucional Universidad de Málaga

Follow-up question handling in the IMIX and Ritel systems: A comparative study

Author: A. Max
B. W. Van Schooten
Den Akker
G. Illouz
O. Gal Ibert
R. Op
S. Rosset
Publication venue: Cambridge University Press
Publication date: 01/01/2009
Field of study

One of the basic topics of question answering (QA) dialogue systems is how follow-up questions should be interpreted by a QA system. In this paper, we shall discuss our experience with the IMIX and Ritel systems, for both of which a follow-up question handling scheme has been developed, and corpora have been collected. These two systems are each other's opposites in many respects: IMIX is multimodal, non-factoid, black-box QA, while Ritel is speech, factoid, keyword-based QA. Nevertheless, we will show that they are quite comparable, and that it is fruitful to examine the similarities and differences. We shall look at how the systems are composed, and how real, non-expert, users interact with the systems. We shall also provide comparisons with systems from the literature where possible, and indicate where open issues lie and in what areas existing systems may be improved. We conclude that most systems have a common architecture with a set of common subtasks, in particular detecting follow-up questions and finding referents for them. We characterise these tasks using the typical techniques used for performing them, and data from our corpora. We also identify a special type of follow-up question, the discourse question, which is asked when the user is trying to understand an answer, and propose some basic methods for handling it

CiteSeerX

University of Twente Research Information

TiFi: Taxonomy Induction for Fictional Domains [Extended version]

Author: Chu C.
Razniewski S.
Weikum G.
Publication venue
Publication date: 01/01/2019
Field of study

Taxonomies are important building blocks of structured knowledge bases, and their construction from text sources and Wikipedia has received much attention. In this paper we focus on the construction of taxonomies for fictional domains, using noisy category systems from fan wikis or text extraction as input. Such fictional domains are archetypes of entity universes that are poorly covered by Wikipedia, such as also enterprise-specific knowledge bases or highly specialized verticals. Our fiction-targeted approach, called TiFi, consists of three phases: (i) category cleaning, by identifying candidate categories that truly represent classes in the domain of interest, (ii) edge cleaning, by selecting subcategory relationships that correspond to class subsumption, and (iii) top-level construction, by mapping classes onto a subset of high-level WordNet categories. A comprehensive evaluation shows that TiFi is able to construct taxonomies for a diverse range of fictional domains such as Lord of the Rings, The Simpsons or Greek Mythology with very high precision and that it outperforms state-of-the-art baselines for taxonomy induction by a substantial margin

MPG.PuRe

The SciQA Scientific Question Answering Benchmark for Scholarly Knowledge

Author: Auer Sören
Barone Dante A.C.
Bartz Cassiano
Cortes Eduardo G.
Jaradeh Mohamad Yaser
Karras Oliver
Koubarakis Manolis
Mouromtsev Dmitry
Pliukhin Dmitrii
Radyush Daniil
Shilin Ivan
Stocker Markus
Tsalapati Eleni
Publication venue: London : Nature Publishing Group
Publication date: 01/01/2023
Field of study

Knowledge graphs have gained increasing popularity in the last decade in science and technology. However, knowledge graphs are currently relatively simple to moderate semantic structures that are mainly a collection of factual statements. Question answering (QA) benchmarks and systems were so far mainly geared towards encyclopedic knowledge graphs such as DBpedia and Wikidata. We present SciQA a scientific QA benchmark for scholarly knowledge. The benchmark leverages the Open Research Knowledge Graph (ORKG) which includes almost 170,000 resources describing research contributions of almost 15,000 scholarly articles from 709 research fields. Following a bottom-up methodology, we first manually developed a set of 100 complex questions that can be answered using this knowledge graph. Furthermore, we devised eight question templates with which we automatically generated further 2465 questions, that can also be answered with the ORKG. The questions cover a range of research fields and question types and are translated into corresponding SPARQL queries over the ORKG. Based on two preliminary evaluations, we show that the resulting SciQA benchmark represents a challenging task for next-generation QA systems. This task is part of the open competitions at the 22nd International Semantic Web Conference 2023 as the Scholarly Question Answering over Linked Data (QALD) Challenge

Repositorium für Naturwissenschaften und Technik

The SciQA Scientific Question Answering Benchmark for Scholarly Knowledge

Author: Auer Sören
Barone Dante A. C.
Bartz Cassiano
Cortes Eduardo G.
Jaradeh Mohamad Yaser
Karras Oliver
Koubarakis Manolis
Mouromtsev Dmitry
Pliukhin Dmitrii
Radyush Daniil
Shilin Ivan
Stocker Markus
Tsalapati Eleni
Publication venue: [London] : Macmillan Publishers Limited, part of Springer Nature
Publication date: 01/01/2023
Field of study

Institutionelles Repositorium der Leibniz Universität Hannover