Location of Repository

Context-informed Knowledge Extraction from Document Collections to Support User Navigation

By Mario Cataldi, Claudio Schifanella, K. Selçuk C, Maria Luisa Sapino and Luigi Di Caro

Abstract

Most of the existing document and web search engines rely on keyword-based queries. To find matches, these queries are processed using retrieval algorithms that rely on word frequencies, topic recentness, document authority, and (in some cases) avail-able ontologies. In this paper, we propose an innovative approach to exploring text collections using a novel keywords-by-concepts (KbC) graph, which supports nav-igation using domain-specific concepts as well as keywords that are characterizing the text corpus. The KbC graph is a weighted graph, created by tightly integrat-ing keywords extracted from documents and concepts obtained from domain tax-onomies. Documents in the corpus are associated to the nodes of the graph based on evidence supporting contextual relevance; thus, the KbC graph supports con-textually informed access to these documents. The construction of the KbC graph relies on a spreading-activation like technique which mimics the way the brain links and constructs knowledge. In this paper, we also present CoSeNa (Context-based Search and Navigation) system that leverages the KbC model as the basis for doc-ument exploration as well as contextually-informed media integration

Topics: Knowledge Management, HCI, Navigation System, Keywords Prox
Year: 2015
OAI identifier: oai:CiteSeerX.psu:10.1.1.635.3067
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://www.di.unito.it/~dicaro... (external link)
  • http://www.di.unito.it/~dicaro... (external link)
  • http://citeseerx.ist.psu.edu/v... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.