Skip to main content
Article thumbnail
Location of Repository

Semantische Annotation von Dokumenten in verteilten Unternehmensdatenquellen zur Verbesserung ihrer Durchsuchbarkeit

By Tobias Wunderlich

Abstract

The goal of this thesis was to create an extension for the Enterprise-Search-System of the Fraunhofer Institute for Computer Graphics Research (IGD), enabling the semantical linking of documents to improve the overall search-quality. For this purpose a concept for recognition and classification of proper names and for their visualization within the search-platform was created. Additionally, classes of proper names were identified, depending on their potential to improve search-quality. Based on an existing rule-based approach of Named Entity Recognition (NER), a new NER-Application, focussed on characteristical documents used at the IGD, was developed. To be able to use the identified proper names as facets in the search-platform, the own NER-approach was integrated into the indexing-pipeline of the search-platform. At the end, the own NER-approach was evaluated and compared to other existing NER-approaches

Topics: natural language processing, semantic search, indexing
Year: 2012
OAI identifier: oai:fraunhofer.de:N-229239
Provided by: Fraunhofer-ePrints
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://publica.fraunhofer.de/d... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.