Semantische Annotation von Dokumenten in verteilten Unternehmensdatenquellen zur Verbesserung ihrer Durchsuchbarkeit

Abstract

The goal of this thesis was to create an extension for the Enterprise-Search-System of the Fraunhofer Institute for Computer Graphics Research (IGD), enabling the semantical linking of documents to improve the overall search-quality. For this purpose a concept for recognition and classification of proper names and for their visualization within the search-platform was created. Additionally, classes of proper names were identified, depending on their potential to improve search-quality. Based on an existing rule-based approach of Named Entity Recognition (NER), a new NER-Application, focussed on characteristical documents used at the IGD, was developed. To be able to use the identified proper names as facets in the search-platform, the own NER-approach was integrated into the indexing-pipeline of the search-platform. At the end, the own NER-approach was evaluated and compared to other existing NER-approaches

Similar works

Full text

thumbnail-image

Fraunhofer-ePrints

redirect
Last time updated on 15/11/2016

This paper was published in Fraunhofer-ePrints.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.