Skip to main content
Article thumbnail
Location of Repository

Entity Knowledge Base Creation from Czech Wikipedia

By Martin Sychra

Abstract

The aim of this thesis is to propose and implement a system for an automatic extraction of named entities from Czech Wikipedia, to create a knowledge base consisting of these entities and to evaluate results of the created system. The first part explains basic notions of this field and discusses related work. The main part proposes several methods of extraction and details their implementation. The following types of entities are extracted: people, places, events and organizations. The final part of the thesis presents results, i.e., the success of the individual methods for each entity type and statistics on extraction of the individual entities in the whole Czech Wikipedia context

Topics: česká Wikipedie; automatická extrakce; Extraction of named entities; zpracování přirozeného jazyka; Czech Wikipedia; znalostní báze; natural language processing; Extrakce pojmenovaných entit; automatic extraction; knowledge base
Publisher: Vysoké učení technické v Brně. Fakulta informačních technologií
Year: 2014
OAI identifier: oai:invenio.nusl.cz:239708
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://www.nusl.cz/ntk/nusl-23... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.