Location of Repository

Derivation of Dictionary for Process Inspector Tool on SharePoint Platform

By Václav Pavlín

Abstract

This master's thesis presents methods for mining important pieces of information from text. It analyses the problem of terms extraction from large document collection and describes the implementation using C# language and Microsoft SQL Server. The system uses stemming and a number of statistical methods for term extraction. This project also compares used methods and suggests the process of the dictionary derivation

Topics: MSSQL.; Text mining; frekvenční analýza; C#; lemmatizace; frequency analysis; stemming; tf-idf; Perl; extrakce pojmů; Dolování z textu; chí kvadrát; MySQL; chi-square; term extraction
Publisher: Vysoké učení technické v Brně. Fakulta informačních technologií
Year: 2012
OAI identifier: oai:invenio.nusl.cz:236591
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://www.nusl.cz/ntk/nusl-23... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.