Dictionary writing system (DWS) plus corpus query package (CQP): the case of TshwaneLex

DE PAUW, Guy; de Schryver, Gilles-Maurice

research

Dictionary writing system (DWS) plus corpus query package (CQP): the case of TshwaneLex

Authors: Guy DE PAUW
Gilles-Maurice de Schryver
Publication date: 1 January 2007
Publisher

Abstract

In this article the integrated corpus query functionality of the dictionary compilation software TshwanelLex is analysed. Attention is given to the handling of both raw corpus data and annotated corpus data. With regard to the latter it is shown how, with a minimum of human effort, machine learning techniques can be employed to obtain part-of-speech tagged corpora that can be used for lexicographic purposes. All points are illustrated with data drawn from English and Northern Sotho. The tools and techniques themselves, however, are language-independent, and as Such the encouraging outcomes of this study are far-reaching

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Ghent University Academic Bibliography

oai:archive.ugent.be:384304

Last time updated on 12/11/2016