Tagging the Teleman Corpus

Brants, Thorsten; Samuelsson, Christer

research

Tagging the Teleman Corpus

Authors: Thorsten Brants
Christer Samuelsson
Publication date: 1 January 1995
Publisher

Abstract

Experiments were carried out comparing the Swedish Teleman and the English Susanne corpora using an HMM-based and a novel reductionistic statistical part-of-speech tagger. They indicate that tagging the Teleman corpus is the more difficult task, and that the performance of the two different taggers is comparable.Comment: 14 pages, LaTeX, to appear in Proceedings of the 10th Nordic Conference of Computational Linguistics, Helsinki, Finland, 199

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.44.34...

Last time updated on 22/10/2014