1 research outputs found

    Towards creation of a Lithuanian lemmatizer for open online collaborative machine translation

    No full text
    Online ISBN 978-3-319-67642-5This paper describes the baseline dictionary-based Lithuanian lemmatizer designed for an open online collaborative Machine Translation system. We evaluated our tool on the gold standard corpus composed of four different domains (official documents, fiction texts, scientific texts, and periodicals) containing ~1 million running words in total and obtained an encouraging accuracy equal to ~85.7%. Afterwards, we have made an error analysis, which will be used for the further improvements of our lemmatizerLituanistikos katedraTaikomosios informatikos katedraVytauto Didžiojo universiteta
    corecore