907 research outputs found

    MONDILEX – towards the research infrastructure for digital resources in Slavic lexicography

    Get PDF

    Lexicographic Tools and Techniques.

    Get PDF
    We describe in brief what grid technologies are and how they could contribute to the language technologies, in particular lexicographic activities. Based on our participation in the EC international project MULTEXT-East, we present some aspects of language resource compatibility: unification and standardisation. We underline the importance of the developed harmonised lexical (morphosyntactic) specifications and descriptions of language data in machine-readable form in a common standard encoding format – Corpus Encoding Standard format – for six Central and East European (CEE) languages, as well as the language-independence of the tools employed.The study and preparation of these results have received funding from the EC's Seventh Framework Programme [FP7/2007-2013] under grant agreement 211938 MONDILEX

    Looking back while moving forward : the impact of societal and technological developments on Flemish sign language lexicographic practices

    No full text
    This paper presents an overview of lexicographic projects related to Flemish Sign Language (Vlaamse Gebarentaal or VGT). VGT consists of five regional varieties. There was a time when this lexical variation was considered a disadvantage in deaf education and L2 teaching. That is why in the 1980s and early 1990s the very first lexicographic project related to Flemish signs concentrated on lexical unification in Signed Dutch. However, when the Flemish Deaf community officially rejected the use of Signed Dutch and started promoting VGT, this also led to a change in focus with respect to sign language lexicography. In 2004, the first electronic bilingual Flemish Sign Language/Dutch dictionary was published online. Since by then the Flemish Deaf community had decided not to opt for standardisation from above, it was decided to also include regional variation in the dictionary and to label the regional variants as such. By looking at past and current projects and initiatives planned for the near future, it has become clear that both societal and technological developments have been shaping lexicographic work related to Flemish Sign Language. We also look forward to future developments that will pave the way for exciting possibilities with respect to sign language lexicography

    Automatic Extraction of Subcategorization from Corpora

    Full text link
    We describe a novel technique and implemented system for constructing a subcategorization dictionary from textual corpora. Each dictionary entry encodes the relative frequency of occurrence of a comprehensive set of subcategorization classes for English. An initial experiment, on a sample of 14 verbs which exhibit multiple complementation patterns, demonstrates that the technique achieves accuracy comparable to previous approaches, which are all limited to a highly restricted set of subcategorization classes. We also demonstrate that a subcategorization dictionary built with the system improves the accuracy of a parser by an appreciable amount.Comment: 8 pages; requires aclap.sty. To appear in ANLP-9

    Multilingual language resources and interoperability

    Get PDF
    This article introduces the topic of ‘‘Multilingual language resources and interoperability’’. We start with a taxonomy and parameters for classifying language resources. Later we provide examples and issues of interoperatability, and resource architectures to solve such issues. Finally we discuss aspects of linguistic formalisms and interoperability

    Constraint Grammar Applications

    Get PDF
    Proceedings of the NODALIDA 2011 Workshop Constraint Grammar Applications. Editors: Eckhard Bick, Kristin Hagen, Kaili Müürisep, Trond Trosterud. NEALT Proceedings Series, Vol. 14 (2011), iv. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/19231

    Comprehension Assistant for Languages of Baltic States

    Get PDF
    Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Joakim Nivre, Heiki-Jaan Kaalep, Kadri Muischnek and Mare Koit. University of Tartu, Tartu, 2007. ISBN 978-9985-4-0513-0 (online) ISBN 978-9985-4-0514-7 (CD-ROM) pp. 167-174
    corecore