907 research outputs found
Lexicographic Tools and Techniques.
We describe in brief what grid technologies are and how they could contribute to the language
technologies, in particular lexicographic activities. Based on our participation in the EC international
project MULTEXT-East, we present some aspects of language resource compatibility: unification and
standardisation. We underline the importance of the developed harmonised lexical (morphosyntactic)
specifications and descriptions of language data in machine-readable form in a common standard
encoding format – Corpus Encoding Standard format – for six Central and East European (CEE)
languages, as well as the language-independence of the tools employed.The study and preparation of these results have received funding from the EC's Seventh Framework Programme
[FP7/2007-2013] under grant agreement 211938 MONDILEX
Looking back while moving forward : the impact of societal and technological developments on Flemish sign language lexicographic practices
This paper presents an overview of lexicographic projects related to Flemish Sign Language (Vlaamse Gebarentaal or VGT). VGT consists of five regional varieties. There was a time when this lexical variation was considered a disadvantage in deaf education and L2 teaching. That is why in the 1980s and early 1990s the very first lexicographic project related to Flemish signs concentrated on lexical unification in Signed Dutch. However, when the Flemish Deaf community officially rejected the use of Signed Dutch and started promoting VGT, this also led to a change in focus with respect to sign language lexicography. In 2004, the first electronic bilingual Flemish Sign Language/Dutch dictionary was published online. Since by then the Flemish Deaf community had decided not to opt for standardisation from above, it was decided to also include regional variation in the dictionary and to label the regional variants as such.
By looking at past and current projects and initiatives planned for the near future, it has become clear that both societal and technological developments have been shaping lexicographic work related to Flemish Sign Language. We also look forward to future developments that will pave the way for exciting possibilities with respect to sign language lexicography
Automatic Extraction of Subcategorization from Corpora
We describe a novel technique and implemented system for constructing a
subcategorization dictionary from textual corpora. Each dictionary entry
encodes the relative frequency of occurrence of a comprehensive set of
subcategorization classes for English. An initial experiment, on a sample of 14
verbs which exhibit multiple complementation patterns, demonstrates that the
technique achieves accuracy comparable to previous approaches, which are all
limited to a highly restricted set of subcategorization classes. We also
demonstrate that a subcategorization dictionary built with the system improves
the accuracy of a parser by an appreciable amount.Comment: 8 pages; requires aclap.sty. To appear in ANLP-9
Multilingual language resources and interoperability
This article introduces the topic of ‘‘Multilingual language resources and interoperability’’. We start with a taxonomy and parameters for classifying language resources. Later we provide examples and issues of interoperatability, and resource architectures to solve such issues. Finally we discuss aspects of linguistic formalisms and interoperability
Constraint Grammar Applications
Proceedings of the NODALIDA 2011 Workshop
Constraint Grammar Applications.
Editors: Eckhard Bick, Kristin Hagen, Kaili Müürisep, Trond Trosterud.
NEALT Proceedings Series, Vol. 14 (2011), iv.
© 2011 The editors and contributors.
Published by
Northern European Association for Language
Technology (NEALT)
http://omilia.uio.no/nealt .
Electronically published at
Tartu University Library (Estonia)
http://hdl.handle.net/10062/19231
Comprehension Assistant for Languages of Baltic States
Proceedings of the 16th Nordic Conference
of Computational Linguistics NODALIDA-2007.
Editors: Joakim Nivre, Heiki-Jaan Kaalep, Kadri Muischnek and Mare Koit.
University of Tartu, Tartu, 2007.
ISBN 978-9985-4-0513-0 (online)
ISBN 978-9985-4-0514-7 (CD-ROM)
pp. 167-174
- …