6 research outputs found
Univerzális dependencia és morfológia magyar nyelvre
Ebben a cikkben beszámolunk az univerzális dependencia Ă©s morfolĂłgia elveinek magyarra törtĂ©nĹ‘ alkalmazásárĂłl, Ă©s bemutatjuk a kihĂvást jelentĹ‘ nyelvi jelensĂ©geket Ă©s az azokra nyĂşjtott megoldásainkat. A kidolgozott elvek alapján rĂ©szben automatikus, rĂ©szben kĂ©zi átalakĂtás segĂtsĂ©gĂ©vel lĂ©trehozzuk a Szeged Treebank egy Ăşjabb változatát
Parallel Syntactic Annotation of Multiple Languages
This paper describes an effort to investigate the incrementally deepening development of an interlingua notation, validated by human annotation of texts in English plus six languages. We begin with deep syntactic annotation, and in this paper present a series of annotation manuals for six different languages at the deep-syntactic level of representation. Many syntactic differences between languages are removed in the proposed syntactic annotation, making them useful resources for multilingual NLP project with semantic components. 1. Background: Goals of Annotation The IAMTC project (Farwell et al., 2004) aims at defining a level of interlingual annotation (the information needed to translate a text from one language to the next) based on annotating parallel multilingual texts (i.e., multiple translations into English of source texts in six foreign languages). As a first step in the sequence of annotations, we annotate texts for syntax. This level of annotation is called IL0. Subsequently, we augment IL0 with semantic disambiguation annotations, namely concepts from an ontology and semanti