Search CORE

1,133 research outputs found

Multilingual projection for parsing truly low resource languages

Author: Agic Zeljko
Johannsen Anders Trærup
Martinez Alonso Hector
Plank Barbara
Schluter Natalie Elaine
Søgaard Anders
Publication venue
Publication date: 01/01/2016
Field of study

International audienceWe propose a novel approach to cross-lingual part-of-speech tagging and dependency parsing for truly low-resource languages. Our annotation projection-based approach yields tagging and parsing models for over 100 languages. All that is needed are freely available parallel texts, and taggers and parsers for resource-rich languages. The empirical evaluation across 30 test languages shows that our method consistently provides top-level accuracies , close to established upper bounds, and outperforms several competitive baselines

University of Groningen

Hal-Diderot

Proceedings - University of Groningen

ARTS repository - University of Groningen

INRIA a CCSD electronic archive server

Copenhagen University Research Information System

Dissertations of the University of Groningen

Multilingual Projection for Parsing Truly Low-Resource Languages

Author: Agić Željko
Johannsen Anders
Martínez Alonso Héctor
Plank Barbara
Schluter Natalie
Søgaard Anders
Publication venue
Publication date: 01/01/2016
Field of study

University of Groningen

Deep machine learning for syntactic annotation projection

Author: Marić Tina
Publication venue: University of Zagreb. Faculty of Science. Department of Mathematics.
Publication date: 01/03/2018
Field of study

U ovom radu istražuje se prijenosno učenje kroz više jezika s ciljem omogućavanja sintaktičke analize jezika koji nemaju dovoljno označenih podataka za učenje. Najbolji pristupi rješavanju problema uključuju projekciju oznaka sintaktičkih ovisnosti preko paralelnih tekstova, iz jezika koji imaju mnogo označenih podataka za učenje u jezike koji imaju nedovoljno. U prvom poglavlju opisuju se osnovni pojmovi morfološkog označivanja rečenica i parsanja njihovih ovisnosnih stabala kao i način označivanja sintaktičkih ovisnosti. Prvi pristup rješavanja problema projekcije oznaka sintaktičkih ovisnosti je opisan u drugom poglavlju. Zasnovan je na algoritmu predstavljenom u znanstvenom radu Multilingual Projection for Parsing Truly Low-Resource Languages [10]. Predložene su prilagodbe algoritma koje vode poboljšanju rezultata. U trećem poglavlju predstavljena je ideja o upotrebi neuronskih mreža za projekcije oznaka sintaktičkih ovisnosti te nekoliko ideja kojima rad u budućnosti može biti unaprijeđen.The purpose of this thesis was to explore cross-lingual transfer learning to dependency parsing, with a goal of enabling syntactic analysis for low-resource languages. The best approaches involve annotation projection: the transfer of dependency structures via parallel texts, from resource-rich to low-resource languages. In the first chapter, basic concepts of part of speech tagging and dependency parsing are described as well as the way of annotating texts. The first approach to solving an annotation projection problem is described in the second chapter. It is based on the algorithm presented in the paper Multilingual Projection for Parsing Truly Low-Resource Languages [10]. We propose the way of adjusting the existing algorithm which leads to the improvement of results. In the third chapter, the idea how to use neural networks for annotation projection is presented, and also some of the ideas how the work done in this thesis can be extended in the future

Repository of Faculty of Science, University of Zagreb

University of Zagreb Repository

Croatian Digital Thesis Repository

Deep machine learning for syntactic annotation projection

Author: Marić Tina
Publication venue: University of Zagreb. Faculty of Science. Department of Mathematics.
Publication date: 01/03/2018
Field of study

Croatian Digital Thesis Repository

Deep machine learning for syntactic annotation projection

Author: Marić Tina
Publication venue: University of Zagreb. Faculty of Science. Department of Mathematics.
Publication date: 01/03/2018
Field of study

University of Zagreb Repository