Search CORE

5,740 research outputs found

Compositional Morphology for Word Representations and Language Modelling

Author: Blunsom Phil
Botha Jan A.
Publication venue
Publication date: 01/01/2014
Field of study

This paper presents a scalable method for integrating compositional morphological representations into a vector-based probabilistic language model. Our approach is evaluated in the context of log-bilinear language models, rendered suitably efficient for implementation inside a machine translation decoder by factoring the vocabulary. We perform both intrinsic and extrinsic evaluations, presenting results on a range of languages which demonstrate that our model learns morphological representations that both perform well on word similarity tasks and lead to substantial reductions in perplexity. When used for translation into morphologically rich languages with large vocabularies, our models obtain improvements of up to 1.2 BLEU points relative to a baseline system using back-off n-gram models.Comment: Proceedings of the 31st International Conference on Machine Learning (ICML

arXiv.org e-Print Archive

CiteSeerX

Oxford University Research Archive

Factored Translation Models

Author: Hoang Hieu
Koehn Philipp
Publication venue
Publication date: 01/06/2007
Field of study

Edinburgh Research Explorer

Word Representation Models for Morphologically Rich Languages in Neural Machine Translation

Author: Cohn Trevor
Haffari Gholamreza
He Xuanli
Vylomova Ekaterina
Publication venue
Publication date: 14/06/2016
Field of study

Dealing with the complex word forms in morphologically rich languages is an open problem in language processing, and is particularly important in translation. In contrast to most modern neural systems of translation, which discard the identity for rare words, in this paper we propose several architectures for learning word representations from character and morpheme level word decompositions. We incorporate these representations in a novel machine translation model which jointly learns word alignments and translations via a hard attention mechanism. Evaluating on translating from several morphologically rich languages into English, we show consistent improvements over strong baseline methods, of between 1 and 1.5 BLEU points

arXiv.org e-Print Archive

Monash University Research Portal

Domain adaptation strategies in statistical machine translation: a brief overview

Author: Ruiz Costa-Jussà Marta
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2015
Field of study

© Cambridge University Press, 2015.Statistical machine translation (SMT) is gaining interest given that it can easily be adapted to any pair of languages. One of the main challenges in SMT is domain adaptation because the performance in translation drops when testing conditions deviate from training conditions. Many research works are arising to face this challenge. Research is focused on trying to exploit all kinds of material, if available. This paper provides an overview of research, which copes with the domain adaptation challenge in SMT.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Exploring different representational units in English-to-Turkish statistical machine translation

Author: Durgar El-Kahlout İlknur
Durgar El-Kahlout Ilknur
Oflazer Kemal
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2007
Field of study

We investigate different representational granularities for sub-lexical representation in statistical machine translation work from English to Turkish. We find that (i) representing both Turkish and English at the morpheme-level but with some selective morpheme-grouping on the Turkish side of the training data, (ii) augmenting the training data with “sentences” comprising only the content words of the original training data to bias root word alignment, (iii) reranking the n-best morpheme-sequence outputs of the decoder with a word-based language model, and (iv) using model iteration all provide a non-trivial improvement over a fully word-based baseline. Despite our very limited training data, we improve from 20.22 BLEU points for our simplest model to 25.08 BLEU points for an improvement of 4.86 points or 24% relative

CiteSeerX

Crossref

Sabanci University Research Database

Modeling Target-Side Inflection in Neural Machine Translation

Author: Fraser Alexander
Marco Marion Weller-Di
Tamchyna Aleš
Publication venue
Publication date: 01/01/2017
Field of study

NMT systems have problems with large vocabulary sizes. Byte-pair encoding (BPE) is a popular approach to solving this problem, but while BPE allows the system to generate any target-side word, it does not enable effective generalization over the rich vocabulary in morphologically rich languages with strong inflectional phenomena. We introduce a simple approach to overcome this problem by training a system to produce the lemma of a word and its morphologically rich POS tag, which is then followed by a deterministic generation step. We apply this strategy for English-Czech and English-German translation scenarios, obtaining improvements in both settings. We furthermore show that the improvement is not due to only adding explicit morphological information.Comment: Accepted as a research paper at WMT17. (Updated version with corrected references.

arXiv.org e-Print Archive

Crossref