Search CORE

4,671 research outputs found

Recommended from our members

Neural Machine Translation Decoding with Terminology Constraints

Author: Byrne WJ
de Gspert Adrià
Hasler eva
Iglesias Gonzalo
Publication venue: Association for Computational Linguistics
Publication date: 10/09/2018
Field of study

Despite the impressive quality improvements yielded by neural machine translation (NMT) systems, controlling their translation output to adhere to user-provided terminology con- straints remains an open problem. We describe our approach to constrained neural decod- ing based on finite-state machines and multi- stack decoding which supports target-side con- straints as well as constraints with correspond- ing aligned input text spans. We demonstrate the performance of our framework on multiple translation tasks and motivate the need for constrained decoding with attentions as a means of reducing misplacement and duplication when translating user constraints

Apollo (Cambridge)

Lexically Constrained Decoding for Sequence Generation Using Grid Beam Search

Author: Hokamp Chris
Liu Qun
Publication venue
Publication date: 01/01/2017
Field of study

We present Grid Beam Search (GBS), an algorithm which extends beam search to allow the inclusion of pre-specified lexical constraints. The algorithm can be used with any model that generates a sequence

\mathbf{\hat{y}} = \{y_{0}\ldots y_{T}\}

, by maximizing

p(\mathbf{y} | \mathbf{x}) = \prod\limits_{t}p(y_{t} | \mathbf{x}; \{y_{0} \ldots y_{t-1}\})

. Lexical constraints take the form of phrases or words that must be present in the output sequence. This is a very general way to incorporate additional knowledge into a model's output without requiring any modification of the model parameters or training data. We demonstrate the feasibility and flexibility of Lexically Constrained Decoding by conducting experiments on Neural Interactive-Predictive Translation, as well as Domain Adaptation for Neural Machine Translation. Experiments show that GBS can provide large improvements in translation quality in interactive scenarios, and that, even without any user input, GBS can be used to achieve significant gains in performance in domain adaptation scenarios.Comment: Accepted as a long paper at ACL 201

arXiv.org e-Print Archive

Crossref