Search CORE

1,817 research outputs found

Syntactic discriminative language model rerankers for statistical machine translation

Author: B Roark
Christof Monz
D Chiang
F Rosenblatt
FJ Och
PF Brown
SF Chen
SI Gallant
Simon Carter
Y Freund
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

This article describes a method that successfully exploits syntactic features for n-best translation candidate reranking using perceptrons. We motivate the utility of syntax by demonstrating the superior performance of parsers over n-gram language models in differentiating between Statistical Machine Translation output and human translations. Our approach uses discriminative language modelling to rerank the n-best translations generated by a statistical machine translation system. The performance is evaluated for Arabic-to-English translation using NIST’s MT-Eval benchmarks. While deep features extracted from parse trees do not consistently help, we show how features extracted from a shallow Part-of-Speech annotation layer outperform a competitive baseline and a state-of-the-art comparative reranking approach, leading to significant BLEU improvements on three different test sets

Crossref

Springer - Publisher Connector

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Aligning Neural Machine Translation Models: Human Feedback in Training and Inference

Author: Farinhas António
Fernandes Patrick
Martins André F. T.
Ramos Miguel Moura
Publication venue
Publication date: 15/11/2023
Field of study

Reinforcement learning from human feedback (RLHF) is a recent technique to improve the quality of the text generated by a language model, making it closer to what humans would generate. A core ingredient in RLHF's success in aligning and improving large language models (LLMs) is its reward model, trained using human feedback on model outputs. In machine translation (MT), where metrics trained from human annotations can readily be used as reward models, recent methods using minimum Bayes risk decoding and reranking have succeeded in improving the final quality of translation. In this study, we comprehensively explore and compare techniques for integrating quality metrics as reward models into the MT pipeline. This includes using the reward model for data filtering, during the training phase through RL, and at inference time by employing reranking techniques, and we assess the effects of combining these in a unified approach. Our experimental results, conducted across multiple translation tasks, underscore the crucial role of effective data filtering, based on estimated quality, in harnessing the full potential of RL in enhancing MT quality. Furthermore, our findings demonstrate the effectiveness of combining RL training with reranking techniques, showcasing substantial improvements in translation quality.Comment: 14 pages, work-in-progres

arXiv.org e-Print Archive

Large-scale Dictionary Construction via Pivot-based Statistical Machine Translation with Significance Pruning and Neural Network Features

Author: Chu Chenhui
Cromieres Fabien
Dabre Raj
Kurohashi Sadao
Nakazawa Toshiaki
Publication venue
Publication date: 01/01/2015
Field of study

We present our ongoing work on large-scale Japanese-Chinese bilingual dictionary con-struction via pivot-based statistical machine translation. We utilize statistical significance pruning to control noisy translation pairs that are induced by pivoting. We construct a large dictionary which we manually verify to be of a high quality. We then use this dictionary and a parallel corpus to learn bilingual neural net-work language models to obtain features for reranking the n-best list, which leads to an ab-solute improvement of 5 % in accuracy when compared to a setting that does not use signif-icance pruning and reranking.

CiteSeerX

Waseda University Repository

LIUM Machine Translation Systems for WMT17 News Translation Task

Author: Aransa Walid
Bardet Adrien
Barrault Loïc
Bougares Fethi
Caglayan Ozan
García-Martínez Mercedes
Publication venue
Publication date: 01/01/2017
Field of study

This paper describes LIUM submissions to WMT17 News Translation Task for English-German, English-Turkish, English-Czech and English-Latvian language pairs. We train BPE-based attentive Neural Machine Translation systems with and without factored outputs using the open source nmtpy framework. Competitive scores were obtained by ensembling various systems and exploiting the availability of target monolingual corpora for back-translation. The impact of back-translation quantity and quality is also analyzed for English-Turkish where our post-deadline submission surpassed the best entry by +1.6 BLEU.Comment: News Translation Task System Description paper for WMT1

arXiv.org e-Print Archive

Crossref

Spiral - Imperial College Digital Repository

Neural Reranking for Named Entity Recognition

Author: Dong Fei
Yang Jie
Zhang Yue
Publication venue
Publication date: 17/07/2017
Field of study

We propose a neural reranking system for named entity recognition (NER). The basic idea is to leverage recurrent neural network models to learn sentence-level patterns that involve named entity mentions. In particular, given an output sentence produced by a baseline NER model, we replace all entity mentions, such as \textit{Barack Obama}, into their entity types, such as \textit{PER}. The resulting sentence patterns contain direct output information, yet is less sparse without specific named entities. For example, "PER was born in LOC" can be such a pattern. LSTM and CNN structures are utilised for learning deep representations of such sentences for reranking. Results show that our system can significantly improve the NER accuracies over two different baselines, giving the best reported results on a standard benchmark.Comment: Accepted as regular paper by RANLP 201

arXiv.org e-Print Archive

Crossref