Search CORE

Investigationes Linguisticae

A finite-state model of German compounds

Author: Junczys-Dowmunt Marcin
Publication venue: 'Adam Mickiewicz University Poznan'
Publication date: 16/06/2007
Field of study

This paper summarizes the results of my Master's thesis and the main points of a talk I presented at the seminar of the Department of Applied Logic at the Adam Mickiewicz University in Poznań.It gives a short overview of the structure of German compounds and newer research concerning the role of the so-called interfixes. After an introduction to the concept of finite-state transducers the construction of a transducer used for naive compound segmentation is described. Tag-based finite-state methods for the further analysis of the found segments are given and discussed. Distributional transducer rules, for the construction of which I assume the existence of local and global morphological contexts, are proposed as means of disambiguation of the analyzed naive segmentation results.This paper summarizes the results of my Master's thesis and the main points of a talk I presented at the seminar of the Department of Applied Logic at the Adam Mickiewicz University in Poznań.It gives a short overview of the structure of German compounds and newer research concerning the role of the so-called interfixes. After an introduction to the concept of finite-state transducers the construction of a transducer used for naive compound segmentation is described. Tag-based finite-state methods for the further analysis of the found segments are given and discussed. Distributional transducer rules, for the construction of which I assume the existence of local and global morphological contexts, are proposed as means of disambiguation of the analyzed naive segmentation results.

Biblioteka Nauki - repozytorium artykuÅÃ³w

Investigationes Linguisticae

Target-Side Context for Discriminative Models in Statistical Machine Translation

Author: Bojar Ondřej
Fraser Alexander
Junczys-Dowmunt Marcin
Tamchyna Aleš
Publication venue
Publication date: 01/01/2016
Field of study

Discriminative translation models utilizing source context have been shown to help statistical machine translation performance. We propose a novel extension of this work using target context information. Surprisingly, we show that this model can be efficiently integrated directly in the decoding process. Our approach scales to large training data sizes and results in consistent improvements in translation quality on four language pairs. We also provide an analysis comparing the strengths of the baseline source-context model with our extended source-context and target-context model and we show that our extension allows us to better capture morphological coherence. Our work is freely available as part of Moses.Comment: Accepted as a long paper for ACL 201

Biblio at Institute of Formal and Applied Linguistics

Near Human-Level Performance in Grammatical Error Correction with Hybrid Machine Translation

Author: Grundkiewicz Roman
Junczys-Dowmunt Marcin
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2018
Field of study

We combine two of the most popular approaches to automated Grammatical Error Correction (GEC): GEC based on Statistical Machine Translation (SMT) and GEC based on Neural Machine Translation (NMT). The hybrid system achieves new state-of-the-art results on the CoNLL-2014 and JFLEG benchmarks. This GEC system preserves the accuracy of SMT output and, at the same time, generates more fluent sentences as it typical for NMT. Our analysis shows that the created systems are closer to reaching human-level performance than any other GEC system reported so far.Comment: Accepted for oral presentation, research track, short papers, at NAACL 201

An Exploration of Neural Sequence-to-Sequence Architectures for Automatic Post-Editing

Author: Grundkiewicz Roman
Junczys-Dowmunt Marcin
Publication venue
Publication date: 30/09/2017
Field of study

In this work, we explore multiple neural architectures adapted for the task of automatic post-editing of machine translation output. We focus on neural end-to-end models that combine both inputs

mt

(raw MT output) and

src

(source language input) in a single neural architecture, modeling

\{mt, src\} \rightarrow pe

directly. Apart from that, we investigate the influence of hard-attention models which seem to be well-suited for monolingual tasks, as well as combinations of both ideas. We report results on data sets provided during the WMT-2016 shared task on automatic post-editing and can demonstrate that dual-attention models that incorporate all available data in the APE scenario in a single model improve on the best shared task system and on all other published results after the shared task. Dual-attention models that are combined with hard attention remain competitive despite applying fewer changes to the input.Comment: Accepted for presentation at IJCNLP 201

Minimally-Augmented Grammatical Error Correction

Author: Grundkiewicz Roman
Junczys-Dowmuntz Marcin
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

Neural Grammatical Error Correction Systems with Unsupervised Pre-training on Synthetic Data

Author: Grundkiewicz Roman
Heafield Kenneth
Junczys-Dowmuntz Marcin
Publication venue
Publication date: 01/01/2019
Field of study

On-the-Fly Fusion of Large Language Models and Machine Translation

Author: Hoang Hieu
Junczys-Dowmunt Marcin
Khayrallah Huda
Publication venue
Publication date: 14/11/2023
Field of study

We propose the on-the-fly ensembling of a machine translation model with an LLM, prompted on the same task and input. We perform experiments on 4 language pairs (both directions) with varying data amounts. We find that a slightly weaker-at-translation LLM can improve translations of a NMT model, and ensembling with an LLM can produce better translations than ensembling two stronger MT models. We combine our method with various techniques from LLM prompting, such as in context learning and translation context