190 research outputs found
Linguistic Structure in Statistical Machine Translation
This thesis investigates the influence of linguistic structure in statistical machine translation. We develop a word reordering model based on syntactic parse trees and address the issues of pronouns and morphological agreement with a source discriminative word lexicon predicting the translation for individual words using structural features. When used in phrase-based machine translation, the models improve the translation for language pairs with different word order and morphological variation
When a Good Translation is Wrong in Context: Context-Aware Machine Translation Improves on Deixis, Ellipsis, and Lexical Cohesion
Though machine translation errors caused by the lack of context beyond one
sentence have long been acknowledged, the development of context-aware NMT
systems is hampered by several problems. Firstly, standard metrics are not
sensitive to improvements in consistency in document-level translations.
Secondly, previous work on context-aware NMT assumed that the sentence-aligned
parallel data consisted of complete documents while in most practical scenarios
such document-level data constitutes only a fraction of the available parallel
data. To address the first issue, we perform a human study on an
English-Russian subtitles dataset and identify deixis, ellipsis and lexical
cohesion as three main sources of inconsistency. We then create test sets
targeting these phenomena. To address the second shortcoming, we consider a
set-up in which a much larger amount of sentence-level data is available
compared to that aligned at the document level. We introduce a model that is
suitable for this scenario and demonstrate major gains over a context-agnostic
baseline on our new benchmarks without sacrificing performance as measured with
BLEU.Comment: ACL 2019 (camera-ready
Anaphora resolution for Arabic machine translation :a case study of nafs
PhD ThesisIn the age of the internet, email, and social media there is an increasing need for processing online information, for example, to support education and business. This has led to the rapid development of natural language processing technologies such as computational linguistics, information retrieval, and data mining. As a branch of computational linguistics, anaphora resolution has attracted much interest. This is reflected in the large number of papers on the topic published in journals such as Computational Linguistics. Mitkov (2002) and Ji et al. (2005) have argued that the overall quality of anaphora resolution systems remains low, despite practical advances in the area, and that major challenges include dealing with real-world knowledge and accurate parsing.
This thesis investigates the following research question: can an algorithm be found for the resolution of the anaphor nafs in Arabic text which is accurate to at least 90%, scales linearly with text size, and requires a minimum of knowledge resources? A resolution algorithm intended to satisfy these criteria is proposed. Testing on a corpus of contemporary Arabic shows that it does indeed satisfy the criteria.Egyptian Government
Incorporating pronoun function into statistical machine translation
Pronouns are used frequently in language, and perform a range of functions.
Some pronouns are used to express coreference, and others are not. Languages
and genres differ in how and when they use pronouns and this poses a problem
for Statistical Machine Translation (SMT) systems (Le Nagard and Koehn,
2010; Hardmeier and Federico, 2010; NovĂĄk, 2011; Guillou, 2012; Weiner, 2014;
Hardmeier, 2014). Attention to date has focussed on coreferential (anaphoric)
pronouns with NP antecedents, which when translated from English into a language
with grammatical gender, must agree with the translation of the head of
the antecedent. Despite growing attention to this problem, little progress has
been made, and little attention has been given to other pronouns.
The central claim of this thesis is that pronouns performing different functions
in text should be handled differently by SMT systems and when evaluating
pronoun translation. This motivates the introduction of a new framework to
categorise pronouns according to their function: Anaphoric/cataphoric reference,
event reference, extra-textual reference, pleonastic, addressee reference, speaker
reference, generic reference, or other function. Labelling pronouns according to
their function also helps to resolve instances of functional ambiguity arising from
the same pronoun in the source language having multiple functions, each with different
translation requirements in the target language. The categorisation framework
is used in corpus annotation, corpus analysis, SMT system development and
evaluation.
I have directed the annotation and conducted analyses of a parallel corpus of
English-German texts called ParCor (Guillou et al., 2014), in which pronouns
are manually annotated according to their function. This provides a first step
toward understanding the problems that SMT systems face when translating pronouns.
In the thesis, I show how analysis of manual translation can prove useful in
identifying and understanding systematic differences in pronoun use between two
languages and can help inform the design of SMT systems. In particular, the analysis
revealed that the German translations in ParCor contain more anaphoric and
pleonastic pronouns than their English originals, reflecting differences in pronoun
use. This raises a particular problem for the evaluation of pronoun translation.
Automatic evaluation methods that rely on reference translations to assess pronoun
translation, will not be able to provide an adequate evaluation when the
reference translation departs from the original source-language text. I also show
how analysis of the output of state-of-the-art SMT systems can reveal how well
current systems perform in translating different types of pronouns and indicate
where future efforts would be best directed. The analysis revealed that biases
in the training data, for example arising from the use of âitâ and âesâ as both
anaphoric and pleonastic pronouns in both English and German, is a problem
that SMT systems must overcome. SMT systems also need to disambiguate the
function of those pronouns with ambiguous surface forms so that each pronoun
may be translated in an appropriate way.
To demonstrate the value of this work, I have developed an automated post-editing
system in which automated tools are used to construct ParCor-style annotations
over the source-language pronouns. The annotations are then used to resolve
functional ambiguity for the pronoun âitâ with separate rules applied to the
output of a baseline SMT system for anaphoric vs. non-anaphoric instances. The
system was submitted to the DiscoMT 2015 shared task on pronoun translation
for English-French. As with all other participating systems, the automatic post-editing
system failed to beat a simple phrase-based baseline. A detailed analysis,
including an oracle experiment in which manual annotation replaces the automated
tools, was conducted to discover the causes of poor system performance.
The analysis revealed that the design of the rules and their strict application to
the SMT output are the biggest factors in the failure of the system.
The lack of automatic evaluation metrics for pronoun translation is a limiting
factor in SMT system development. To alleviate this problem, Christian Hardmeier
and I have developed a testing regimen called PROTEST comprising (1)
a hand-selected set of pronoun tokens categorised according to the different problems
that SMT systems face and (2) an automated evaluation script. Pronoun
translations can then be automatically compared against a reference translation,
with mismatches referred for manual evaluation. The automatic evaluation was
applied to the output of systems submitted to the DiscoMT 2015 shared task
on pronoun translation. This again highlighted the weakness of the post-editing
system, which performs poorly due to its focus on producing gendered pronoun
translations, and its inability to distinguish between pleonastic and event reference
pronouns
Towards Multilingual Coreference Resolution
The current work investigates the problems that occur when coreference resolution is considered as a multilingual task. We assess the issues that arise when a framework using the mention-pair coreference resolution model and memory-based learning for the resolution process are used. Along the way, we revise three essential subtasks of coreference resolution: mention detection, mention head detection and feature selection. For each of these aspects we propose various multilingual solutions including both heuristic, rule-based and machine learning methods. We carry out a detailed analysis that includes eight different languages (Arabic, Catalan, Chinese, Dutch, English, German, Italian and Spanish) for which datasets were provided by the only two multilingual shared tasks on coreference resolution held so far: SemEval-2 and CoNLL-2012. Our investigation shows that, although complex, the coreference resolution task can be targeted in a multilingual and even language independent way. We proposed machine learning methods for each of the subtasks that are affected by the transition, evaluated and compared them to the performance of rule-based and heuristic approaches. Our results confirmed that machine learning provides the needed flexibility for the multilingual task and that the minimal requirement for a language independent system is a part-of-speech annotation layer provided for each of the approached languages. We also showed that the performance of the system can be improved by introducing other layers of linguistic annotations, such as syntactic parses (in the form of either constituency or dependency parses), named entity information, predicate argument structure, etc. Additionally, we discuss the problems occurring in the proposed approaches and suggest possibilities for their improvement
Incremental Coreference Resolution for German
The main contributions of this thesis are as follows:
1. We introduce a general model for coreference and explore its application to German.
⢠The model features an incremental discourse processing algorithm which allows it to coherently address issues caused by underspecification of mentions, which is an especially pressing problem regarding certain German pronouns.
⢠We introduce novel features relevant for the resolution of German pronouns. A subset of these features are made accessible through the incremental architecture of the discourse processing model.
⢠In evaluation, we show that the coreference model combined with our features provides new state-of-the-art results for coreference and pronoun resolution for German.
2. We elaborate on the evaluation of coreference and pronoun resolution.
⢠We discuss evaluation from the view of prospective downstream applications that benefit from coreference resolution as a preprocessing component. Addressing the shortcomings of the general evaluation framework in this regard, we introduce an alternative framework, the Application Related Coreference Scores (ARCS).
⢠The ARCS framework enables a thorough comparison of different system outputs and the quantification of their similarities and differences beyond the common coreference evaluation. We demonstrate how the framework is applied to state-of-the-art coreference systems. This provides a method to track specific differences in system outputs, which assists researchers in comparing their approaches to related work in detail.
3. We explore semantics for pronoun resolution.
⢠Within the introduced coreference model, we explore distributional approaches to estimate the compatibility of an antecedent candidate and the occurrence context of a pronoun. We compare a state-of-the-art approach for word embeddings to syntactic co-occurrence profiles to this end.
⢠In comparison to related work, we extend the notion of context and thereby increase the applicability of our approach. We find that a combination of both compatibility models, coupled with the coreference model, provides a large potential for improving pronoun resolution performance.
We make available all our resources, including a web demo of the system, at: http://pub.cl.uzh.ch/purl/coreference-resolutio
- âŚ