Search CORE

22,479 research outputs found

A New Look at Translation: Teaching tools for language and literature

Author: Fountain Anne
Fountain Catherine
Publication venue: SJSU ScholarWorks
Publication date: 01/03/2009
Field of study

Does translation have a place in the modern language or literature classroom? This article argues that as long as translation is recognized as a distinct skill rather than a path to language acquisition it can and should play a role in language instruction. The rising popularity of Web-based machine translation (WBMT) sites among students points to a need to help foreign language learners better understand the translation process. Along with a discussion of how instructors can minimize inappropriate use of WBMT, the article provides examples of how translation in the proper context can be used productively to teach both language and literature. It also shows that teachers have much to gain by supporting translation and interpretation as professional options for advanced language learners. Examples are given in Spanish

SJSU ScholarWorks

Approximating solution structure of the Weighted Sentence Alignment problem

Author: Kolokolova Antonina
Nizamee Renesa
Publication venue
Publication date: 08/09/2014
Field of study

We study the complexity of approximating solution structure of the bijective weighted sentence alignment problem of DeNero and Klein (2008). In particular, we consider the complexity of finding an alignment that has a significant overlap with an optimal alignment. We discuss ways of representing the solution for the general weighted sentence alignment as well as phrases-to-words alignment problem, and show that computing a string which agrees with the optimal sentence partition on more than half (plus an arbitrarily small polynomial fraction) positions for the phrases-to-words alignment is NP-hard. For the general weighted sentence alignment we obtain such bound from the agreement on a little over 2/3 of the bits. Additionally, we generalize the Hamming distance approximation of a solution structure to approximating it with respect to the edit distance metric, obtaining similar lower bounds

arXiv.org e-Print Archive

CiteSeerX

Proceedings of the LREC workshop on partial parsing : between chunk parsing and deep parsing

Author: Kübler Sandra
Piskorski Jakub
Przepiorkowski Adam
Publication venue
Publication date: 03/11/2008
Field of study

Hochschulschriftenserver - Universität Frankfurt am Main

A Formal Model of Ambiguity and its Applications in Machine Translation

Author: Dyer Christopher
Publication venue
Publication date: 01/01/2010
Field of study

Systems that process natural language must cope with and resolve ambiguity. In this dissertation, a model of language processing is advocated in which multiple inputs and multiple analyses of inputs are considered concurrently and a single analysis is only a last resort. Compared to conventional models, this approach can be understood as replacing single-element inputs and outputs with weighted sets of inputs and outputs. Although processing components must deal with sets (rather than individual elements), constraints are imposed on the elements of these sets, and the representations from existing models may be reused. However, to deal efficiently with large (or infinite) sets, compact representations of sets that share structure between elements, such as weighted finite-state transducers and synchronous context-free grammars, are necessary. These representations and algorithms for manipulating them are discussed in depth in depth. To establish the effectiveness and tractability of the proposed processing model, it is applied to several problems in machine translation. Starting with spoken language translation, it is shown that translating a set of transcription hypotheses yields better translations compared to a baseline in which a single (1-best) transcription hypothesis is selected and then translated, independent of the translation model formalism used. More subtle forms of ambiguity that arise even in text-only translation (such as decisions conventionally made during system development about how to preprocess text) are then discussed, and it is shown that the ambiguity-preserving paradigm can be employed in these cases as well, again leading to improved translation quality. A model for supervised learning that learns from training data where sets (rather than single elements) of correct labels are provided for each training instance and use it to learn a model of compound word segmentation is also introduced, which is used as a preprocessing step in machine translation

Digital Repository at the University of Maryland