17,042 research outputs found

    Translationese and post-editese : how comparable is comparable quality?

    Get PDF
    Whereas post-edited texts have been shown to be either of comparable quality to human translations or better, one study shows that people still seem to prefer human-translated texts. The idea of texts being inherently different despite being of high quality is not new. Translated texts, for example,are also different from original texts, a phenomenon referred to as ‘Translationese’. Research into Translationese has shown that, whereas humans cannot distinguish between translated and original text,computers have been trained to detect Translationesesuccessfully. It remains to be seen whether the same can be done for what we call Post-editese. We first establish whether humans are capable of distinguishing post-edited texts from human translations, and then establish whether it is possible to build a supervised machine-learning model that can distinguish between translated and post-edited text

    The universality of categories and meaning: a Coserian perspective

    Get PDF
    Studies in linguistic typology have challenged the idea that languages can be analyzed in terms of a set of preestablished universal categories. Each language should instead be described “in its own terms,” a view consistent with the ‘old’ structuralist paradigm in linguistics. The renewed orientation toward differences between languages raises two questions: (i) How do we identify the meanings which are assumed to be crosslinguistically comparable? (ii) What is the relationship between language-particular categories and comparative concepts commonly used in linguistic typology? To answer these questions, this article focuses on a number of distinctions advocated by Eugenio Coseriu (1921–2002). Coseriu distinguishes three levels of meaning (designation, “signifiés,” and sense) and three types of universals (essential, empirical, and possible universals). Their relevance for linguistic typology is discussed with regard to the expression of possession and a particular diathesis in Japanese, viz. ukemi or “indirect passive.” As well as relating language-particular categories and comparative concepts, Coseriu’s approach offers a promising avenue to account for the ways language-specific meanings interact with extralinguistic knowledge and contents of discourse and texts, which are the object of translation

    Learning Language Representations for Typology Prediction

    Full text link
    One central mystery of neural NLP is what neural models "know" about their subject matter. When a neural machine translation system learns to translate from one language to another, does it learn the syntax or semantics of the languages? Can this knowledge be extracted from the system to fill holes in human scientific knowledge? Existing typological databases contain relatively full feature specifications for only a few hundred languages. Exploiting the existence of parallel texts in more than a thousand languages, we build a massive many-to-one neural machine translation (NMT) system from 1017 languages into English, and use this to predict information missing from typological databases. Experiments show that the proposed method is able to infer not only syntactic, but also phonological and phonetic inventory features, and improves over a baseline that has access to information about the languages' geographic and phylogenetic neighbors.Comment: EMNLP 201

    Towards a corpus-based, statistical approach of translation quality : measuring and visualizing linguistic deviance in student translations

    Get PDF
    In this article we present a corpus-based statistical approach to measuring translation quality, more particularly translation acceptability, by comparing the features of translated and original texts. We discuss initial findings that aim to support and objectify formative quality assessment. To that end, we extract a multitude of linguistic and textual features from both student and professional translation corpora that consist of many different translations by several translators in two different genres (fiction, news) and in two translation directions (English to French and French to Dutch). The numerical information gathered from these corpora is exploratively analysed with Principal Component Analysis, which enables us to identify stable, language-independent linguistic and textual indicators of student translations compared to translations produced by professionals. The differences between these types of translation are subsequently tested by means of ANOVA. The results clearly indicate that the proposed methodology is indeed capable of distinguishing between student and professional translations. It is claimed that this deviant behaviour indicates an overall lower translation quality in student translations: student translations tend to score lower at the acceptability level, that is, they deviate significantly from target-language norms and conventions. In addition, the proposed methodology is capable of assessing the acceptability of an individual student’s translation – a smaller linguistic distance between a given student translation and the norm set by the professional translations correlates with higher quality. The methodology is also able to provide objective and concrete feedback about the divergent linguistic dimensions in their text

    On the Similarities Between Native, Non-native and Translated Texts

    Full text link
    We present a computational analysis of three language varieties: native, advanced non-native, and translation. Our goal is to investigate the similarities and differences between non-native language productions and translations, contrasting both with native language. Using a collection of computational methods we establish three main results: (1) the three types of texts are easily distinguishable; (2) non-native language and translations are closer to each other than each of them is to native language; and (3) some of these characteristics depend on the source or native language, while others do not, reflecting, perhaps, unified principles that similarly affect translations and non-native language.Comment: ACL2016, 12 page

    Rethought Forms: How Do They Work?

    Get PDF
    This paper is a critical evaluation of Holger Thesleff’s thinking on Plato’s Forms, especially of his “rethinking” of the matter, as he puts it in the title of his most recent contribution. It lays out a broadly sympathetic perspective through dialectical engagement with the main lines of his interpretation and reconstruction of Plato’s world. The aim is to launch the formal academic reception of that reconstruction (rethinking), which Thesleff cautiously and modestly presents as a “proposal” — his teaser to elicit a reaction, positive or negative. The exegetical focus is on tracing the inspiration and reasoning behind his “two-level” model of Plato’s ontology, which, in turn, supports his tripartite classification of Forms. The critical focus is on identifying potential areas of misunderstanding and supplying any explanations, analyses, or arguments that may enhance the clarity of the respective positions

    Priming as cognitive motivation for the "first translational response universal"

    Get PDF
    This article fi rst surveys the current, somewhat unproductive state of research into potential universals of translation. Then it considers in specific the “first translational response universal” (Malmkjær 2011), suggesting that it may be rooted in the cognitive mechanism of priming. Empirical evidence for this is next sought in the analysis of a set of 34 novice translations of the same short passage from Swedish into Polish, which are shown to exhibit the effects of priming to a considerable extent. Overall, the objective is to illustrate a possible way of investigating postulated translation universals: fi rst identifying a cluster of cognitive mechanisms to motivate the universal, then determining the linguistic structures that are concrete manifestations of such mechanisms in languages meeting in translation. The proposed research procedure thus proceeds from a cognitive process to a detailed language structure, allowing for the examination of phenomena observed in the “third code” on the supra-cultural level

    Rethinking Plato’s Forms

    Get PDF
    This is a proposal for rethinking the main lines of Plato’s philosophy, including some of the conceptual tools he uses for building and maintaining it. Drawing on a new interpretive paradigm for Plato’s overall vision, the central focus is on the so-called Forms. Regarding the guiding paradigm, we propose replacing the dualism of a world of Forms separated from a world of particulars, with the monistic model of a hierarchically structured universe comprising interdependent levels of reality. Regarding the tools of the trade, we distinguish between three constructs that have come, one and all, and largely indiscriminately, to be regarded as Forms: Ideal Forms, Conceptual Forms, and Relational Forms. This recalibration of what we know of Plato’s outlook, tools, and methods, together with a realignment of these with his general aims, will also help restore the philosopher’s emphasis on that which is good, a perspective often blurred in the structure of two worlds
    corecore