21,640 research outputs found

    Skills and Profile of the New Role of the Translator as MT Post-editor

    Get PDF
    This paper explores the skills and profile of the new role of the translator as MT post-editor in view of the rising interest and use of MT in the translation industry. After a brief review of the relevant literature declaring post-editing (PE) as a profession on its own, the paper goes on to identify the different tasks involved in PE processes, following the work of Krings (Krings, 2001). Then, a series of competences are defined and grouped into three main categories: core competences, linguistic skills and instrumental competences. Finally, a description of the controlled translation scenario of MT PE is advanced taking into account the overall scenario of any translation project, including client description, text domain, text description, use of glossaries, MT engine, MT output quality and purpose of the translated text.Aquest article aborda les habilitats i les característiques del perfil del nou rol del traductor com a posteditor de traducció automàtica, tot i tenint en compte l'augment de l'interès en i l'ús de la traducció automàtica per part de la industria de la traducció. Després d'una breu revisió de la literatura més rellevant sobre postedició (PE) en tant que professió per ella mateixa, l'article identifica les diferents tasques implicades en els processos de PE, segons la proposta de Krings (2001). A continuació es defineix una sèrie de competències que s'agrupen en tres categories principals: competències nuclears, habilitats lingüístiques i competències instrumentals. Finalment el artículo proposa una descripció de l'escenari de traducció controlada propi de la PE de traducció automàtica, sense perdre de vista l'escenari general de qualsevol projecte de traducció, que inclou la descripció del client, el domini del text, la descripció del text, l'ús de glossaris, el motor de traducció automàtica, la qualitat de la traducció automàtica resultant i el propòsit del text traduït.Este artículo aborda las habilidades y las características del perfil del nuevo rol del traductor como poseditor de traducción automática, a la luz del aumento del interés en y del uso de la traducción automática por parte de la industria de la traducción. Después de una breve revisión de la literatura más relevante sobre posedición (PE) en tanto que profesión por sí misma, en el artículo se identifican las diferentes tareas implicadas en los procesos de PE, según la propuesta de Krings (2001). A continuación se define una serie de competencias que se agrupan en tres categorías principales: competencias nucleares, habilidades lingüísticas y competencias instrumentales. Finalmente el artículo propone una descripción del escenario de traducción controlada propio de la PE de traducción automática, sin perder de vista el marco general de cualquier proyecto de traducción, que incluye la descripción del cliente, el dominio del texto, la descripción del texto, el uso de glosarios, el motor de traducción automática, la calidad de la traducción automática resultante y el propósito del texto traducido

    A Neural, Interactive-predictive System for Multimodal Sequence to Sequence Tasks

    Full text link
    We present a demonstration of a neural interactive-predictive system for tackling multimodal sequence to sequence tasks. The system generates text predictions to different sequence to sequence tasks: machine translation, image and video captioning. These predictions are revised by a human agent, who introduces corrections in the form of characters. The system reacts to each correction, providing alternative hypotheses, compelling with the feedback provided by the user. The final objective is to reduce the human effort required during this correction process. This system is implemented following a client-server architecture. For accessing the system, we developed a website, which communicates with the neural model, hosted in a local server. From this website, the different tasks can be tackled following the interactive-predictive framework. We open-source all the code developed for building this system. The demonstration in hosted in http://casmacat.prhlt.upv.es/interactive-seq2seq.Comment: ACL 2019 - System demonstration

    Translation methods and experience : a comparative analysis of human translation and post-editing with students and professional translators

    Get PDF
    While the benefits of using post-editing for technical texts have been more or less acknowledged, it remains unclear whether post-editing is a viable alternative to human translation for more general text types. In addition, we need a better understanding of both translation methods and how they are performed by students as well as professionals, so that pitfalls can be determined and translator training can be adapted accordingly. In this article, we aim to get a better understanding of the differences between human translation and post-editing for newspaper articles. Processes were registered by means of eye tracking and keystroke logging, which allows us to study translation speed, cognitive load, and the usage of external resources. We also look at the final quality of the product as well as translators' attitude towards both methods of translation

    Translation and human-computer interaction

    Get PDF
    This paper seeks to characterise translation as a form of human-computer interaction. The evolution of translator-computer interaction is explored and the challenges and benefits are enunciated. The concept of cognitive ergonomics is drawn on to argue for a more caring and inclusive approach towards the translator by developers of translation technology. A case is also made for wider acceptance by the translation community of the benefits of the technology at their disposal and for more humanistic research on the impact of technology on the translator, the translation profession and the translation process

    Improving the translation environment for professional translators

    Get PDF
    When using computer-aided translation systems in a typical, professional translation workflow, there are several stages at which there is room for improvement. The SCATE (Smart Computer-Aided Translation Environment) project investigated several of these aspects, both from a human-computer interaction point of view, as well as from a purely technological side. This paper describes the SCATE research with respect to improved fuzzy matching, parallel treebanks, the integration of translation memories with machine translation, quality estimation, terminology extraction from comparable texts, the use of speech recognition in the translation process, and human computer interaction and interface design for the professional translation environment. For each of these topics, we describe the experiments we performed and the conclusions drawn, providing an overview of the highlights of the entire SCATE project

    Multi-modal post-editing of machine translation

    Get PDF
    As MT quality continues to improve, more and more translators switch from traditional translation from scratch to PE of MT output, which has been shown to save time and reduce errors. Instead of mainly generating text, translators are now asked to correct errors within otherwise helpful translation proposals, where repetitive MT errors make the process tiresome, while hard-to-spot errors make PE a cognitively demanding activity. Our contribution is three-fold: first, we explore whether interaction modalities other than mouse and keyboard could well support PE by creating and testing the MMPE translation environment. MMPE allows translators to cross out or hand-write text, drag and drop words for reordering, use spoken commands or hand gestures to manipulate text, or to combine any of these input modalities. Second, our interviews revealed that translators see value in automatically receiving additional translation support when a high CL is detected during PE. We therefore developed a sensor framework using a wide range of physiological and behavioral data to estimate perceived CL and tested it in three studies, showing that multi-modal, eye, heart, and skin measures can be used to make translation environments cognition-aware. Third, we present two multi-encoder Transformer architectures for APE and discuss how these can adapt MT output to a domain and thereby avoid correcting repetitive MT errors.Angesichts der stetig steigenden Qualität maschineller Übersetzungssysteme (MÜ) post-editieren (PE) immer mehr Übersetzer die MÜ-Ausgabe, was im Vergleich zur herkömmlichen Übersetzung Zeit spart und Fehler reduziert. Anstatt primär Text zu generieren, müssen Übersetzer nun Fehler in ansonsten hilfreichen Übersetzungsvorschlägen korrigieren. Dennoch bleibt die Arbeit durch wiederkehrende MÜ-Fehler mühsam und schwer zu erkennende Fehler fordern die Übersetzer kognitiv. Wir tragen auf drei Ebenen zur Verbesserung des PE bei: Erstens untersuchen wir, ob andere Interaktionsmodalitäten als Maus und Tastatur das PE unterstützen können, indem wir die Übersetzungsumgebung MMPE entwickeln und testen. MMPE ermöglicht es, Text handschriftlich, per Sprache oder über Handgesten zu verändern, Wörter per Drag & Drop neu anzuordnen oder all diese Eingabemodalitäten zu kombinieren. Zweitens stellen wir ein Sensor-Framework vor, das eine Vielzahl physiologischer und verhaltensbezogener Messwerte verwendet, um die kognitive Last (KL) abzuschätzen. In drei Studien konnten wir zeigen, dass multimodale Messung von Augen-, Herz- und Hautmerkmalen verwendet werden kann, um Übersetzungsumgebungen an die KL der Übersetzer anzupassen. Drittens stellen wir zwei Multi-Encoder-Transformer-Architekturen für das automatische Post-Editieren (APE) vor und erörtern, wie diese die MÜ-Ausgabe an eine Domäne anpassen und dadurch die Korrektur von sich wiederholenden MÜ-Fehlern vermeiden können.Deutsche Forschungsgemeinschaft (DFG), Projekt MMP

    The Professional Profile of a Post-editor according to LSCs and Linguists: a Survey-Based Research

    Get PDF
    The boundaries between translation technologies are fading and language professionals are heading towards a pluri- and transdisciplinary job description, for which the use of CAT tools, translation management systems, and machine translation (MT) are compulsory. “Language paraprofessionals”, “paralinguists”, “language consultants”, “digital linguists”, and a long list of other titles is emerging to refer to the professionals who master a number of features of several tools, while remaining attentive to linguistics (see Bond 2018). According to TAUS DQF Dashboard data presented in TAUS Newsletter the 1st of May of 2019, the industry averages show that 9.7% of the translation output origin comes from MT and that 1,057 words per hour are post-edited on average. This has clear repercussions on the profession from the employability perspective.With 66 submissions by LSCs and industry stakeholders, and 142 answers from individuals (in-house or freelance translators), we present the most salient subject matters from and for the translation industry regarding MT post-editing. Some represent gaps to be filled; others represent common ground already found. Thanks to this up-to-date knowledge of the globalization landscape, clear goals can be set, and the way is paved for evolution.&nbsp

    Retrieve and Refine: Improved Sequence Generation Models For Dialogue

    Full text link
    Sequence generation models for dialogue are known to have several problems: they tend to produce short, generic sentences that are uninformative and unengaging. Retrieval models on the other hand can surface interesting responses, but are restricted to the given retrieval set leading to erroneous replies that cannot be tuned to the specific context. In this work we develop a model that combines the two approaches to avoid both their deficiencies: first retrieve a response and then refine it -- the final sequence generator treating the retrieval as additional context. We show on the recent CONVAI2 challenge task our approach produces responses superior to both standard retrieval and generation models in human evaluations

    Generating Text Sequence Images for Recognition

    Full text link
    Recently, methods based on deep learning have dominated the field of text recognition. With a large number of training data, most of them can achieve the state-of-the-art performances. However, it is hard to harvest and label sufficient text sequence images from the real scenes. To mitigate this issue, several methods to synthesize text sequence images were proposed, yet they usually need complicated preceding or follow-up steps. In this work, we present a method which is able to generate infinite training data without any auxiliary pre/post-process. We tackle the generation task as an image-to-image translation one and utilize conditional adversarial networks to produce realistic text sequence images in the light of the semantic ones. Some evaluation metrics are involved to assess our method and the results demonstrate that the caliber of the data is satisfactory. The code and dataset will be publicly available soon
    corecore