6,682 research outputs found

    MultiMWE: building a multi-lingual multi-word expression (MWE) parallel corpora

    Get PDF
    Multi-word expressions (MWEs) are a hot topic in research in natural language processing (NLP), including topics such as MWE detection, MWE decomposition, and research investigating the exploitation of MWEs in other NLP fields such as Machine Translation. However, the availability of bilingual or multi-lingual MWE corpora is very limited. The only bilingual MWE corpora that we are aware of is from the PARSEME (PARSing and Multi-word Expressions) EU project. This is a small collection of only 871 pairs of English-German MWEs. In this paper, we present multi-lingual and bilingual MWE corpora that we have extracted from root parallel corpora. Our collections are 3,159,226 and 143,042 bilingual MWE pairs for German-English and Chinese-English respectively after filtering. We examine the quality of these extracted bilingual MWEs in MT experiments. Our initial experiments applying MWEs in MT show improved translation performances on MWE terms in qualitative analysis and better general evaluation scores in quantitative analysis, on both German-English and Chinese-English language pairs. We follow a standard experimental pipeline to create our MultiMWE corpora which are available online. Researchers can use this free corpus for their own models or use them in a knowledge base as model features

    Proceedings

    Get PDF
    Proceedings of the Workshop on Annotation and Exploitation of Parallel Corpora AEPC 2010. Editors: Lars Ahrenberg, Jörg Tiedemann and Martin Volk. NEALT Proceedings Series, Vol. 10 (2010), 98 pages. © 2010 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/15893

    Findings of the WMT 2020 Biomedical Translation Shared Task: Basque, Italian and Russian as New Additional Languages

    Get PDF
    Machine translation of scientific abstracts and terminologies has the potential to support health professionals and biomedical researchers in some of their activities. In the fifth edition of the WMT Biomedical Task, we addressed a total of eight language pairs. Five language pairs were previously addressed in past editions of the shared task, namely, English/German, English/French, English/Spanish, English/Portuguese, and English/Chinese. Three additional languages pairs were also introduced this year: English/Russian, English/Italian, and English/Basque. The task addressed the evaluation of both scientific abstracts (all language pairs) and terminologies (English/Basque only). We received submissions from a total of 20 teams. For recurring language pairs, we observed an improvement in the translations in terms of automatic scores and qualitative evaluations, compared to previous years

    A language-independent method for the alignement of parallel corpora

    Get PDF
    PACLIC 20 / Wuhan, China / 1-3 November, 200

    Towards a Re-Definition of Government Interpreters' Agency Against a Backdrop of Sociopolitical and Cultural Evolution: A Case of Premier's Press Conferences in China

    Get PDF
    The sociopolitical and cultural evolution as a result of the Reform and Opening up in 1978, facilitated not least by the inexorable juggernaut of globalization and technological advancement, has revolutionized the way China engages domestically and interacts with the outside world. The need for more proactive diplomacy and open engagement witnessed the institutionalization of the interpreter-mediated premier's press conferences. Such a discursive event provides a vital platform for China to articulate its discourse and rebrand its image in tandem with the profound changes signaled by the Dengist reform. This chapter investigates critically how political press conference interpreting and interpreters' agency in China are impacted in relation to such dramatic transformations. It is revealed that, while interpreters are confronted with seemingly conflicting expectations, in actual practice they are often able to negotiate a way as highly competent interpreting professionals with the additional missions of advancing China's global engagement and safeguarding China's national interests
    corecore