Search CORE

6 research outputs found

Head finalization reordering for Chinese-to-Japanese machine translation.” in

Author: Hajime Tsukada
Han Dan
Katsuhito Sudoh
Kevin Duh
Masaaki Nagata
Xianchao Wu
Publication venue
Publication date: 01/01/2012
Field of study

Abstract In Statistical Machine Translation, reordering rules have proved useful in extracting bilingual phrases and in decoding during translation between languages that are structurally different. Linguistically motivated rules have been incorporated into Chineseto-Englis

CiteSeerX

Effects of Parsing Errors on Pre-reordering Performance for Chinese-to-Japanese SMT

Author: Han Dan
Martinez-Gomez Pascual
Miyao Yusuke
Nagata Masaaki
Sudoh Katsuhito
Publication venue: Department of English, National Chengchi University
Publication date: 01/01/2013
Field of study

Waseda University Repository

Empirical Dependency-Based Head Finalization for Statistical Chinese-, English-, and French-to-Myanmar (Burmese) Machine Translation

Author: Andrew Finch
Chenchen Ding
Eiichiro Sumita
Masao Utiyama
Thu ‡
Ye Kyaw
Publication venue
Publication date: 23/04/2020
Field of study

Abstract We conduct dependency-based head finalization for statistical machine translation (SMT) for Myanmar (Burmese). Although Myanmar is an understudied language, linguistically it is a head-final language with similar syntax to Japanese and Korean. So, applying the efficient techniques of Japanese and Korean processing to Myanmar is a natural idea. Our approach is a combination of two approaches. The first is a head-driven phrase structure grammar (HPSG) based head finalization for English-to-Japanese translation, the second is dependency-based pre-ordering originally designed for English-to-Korean translation. We experiment on Chinese-, English-, and French-to-Myanmar translation, using a statistical pre-ordering approach as a comparison method. Experimental results show the dependency-based head finalization was able to consistently improve a baseline SMT system, for different source languages and different segmentation schemes for the Myanmar language

CiteSeerX

Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation

Author: Chu Chenhui
Kurohashi Sadao
Mao Zhuoyuan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/01/2022
Field of study

In the present study, we propose novel sequence-to-sequence pre-training objectives for low-resource machine translation (NMT): Japanese-specific sequence to sequence (JASS) for language pairs involving Japanese as the source or target language, and English-specific sequence to sequence (ENSS) for language pairs involving English. JASS focuses on masking and reordering Japanese linguistic units known as bunsetsu, whereas ENSS is proposed based on phrase structure masking and reordering tasks. Experiments on ASPEC Japanese–English & Japanese–Chinese, Wikipedia Japanese–Chinese, News English–Korean corpora demonstrate that JASS and ENSS outperform MASS and other existing language-agnostic pre-training methods by up to +2.9 BLEU points for the Japanese–English tasks, up to +7.0 BLEU points for the Japanese–Chinese tasks and up to +1.3 BLEU points for English–Korean tasks. Empirical analysis, which focuses on the relationship between individual parts in JASS and ENSS, reveals the complementary nature of the subtasks of JASS and ENSS. Adequacy evaluation using LASER, human evaluation, and case studies reveals that our proposed methods significantly outperform pre-training methods without injected linguistic knowledge and they have a larger positive impact on the adequacy as compared to the fluency

arXiv.org e-Print Archive

Kyoto University Research Information Repository

Error propagation

Author: Lê Minh Ngoc
Publication venue: Independently published
Publication date: 28/05/2021
Field of study

VU Research Portal