Search CORE

27 research outputs found

Tree-to-string alignment template for statistical machine translation

Author
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2006
Field of study

Crossref

Statistical Translation Model Based On Source Syntax Structure

Author: Liu Qun
Liu Yang
Mi Haitao
Publication venue: Institute of Digital Enhancement of Cognitive Processing, Waseda University
Publication date: 01/01/2011
Field of study

Waseda University Repository

Chunk-Based Bi-Scale Decoder for Neural Machine Translation

Author: Chen Jiajun
Huang Shujian
Li Hang
Liu Xiaohua
Tu Zhaopeng
Zhou Hao
Publication venue
Publication date: 01/01/2017
Field of study

In typical neural machine translation~(NMT), the decoder generates a sentence word by word, packing all linguistic granularities in the same time-scale of RNN. In this paper, we propose a new type of decoder for NMT, which splits the decode state into two parts and updates them in two different time-scales. Specifically, we first predict a chunk time-scale state for phrasal modeling, on top of which multiple word time-scale states are generated. In this way, the target sentence is translated hierarchically from chunks to words, with information in different granularities being leveraged. Experiments show that our proposed model significantly improves the translation performance over the state-of-the-art NMT model.Comment: Accepted as a short paper by ACL 201

arXiv.org e-Print Archive

Crossref

Graph-to-Sequence Learning using Gated Graph Neural Networks

Author: Beck Daniel
Cohn Trevor
Haffari Gholamreza
Publication venue
Publication date: 01/01/2018
Field of study

Many NLP applications can be framed as a graph-to-sequence learning problem. Previous work proposing neural architectures on this setting obtained promising results compared to grammar-based approaches but still rely on linearisation heuristics and/or standard recurrent networks to achieve the best performance. In this work, we propose a new model that encodes the full structural information contained in the graph. Our architecture couples the recently proposed Gated Graph Neural Networks with an input transformation that allows nodes and edges to have their own hidden representations, while tackling the parameter explosion problem present in previous work. Experimental results show that our model outperforms strong baselines in generation from AMR graphs and syntax-based neural machine translation.Comment: ACL 201

arXiv.org e-Print Archive

Crossref

Monash University Research Portal