307 research outputs found

    BattRAE: Bidimensional Attention-Based Recursive Autoencoders for Learning Bilingual Phrase Embeddings

    Full text link
    In this paper, we propose a bidimensional attention based recursive autoencoder (BattRAE) to integrate clues and sourcetarget interactions at multiple levels of granularity into bilingual phrase representations. We employ recursive autoencoders to generate tree structures of phrases with embeddings at different levels of granularity (e.g., words, sub-phrases and phrases). Over these embeddings on the source and target side, we introduce a bidimensional attention network to learn their interactions encoded in a bidimensional attention matrix, from which we extract two soft attention weight distributions simultaneously. These weight distributions enable BattRAE to generate compositive phrase representations via convolution. Based on the learned phrase representations, we further use a bilinear neural model, trained via a max-margin method, to measure bilingual semantic similarity. To evaluate the effectiveness of BattRAE, we incorporate this semantic similarity as an additional feature into a state-of-the-art SMT system. Extensive experiments on NIST Chinese-English test sets show that our model achieves a substantial improvement of up to 1.63 BLEU points on average over the baseline.Comment: 7 pages, accepted by AAAI 201

    Translating Phrases in Neural Machine Translation

    Full text link
    Phrases play an important role in natural language understanding and machine translation (Sag et al., 2002; Villavicencio et al., 2005). However, it is difficult to integrate them into current neural machine translation (NMT) which reads and generates sentences word by word. In this work, we propose a method to translate phrases in NMT by integrating a phrase memory storing target phrases from a phrase-based statistical machine translation (SMT) system into the encoder-decoder architecture of NMT. At each decoding step, the phrase memory is first re-written by the SMT model, which dynamically generates relevant target phrases with contextual information provided by the NMT model. Then the proposed model reads the phrase memory to make probability estimations for all phrases in the phrase memory. If phrase generation is carried on, the NMT decoder selects an appropriate phrase from the memory to perform phrase translation and updates its decoding state by consuming the words in the selected phrase. Otherwise, the NMT decoder generates a word from the vocabulary as the general NMT decoder does. Experiment results on the Chinese to English translation show that the proposed model achieves significant improvements over the baseline on various test sets.Comment: Accepted by EMNLP 201

    Stochastic Modelling and Analysis for Bridges under Spatially Varying Ground Motions

    Get PDF
    Earthquake is undoubtedly one of the greatest natural disasters that can induce serious structural damage and huge losses of properties and lives. The resulting destructive consequences not only have made structural seismic analysis and design much more important but have impelled the necessity of more realistic representation of ground motions, such as inclusion of ground motion spatial variations in earthquake modelling and seismic analysis and design of structures. Recorded seismic ground motions exhibit spatial variations in their amplitudes and phases, and the spatial variabilities have an important effect on the responses of structures extended in space, such as long span bridges. Because of the multi-parametric nature and the complexity of the problems, the development of specific design provisions on spatial variabilities of ground motions in modern seismic codes has been impeded. Eurocode 8 is currently the only seismic standard worldwide that gives a set of detailed guidelines to explicitly tackle spatial variabilities of ground motions in bridge design, providing both a simplified design scheme and an analytical approach. However, there is gap between the code-specified provisions in Eurocode 8 and the realistic representation of spatially varying ground motions (SVGM) and the corresponding stochastic vibration analysis (SVA) approaches. This study is devoted to bridge this gap on modelling of SVGM and development of SVA approaches for structures extended in space under SVGM. A complete and realistic SVGM representation approach is developed by accounting for the incoherence effect, wave-passage effect, site-response effect, ground motion nonstationarity, tridirectionality, and spectra-compatibility. This effort brings together various aspects regarding rational seismic scenarios determination, comprehensive methods of accounting for varying site effects, conditional modelling of SVGM nonstationarity, and code-specified ground motion spectra-compatibility. A comprehensive, systematic, and efficient SVA methodology is derived for long span structures under tridirectional nonstationary SVGM. An absolute-response-oriented scheme of pseudo-excitation method and an improved high precision direct integration method are proposed to reduce the enormous computational effort of conventional nonstationary SVA. A scheme accounting for tridirectional varying site-response effect is incorporated in the nonstationary SVA scheme systematically. The proposed highly efficient and accurate SVA approach is implemented and verified in a general finite element analysis platform to make it readily applicable in SVA of complex structures. Based on the proposed SVA approach, parametric studies of two practical long span bridges under SVGM are conducted. To account for spatial randomness and variability of soil properties in soil-structure interaction analysis of structures under SVGM, a meshfree-Galerkin approach is proposed within the Karhunen-Loeve expansion scheme for representation of spatial soil properties modelled as a random field. The meshfree shape functions are proposed as a set of basis functions in the Galerkin scheme to solve integral equation of Karhunen-Loeve expansion, with a proposed optimization scheme in treating the compatibility between the target and analytical covariance models. The accuracy and validity of the meshfree-Galerkin scheme are assessed and demonstrated by representation of covariance models for various homogeneous and nonhomogeneous spatial fields. The developed modelling approaches of SVGM and the derived analytical SVA approaches can be applied to provide more refined solutions for quantitatively assessing code-specified design provisions and developing new design provisions. The proposed meshfree-Galerkin approach can be used to account for spatial randomness and variability of soil properties in soil-structure interaction analysis.4 month
    corecore