23,519 research outputs found

    Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input

    Full text link
    Non-autoregressive translation (NAT) models, which remove the dependence on previous target tokens from the inputs of the decoder, achieve significantly inference speedup but at the cost of inferior accuracy compared to autoregressive translation (AT) models. Previous work shows that the quality of the inputs of the decoder is important and largely impacts the model accuracy. In this paper, we propose two methods to enhance the decoder inputs so as to improve NAT models. The first one directly leverages a phrase table generated by conventional SMT approaches to translate source tokens to target tokens, which are then fed into the decoder as inputs. The second one transforms source-side word embeddings to target-side word embeddings through sentence-level alignment and word-level adversary learning, and then feeds the transformed word embeddings into the decoder as inputs. Experimental results show our method largely outperforms the NAT baseline~\citep{gu2017non} by 5.115.11 BLEU scores on WMT14 English-German task and 4.724.72 BLEU scores on WMT16 English-Romanian task.Comment: AAAI 201

    BCS thermal vacuum of fermionic superfluids and its perturbation theory

    Full text link
    The thermal field theory is applied to fermionic superfluids by doubling the degrees of freedom of the BCS theory. We construct the two-mode states and the corresponding Bogoliubov transformation to obtain the BCS thermal vacuum. The expectation values with respect to the BCS thermal vacuum produce the statistical average of the thermodynamic quantities. The BCS thermal vacuum allows a quantum-mechanical perturbation theory with the BCS theory serving as the unperturbed state. We evaluate the leading-order corrections to the order parameter and other physical quantities from the perturbation theory. A direct evaluation of the pairing correlation as a function of temperature shows the pseudogap phenomenon results from the perturbation theory. The BCS thermal vacuum is shown to be a generalized coherent and squeezed state. The correspondence between the thermal vacuum and purification of the density matrix allows a unitary transformation, and we found the geometric phase in the parameter space associated with the transformation.Comment: 14 pages, 2 figure

    Efficient high order semi-implicit time discretization and local discontinuous Galerkin methods for highly nonlinear PDEs

    Get PDF
    International audienceIn this paper, we develop a high order semi-implicit time discretization method for highly nonlinear PDEs, which consist of the surface diffusion and Willmore flow of graphs, the Cahn-Hilliard equation and the Allen-Cahn/Cahn-Hilliard system. These PDEs are high order in spatial derivatives, which motivates us to develop implicit or semi-implicit time marching methods to relax the severe time step restriction for stability of explicit methods. In addition, these PDEs are also highly nonlinear, fully implicit method will incredibly increase the difficulty of implementation. In particular, we can not well separate the stiff and non-stiff components for these problems, which leads to the traditional implicit-explicit methods nearly meaningless. In this paper, a high order semi-implicit time marching method and the local discontinuous Galerkin spatial method are coupled together to achieve high order accuracy in both space and time, and to enhance the efficiency of the proposed approaches, the resulting linear or nonlinear algebraic systems are solved by multigrid solver. Numerical simulation results in one and two dimensions are presented to illustrate that the combination of the local discontinuous Galerkin method for spatial approximation, semi-implicit temporal integration with the multigrid solver provides a practical and efficient approach when solving this family of problems
    • …
    corecore