23,519 research outputs found
Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input
Non-autoregressive translation (NAT) models, which remove the dependence on
previous target tokens from the inputs of the decoder, achieve significantly
inference speedup but at the cost of inferior accuracy compared to
autoregressive translation (AT) models. Previous work shows that the quality of
the inputs of the decoder is important and largely impacts the model accuracy.
In this paper, we propose two methods to enhance the decoder inputs so as to
improve NAT models. The first one directly leverages a phrase table generated
by conventional SMT approaches to translate source tokens to target tokens,
which are then fed into the decoder as inputs. The second one transforms
source-side word embeddings to target-side word embeddings through
sentence-level alignment and word-level adversary learning, and then feeds the
transformed word embeddings into the decoder as inputs. Experimental results
show our method largely outperforms the NAT baseline~\citep{gu2017non} by
BLEU scores on WMT14 English-German task and BLEU scores on WMT16
English-Romanian task.Comment: AAAI 201
BCS thermal vacuum of fermionic superfluids and its perturbation theory
The thermal field theory is applied to fermionic superfluids by doubling the
degrees of freedom of the BCS theory. We construct the two-mode states and the
corresponding Bogoliubov transformation to obtain the BCS thermal vacuum. The
expectation values with respect to the BCS thermal vacuum produce the
statistical average of the thermodynamic quantities. The BCS thermal vacuum
allows a quantum-mechanical perturbation theory with the BCS theory serving as
the unperturbed state. We evaluate the leading-order corrections to the order
parameter and other physical quantities from the perturbation theory. A direct
evaluation of the pairing correlation as a function of temperature shows the
pseudogap phenomenon results from the perturbation theory. The BCS thermal
vacuum is shown to be a generalized coherent and squeezed state. The
correspondence between the thermal vacuum and purification of the density
matrix allows a unitary transformation, and we found the geometric phase in the
parameter space associated with the transformation.Comment: 14 pages, 2 figure
Efficient high order semi-implicit time discretization and local discontinuous Galerkin methods for highly nonlinear PDEs
International audienceIn this paper, we develop a high order semi-implicit time discretization method for highly nonlinear PDEs, which consist of the surface diffusion and Willmore flow of graphs, the Cahn-Hilliard equation and the Allen-Cahn/Cahn-Hilliard system. These PDEs are high order in spatial derivatives, which motivates us to develop implicit or semi-implicit time marching methods to relax the severe time step restriction for stability of explicit methods. In addition, these PDEs are also highly nonlinear, fully implicit method will incredibly increase the difficulty of implementation. In particular, we can not well separate the stiff and non-stiff components for these problems, which leads to the traditional implicit-explicit methods nearly meaningless. In this paper, a high order semi-implicit time marching method and the local discontinuous Galerkin spatial method are coupled together to achieve high order accuracy in both space and time, and to enhance the efficiency of the proposed approaches, the resulting linear or nonlinear algebraic systems are solved by multigrid solver. Numerical simulation results in one and two dimensions are presented to illustrate that the combination of the local discontinuous Galerkin method for spatial approximation, semi-implicit temporal integration with the multigrid solver provides a practical and efficient approach when solving this family of problems
- …