Search CORE

11,080 research outputs found

The Lower The Simpler: Simplifying Hierarchical Recurrent Models

Author: Jiang Hui
Wang Chao
Publication venue
Publication date: 20/05/2019
Field of study

To improve the training efficiency of hierarchical recurrent models without compromising their performance, we propose a strategy named as `the lower the simpler', which is to simplify the baseline models by making the lower layers simpler than the upper layers. We carry out this strategy to simplify two typical hierarchical recurrent models, namely Hierarchical Recurrent Encoder-Decoder (HRED) and R-NET, whose basic building block is GRU. Specifically, we propose Scalar Gated Unit (SGU), which is a simplified variant of GRU, and use it to replace the GRUs at the middle layers of HRED and R-NET. Besides, we also use Fixed-size Ordinally-Forgetting Encoding (FOFE), which is an efficient encoding method without any trainable parameter, to replace the GRUs at the bottom layers of HRED and R-NET. The experimental results show that the simplified HRED and the simplified R-NET contain significantly less trainable parameters, consume significantly less training time, and achieve slightly better performance than their baseline models.Comment: NAACL-HLT 201

arXiv.org e-Print Archive

Explicit Utilization of General Knowledge in Machine Reading Comprehension

Author: Jiang Hui
Wang Chao
Publication venue
Publication date: 20/05/2019
Field of study

To bridge the gap between Machine Reading Comprehension (MRC) models and human beings, which is mainly reflected in the hunger for data and the robustness to noise, in this paper, we explore how to integrate the neural networks of MRC models with the general knowledge of human beings. On the one hand, we propose a data enrichment method, which uses WordNet to extract inter-word semantic connections as general knowledge from each given passage-question pair. On the other hand, we propose an end-to-end MRC model named as Knowledge Aided Reader (KAR), which explicitly uses the above extracted general knowledge to assist its attention mechanisms. Based on the data enrichment method, KAR is comparable in performance with the state-of-the-art MRC models, and significantly more robust to noise than them. When only a subset (20%-80%) of the training examples are available, KAR outperforms the state-of-the-art MRC models by a large margin, and is still reasonably robust to noise.Comment: ACL 201

arXiv.org e-Print Archive

The Strong Decays of Orbitally Excited $B^{*}_{sJ}$ Mesons by Improved Bethe-Salpeter Method

Author: Fu Hui-Feng
Jiang Yue
Wang Guo-Li
Wang Zhi-Hui
Publication venue: 'Elsevier BV'
Publication date: 06/02/2012
Field of study

We calculate the masses and the strong decays of orbitally excited states

B_{s0}

B'_{s1}

B_{s1}

and

B_{s2}

by the improved Bethe-Salpeter method. The predicted masses of

B_{s0}

and

B'_{s1}

are

M_{B_{s0}}=5.723\pm0.280 {\rm GeV}

M_{B'_{s1}}=5.774\pm0.330 {\rm GeV}

. We calculate the isospin symmetry violating decay processes

B_{s0}\to B_s \pi

and

B'_{s1}\to B_s^* \pi

through

\pi^0-\eta

mixing and get small widths. Considering the uncertainties of the masses, for

B_{s0}

and

B'_{s1}

, we also calculate the OZI allowed decay channels:

B_{s0}\to B\bar K

and

B'_{s1}\to B^*\bar K

. For

B_{s1}

and

B_{s2}

, the OZI allowed decay channels

B_{s1}\to B^{*}\bar K

B_{s2}\to B\bar K

and

B_{s2}\to B^{*}\bar K

are studied. In all the decay channels, the reduction formula, PCAC relation and low energy theorem are used to estimate the decay widths. We also obtain the strong coupling constants

G_{B_{s0}B_s\pi}

G_{B_{s0}B\bar K}

G_{B'_{s1}B_s^*\pi}

F_{B'_{s1}B_s^*\pi}

G_{B'_{s1}B^*\bar K}

F_{B'_{s1}B^*\bar K}

G_{B_{s1}B^{*}\bar K}

F_{B_{s1}B^{*}\bar K}

G_{B_{s2}B\bar K}

and

G_{B_{s2}B^{*}\bar K}

.Comment: 21 pages, 1 figure, 4 table

arXiv.org e-Print Archive

Deep Learning for Object Saliency Detection and Image Segmentation

Author: Jiang Hui
Pan Hengyue
Wang Bo
Publication venue
Publication date: 05/05/2015
Field of study

In this paper, we propose several novel deep learning methods for object saliency detection based on the powerful convolutional neural networks. In our approach, we use a gradient descent method to iteratively modify an input image based on the pixel-wise gradients to reduce a cost function measuring the class-specific objectness of the image. The pixel-wise gradients can be efficiently computed using the back-propagation algorithm. The discrepancy between the modified image and the original one may be used as a saliency map for the image. Moreover, we have further proposed several new training methods to learn saliency-specific convolutional nets for object saliency detection, in order to leverage the available pixel-wise segmentation information. Our methods are extremely computationally efficient (processing 20-40 images per second in one GPU). In this work, we use the computed saliency maps for image segmentation. Experimental results on two benchmark tasks, namely Microsoft COCO and Pascal VOC 2012, have shown that our proposed methods can generate high-quality salience maps, clearly outperforming many existing methods. In particular, our approaches excel in handling many difficult images, which contain complex background, highly-variable salient objects, multiple objects, and/or very small salient objects.Comment: 9 pages, 126 figures, technical repor

arXiv.org e-Print Archive

The uniform local asymptotics of the total net loss process in a new time-dependent bidimensional renewal model

Author: Jiang Tao
Wang Yuebao
Xu Hui
Publication venue
Publication date: 15/06/2017
Field of study

In this paper, we consider a bidimensional renewal risk model with constant force of interest, in which the claim size vector with certain local subexponential marginal distribution and its inter-arrival time are subject to a new time-dependence structure. We obtain the uniform local asymptotics of the total net loss process in the model. Moreover, some specific examples of the joint distribution satisfying the conditions of the dependence structure are given. Finally, in order to illustrate a condition of the above result, a local subexponential distribution is find for the first time that, its local distribution is not almost decreased.Comment: 26 page

arXiv.org e-Print Archive

Derivation of the gap and Bethe-Salpeter equations at large $N_c$ limit and symmetry preserving truncations

Author: Fu Hui-Feng
Jiang Libo
Wang Qing
Publication venue: 'American Physical Society (APS)'
Publication date: 12/10/2017
Field of study

We develop a framework for deriving Dyson-Schwinger Equations (DSEs) and Bethe-Salpeter Equation (BSE) in QCD at large

N_c

limit. The starting point is a modified form (with auxiliary fields) of QCD generating functional. This framework provides a natural order-by-order truncation scheme for DSEs and BSE, and the kernels of the equations up to any order are explicitly given. Chiral symmetry (at chiral limit) is preserved in any order truncation, so it exemplifies the symmetry preserving truncation scheme. It provides a method to study DSEs and BSE beyond the Rainbow-Ladder truncation, and is especially useful to study contributions from non-Abelian dynamics (those arise from gluon self-interactions). We also derive the equation for the quark-ghost scattering kernel, and discuss the Slavnov-Taylor identity connecting the quark-gluon vertex, the quark propagator and the quark-ghost scattering kernel.Comment: 19 pages, 2 figure

arXiv.org e-Print Archive

On the values of representation functions II

Author: Jiang Xing-Wang
Sandor Csaba
Yang Quan-Hui
Publication venue
Publication date: 23/04/2019
Field of study

For a set

A

of nonnegative integers, let

R_2(A,n)

and

R_3(A,n)

denote the number of solutions to

n=a+a'

with

a,a'\in A

a<a'

and

a\leq a'

, respectively. In this paper, we prove that, if

A\subseteq \mathbb{N}

and

N

is a positive integer such that

R_2(A,n)=R_2(\mathbb{N}\setminus A,n)

for all

n\geq2N-1

, then for any

\theta

with

0<\theta<\frac{2\log2-\log3}{42\log 2-9\log3}

, the set of integers

n

with

R_2(A,n)=\frac{n}{8}+O(n^{1-\theta})

has density one. The similar result holds for

R_3(A,n)

. These improve the results of the first author.Comment: 12 page

arXiv.org e-Print Archive

The Production of $X(3940)$ and $X(4160)$ in $B_c$ decays

Author: Jiang Yue
Wang Guo-Li
Wang Tian-hong
Wang Zhi-Hui
Zhang Yi
Publication venue: 'IOP Publishing'
Publication date: 02/09/2016
Field of study

Considering

X(3940)

and

X(4160)

\eta_c(3S)

and

\eta_c(4S)

, we study the productions of

X(3940)

and

X(4160)

in exclusive weak decays of

B_c

meson by the improved Bethe-Salpeter(B-S) Method. Using the relativistic B-S equation and Mandelstam formalism, we calculate the corresponding decay form factors. The predictions of the corresponding branching ratios are:

Br(B_c^+\to X(3940)e^+\nu_e)

=1.0\times10^{-4}

and

Br(B_c^+\to X(4160)e^+\nu_e)=2.4\times10^{-5}

. That will provide us a new way to observe the

X(3940)

and

X(4160)

in the future, as well as to improve the knowledge of

B_c

meson decay.Comment: 15 pages, 7 figure

arXiv.org e-Print Archive

The weak decay $B_c$ to $Z(3930)$ and $X(4160)$ by Bethe-Salpeter method

Author: Jiang Yue
Wang Guo-Li
Wang Tian-hong
Wang Zhi-Hui
Zhang Yi
Publication venue
Publication date: 09/08/2020
Field of study

Considering

Z(3930)

and

X(4160)

\chi_{c2}(2P)

and

\chi_{c2}(3P)

states, the semileptonic and nonleptonic of

B_c

decays to

Z(3930)

and

X(4160)

are studied by the improved Bethe-Salpeter(B-S) Method. The form factors of decay are calculated through the overlap integrals of the meson wave functions in the whole accessible kinematical range. The influence of relativistic corrections are considered in the exclusive decays. Branching ratios of

B_c

weak decays to

Z(3930)

and

X(4160)

are predicted. Some of the branching ratios are:

Br(B_c^+\to Z(3930)e^+\nu_e)

=(3.03^{+0.09}_{-0.16})\times 10^{-4}

and

Br(B_c^+\to X(4160)e^+\nu_e)

=(3.55^{+0.83}_{-0.35})\times 10^{-6}

. These results may provide useful information to discover

Z(3930)

and

X(4160)

and the necessary information for the phenomenological study of

B_c

physics.Comment: arXiv admin note: substantial text overlap with arXiv:1605.0909

arXiv.org e-Print Archive

Highly birefringent polymer terahertz fiber with honeycomb cladding

Author: Chang Sheng-jiang
Fan Fei
Hou Yu
Jiang Zi-Wei
Wang Xiang-Hui
Publication venue
Publication date: 04/11/2011
Field of study

Two highly birefringent polymer terahertz (THz) fibers were proposed in this paper, which were formed with honeycomb cladding and some elliptical air holes in the fiber core. The losses and mode birefringence for two different fibers are investigated by finite-different time-domain method. The results show that fiber 2 can achieve both high birefringence (larger than 0.022) and low confinement loss (0.01 dB/m) in a wide THz frequency range. Moreover, compared with a round solid-core fiber, guiding loss of the THz fiber caused by polymer material absorption can be reduced effectively as a part of the mode power is trapped in the air holes

arXiv.org e-Print Archive