Search CORE

10,871 research outputs found

Learn to Interpret Atari Agents

Author: Bai Song
Torr Philip H. S.
Yang Zhao
Zhang Li
Publication venue
Publication date: 24/01/2019
Field of study

Deep Reinforcement Learning (DeepRL) agents surpass human-level performances in a multitude of tasks. However, the direct mapping from states to actions makes it hard to interpret the rationale behind the decision making of agents. In contrast to previous a-posteriori methods of visualizing DeepRL policies, we propose an end-to-end trainable framework based on Rainbow, a representative Deep Q-Network (DQN) agent. Our method automatically learns important regions in the input domain, which enables characterizations of the decision making and interpretations for non-intuitive behaviors. Hence we name it Region Sensitive Rainbow (RS-Rainbow). RS-Rainbow utilizes a simple yet effective mechanism to incorporate visualization ability into the learning model, not only improving model interpretability, but leading to improved performance. Extensive experiments on the challenging platform of Atari 2600 demonstrate the superiority of RS-Rainbow. In particular, our agent achieves state of the art at just 25% of the training frames. Demonstrations and code are available at https://github.com/yz93/Learn-to-Interpret-Atari-Agents

arXiv.org e-Print Archive

Integrable Open Spin Chains from Flavored ABJM Theory

Author: Bai Nan
Chen Hui-Huang
He Song
Wu Jun-Bao
Yang Wen-Li
Zhu Meng-Qi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

We compute the two-loop anomalous dimension matrix in the scalar sector of planar

{\cal N}=3

flavored ABJM theory. Using coordinate Bethe ansatz, we obtain the reflection matrix and confirm that the boundary Yang-Baxter equations are satisfied. This establishes the integrability of this theory in the scalar sector at the two-loop order.Comment: v2, 25 pages, 2 figures, minor corrections, references adde

arXiv.org e-Print Archive

Directory of Open Access Journals

MPG.PuRe

$\eta_{Q}$ meson photoproduction in ultrarelativistic heavy ion collisions

Author: Bai Zhen
Cai Yan-Bing
Wang Jian-Song
Yang Hai-Tao
Yu Gong-Ming
Zhao Gao-Gao
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2017
Field of study

The transverse momentum distributions for inclusive

\eta_{c,b}

meson described by gluon-gluon interactions from photoproduction processes in relativistic heavy ion collisions are calculated. We considered the color singlet (CS) and color octet (CO) components with the framework of non-relativistic Quantum Chromodynamics (NRQCD) into the production of heavy quarkonium. The phenomenological values of the matrix elements for the color-singlet and color-octet components give the main contribution to the production of heavy quarkonium from the gluon-gluon interaction caused by the emission of additional gluon in the initial state. The numerical results indicate that the contribution of photoproduction processes cannot be negligible for mid-rapidity in p-p and Pb-Pb collisions at the Large Hadron Collider (LHC) energies.Comment: 11 pages, 2 figure

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Two Sides of the Same Coin: White-box and Black-box Attacks for Transfer Learning

Author: Bai Kun
Liang Jian
Song Yangqiu
Yang Qiang
Zhang Yinghua
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 25/08/2020
Field of study

Transfer learning has become a common practice for training deep learning models with limited labeled data in a target domain. On the other hand, deep models are vulnerable to adversarial attacks. Though transfer learning has been widely applied, its effect on model robustness is unclear. To figure out this problem, we conduct extensive empirical evaluations to show that fine-tuning effectively enhances model robustness under white-box FGSM attacks. We also propose a black-box attack method for transfer learning models which attacks the target model with the adversarial examples produced by its source model. To systematically measure the effect of both white-box and black-box attacks, we propose a new metric to evaluate how transferable are the adversarial examples produced by a source model to a target model. Empirical results show that the adversarial examples are more transferable when fine-tuning is used than they are when the two networks are trained independently

arXiv.org e-Print Archive

Crossref