10,871 research outputs found
Learn to Interpret Atari Agents
Deep Reinforcement Learning (DeepRL) agents surpass human-level performances
in a multitude of tasks. However, the direct mapping from states to actions
makes it hard to interpret the rationale behind the decision making of agents.
In contrast to previous a-posteriori methods of visualizing DeepRL policies, we
propose an end-to-end trainable framework based on Rainbow, a representative
Deep Q-Network (DQN) agent. Our method automatically learns important regions
in the input domain, which enables characterizations of the decision making and
interpretations for non-intuitive behaviors. Hence we name it Region Sensitive
Rainbow (RS-Rainbow). RS-Rainbow utilizes a simple yet effective mechanism to
incorporate visualization ability into the learning model, not only improving
model interpretability, but leading to improved performance. Extensive
experiments on the challenging platform of Atari 2600 demonstrate the
superiority of RS-Rainbow. In particular, our agent achieves state of the art
at just 25% of the training frames. Demonstrations and code are available at
https://github.com/yz93/Learn-to-Interpret-Atari-Agents
Integrable Open Spin Chains from Flavored ABJM Theory
We compute the two-loop anomalous dimension matrix in the scalar sector of
planar flavored ABJM theory. Using coordinate Bethe ansatz, we
obtain the reflection matrix and confirm that the boundary Yang-Baxter
equations are satisfied. This establishes the integrability of this theory in
the scalar sector at the two-loop order.Comment: v2, 25 pages, 2 figures, minor corrections, references adde
meson photoproduction in ultrarelativistic heavy ion collisions
The transverse momentum distributions for inclusive meson
described by gluon-gluon interactions from photoproduction processes in
relativistic heavy ion collisions are calculated. We considered the color
singlet (CS) and color octet (CO) components with the framework of
non-relativistic Quantum Chromodynamics (NRQCD) into the production of heavy
quarkonium. The phenomenological values of the matrix elements for the
color-singlet and color-octet components give the main contribution to the
production of heavy quarkonium from the gluon-gluon interaction caused by the
emission of additional gluon in the initial state. The numerical results
indicate that the contribution of photoproduction processes cannot be
negligible for mid-rapidity in p-p and Pb-Pb collisions at the Large Hadron
Collider (LHC) energies.Comment: 11 pages, 2 figure
Two Sides of the Same Coin: White-box and Black-box Attacks for Transfer Learning
Transfer learning has become a common practice for training deep learning
models with limited labeled data in a target domain. On the other hand, deep
models are vulnerable to adversarial attacks. Though transfer learning has been
widely applied, its effect on model robustness is unclear. To figure out this
problem, we conduct extensive empirical evaluations to show that fine-tuning
effectively enhances model robustness under white-box FGSM attacks. We also
propose a black-box attack method for transfer learning models which attacks
the target model with the adversarial examples produced by its source model. To
systematically measure the effect of both white-box and black-box attacks, we
propose a new metric to evaluate how transferable are the adversarial examples
produced by a source model to a target model. Empirical results show that the
adversarial examples are more transferable when fine-tuning is used than they
are when the two networks are trained independently
- …